Category: Tasks

29/05/08

Permalink 01:44:32 pm, by pat, 41 words, 232 views   English (CA)
Categories: Tasks; Mins. worked: 0

Documents needing editing

Attached is an excel file listing the documents already in the database that require editing.

I suppose they will have to be isolated from the dbase so that Kim and I can edit them.

List of files needing to be extruded.

27/05/08

Permalink 03:55:37 pm, by pat, 95 words, 364 views   English (CA)
Categories: Tasks; Mins. worked: 360

Trench Warfare

We continue to wade through the unedited documents, adding every superscript, period and circumflex that was missed in the first encoding offensive. We are hopeful tomorrow will conclude this task.

Martin, you mentioned being able to maybe correct a repetitive error I have made. In the bibl tag under "date when" we have been using the date the office registered the file, rather than the date given to it by its original author. The date given by the author needs replace the incorrect date in the bibl section.

Do you think you can remedy this?

26/09/07

Permalink 12:16:25 pm, by clifton, 314 words, 301 views   English (CA)
Categories: Tasks; Mins. worked: 0

tag extraction

I created a php page (Located here: http://web.uvic.ca/~lang02/fall_07/despatches/crawl.php) to parse through the BC and VI folders of the ColDesp project and remove all waterloo script tags starting with a period.

The php page was able to extract 912 "tags" (and other items) in total. While creating crawl.php I noticed two interesting things to note about these tags that might make it difficult to parse through to capture the remaining data correctly. There are no closing tags (even if what should be enclosed within the tag spans multiple lines. Tags seem to be able to end with either a space, semi-colon or asterisk or exclamation mark.

I noticed from the results of crawl.php that individual names and reference files were also captured because they began with periods and I am not sure if these are special tags or not. Here are some examples:

Tag 55: ...Allan,
Tag 56: ...Anderson,
Tag 57: ...Auckland,
Tag 58: ...B00101
Tag 59: ...B00102
Tag 60: ...B00103
Tag 749: ...V03419
Tag 750: ...V03501
Tag 751: ...V03502
Tag 752: ...V03503

I also noticed that there were some tags that look the same but either end or begin differently: Here are some examples:

Tag 25: .'us
Tag 908: .us
Tag 909: .us!

Tag 1: $$cdes
Tag 592: ...DES
Tag 824: .Des
Tag 850: .des

Tag 880: .pa
Tag 881: .par
Tag 882: .par'7.{{California
Tag 883: .par'[Personal?]
Tag 884: .par2

There are also tags that I was able to pull out that I could not find reference for in the manual. Here are some examples:

Tag 3: .!
Tag 4: .!9399-10199-11307
Tag 5: ."
Tag 6: .$50,000
Tag 7: .'+32
Tag 8: .'+41
Tag 9: .'+50
Tag 10: .'2c
Tag 11: .'35
Tag 12: .'45
Tag 13: .'47
Tag 14: .'55
Tag 27: .*
Tag 28: .-›(c&S't
Tag 29: ..
Tag 30: ...
Tag 31: ..."
Tag 32: ....
Tag 33: .....
Tag 34: ......
Tag 35: .......
Tag 36: ........
Tag 37: ........!
Tag 38: .........
Tag 39: ..........
Tag 40: ...........
Tag 41: ............
Tag 42: .............
Tag 43: ..............
Tag 44: ...............
Tag 45: ................
Tag 46: .................
Tag 47: ..................
Tag 48: ....................
Tag 49: .........................................................
Tag 50: ..................25,550.-.--
Tag 51: ...............€3000
Tag 52: ......€4750
Tag 822: .12.
Tag 823: .2c
Tag 882: .par'7.{{California
Tag 883: .par'[Personal?]

Tag 864: .key
Tag 865: .keyx

Tag 862: .ix

<< Previous Page ::

Colonial Despatches

The Colonial Despatches is an XML database project which is creating a digital archive containing the original correspondence between the British Colonial Office and the colonies of Vancouver Island and British Columbia. The project lives at http://bcgenesis.uvic.ca, and the web application runs on the Pear dev Tomcat. The XML data is managed in SVN at http://revision.tapor.uvic.ca/svn/coldesp/.

Reports

Categories

September 2014
Sun Mon Tue Wed Thu Fri Sat
 << <   > >>
  1 2 3 4 5 6
7 8 9 10 11 12 13
14 15 16 17 18 19 20
21 22 23 24 25 26 27
28 29 30        

XML Feeds