Archives for: May 2012

30/05/12

Permalink 08:55:55 am, by mholmes, 33 words, 307 views   English (CA)
Categories: Activity log, Announcements; Mins. worked: 30

CO 60 Vol 10 page images added to the Colonial Despatches collection

767 page images for CO 60 Vol 10 (in three different sizes) have been added to the collection. These cover the 1861 Despatched from London, January to August. These will now be linked into the transcription documents.

29/05/12

Permalink 05:37:25 pm, by mholmes, 83 words, 78 views   English (CA)
Categories: Activity log; Mins. worked: 240

ContentDM metadata now imported

Today's progress:

  • Finished the XSLT, and ran it against the whole collection.
  • Went through those files with no matches in the ContentDM repo, and manually ported over info from similar files, and elaborated what was there based on map legends etc.
  • Tweaked the XSLT to add a link to the ContentDM repo.

Still to do: rework the processMapBibl template so that it really uses all of the info that's now there (author, publisher, etc. etc.). This should probably be done with regular templates.

28/05/12

Permalink 10:00:43 am, by mholmes, 189 words, 150 views   English (CA)
Categories: Activity log; Mins. worked: 210

Mapping between ContentDM metadata and TEI

This is the complete mapping for copying metadata over from the ContentDM records to our TEI files:

  • dc:title (multiple): titleStmt/title, bibl/title.
  • dc:description[1]: notesStmt/note (replace the first one).
  • dc:description[preceding-sibling::dc:description][string-length(.) gt 50]: notesStmt/note (add new ones). These are the textual descriptions; the shorter ones are various scale and coordinate details.
  • dc:description[matches(., "^[0-9]+[ 0-9'NW\-\./]+$") and string-length(.) gt 3]: bibl/geo. These one-line expressions of geo locations will have to be further processed into something we can use to map to Google. They're not really in consistent format.
  • dc:subject (multiple) = notesStmt/note type="subject".
  • dc:creator = bibl/author.
  • dc:contributor[not(preceding-sibling::dc:creator)][not(starts-with(., 'Fund')] = bibl/author.
  • dc:language == 'eng' : bibl/@xml:lang = 'en'
  • dc:language == 'spa' : bibl/@xml:lang = 'es'
  • dc:contributor[starts-with(., 'Fund')] = funder.
  • dc:publisher = bibl/publisher
  • dc:relation = bibl/publisher (really should be repository, but we don't want to be get into having a full msIdentifier).
  • dc:identifier[starts-with(., 'http://contentdm')] = idno type="contentdm".

I'm now halfway through the XSLT which will integrate the metadata into the TEI files. Should be done tomorrow.

25/05/12

Permalink 09:47:00 am, by mholmes, 54 words, 113 views   English (CA)
Categories: Activity log; Mins. worked: 60

Mapping between ContentDM metadata and TEI

This is my preliminary mapping:

  • dc:title (multiple): titleStmt/title, bibl/title.
  • dc:description[1]: notesStmt/note (replace the first one).
  • dc:subject (multiple) = notesStmt/note type="subject".
  • dc:creator = bibl/author.
  • dc:language == 'eng' : bibl/@xml:lang = 'en'
  • dc:language == 'spa' : bibl/@xml:lang = 'es'
  • dc:contributor[starts-with(., 'Fund')] = funder.
  • [ more to come later... ]

24/05/12

Permalink 03:00:26 pm, by mholmes, 113 words, 106 views   English (CA)
Categories: Activity log; Mins. worked: 240

Matching part of the process finished

Spent most of the day manually aligning records between ContentDM and ColDesp, so this is where we're at:

  • DONE: Manually edit the XHTML file to fix bad matches among the candidates.
  • DONE: Search for matches for the unmatched items manually.
  • DONE: Add matches found back into the XHTML.
  • Generate from the XHTML a list of pairings from which metadata can be brought over.
  • Map desired metadata fields in ContentDM OAI file to TEI.
  • Write XSLT to port the metadata into the TEI files.
  • Update the map gallery rendering code to include the new metadata.

Also wrote to CP with a list of 7 maps that we have, but which are apparently missing from ContentDM.

23/05/12

Permalink 03:14:27 pm, by mholmes, 150 words, 98 views   English (CA)
Categories: Activity log; Mins. worked: 240

Matching with ContentDM records

More progress on matching with ContentDM. I've now generated an XHTML file with two tables, one of candidate matches (186 maps) with links to both ColDesp and ContentDM, for human checking, and one of failed matches (33 maps from ColDesp), with ColDesp links and enough metadata for a manual search. I've manually verified the 186 candidate matches and found that most match; I reported one map apparently missing from ContentDM to CP, and found a dupe in ColDesp.

Next steps:

  • Manually edit the XHTML file to fix bad matches among the candidates.
  • Search for matches for the unmatched items manually.
  • Add matches found back into the XHTML.
  • Generate from the XHTML a list of pairings from which metadata can be brought over.
  • Map desired metadata fields in ContentDM OAI file to TEI.
  • Write XSLT to port the metadata into the TEI files.
  • Update the map gallery rendering code to include the new metadata.

22/05/12

Permalink 11:33:43 am, by mholmes, 34 words, 218 views   English (CA)
Categories: Activity log, Announcements; Mins. worked: 20

CO 305 Vol 18 page images added to the Colonial Despatches collection

910 page images for CO 305 Vol 18 (in three different sizes) have been added to the collection. These cover the 1861 Vancouver Island Public Offices and Miscellaneous Correspondence. These will now be linked into the transcription documents.

10/05/12

Permalink 02:55:36 pm, by kim, 42 words, 133 views   English (CA)
Categories: Tasks; Mins. worked: 15

Apostophe rending glitch?

EDIT: Fixed 2012-05-23. In this file, hover over the word "Majesties," which has sic/corr tags around it, the intention being to correct it to "Majesty's." In the hover-over pop-up, the apostrophe renders as the hex-code for an apostrophe. Very strange!

09/05/12

Permalink 01:23:28 pm, by mholmes, 36 words, 269 views   English (CA)
Categories: Activity log, Announcements; Mins. worked: 60

Complete set of CO 305 Vol 17 page images added to the Colonial Despatches collection

The complete collection of 1208 page images for CO 305 Vol 17 (in three different sizes) have been added to the collection. These cover the 1861 Vancouver Island Despatches to London. These will now be linked into the transcription documents.

Colonial Despatches

The Colonial Despatches is an XML database project which is creating a digital archive containing the original correspondence between the British Colonial Office and the colonies of Vancouver Island and British Columbia. The project lives at http://bcgenesis.uvic.ca, and the web application runs on the Pear dev Tomcat. The XML data is managed in SVN at http://revision.tapor.uvic.ca/svn/coldesp/.

Reports

Categories

May 2012
Sun Mon Tue Wed Thu Fri Sat
 << < Current> >>
    1 2 3 4 5
6 7 8 9 10 11 12
13 14 15 16 17 18 19
20 21 22 23 24 25 26
27 28 29 30 31    

XML Feeds