plan data normalizing for TimesColonist db
Reviewed files and met with John Lutz on how to normalize dates in Times Colonist transcipt database. Problem of publication date vs event date (how to distinguish, what to do if one or both are missing, to what degree can instances of each be automatically culled from text).
If date appears as first bit of record, then it is publication date and assume it is event date unless a date bit can be found elsewhere in the record. If no date appears as first bit of record then leave publication data field empty and try to find event date in the text of the record.
Each record will include a "publicationDate" field and an "eventDate" field. In addition, each record will include a "publicationName" field, as other datasets include material drawn from more than one publication.
Will eventually incorporate tables for smaller db's on Architecture, Boer War, and the West Coast of Vancouver Island.