Posted report on progress to end of May at
https://basecamp.com/2690203/projects/6501222/messages/58643365
We continue to work on the 1949 person index for Vancouver. There are 604 pages in the person directory and we have gone through 240 of them so far consuming about 15 person-days. This is going very slowly as the documents are hard to read. I notice that in my last posting I reversed the references to the 1949 person and 1949 street indexes. I.e. I said we'd completed the person index and had begun on the street index when in fact we had completed the street index and had begun on the person index.
In collaboration with Jordan's group, we have assembled a spreadsheet for about 300 letters of protest collected by the Custodian and identified the presence in each of about 16 types of claim (e.g. lack of consent, violation of rights, explicit mention of fishing assets), author names and other details.
We have also OCR'd, transcribed and proofed 3 of the 4 lists of unsold properties held by the custodian. The first list has about 480 records. The second and third are sublists of the first (i.e. no new properties have been added), so rather than transcribe all that again, we're adding "appears on list 2" and "appears on list 3" columns to the first list and putting a 1 or 0 in those as appropriate for each record.
The fourth list of about 100 records consists of some properties on the first list and some new lists, and the OCR isn't helpful, so we'll manually transcribe the new records and put a "1" in the "appears on list 4" column for the records that appear on lists 1 through 3. It looks like we may have to add some columns to the tables to accommodate the structure of the entries in the fourth list, which are far less consistent than the first list.
Next week :
- we will begin analyzing some of the protest letters and some of the Campbell fonds letters to figure out what modifications we have to make to the schema file (e.g. elements for the opening and closing material of letters), and markup a sample of documents to test the modifications and estimate time needed to transcribe 1 to 2 page letters.
- continue work on the 1949 person directory.
- we still have all the directory files listed last time in the job hopper.
- talk with Martin about adding ethnicity attribution to names in our XML files.