Read and annotated one of the JAEH articles, and also Skyped with JSR to prep for the interviews tomorrow; created a diagram to support one of the questions.
The RBCM will be working on a public-facing edition of our land titles data, so I've put together an overview of the db and the data structure along with an EER diagram for them; waiting to hear back as to what they'd like us to do in terms of pre-processing or transforming the data.
After using the Maple Ridge db as the case study for the article work, I've now done a final pass through all the results from the algorithm and generated SQL commands which have been run to add ethnicity for owners. These are the steps:
- Check all name lists manually. There were a couple of Indian names which were not picked up by the Lauderdale and Kestenbaum list process. Researched those, and a couple of other oddities.
- Generated SQL statements from the name lists. These are in three files in svn for the record.
- Copied live to dev db, and backed up live db.
- Uncommented the ethnicity field in the local_classes.php file in both databases and tested that the field is available.
- Ran SQL on dev db and tested results.
- Ran SQL on live db and tested results.
- Made a second backup of the live db.
Sent it to JSR as a docx.
Lots of back and forth with SK at NNM to complete retrospective report on activities (Research, Dissemination, Training) in both parts of our cluster, and to set major milestones and budget for upcoming year. Also worked out location of student labour (almost all at NNM) and workflow and tasks for stuff project needs for integration with material developed by other clusters.
Submitted 2 documents to JSR for review by Executive Committee.
Much polishing to do, and formatting/footnoting/referencing, but the bulk is done, and I've added a flowchart diagram which helps clarify things. I think it'll come in at about 25 pages at 12pt double-spaced.
As part of work on the article, ran the algorithm against the Maple Ridge land titles db data, and did some background research to reduce the number of ja-xProvs. I have the output counts for the initial phase and the phase after research; some interesting/problematic cases emerged, which I'm discussing with MA and JSR, but in general everything worked as expected (virtually zero genuine false positives; all provisionals are "interesting" and need research; no false negatives).
This is the first part of the last section. I think we're going to end up with about twelve single-spaced letter pages (there are nine right now) plus images, assuming those are allowed. Made some tweaks to documentation and to the regex in the process of writing; it's help;ful to be re-examining all this in the process of writing the article.
Talked with SK about five major individuals :
Kagetsu (Logging), Kimura (fishing & committee), Saito (shoolmaster), Tonomura (business), Nakatani (?fishing)
She's put in for one big grant and one small grant to pay for work on these in addition to what the LofI can pay for.
Also, worth a look through the Campbell fonds to see if there are documents there that provide individual details esp. on chattels
She's also directing two UVic students in a course to go through the protest letters and identify dramatic stories or recurrent themes that might inform future work on products for the project.
Spent a lot of time today on the name "Ide", which is a good illustration of the wild variation in phonological representation versus kanji for surnames; that's made a para for the paper, but there is one individual case that's proving a thorny one to resolve due to the fact that surname and forename are apparently reversed in the one instance of the name in a city directory, and the fact that the person doesn't show up in either of our community sources for the same year.