Work on ethnicities in directories and land titles data
Rewrote the diagnostics file so that it processes the existing directories files rather than those created in a temp directory when the algorithm was being developed; build the diagnostics into the main build process and directed its output to the products directory which is preserved. Reported on the results and the work required to complete this task to JSR and MT, who will presumably be doing it.
Then, per discussions with JSR, changed the Other East Asian ethnicity to Other Asian, and ported this change to the original Land Titles db. Abstracted the ethnicity assignment code into a separate module so it can be applied to any source, and then created a new library which uses it to process the Land Titles (Powell St) db, XML version, and creates a diagnostic output that shows where the current algorithm would differ from the original assignments which were made by humans. Analyzed these results and reported on the work which will be involved in revisiting the 600+ names for which this process disagrees with the original one.