This is now basically working quite well, and looks reasonable. We found lots of minor issues with data and encoding and fixed them -- this is a good way to reveal them -- and we have some outstanding questions to answer, but it's remarkable how revealing just this one set of tables is.