Log in

HCMC Journal

Ben Jonson 2026-02-23 to 2026-02-27

to : Martin Holmes
Minutes: 280

On Wednesday, I focused on mechanisms by which we can discover plausible dates for all the documents, and apply them in the form of staticSearch meta tags for the search page. I wrote one tool to create a lookup to dates from document ids harvested from the TEI source data, and then supplemented that with another option to discover dates from within the HTML files themselves (specifically required for the performance records, which apparently have no TEI source). By the end of the process, over 2,400 pages from the site have working dates, and most of those which don’t are not really eligible for them anyway, being About pages, listings pages, and so on. There is a handful of documents that should have dates but never did, but there’s not much we can do about that. We’re now ready to start building the staticSearch page and creating the genre-specific links to it.

On Friday, did some cleanup of document titles, and in the process discovered a number of documents lacking useful titles; these will have to be retrieved from the XML sources.