Log in

HCMC Journal

DVPP 2024-05-13 to 2024-05-17

to : Martin Holmes
Minutes: 420

Over the weekend, converted Temple Bar 21-30 into XML and OCRed a decade-year. On Monday, replaced the old documentation on how to run the SQL-to-TEI process with new documentation for the spreadsheet-to-TEI process, and tweaked the documentation CSS. On Monday, discovered that 765 images had been created with the wrong file extension, so I spent a while fixing those and all the references to them.

On Tuesday morning, worked through the risky process of renaming all the Temple Bar files and fixing all the links to them, to introduce the leading zero which we now know we’ll need. Also discussed the chapter with AC, and watched a video of JP talking about his annotated run of AYR.

On Wednesday, met with AC to discuss the Dickens work, and wrote code to generate new analytical spreadsheets showing the counts of poems by poet in each periodical. Then wrote some documentation on how to encode titles and links, and then some Schematron to constrain leading and trailing spaces, which then threw up dozens of errors to fix.

On Thursday, processed the next batch of Temple Bar poems into XML, OCRed some more decade-year poems, and discussed genre tagging with AC.

On Friday, picked a good long complicated poem to encode, to keep my hand in and test all our shortcuts and encoding mechanisms. Then fixed a bunch of badly-encoded links in notes, and added a diagnostic that catches 298 more, which the RAs can work on.