Log in

HCMC Journal

Jonson XML to TEI remediation progress

to : Illya Nokhrin
Minutes: 360

Continued working on converting CEWBJ XML files for modernized texts to TEI-XML. There is a fair bit of variety in the encoding for cast lists. Have opted to account for the two most popular formats (divs with p elements and tables with rows/cells). Others will likely have to be encoded manually.

After discussion with MH, used an accumulator for the insertion of to anchors for collation and annotation notes. This requires a second pass, which I built into the main XSLT file using XSLT’s transform() function. There are some places where it fails (in which case I’ve had it insert a comment saying that the to anchor will have to be moved manually), but for the most part it works quite well.

Getting close to finishing what is possible with XSLT-based conversion/remediation for this.