Log in

HCMC Journal

BNF startup

: Illya Nokhrin
Minutes: 360

Downloaded BNF files from their Github repo. All files are encoded in TEI-XML. Generally pretty similar to LEMDO, with some exceptions. There are only two completed texts: The Sea Voyage and The Spanish Curate. Each has an edited (ie. modernized) and a base text (ie. semi-dip) edition. I’ve written XSLT to process both types of files to get them closer to LEMDO formatting. There are still some decisions to be made (for example, what to do with the student-create seg attributes that BNF used to tag terms that matched a hand-picked number of categories). Unsurprisingly, the most complex part of the transformation was dealing with quotation marks, since BNF uses straight quotes for their texts. There is also a 22-item Works Cited list for The Sea Voyage, which we can hand convert to LEMDO’s bibliography format. The reference links will have to be updated after that conversion is complete. There will also be other aspects that will have to be remediated by hand. With only 4 play text files, my instinct is to not spend too much time trying to write complex XSLT for things that can be done fairly easily using hand remediation.