Log in

HCMC Journal

Duplicate ids and a new diagnostic

: Martin Holmes
Minutes: 55

The addition of some new documents from old ISE supplementary texts resulted in many duplicate id errors, and when we had diagnosed and fixed those, we realized that LEMDO does not have Schematron which enforces our normal requirement that all ids in a document must begin with the id of the root element. This simple requirement is sufficient to reduce the duplicate id problem substantially in other large projects. However, some quick XPathing revealed that there are over a thousand problem cases already, so we can’t just add Schematron and break everything, so I’ve ended up adding a new diagnostic which lists them, and when they’re finally cleared, we can add the Schematron and remove the diagnostic.