markup for phonemic representations
To clarify which phonemic transcriptions in the database were recorded by MDK, and which have been derived by ECH from MDK's narrow transcriptions, we are adding subtype="unattested" to the markup.
A phonemic form transcribed by MDK will still be marked up like:
<pron>
<seg type="phonemic">ṣə̣́nṣə̣nt</seg><bibl>JM3.20.11</bibl>
<seg type="narrow">sə́nsə̀nt</seg><bibl>Y24.40</bibl>
</pron>
(An appropriate disclaimer can be programmatically added to all of MDK's phonemic forms in the final output.)
A phonemic form derived by ECH will be marked up like:
<pron>
<seg type="phonemic" subtype="unattested">ṣə̣́nṣə̣nt lx</seg><bibl>ECH</bibl>
<seg type="narrow">ṣə̣́nṣə̣nt ləx</seg><bibl>JM3.21.1</bibl>
</pron>
A phonemic form derived by MDK (one with no source noted in the file cards or lexware database) will be marked up like:
<pron>
<seg type="phonemic" subtype="unattested">ṣə̣́nṣə̣nt lx</seg><bibl>MDK</bibl>
<seg type="narrow">ṣə̣́nṣə̣nt ləx</seg><bibl>JM3.21.1</bibl>
</pron>
This applies to <phr>s as well as <seg>s. Martin has rewritten the schema to allow for subtypes of <phr>s.
Every <seg> and <phr> should now have a sister <bibl> showing its source.
I have carried these changes through the s-rtr, c-rtr, and phar-w files, and will implement them in the affix file next.
It is also possible to put <bibl> tags higher up the tree; e.g., if everything in an entry came from the same source, only one <bibl> tag would be needed, a sister to the whole <entry>