Autophonemicizer update
I remembered something today that will make it much easier to autophonemicize pron:segs.
In Lexware, all the entries were hyphed. MDH took the same input to generate both the pron:segs (removing the morpheme delimiters) and the hyphs (adding the m-tags).
So we need use the contents of the hyphs, minus their m-tags, to generate the pron:<seg type="p" subtype="i">s. Then the morpheme delimiters will be included, and we can use the same autophonemicizer transformation that we used for the <phr>s. Then we simply remove + - = √ from the pron:segs again. Then we can collapse duplicate pron:segs, the same as we're doing for <phr>s.
So we don't have to worry about writing complicated rules using a long list of prefixes to find the "root-initial" context in pron:segs without morpheme delimiters.
We will also autophonemicize the contents of the hyphs, by removing the m-tags, autophonemicizing, and then adding the m-tags back in based on the morpheme delimiters.
Autophonemicizer2.doc contains detailed instructions for all this.
MDH has also implemented a transformation to collapse identical segs within the same pron. This happens before the autophonemicization.
Visions of Autohyphenators dance in my head! Maybe next year ...