Added wordlist output
Created simple wordlist output formats, so we can get a handle on how best to deal with <gloss> tags. They're drawing from the complete set of entries, edited or not. They show a couple of things that need fixing, most of which we know about, but they also show that there's a tendency to include the same gloss multiple times in an entry. For instance, in an edited entry we have this:
ṣə̣́nṣə̣nt: tame, gentle, quiet, tame, tame, gentle
The entry shows why: there are multiple <gloss> tags containing the same words sprinkled through the <entry>.
It's possible for me to write the code so that it ignores these duplicate entries, but there are a couple of problems with that: first, SMK and ECH would be doing lots of extra tagging that we're ignoring, making the entries more complicated, and second, the generation of wordlists will take much longer because it'll have to check every <gloss> to see if it's a duplicate. So I think a good policy would be to make sure we only tag a particular word or phrase once as a gloss in any given entry.