Moses 2024-10-07 to 2024-10-11
to : Martin Holmes
Minutes: 275
On Monday, got some very useful feedback from EH on the online dictionary interface; as a result, I put in an explicit exclusion for the examples listing page from the search indexing, and also modified the pre-processing so that all internal notes are removed before the phase of creating the website, since there were a couple of contexts in which their content had ended up showing up even though it’s not supposed to.
On Wednesday, had some discussion on the tokenization problem affecting the undertie character, which in Unicode is a punctuation character and therefore is a tokenization point, but in Nx seems to be used as a word character. Then worked on a build break caused by my wiping out the workspace on Jenkins; it turned out that a copy operation which provided a jar for sorting in the utilities folder had been commented out at some point. Then, following discussion with SK and further research, discovered that the undertie is actually in the subclass of Connector Punctuation, which PERL includes in its word character (\w) class. That’s a strong argument for its being handled as a word character, so I raised an issue on staticSearch and created a PR with a fix and a test. Then fixed a couple of remaining issues on the print dictionary per SK.
On Thursday, continued work on the PDF, adding the final bibliography section, which required some work to style the entries, and also to handle long URLs which need to be able to wrap.