Posting hours from isolation log 2021-01-04 to 2021-01-08.
Just a note to self for future reference: One of our source documents came in the form of full-page-spread images, and rather than manually crop them (there are 370), I wrote a bit of JS to generate a bash script which uses ImageMagick to do it. First I renamed all the input files so that they were -0001, -0003, -0005 etc. to allow room for the new ones, then used this:
function writeCropLines(num){ let padNum = num.toString().padStart(4, '0'); let output = 'convert mgchau-' + padNum + '.jpg -crop '; output += '1377x1936+0+0 out/mgchau-' + padNum + '.jpg '; output += '\u000a'; padNum2 = (num + 1).toString().padStart(4, '0'); output += 'convert mgchau-' + padNum + '.jpg -crop '; output += '1497x1936+1096+0 out/mgchau-' + padNum2 + '.jpg '; output += '\u000a'; return output; } function doAll(){ let output = ''; for (var i=1; i<740; i+=2){ output += writeCropLines(i); } return output; } console.log(doAll());
in the browser dev tools to generate the script.
As of today, the Scandinavian-Canadian Studies Journal, and the Robert Graves Diary project, both maintained by HCMC, have migrated to pure static websites built according to Endings principles and using the staticSearch engine.
Isolation log week 42
Work done from home and in the office 2021-01-04 to 2021-01-08.
Total hours: 37.00
G&T hours: +2.00
Monday 2021-01-04
0.25 hours: Update Jenkins servers.
0.75 hours: Maint: Investigation/testing of dart-sass install to replace our current sassc, which is now deprecated; after next reboot of Jenkins I'll switch MyNDIR and LEMDO over to that, and we'll see how well it works.
6.00 hours: LEMDO: Complete rewrite of document type taxonomy, update of all XML documents and schemas, and fixes to processing required as a result. This will leave us in a better position to provide good search filter options. Lots of cleanup was needed, and weaknesses in our XSLT are evident.
Total: 7.00 hours
Tuesday 2021-01-05
0.25 hours: Update Jenkins servers.
0.50 hours: LEMDO: Figure out and fix overnight build break.
0.50 hours: Maint: Update work desktop, lab machines, and HCMC laptop.
1.50 hours: HCMC staff meeting.
0.50 hours: MoEML: Discuss solution to encoding five-period figure; implement schema and schematron changes.
3.00 hours: Scancan: Static site launched by LW; rewrote the build process to remove all eXist content, fixed a couple of bugs, updated some info pages, moved obsolete files, added publication tasks, tested, and published latest build.
0.25 hours: Endings: Added Winter 2020 update to site.
0.25 hours: MoEML: Find and fix reported bug in diagnostic.
1.00 hours: Wendat: Process MS 60 into facsimile document ready for encoding; examine McGill-Chaumonot page-images and try various approaches to splitting the spreads into individual pages. Will need to be scripted using ImageMagick, I think.
Total: 7.25 hours
Wednesday 2021-01-06
0.25 hours: Update Jenkins servers.
1.75 hours: Wendat: Split and remediate page-images to create 740 individual page-images for McGill-Chaumonot MS; create TEI file for the MS, add new doctype category for grammars, and update stats.
3.00 hours: LEMDO: Fix menus and listings pages broken by changes to taxonomies; fix bugs in rendering linebreaks; work out and test encoding of verse with rhyme; remediate FV_Q1 to fix hundreds of bad entity references.
1.25 hours: Endings: Project meeting and tweaks on bulletin board.
0.75 hours: Endings/staticSearch: Added new What's New section to documentation, and did other prep for the upcoming release.
0.50 hours: LEMDO: Track down and fix spacing issue in QME toolbox.
Total: 7.50 hours
Thursday 2021-01-07
0.25 hours: Update Jenkins servers.
2.50 hours: Maint: With GN, testing of Tomcat 9, eXist apps, and different JDKs to determine what we need on the replacement for Peach. Long discussions of staticization strategies and search functionality for CGWP, Francotoile, VIHistory and other sites.
2.50 hours: LEMDO: Working on overnight build break, discussions of tracking for document history versus document status and Schematron, fixes to many documents and to Schematron rules.
1.00 hours: MoEML: Project meeting and
1.00 hours: Endings: Update to website to add Symposium page; started remediating the horrible old HTML to create something processable, so I can more easily maintain the site with a build process.
Total: 7.25 hours
Friday 2021-01-08
0.25 hours: Update Jenkins servers.
0.75 hours: LEMDO: New taxonomy for document history.
1.50 hours: TEI: Weekly meeting with SB: working on the Japanese ruby proposal.
1.00 hours: MoEML: Fix for diagnostics and other tweaks to complete all my outstanding tickets.
2.25 hours: LEMDO: Write basic documentation for new taxonomies; update conversion processing chains to add in new categories when generating documents; addition of new categories to existing documents.
2.25 hours: Endings: Plans for tech paper; created build process for site, removing piles of unwanted JS and abstracting core components into XSLT variables; added new pages and tweaked existing pages; got everything to validate; published latest version.
Total: 8.00 hours
Posting 21 hours of G & T used to cover the three non-vacation days between Christmas and New Year.
Posting hours from isolation log 2020-12-21 to 2020-12-23 (short week, taking vacation on the 24th).
Isolation log week 41
Work done from home and in the office 2020-12-21 to 2020-12-23 (short week, taking vacation on the 24th).
Total hours: 21.50
G&T hours: +0.50
Monday 2020-12-21
0.25 hours: Update Jenkins servers.
0.50 hours: LEMDO: Add Schematron rule to try to prevent build break from last night. Took a few shots to get it right.
2.00 hours: Endings/staticSearch: Cleaned up and pruned some old branches; merged JT's branch for apostrophe handling; fixed a bug in scroll-to-text-fragment URL construction; tested it in Graves and MoEML; created a new branch for issue 51, document sort keys, which we need for the Despatches; implemented and tested it.
1.00 hours: ColDesp: Configured new metatags for all documents consisting of sort keys as required by JT for search result returns.
3.25 hours: LEMDO: Handled CSS margin ticket; discussed with JT and JJ the current behaviour of popups on the site, agreed on a more elaborate multiple-popup implementation, planned out the details and created a ticket for it; fixed broken behaviour whereby when annotations and collations were turned off in the toolbox, their signals/symbols remained confusingly visible; discussion with JJ on peer-reviewed status versus catRef; decided on status, implemented and re-edited existing documents accordingly.
0.50 hours: Wendat: Integrated translations from ML into project stats page; added new stat for unclear passages, and integrated that into diagnostics chart.
Total: 7.50 hours
Tuesday 2020-12-22
0.25 hours: Update Jenkins servers.
0.50 hours: ColDesp/Endings/staticSearch: Test results of new sort key implementation in Despatches build; inform Despatches team; create pull requests on staticSearch repo to land the code in the dev branch.
1.00 hours: Despatches: Switch back to dev branch of staticSearch following merge; create icons for different doc types to use in search results, and integrate into site build.
1.00 hours: HCMC Staff meeting.
2.00 hours: LEMDO: Project meeting; fixes for and Schematron to prevent two types of nested links; add harvesting of required XML source documents into anthology trees; add document status row to page metadata.
0.50 hours: Endings/staticSearch: Write documentation for new sort key feature; triage tickets and create new milestone ready for a decision on 1.1 release.
1.75 hours: MyNDIR: Setting up initial Author Mode CSS for easier direct proofing in Oxygen per discussion with PB.
Total: 7.00 hours
Wednesday 2020-12-23
0.25 hours: Update Jenkins servers.
4.00 hours: Endings/staticSearch/Graves/ColDesp: Raise and complete ticket for format consistency for meta tags; bring Graves and ColDesp into alignment with the new forms and test. Triage all tickets assigned to Release 1.1, and assign some to 1.2, summarize others and raise questions, and start dialog with JT on the ones which may be implemented for the upcoming 1.2. Do some code-polishing and documentation, particularly in the JS for the search.
1.25 hours: MoEML: More fixes for broken and redirected links throughout the collection, based on W3C linkchecker results (page #23).
0.25 hours: ColDesp: Tweak size of search page hit doc icons.
1.00 hours: Scancan: Set up vol 28 master with what we have so far; test-built PDF and made tweaks to processing to handle odd characters in headings.
0.25 hours: MyNDIR: More tweaks to Author Mode CSS.
Total: 7.00 hours
Posting hours from isolation log 2020-12-14 to 2020-12-18.
Isolation log week 40
Work done from home and in the office 2020-12-14 to 2020-12-18
Total: 40.00 hours
G&T hours: +5.00
Monday 2020-12-14
0.25 hours: Update Jenkins servers.
0.50 hours: Endings/staticSearch: Fix a couple of issues discovered as a result of Scancan work this weekend.
2.00 hours: Scancan: Set up Jenkins job; integrate staticSearch; test and debug default search behaviour; start implementing custom overrides to provide links to old volumes and to current PDFs.
0.25 hours: LEMDO: Add new Schematron to prevent egXMLs in headings.
1.00 hours: Endings: Library archiving meeting.
1.00 hours: Mariage: Unsuccessful attempts to get current eXist configured as existing Mariage webapp is configured. May have to go for full-blown staticization instead.
2.50 hours: Scancan: More testing, debugging, and refining the integration of staticSearch. Almost done.
Total: 7.50 hours
Tuesday 2020-12-15
0.25 hours: Update Jenkins servers.
1.00 hours: LEMDO: Fixing a couple of build breaks introduced by unexpected encoding patterns in documentation. Found workaround encodings and added Schematron to prevent unwanted patterns.
1.00 hours: Scancan: Tweaks to search weighting to push old full-volume PDFs down the list in relevance scoring (still not properly tuned).
2.00 hours: HCMC Staff Meeting.
3.00 hours: LEMDO/MoEML: MoEML Project meeting, LEMDO QME state-of-play meeting, implementing four changes arising from that, testing and debugging.
0.50 hours: Endings/staticSearch: Did a documentation ticket, followed up on a couple of other issues.
0.50 hours: Mariage: Debugging XQuery code no longer working on new eXist. Determined it was due to using early implementation of XQuery maps code, subsequently removed from the spec and no longer supported by eXist. Could be rewritten, but probably best to put the time into staticizing Mariage.
0.25 hours: LEMDO: Diagnosing and fixing build break due to double link.
Total: 8.50 hours
Wednesday 2020-12-16
0.25 hours: Update Jenkins servers.
0.25 hours: MoEML: Diagnose overnight build break and change colliding id.
1.50 hours: Endings/staticSearch: first implementation of KWIC pointers to document fragment ids (issue 88) along with scroll-to-text-fragment, which now seems to be working.
1.00 hours: Endings/diagnostics: debug issues raised on ticket, check and test problem with Nashorn JS engine now no longer supported in Oxygen's JDK; with JT implement partial solution. More to do here.
0.50 hours: Keats: Fix JS bug I noticed while testing staticSearch. Test switch to new fragment-supporting staticSearch, but page-load JavaScript undermines it; needs more investigation.
1.00 hours: Scancan: Add fragment ids to major article sections; update vnu validator version and fix new warnings resulting; test with new staticSearch branch, but still need to rewrite custom override JS to have this work properly.
2.00 hours: Endings/staticSearch: add feature detection to allow regular fragment navigation in non-Chrome browsers; test in other browsers; discussions of the best way forward with scroll-to-text-fragment. Switched MoEML to the test branch to see the results tomorrow.
0.50 hours: Emails on Moses, Endings meeting poll, ticket triage.
Total: 7.00 hours
Thursday 2020-12-16
0.25 hours: Update Jenkins servers.
0.50 hours: Endings/staticSearch: find scrolling bug when search string is empty, raise ticket, commit fix (hopefully), and kick off MoEML build to test. Following test with Keats, switched it back to dev branch, since page-load JS interferes with it anyway. More discussion on ticket.
0.25 hours: Scancan: fix for obsolete @type on style tags.
0.25 hours: LEMDO: Re-encoded bibliography using proper tagging.
0.50 hours: Endings/staticSearch: Confirm bugfix worked, cherry-pick commit into dev, close ticket; read background to Mozilla's position on scroll-to-text-fragment.
0.50 hours: Scancan: Port recent staticSearch changes into override JS, build, bugfix and test; add more robust fragment id generation that will not change with every build; add lang attribute to root element of all files; fix broken ids in old volumes.
2.00 hours: MoEML: Test scroll-to-fragment and fragment-link features in MoEML search; add document status descriptor to search page; implement special styling for semi-diplomatic texts, and fix XSLT bug in handling of lgs in the process.
1.00 hours: Wendat: Implement tagging protocol for "interesting things", and add documentation and harvesting process.
2.25 hours: ColDesp: Working on errant spaces, and discovered a lot of erroneous tagging that needed fixing, as well as lots of XSLT sloppiness of my own making, now fixed; not clear yet whether all problems are solved.
0.50 hours: Wendat: Project meeting.
Total: 8.00 hours
Friday 2020-12-17
0.25 hours: Update Jenkins servers.
2.00 hours: ColDesp: Deep dive into remaining issues with unwanted whitespace around quotes and footnote numbers; various fixes committed at different stages of the build process. Now apparently all fixed.
0.25 hours: DVPP: Working on getting an RA paid for work in the fall. Not yet resolved.
1.25 hours: TEI: Weekly meeting with SB and NC to work on Stylesheets issue for adding anchors to examples. Worked through a CORS problem on my Jenkins at the same time (blocking of inline styles).
4.00 hours: Endings/staticSearch/Scancan/MoEML: Building on JT's work on XSLT yesterday, adding new scrollToTextFragment switch, documentation, and implementation; tested in the various other projects; worked through some plans on tickets; after JT merged changes, switched projects back to dev branch; created schema patching stage in build process so we can get rng interleave output (working around TEI bug); updates to documentation.
0.25 hours: Keats: Added Schematron to help avoid bad image links.
1.00 hours: LEMDO: Discussion with JT on video issues; researched adding poster frames to videos, and scripted addition of poster frames to 21 scene videos.
Total: 9.00 hours
Edition 3.7 of Dr. Kim Blank's excellent site Mapping Keats's Progress has been released.