Today I went through the process of pulling a batch of OCRed docs from the repo (already organized by AM) and turning it into TEI, then doing the name markup. As expected, the name markup takes only a couple of minutes per page; the other stuff is the difficult bit. Rebuilt the site, and then we had a project Hangout meeting and discussed CSS etc. with the designer.
PM has been working assiduously on Saga.local to rewrite the Wheelock units, and today we built the first six and posted them for MN to OK before we proceed. They're in hrd/latin/new_wheelock. The data is all on Saga, but PM is backing up for himself, and I've got the first six units locally.
Received request from BAK to update site with new student blog entry.
Task completed; confirmation email sent to BAK.
Dealt with some tickets, closed some obsolete ones, discussed others in meeting. Also added XSLT handling for tables in the XML, tested and tweaked it. Existing tables now working well.
Using the code I rewrote yesterday, I've now integrated the treaty spreadsheet into the main personography. I had to resolve a few duplicate ids, so I've now written an additional output page for the web site which lists all existing ids (except for purely local ones) so that assignment is easier.
Both PS and I worked up the new pages for gallery and tweets, and reworked the menu structure. I think this should be done now.
The encoder package build script was not expecting packages without images. Now it is.
At AB's request, I've done new versions of the WordCloud from tweets during the conference. One includes all retweets and the content is relatively un-curated; in the other two, I stripped out all the retweets, got rid of &, URLs, "th" and "st" after numerals, hash tags, and handles, and expanded "St" to "Saint" (some of these requested by AB). The sites I used were: https://tagul.com/create and http://www.wordclouds.com/
Turned these into basic TEI with a lot of regexps and crafty style-based search-and-replaces in LibreOffice.
Meeting with GL and DH; following that, I used a range of hacky approaches to convert Word doc content from the English and French Convention of 40 docs to basic TEI, for GL to split out into debate days.
:: Next Page >>