Archives for: August 2012

30/08/12

Permalink 03:30:51 pm, by mholmes, 18 words, 37 views   English (CA)
Categories: Activity log; Mins. worked: 60

Working on contributor roles

With JJ, have been working on aligning our byline attribution phraseology with Marc Relator terms. We're almost there.

29/08/12

Permalink 04:15:30 pm, by mholmes, 58 words, 31 views   English (CA)
Categories: Activity log; Mins. worked: 45

Added eventography and sample event

Added <eventList>, <event> and <rs> into the schema, then created the eventography file, with one sample event, and marked the event up in two files using <rs>, as a demonstration for CB. Once we have some real data in the eventography, I'll add the requisite XSLT to make it functional.

Permalink 04:13:53 pm, by mholmes, 95 words, 32 views   English (CA)
Categories: Activity log; Mins. worked: 120

Lots of fixes for name markup

Fixed several hundred encoding oddities and errors related to name tags and docAuthor tags. Also updated Schematron to try to catch some of these. We're now in a position where we have almost all names marked up as either @type="person" or @type="pageant", and the question now is whether we should simply make "person" the default, and therefore optional, and save a bunch of space in files by deleting all instances of it. I see no reason not to do that. There are some name tags which lack it already, and they're working properly.

28/08/12

Permalink 03:29:04 pm, by mholmes, 56 words, 38 views   English (CA)
Categories: Activity log; Mins. worked: 180

Byline processing

Working on standardizing the current byline markup so that I can use it as the basis for generating more formal <respStmt> elements using our taxonomy of responsibilities. I have all names now marked up as names, but the informal descriptions of their roles are going to be very difficult to parse into clear categories.

Permalink 12:36:09 pm, by mholmes, 27 words, 52 views   English (CA)
Categories: Activity log; Mins. worked: 60

list/@type values normalized and documented

Standardized our usage of @type on <list>, removing a couple of obsolete usages, adding documentation to the schema, and tweaking the XSLT. Many files changed.

27/08/12

Permalink 08:00:20 am, by mholmes, 4 words, 35 views   English (CA)
Categories: Activity log; Mins. worked: 20

Timesheets done

Timesheets for Aug/2 done.

24/08/12

Permalink 10:41:55 am, by mholmes, 23 words, 50 views   English (CA)
Categories: Activity log; Mins. worked: 180

Date code change done

Processed all the files, tested locally, made some fixes, and migrated the changes to the live db. Also updated documentation on date encoding.

23/08/12

Permalink 04:59:23 pm, by mholmes, 112 words, 47 views   English (CA)
Categories: Activity log; Mins. worked: 420

Preparation for date code change

Lots of changes, most of them relating to the date coding changeover tomorrow, but also others:

  • Added Twitter link to footer.
  • Finished and tested all changes to XSLT for dates.
  • Centralized ajax storage div in includes file.
  • Finished writing and correcting date encoding instructions.
  • Wrote comparison XSLT (which runs against itself) to check the boilerplate.xml caption segments against the backups in captions.xsl, and report on discrepancies. Migrated changes back from boilerplate to XSLT file.
  • Fixed display of some captions which happen to have tags in them (which I wasn't expecting).
  • Tested new date code on my local app with a converted personography.

Good to go for the big conversion tomorrow.

22/08/12

Permalink 04:00:36 pm, by mholmes, 47 words, 47 views   English (CA)
Categories: Activity log; Mins. worked: 60

Bugfix for ID page

Links to biblio items on the ids page weren't working. I've now fixed it so that they show a little popup with the full biblio item. In the process, I identified a slight stupidity in XHTML page construction, so I've set myself a task to fix it.

21/08/12

Permalink 04:26:42 pm, by mholmes, 90 words, 37 views   English (CA)
Categories: Activity log; Mins. worked: 120

Preparation for date encoding conversion

On Friday I'll be converting all our existing date encoding to make proper use of the custom dating and calendar attributes, as well as updating the XSLT to take account of this, and providing new documentation. Today I wrote and tested the XSLT conversion code for the source documents, and wrote the first half of the documentation for encoders. What remains:

  • Complete documentation for encoders.
  • Rewrite XSLT to handle appropriate attributes.
  • Make all changes in local test db, and ensure everything is working.
  • Commit changes to SVN.
  • Update live db.
Permalink 11:31:45 am, by Janelle Jenstad Email , 103 words, 66 views   English (CA)
Categories: Encoding Notes; Mins. worked: 0

Principles for tagging dates

We don’t need to mark-up dates in Contributor bios. They function as strings of characters in the bios.

1700 would be a reasonable cut-off for other dates. After 1700, we’re into secondary sources, not primary sources. And whatever happened on a location after 1700 isn’t really “early modern” by the conventions of the discipline of English studies. (Early modern is more capacious for historians.)

The guiding principle needs to be: are we going to do anything with this date? Is it harvestable data, or information that we need to manage the site effectively? If not, then it’s just a string of characters.

20/08/12

Permalink 04:02:56 pm, by mholmes, 188 words, 65 views   English (CA)
Categories: Activity log; Mins. worked: 150

Java sort comparator for MoEML

I've rewritten my original TitleSortComparator class so that it not only handles leading articles but also decomposes the ash character into an ae sequence before doing the comparison. This handles the outstanding issues with sorting of both the historical personography and the bibliography.

This is what had to be done:

  • The new class was written with Eclipse, and is called MolSortComparator. It should be used as a model for future comparators and collators for eXist-only (no Cocoon) projects.
  • There is also a test class.
  • The class is assigned to the ca.uvic.hcmc.mol package.
  • The jar file is created using File / Export..., and is named identically to the class (don't know if this matters or not, but it's what I've always done in the past).
  • It is uploaded into WEB-INF/lib.
  • It is called as follows:
    <xsl:sort select="persName/reg" order="ascending" collation="http://saxon.sf.net/collation?class=ca.uvic.hcmc.mol.MolSortComparator"/>
    

After deployment, eXist needs to be restarted, but I was able to do that from the Tomcat Manager interface (although it took several minutes for the app to shut down).

17/08/12

Permalink 02:10:22 pm, by mholmes, 46 words, 40 views   English (CA)
Categories: Activity log; Mins. worked: 120

Date conversion code is now live

I think it's all basically working correctly, except that now we will have to convert from using @*-custom attributes, and I'll then have to rewrite the mouseover code accordingly. The XML and XSLT will have to be changed at the same time. That's for next week.

16/08/12

Permalink 03:29:52 pm, by Cameron, 80 words, 288 views   English (CA)
Categories: Announcements; Mins. worked: 0

Marking up reigns

When marking up dates, tag monarchical reigns with notBefore and notAfter as follows:

When Stow says "yet then called the riuer of the Wels, which name of Ryuer continued: and it was ſo called in the raign of Edwarde the firſt"

tag the reign:

<date notBefore="1272-11-20" notAfter="1306-07-07">raign of <hi>Edwarde</hi> the firſt</date>

This post based on an email exchange between JJ and SM.

15/08/12

Permalink 05:31:03 pm, by mholmes, 153 words, 43 views   English (CA)
Categories: Activity log; Mins. worked: 120

First date conversion popups now working

I have Julian to Gregorian date conversion popups working for simple dates (@when only). This is not yet on the live site because date ranges are not yet working, but I hope to have them in place soon.

There is a lot of unnecessary conversion between string representations and actual xs:date values all over the code, due to the way it's grown by accretion, and the way we've changed our minds about rendering rules. I should really look at this again, and see if I can take strings on input, convert to xs:date, do all manipulation with xs:date, and then render out right at the end. The problem here is that when we have partial dates (e.g. year-only dates), constructing an xs:date object requires the addition of month and day values, otherwise it fails; this introduces a spurious precision to a date which consists only of a year.

Permalink 04:03:03 pm, by mholmes, 43 words, 32 views   English (CA)
Categories: Activity log; Mins. worked: 60

Baptism dates now working correctly

Baptism dates are now rendering as prescribed by CB. This took longer than expected, because it's a special case, but still needs to be part of the regular rendering pipeline so that projected handling of non-gregorian calendar dates works as expected with it.

14/08/12

Permalink 03:31:11 pm, by mholmes, 95 words, 79 views   English (CA)
Categories: Activity log, Tasks; Mins. worked: 20

Automatic TOC generation must be revised

At present, TOCs are only generated for documents which have multiple <text> elements. This is resulting in documents which should never have multiple texts being created that way, just so they get an automated TOC. Obviously this is silly, and I need to revise the system so that it continues to support the existing documents (while we change them -- although perhaps not all will change), but supports TOC generation for single-text documents based on some measure of complexity. (Or perhaps all documents with multiple div elements should get a TOC by default?)

Permalink 03:27:00 pm, by mholmes, 117 words, 44 views   English (CA)
Categories: Activity log; Mins. worked: 120

Julian to Gregorian date conversion working

I now have a working and tested fuction which, given a Julian date at any level of specificity (YYYY, YYYY-MM, or YYYY-MM-DD) will return either a single Gregorian equivalent, or (more likely) a sequence of two Gregorian dates representing a corresponding range, accounting for the leap day offset and the March New Year issues.

Meanwhile, we've determined that in the context of the personography, birth and death dates will be rendered as en-dash-separated years only where both are precise and certain (i.e. @when). In other cases, the renderings will be split into b. xxx d. xxx clauses, so that the ranges, precision, certainty etc. on the two components (which may be different) can be expressed unambiguously.

Permalink 01:41:57 pm, by Cameron, 83 words, 34 views   English (CA)
Categories: Activity log; Mins. worked: 1

Stow: Second Pass Encoding

The following from SM (examples in this file):

  1. Check page break
  2. Check fw
    Is header (running title) correct? Is page number correct?
    Is signature correct?
    Is catchword correct?
    Remove any empty form works elements
  3. Add comments for printer's ornaments
  4. Proof text itself – insert comments to editor if anything is unclear, in doubt, or needs to be glossed - Check hyphens and insert comments for double hyphens
  5. Identify any foreign languages (Consult http://mapoflondon.uvic.ca/tei_markup.htm if necessary)
  6. Mark up dates

13/08/12

Permalink 05:38:15 pm, by mholmes, 261 words, 38 views   English (CA)
Categories: Activity log; Mins. worked: 360

Beginning some serious work on dates

I've started work on the date conversion code for handling Julian vs Gregorian proleptic. It's astoundingly complicated, but for now I've decided to handle two core features: the increasing day offset caused by the Julian leap year miscalculation, and the fact that between 1155 and 1751 the year was generally viewed as starting on March 25. There are dozens of other potential gotchas, but this will at least give us a basic way of handling conversion that will be approximately right nearly all the time.

The core ideas are:

  • Where @calendar="mol:julian", the encoder will either do a calculation to create gregorian date values (which will only really happen if there's solid evidence to enable this), or they'll encode the date string using @when-custom and friends from att.datable.custom.
  • If this is a transcription of a source, the original text will go into the date element, and that will be what's rendered. If there's no source text, then the date element may be empty.
  • In the former case, the original text will be rendered faithfully, and a mouseover will be created explaining the date in gregorian.
  • In the latter case, a rendering will be created according to our styleguide (based on MLA and on practice from elsewhere that CB is collecting), and again, a mouseover will give expanded detail.

I've spun off all the XSLT relating to dates into a separate module, and created a testing module that I'm building as I go along, so that we can verify everything is working without too much extra work whenever we make a change.

10/08/12

Permalink 02:31:20 pm, by mholmes, 68 words, 42 views   English (CA)
Categories: Activity log; Mins. worked: 120

New fictional personography, bugfixes and cleanup

  • Added Fictional Personography page.
  • Changed index_moeml to encyclopedia.xml, which makes more sense and fits with its menu item.
  • Updated index_site.xql so that links to the old URL still work, just in case.
  • Fixed a bug which was introducing an extra comma after nested analytic titles. I had not been paying enough attention when using the preceding:: axis, and had to change self:: to ancestor-or-self::.

09/08/12

Permalink 04:43:46 pm, by mholmes, 171 words, 32 views   English (CA)
Categories: Activity log; Mins. worked: 120

Fixes to personography rendering and implementation instance

The personography has been enhanced with codes for student contributors, and their page is now equipped with an auto-harvested table; CB will be checking the existing information from that page is all in the personography entries before deleting the old list.

The personography of historical figures has now been regenerated using the new system, at a new URL, and its table captions are now in boilerplate.xml; the table is now part of a static page which can be edited.

I investigated the use of <respons> instead of <occupation> for the person entries, but it has a required attribute @locus, which is supposed to specify precisely what part of the element concerned the person is responsible for. This is not what we need to do at all, so <occupation> is our only option, but I'll continue to raise this issue on TEI lists, because it appears there's no useful way to assign responsibility according to a formal scheme without using the inappropriate <occupation>.

Permalink 11:11:00 am, by mholmes, 230 words, 67 views   English (CA)
Categories: Activity log, Encoding Notes; Mins. worked: 60

Fixes to TRIU1 and MIDD17, and implementation of linking to fragments in other docs

As part of fixing the missing text in links in MIDD17, I tested and fixed the system for implementing links from one document to a fragment of another. This is basically how it works:

If you want to link from one document (say MIDD17) to another (say TRIU1), you can link to the whole document using mol:TRIU. However, sometimes you might want to link to a specific part of the TRIU1 document (which is a multi-text document). You can link to a specific <div> in the target document like this:

  • First, give the <div> in the target document its own @xml:id. For instance, we might want to link to the "Grocer's Company" section of the TRIU1 document, so we find the div that contains that section, and give it an @xml:id. Our convention is that an @xml:id for a section in a target document should be created with a prefix that consists of the main document id (TRIU1) and an underscore. So we do this: <div xml:id="TRIU1_grocers">.
  • In the document where you want to put the link, construct the link as follows: <ref target="mol:TRIU1#TRIU1_grocers">. In other words, mol: + the document id + # + the div id.

Links constructed like this are now working on the site (see MIDD17 links to TRIU1 for examples).

08/08/12

Permalink 04:24:19 pm, by mholmes, 52 words, 40 views   English (CA)
Categories: Activity log; Mins. worked: 15

Nested titles fixed

Monographic and journal titles need to be italicized, but when nested inside another of the same, they need to revert to roman type. That is now working.

Note to self: there are a lot of old CSS classes that we're no longer using, which could be cleared out to simplify the CSS.

07/08/12

Permalink 05:30:26 pm, by mholmes, 256 words, 43 views   English (CA)
Categories: Activity log; Mins. worked: 300

First day back: progress with personography and stubs

Edited the personography and associated code as follows:

  • The <persName>/@type "tech" is now obsolete (and should be removed from the schema -- note to self). All "tech" people are now "cont" (contributors).
  • Two taxonomies have been added to the header of the file, providing codes for various roles played by "cont" people. One is a short one, created by us, with some basic roles; another is a selection from the MarcRelator codes.
  • People who were "tech" are now "cont", with <occupation>/@code="#prog".
  • All other people who are "cont" should also be assigned occupation codes from the appropriate scheme.
  • The index_site.xql file can now display a table of people based on both the person type and the role (type and subtype respectively in the page params). This is probably pointless, because we will actually create sublists of people through XInclude in info pages, but it's useful for testing.

Changed the revisionDesc/@status values to remove "incomplete", and substitute "stub" and "empty" for the two types of incomplete pages. Suitable boilerplate is now added to pages with these two values from boilerplate.xml, and CB has updated all pages so their values are presumably correct. Some pages with other status values still have a stub message hard-coded into them, but CB will fix this.

The <term> and <mentioned> elements are now handled correctly (rendered as italics).

The Chrome bug (rendering <q> with straight instead of curly quotes) has been reported to the Chrome project.

Permalink 03:38:11 pm, by Cameron, 182 words, 48 views   English (CA)
Categories: Activity log; Mins. worked: 0

How To Type A Curly Apostrophe in oXygen

These instructions were originally posted by MH in Flow.

We're going to be moving over to using a curly apostrophe (Unicode character U+2019, which is the same as the right [closing] "smart" single quote). This means you'll need to be able to type this character in Oxygen when you need it. You can do this using a Code Template.

First, find that character in your Character Map, and copy it to the clipboard so you can paste it when you need it. Here it is, in case you want to copy it from here: ’

Click on Options / Preferences, then type "Code Templates" into the filter box at the top. You'll see a list of the existing Code Templates. Click on New, then fill in the following details:

Name: Apostrophe Description: Curly apostrophe (U+2019)
Associate with: XML Editor
Content: [Paste the apostrophe in here]

Then press OK.

Now you can do this to insert a curly apostrophe:

Press Control + Space. You should see the Code Template selector appear, and there is only one template, which is this one. Press OK to select it.

Permalink 03:33:36 pm, by Cameron, 62 words, 39 views   English (CA)
Categories: Activity log; Mins. worked: 0

How to Access MoEML's email account

These instructions are adapted from the instructions posted to Flow by JJ on 3 July 2012.

Netlink ID: london
Full address: london@uvic.ca
Obtain password from JJ

All of the emails sent to this account are automatically forwarded to mapoflondon@gmail.com. Obtain password from JJ and see documentation/emails.odt in the SVN repository for more information on using the Gmail account.

Permalink 03:25:21 pm, by Cameron, 36 words, 41 views   English (CA)
Categories: Activity log; Mins. worked: 0

How To Access MLA Handbook Online

These intructions were originally written by JJ and posted to Flow on July 4 2012.

Go to MLA website: mlahandbook.org
[Get user id and password from JJ.]

Search function works fairly well if you know MLA already.

Permalink 03:23:02 pm, by Cameron, 64 words, 41 views   English (CA)
Categories: Activity log; Mins. worked: 0

How To Check Our Web Stats

These instructions were created by JJ and were originally posted to Flow on July 4 2012

Go to http://webstats.uvic.ca/.

Username = jenstad
PW = [ask Janelle]

Once you are in, you can view sessions, page views, hits, referrals, domains, entrance pages, and exit pages ... and sort by date. Normally, you won't need to check these stats unless Janelle asks you to perform a specific task.

Permalink 02:57:38 pm, by Cameron, 111 words, 57 views   English (CA)
Categories: Activity log; Mins. worked: 0

How To Set Up Schematron To Do Validation

These instructions were created by MH and posted to Flow on July 6 2012.
Our requirements for validation have been getting more and more complicated, and I'm now using a fairly large Schematron schema which requires XPath 2.0 and foreign elements. This means that you will need to make a couple of changes to the Oxygen preferences so that Schematron functions correctly. If you don't do this, validation of XML files will fail. Here's what you do:
Click on Options / Preferences.
In the filter box at the top left, type XML Parser and press return. XML Parser [P] should be selected.
Under "Schematron / Schematron XPath Version", select "2.0".
Under "ISO Schematron", check "Allow foreign elements".

Map Of London

This project allows literary and scholarly works (primary and secondary) to be associated with locations in London, providing the reader with a richer understanding of the works.

Reports

August 2012
Sun Mon Tue Wed Thu Fri Sat
 << < Current> >>
      1 2 3 4
5 6 7 8 9 10 11
12 13 14 15 16 17 18
19 20 21 22 23 24 25
26 27 28 29 30 31  

XML Feeds