Archives for: 2011

22/12/11

Permalink 11:43:53 am, by mholmes, 160 words, 89 views   English (CA)
Categories: Activity log; Mins. worked: 40

Implemented columns for lists in Forest Nuptiale

This task has been outstanding for a while, but I've managed to solve it in a very simple way using CSS columns. The solution is specialized to lists at the moment, but it should work identically for any other element on which we want to implement it. Notes/limitations:

  • The implementation is triggered by the list having a child::cb, but it uses descendant::cb for counting the number of columns, on the assumption that some column breaks may occur within list items. It would fail to work if there were no <cb> that was a direct child of the <list>, though. If we were implementing a more general solution, we would need to figure out what level the columnar layout should be implemented on, and we'd probably have to require @type="columnar" or something like that on a block-level element, to trigger the appropriate CSS.
  • The CSS still has to have -moz- and -webkit- prefixes.
Permalink 10:48:49 am, by mholmes, 77 words, 93 views   English (CA)
Categories: Activity log; Mins. worked: 60

Page number linking bug fixed

Fixed this bug, which was caused by the XSLT not expecting that notes would be placed inside page number <fw> tags. In the process of fixing it, I realized that links from TOC page numbers to the pages concerned would not work in the continuous view because we're not showing page numbers in that view, so I added code to create empty anchors in the text; this allows links from TOCs to work as expected.

21/12/11

Permalink 03:38:03 pm, by mholmes, 30 words, 87 views   English (CA)
Categories: Activity log; Mins. worked: 120

Worked through some outstanding tasks

Four tasks have been cleared off the list -- two had already been fixed, and just needed checking, but others had to be implemented. Fixes detailed on the original posts.

20/12/11

Permalink 03:32:50 pm, by mholmes, 214 words, 77 views   English (CA)
Categories: Activity log; Mins. worked: 120

Notes versus choice/sic/corr

The cleanup of notes resulting from input by site reviewers last year resulted in many <note>s being reduced to a single word, which is often a simple correction for the word or phrase being annotated. Such cases are not actually <note>s at all; they're better represented by <choice>/<sic>/<corr>, in the same way as we already use <choice>/<orig>/<reg>. Noticing this, we also noticed that the <mentioned> tag has been rather misused in some cases, when either <quote>, <soCalled> or nothing at all would be more appropriate. I've also added a handler for <choice>/<sic>/<corr> which is parallel with the existing one for <choice>/<orig>/<reg>

I've worked through the Sonnet 1609 text to change all such <note>s to <choice>/<sic>/<corr>; some should be <choice>/<orig>/<reg>, but I can't determine that, so LSPW will make another pass through the file and fix those. This markup can then be used as a model where similar situations pertain in other files.

16/12/11

Permalink 04:06:35 pm, by lspwong, 148 words, 114 views   English (CA)
Categories: Academic; Mins. worked: 0

Le Blanc t.o.c. issues

EDIT: Bug fixed by MDH 2011-12-22.

I found the following problems with the Le Blanc table of contents that need to be fixed:

1) When you click on the following page number links, they don't jump to the corresponding page in the body of the text:

<ref type="pageNum" target="#p_1" rend="float: right;">page I</ref>
<ref type="pageNum" target="#p_564" rend="float: right;">564</ref>
<ref type="pageNum" target="#p_621" rend="float: right;"><choice><sic>611</sic><corr>621</corr></choice></ref>
<ref type="pageNum" target="#p_644" rend="float: right;">644</ref>

2) Page number 76 should be in line with the chapter heading in <rs> tag but it appears on the following line even though there is no <lb/>.

07/12/11

Permalink 03:28:44 pm, by lspwong, 24 words, 95 views   English (CA)
Categories: Academic; Mins. worked: 0

Fix page number alignment in Varin text

Discovered a problem with alignment of page numbers 54, 90, 132, 136 and 158 in the Varin text. The page numbers appear centred when they should be floating left.

24/11/11

Permalink 10:33:29 am, by mholmes, 81 words, 67 views   English (CA)
Categories: Activity log; Mins. worked: 20

Capturing abbreviations

Following much discussion of the way we handle abbreviations, we're now thinking that it would be desirable to compile some example images, one for each generic type of abbreviation, from our source pages, and link them into the abbreviation definitions. For the first stage in this, MDH should use XQuery to extract a list of all abbreviations marked up in the text, along with their page numbers, so we can look through and pick out the best examples in each case.

01/11/11

Permalink 03:33:13 pm, by lspwong, 102 words, 152 views   English (CA)
Categories: Tasks, Academic; Mins. worked: 5

"Maladies des femmes" t.o.c. issues

There's a problem with the display of the chapter heading "Le traictement de la femme nouuellement accouchee, & tout le temps de ſa couche" (c.50.p.845) in the Maladies des femmes table of contents. There is a break between "tout" and "le" but there shouldn't be one. EDIT by MDH: This appears to have solved itself, or to have been solved by a previous fix. There's no break I can see. It's possible to make lines break by setting very large text in a narrow browser window, but that's preferable to having the text disappear or intrude into another block of text.
Permalink 03:11:04 pm, by mholmes, 26 words, 91 views   English (CA)
Categories: Activity log; Mins. worked: 30

Fixed a page-number positioning bug

Fixed a bug with page number positioning by re-working some of the margins on the various types of <fw>. Changes documented in CSS file.

Permalink 02:38:19 pm, by mholmes, 13 words, 84 views   English (CA)
Categories: Activity log; Mins. worked: 40

Investigated and dismissed an outstanding bug

This bug proved to be a bug in Firefox so it's now closed.

Permalink 11:27:16 am, by mholmes, 88 words, 78 views   English (CA)
Categories: Activity log; Mins. worked: 20

Gaiji elements inside and outside abbreviations are now handled correctly

<g> elements were previously only handled when they occurred within <abbr> tags, but they do occur elsewhere, so I've now added a new handler which shows their character description as a mouseover popup wherever they show up. The original handler for <xsl:template match="choice[child::abbr[descendant::g] and child::expan]"> has also been tweaked so that it more clearly distinguishes between the expansion of the abbreviation, and the explanation of the one or more gaiji characters which appear in it.

Permalink 10:52:26 am, by mholmes, 192 words, 75 views   English (CA)
Categories: Activity log; Mins. worked: 60

Index now split into medical and nominal

The original References page has now been split into two indexes, "Noms propres" and "Terminologie médicale". These were the required changes:

  • LSPW split off the medical items into a separate file called botanical.xml, also in the references subcollection.
  • The XQuery now looks for a parameter called "index", and if it's set to "med", uses the botanical.xml file; if it's set to "nom" uses the regular references.xml file; and otherwise collects all the references. It includes the parameter in its XML output in the form of an @subtype attribute on the parent <list> element that contains the reference <item>s.
  • The XSLT checks the @subtype parameter and retrieves the appropriate page title and introductory text for the reference list.
  • The original Index menu item has been split into two.

There is currently no way to retrieve the full page with all references from a menu item or a link, although of course it's still there at its original location. Should we include a link on each of the reference pages enabling the user to see the complete merged list? If so, what should it say?

31/10/11

Permalink 09:17:22 am, by mholmes, 49 words, 51 views   English (CA)
Categories: Activity log; Mins. worked: 15

References to be split

On instructions from CC, references are to be split into two groups: botanical references and other references. I've created a new file, botanical.xml, which will hold the new references, and LSPW will separate them out. Then we'll need to decide on how to show them on the site.

18/10/11

Permalink 09:17:09 am, by lspwong, 25 words, 123 views   English (CA)
Categories: Tasks, Academic; Mins. worked: 0

<g> tags in the Maladies des femmes

Handler needs to be written for <g> tags that are not in context of an abbreviation. EDIT by MDH: This has been fixed.

12/10/11

Permalink 09:08:12 am, by lspwong, 80 words, 129 views   English (CA)
Categories: Tasks, Academic; Mins. worked: 0

"Margin-top/bottom: 0" not inherited in some paragraphs

Find out why paragraphs that begin with drop-caps don't inherit "margin-top: 0" and "margin-bottom: 0" from TEI header in the Maladies des femmes. EDIT by MDH: The simple answer to this is that I was explicitly excluding the paraStyle CSS in the case of a drop-cap paragraph because any text-indent setting screwed up the drop-cap. However, I've now done something more sophisticated, by stripping out the text-indent setting with a replace() using this regex:
(text-indent[\s]*:[\s]*[^;]*[;])|(text-indent[\s]*:[\s*][\d][^\s;]*$)

29/09/11

Permalink 10:22:57 am, by mholmes, 79 words, 60 views   English (CA)
Categories: Activity log; Mins. worked: 15

Centred headings

LSPW pointed out that headings in chapters which are nested in <div>s to the point where they become <h3> elements were not centred, whereas those which become <h2>s are centred by default. I added text-align: center to the h3 ruleset in the stylesheet, but then we discovered that reference items have <h3> headings, and those should be left-aligned, so I added another ruleset to set them back to left-aligned.

27/09/11

Permalink 02:16:49 pm, by mholmes, 67 words, 63 views   English (CA)
Categories: Activity log; Mins. worked: 60

Welcome to GM

GM has joined the team, and will be working 6 hours per week on the Ville-Thierry. Slots have been booked on Spartan, and he's had a general intro to the project; LSPW will direct him to the TEI Guidelines initially, and then show him the V-T text and get him started. The V-T has little or no CSS code in it yet, so it's straightforward transcription and markup.

21/09/11

Permalink 03:42:54 pm, by lspwong, 182 words, 128 views   English (CA)
Categories: Tasks; Mins. worked: 20

"Maladies des femmes" t.o.c. issues

Task for MH: Please find out why note tags are interfering with alignment of chapter and page numbers in table of contents in Maladies des femmes text. Chapter and page numbers should be floating right.

EDIT by MDH: This task seems to have been obviated by two things: First, most of the notes included in the TOC should actually be <choice>/<sic>/<corr> or <choice>/<orig>/<reg> blocks, so there will be a lot fewer note buttons in the TOCs in any case. Second, including a note inside a linked page reference is impractical, because it's not clear what clicking on it could or should do -- show the note, or go to the page reference? So on balance, it makes more sense to attach notes to the text preceding floating page numbers, if there's actually a need for a note, but when the note is simply (for instance) noting an incorrect page number in the original, we can just use <choice>/<sic>/<corr>.

16/09/11

Permalink 09:37:14 am, by lspwong, 36 words, 168 views   English (CA)
Categories: Tasks, Academic; Mins. worked: 0

Left-aligned page numbers to be fixed

[EDIT BY MDH: Fixed this in the CSS. Look out for unwanted side-effects.] Task for MH: Figure out why page numbers for even-numbered pages float far left for both Le Blanc and Maladies des Femmes texts.

01/09/11

Permalink 03:13:17 pm, by mholmes, 102 words, 88 views   English (CA)
Categories: Activity log; Mins. worked: 120

More work on Le Blanc

I've made substantial changes to the Le Blanc, adding many linebreaks, moving all right-floated labels further into the margin, and reducing the page size. There are still problems: one issue is that the <quote type="italics"> inline quotes often actually have a smaller font size than the surrounding text, but @rend attributes on <quote> are not carried through to the XHTML yet. That needs to be fixed in the XSLT, and then the relevant <quote> elements (there are 394 of them) need to be examined to see if they need the @rend to reduce their font size.

Permalink 12:56:53 pm, by mholmes, 143 words, 96 views   English (CA)
Categories: Activity log; Mins. worked: 240

All files using new schema; Le Blanc refactored

I've now switched all files over to using the new schema, and standardized them all on the xml-model processing instruction, replacing the oxygen and oasis-schema PIs wherever they occurred. I've also trimmed out a bunch of old unnecessary namespace declarations from root elements.

Following that, I've worked extensively on the Le Blanc, which had a disastrous document model; some chapters were identified as such and some not; chapters were embedded within chapters within other chapters; other divs were left hanging outside chapters; and the structure took no account at all of the volume's organization into four books. That's all now fixed, along with many typos or layout issues I came across while working on it, but it will need proofing again eventually. I'm going to spend a little more time going through LCC's list of issues while I'm focused on that text, though.

31/08/11

Permalink 04:49:56 pm, by mholmes, 219 words, 76 views   English (CA)
Categories: Activity log; Mins. worked: 360

Work on Le Blanc and new schema

I've done some major refactoring of Le Blanc, which is my pilot text for using <label> instead of <argument> for the marginal headings. That feature is working well, but I've had to do a huge amount of work to fix wildly proliferating <div>s in the original markup; they're still a bit of a mess, with chapter <div>s interspersed with other unidentified <div>s that have no apparent purpose, while some subsections in chapters are nested <div>s and others aren't. But a few more hours should sort it.

I've also generated a new schema, which includes <argument> in all the right places as before, but also includes <label> and adds the @type attribute to it (so we can distinguish <label>s used for marginal headings from those in e.g. TOCs.

In the process of this work, I discovered some aberrant behaviour in oXygen which threw me for a while. The old oasis-schema PI, linking to an RNG file, was not actually working, now that oXygen has switched to xml-model; instead, it was silently switching to validation with tei_all, which meant that many things would validate under the oasis-schema heading but fail to validate when xml-model was used.

26/08/11

Permalink 02:16:28 pm, by mholmes, 151 words, 156 views   English (CA)
Categories: Activity log, Tasks; Mins. worked: 30

Arguments inside paragraphs

EDIT by MDH 2011-12-22: I believe this task has been obviated by the proposed (and partially-implemented) change from using <argument> to using <label>. Setting this task to Completed.

Note to self: I thought I had made substantial amendments to the schema that would permit us to use <argument> inside e.g. <p>, because that's where our marginal arguments most often show up. However, in the Le Blanc text, that seems to be failing, so I need to revisit it. The text has been marked up in such a way that open ps and divs are being closed before and argument and opened again afterwards, and when I try to refactor appropriately, the code turns out not to be valid. Either the validation process is failing somehow, or the schema is not the most recent, or perhaps my modifications didn't go far enough.

Permalink 12:39:13 pm, by mholmes, 240 words, 76 views   English (CA)
Categories: Activity log; Mins. worked: 300

Le Blanc reveals more issues involving other documents

I've spent quite a while this morning fixing page sizing and margin issues with the Le Blanc; this arises out of the fact that (as far as I can tell) the original page dimensions were measured from the PDF page width rather than the width of the page-image sitting inside the PDF. I think I have that problem basically solved (I had to recalculate about 400 margin-right settings).

However, we have another transformational nesting issue which is exemplified 45 times in Le Blanc, but also shows up in three other texts in the collection: <cit>/<quote> elements cross page boundaries, and therefore contain <fw> and <pb>, but in this context, <quote type="italic"> was converted into an XHTML <i> tag. Initially, I thought this was another case where we needed to split out the contents before the main transformation, but I began to re-think, because there's such a range of related phenomena, including the use of <blockquote>.

I've decided to go for even more divs and spans, eschewing more specialized XHTML elements, because it's easier to get valid structures in highly complicated documents this way; I can then control the display with CSS, making <div>s inline and <span>s block where required. This makes for valid documents (most of them validate now), which ensures that the rest of the CSS gets applied correctly.

25/08/11

Permalink 03:30:39 pm, by mholmes, 96 words, 90 views   English (CA)
Categories: Activity log; Mins. worked: 45

Beginning work on Le Blanc

LCC left a detailed list of issues to address in the Le Blanc text, and I've started on the first one, which involves the markup and rendering of the title page. Some fixes will be required in XSLT, and also the markup needs a bit of reconfiguring. <titlePart> elements do not seem to be processed as expected; instead of getting <h2> and <h3> tags, they're getting block-display <span>s, and I can't yet figure out how that's happening. It means their rend attributes are not being passed through.

Permalink 01:48:18 pm, by mholmes, 48 words, 95 views   English (CA)
Categories: Activity log; Mins. worked: 45

Created documentation for marking up odd glyphs in abbreviations

Five of the engravings contain unusual glyphs used for abbreviation, and now the same has appeared in one of the regular texts, so I've documented the markup process for these glyphs, and passed the document on to LSPW so she can merge it into her growing markup documentation.

24/08/11

Permalink 05:34:55 pm, by mholmes, 29 words, 80 views   English (CA)
Categories: Activity log; Mins. worked: 60

le_bon_mariage.xml XHTML output now validates

Tracked down a bunch of bugs, which EGB fixed, and the output now validates, after coming through our two-stage transform. Yay! From 406 errors to zero in a few days.

Permalink 11:45:26 am, by mholmes, 291 words, 71 views   English (CA)
Categories: Activity log; Mins. worked: 75

Fix for non-item content embedded in lists

I've written an intermediary stylesheet to split out <list> elements containing <pb> and <fw> children into separate lists with the <pb>s and <fw>s outside them. This is processed prior to the main transformation, and has solved the problem for lists. I may revisit it for lg/l at some point. For the record, here's the template (the only non-pass-through template in an identity transform):

<!-- Split out interrupted lists into a sequence of lists with the interruptions
      between them. -->

  <xsl:template match="list[child::fw or child::pb]">
<!-- Stash a reference to the current list so we can easily refer to it in later nested code. -->
    <xsl:variable name="thisList" select="."/>
    <xsl:for-each-group select="child::item" group-adjacent="count(preceding-sibling::fw) + count(preceding-sibling::pb)">
        <list>
          <xsl:for-each select="$thisList/@*">
            <xsl:copy/>
          </xsl:for-each><xsl:text>
</xsl:text>
          <xsl:for-each select="current-group()">
            <xsl:copy-of select="."/><xsl:text>
</xsl:text>
          </xsl:for-each>
        </list><xsl:text>
</xsl:text>
      <xsl:variable name="lastItem" select="current-group()[position() = last()]"/>
          <xsl:for-each select="$lastItem/following-sibling::fw[preceding-sibling::item[1] = $lastItem] | $lastItem/following-sibling::pb[preceding-sibling::item[1] = $lastItem]">
            <xsl:copy-of select="."/><xsl:text>
</xsl:text>
          </xsl:for-each><xsl:text>
</xsl:text>
      </xsl:for-each-group>    
      
    
  </xsl:template>

22/08/11

Permalink 10:32:32 am, by mholmes, 12 words, 79 views   English (CA)
Categories: Activity log; Mins. worked: 10

Processed a thumbnail

Created a thumbnail image for the title page of Maladies des Femmes.

18/08/11

Permalink 08:57:07 am, by mholmes, 380 words, 132 views   English (CA)
Categories: Activity log, Tasks; Mins. worked: 90

Validation issues to tackle

EDIT by MDH 2011-12-22: I believe both of these issues are now fixed. I wrote a template to split lists and it's now in use, and I think LSPW has fixed the erroneous @target attributes. Setting this task to Completed.

Several XHTML validation issues have arisen recently. They fall into the following categories:

  • Unexpected elements inside lists. le_bon_mariage.xml has many instances of page breaks, and forme works which occur inside lists. The XSLT doesn't expect this, and outputs XHTML elements inside <ul> and <ol> which are of course invalid (only <li> is allowed).
  • Duplicate ids in reference links. Reference links inside the text are provided with unique ids so that you can navigate to them, and these ids are constructed from the target id of the reference itself, followed by an underscore and a number; numbers are provided in sequence by counting preceding references. This is going wrong somehow, resulting in more than one reference link with the same id attribute in the XHTML.

The first requires a policy decision (do we allow other elements than item inside of list?), followed by fixes to the XML and/or the XSLT to deal with the consequences of the policy. The second seems to be caused by typos in the @target attributes of ref elements in the references file; I've identified a set of these and asked LSPW to fix them. The same kind of typos also show up in anthology documents, so I've identified a stack of them using XQuery and sent them to LSPW too.

NOTE: With regard to the first issue, I wrote to the TEI list, and had a number of useful responses; I think the most appropriate was to keep the current markup (which is, after all, "correct"), but use a two-stage transformation to get the document to XHTML. In the first stage, any lists bisected by page breaks, forme works etc. would be split into two lists, allowing the second transform to proceed appropriately. This is a really good idea, and could also be used to handle bisected lgs, which are currently output as <ul>/<li> sets if they're not bisected, or <div>/<span> sets if they are.

16/08/11

Permalink 04:42:06 pm, by mholmes, 69 words, 91 views   English (CA)
Categories: Activity log; Mins. worked: 30

LSPW's markup documentation added to the SVN repo

Following a meeting discussing contracts, working hours and future plans, we added LSPW's word-processor file, in which she's documenting the markup practices of the project, to the repo, in a new directory called documentation. It looks pretty comprehensive and extremely useful. I might just store it in the exist db so it could be made available on the site (although a PDF version would be more appropriate for that).

11/08/11

Permalink 03:39:38 pm, by mholmes, 192 words, 67 views   English (CA)
Categories: Activity log; Mins. worked: 30

Another approach to original vs. modern transcriptions

I've always been unhappy with the way original and modern transcriptions in the engravings are stored in separate annotations and linked to separate zones. I think a better approach would be to combine the two within one annotation, inside special divs that mark them as alternates:

<div type="transcription_alternates">
  <div type="transcription_orig">
    <p>[...]</p>
  </div>
  <div type="transcription_modern">
    <p>[...]</p>
  </div>
</div>

The XSLT could then provide a single pop-up annotation window with a choice of two tabs at the top or the bottom, so you could switch between them. It could even remember your preference (the last tab you looked at) in a cookie.

It might be possible to automate the conversion of the existing code to this new approach using XSLT; you'd have to identify zones which were equivalent (mostly overlapping), and rewrite the contents of the Transcription category so it contains both, then delete the modernized ones. I think this would definitely be a cleaner approach to providing both transcriptions.

Permalink 03:29:50 pm, by mholmes, 134 words, 85 views   English (CA)
Categories: Activity log; Mins. worked: 60

More on regularizations

More research on the regularizations suggests that those which handle i/j and u/v variation are also unnecessary in the original transcription field. Using this XQuery, I was able to identify 573 choice elements which have 340 distinct variants:

xquery version "1.0";

declare default element namespace "http://www.tei-c.org/ns/1.0";

let $list :=
(
for $c in //choice[orig]
where translate($c/orig, 'sſuvUVijIJ', '') = translate($c/reg,  'sſuvUVijIJ', '')
return concat($c/orig, '  :  ', $c/reg)
),
$distList := distinct-values($list)
return 
(concat(count($list), ' items, of which there are ', count($distList), ' distinct values: '), 
$distList)

If CC agrees that all these are irrelevant, it would be easy to remove them. That would leave 247 other regularizations, which mainly seem to consist of supplying missing spacing and apostrophes. These could presumably be kept.

Permalink 03:20:27 pm, by mholmes, 65 words, 98 views   English (CA)
Categories: Academic; Mins. worked: 60

Fixes to image markup files

I discovered when validating documents that some had information missing from their headers (part of the Image Markup Tool header info, perhaps dating back to a version of the IMT that didn't handle that data properly under Wine on Linux). Fixed those, and also fixed some errors in il_cherche.xml, which had one annotation that was classed as an Objet but actually contained transcription.

Permalink 01:26:19 pm, by mholmes, 62 words, 75 views   English (CA)
Categories: Activity log; Mins. worked: 120

Latinate abbreviations in five engravings now marked up

I've completed the markup of all the latinate abbreviations identified by CC in the five engravings which contain them; I've also found what I think is one more instance of apostrophe-with-dot-below used for -(m)ent, which I've marked up without a description, waiting for word from CC.

I tweaked the title attribute so it begins with the expansion, for additional clarification.

Permalink 11:43:54 am, by mholmes, 177 words, 148 views   English (CA)
Categories: Activity log, Tasks; Mins. worked: 30

Normalizing the use of choice/orig/reg

Arising out of a discussion with CC: The intended use of <choice>/<orig>/<reg> in the original transcriptions of the engravings is to restore normal contemporary usage, while the modern transcription is a full modernization. However, there are instances of <choice>/<orig>/<reg> which are used to replace long s with short s (dating back to when we had no modernizations), so these are inconsistent with our policy, and we should get rid of them. I might be able to do this entirely mechanically -- in the case of any regularization where the only difference is between long and short s, I can simply replace the entire <choice> element with the contents of <orig>; while for every regularization where a long s exists in the <orig> but other things are also regularized in the <reg>, I can add an attribute we can use temporarily to find and manually fix them. This is now on my task list.

10/08/11

Permalink 02:01:20 pm, by mholmes, 211 words, 71 views   English (CA)
Categories: Activity log; Mins. worked: 150

Implemented gaiji markup plan

I've added the required tagging to billet_doux.xml, which has five odd abbreviations of two different types, and implemented the XSLT and CSS to render it as planned. I'm now waiting for textual descriptions in French from CC of the individual symbols, then I'll make the same changes to the other four files which contain the abbreviations.

In the process, I had to modify the schema, which I did using Roma 2, starting from the previous ODD file (mariage_2011_02.odd), and simply adding the gaiji module and a couple of other things. That seemed to work fine, except for the file forest_nuptiale.xml, which was showing validation errors on the @rend attribute when it appears in argument. This was weird, because exactly the same structures validated fine in other files. After some work comparing files, I discovered that the forest_nuptiale.xml file had this declaration:

<?oxygen RNGSchema="../../mariage.rng" type="xml"?>

while other files such as le_bon_mariage.xml had this:

<?oasis-schema href="../../mariage.rng" type="application/xml"?>

Changing to the oasis-schema declaration made the file valid for some reason. Weird, but worth noting in case it crops up again. Other files have combinations of both schema declarations. I should clean up those inconsistencies.

Permalink 09:32:14 am, by mholmes, 72 words, 88 views   English (CA)
Categories: Activity log; Mins. worked: 10

Minor bugfix

When adding XSLT processing for the <head> tag, I had neglected to suppress its processing inside <lg> during both the <lg> scenarios (one of which uses <ul>, while the other uses <div>s and <span>s because <fw> tags appear inside <lg>s). <head> is now rendered first, and suppressed later, in both scenarios.

09/08/11

Permalink 01:04:33 pm, by mholmes, 175 words, 110 views   English (CA)
Categories: Activity log, Tasks; Mins. worked: 60

Plan for abbreviations in engravings

The abbreviations deriving from Latin, listed in CC's post, will be handled like this:

  • Each distinct abbreviation will have a <glyph> element in the <charDecl> section of the file header, explaining it briefly.
  • Each instance of the abbreviation will be marked up in the original transcription following this pattern:
    <choice>
      <abbr>Vo<hi rend="vertical-align: super;"><g ref="#theGlyphTag">9</g></hi></abbr>
      <expan>Vous</expan>
    </choice>
    
  • The XSLT will display ONLY the rendered <abbr> element, and will give it a @title attribute for mouseover which is constructed from the information in the <glyph> element(s) referenced inside the <abbr>. This differs from a regular <choice>/<abbr>/<expan>, in which the <expan> is used as the @title attribute. The actual expansion will normally be available to readers in the modernized transcription, since these all occur in the engravings.

08/08/11

Permalink 04:15:29 pm, by mholmes, 247 words, 72 views   English (CA)
Categories: Activity log; Mins. worked: 150

Bug fixes and enhancements

The following updates and changes were made in response to problems and new feature requests, in consultation with LSPW:

  • A bug was fixed whereby the wrong document title was being used when showing the list of "Liens à cette référence dans les documents". I was using the first trunc title in the monogr, which was wrong when the document was an analytic.
  • A new value ("éloge") for text class has been added, and four documents use it, so we've added a new menu item for these documents to give them their own TOC. In the process, I noticed that the sort-by-type function was not using the TitleSortComparator, so it was sorting éloge after everything else; that's now been remedied.
  • There's a need to allow <head> inside <lg>, so I've added processing for that.
  • Various documents had become duplicated under different names in SVN and in the exist db, so we went through the process of cleaning that up.

Finally, we looked at the possibility of using OCR to handle the Tableau de l'Amour Conjugal. I installed the Tesseract back-end along with a sourceforge front-end called gImageReader, and also the Hunspell and MySpell French dictionaries, and we tried it out, using Simple Scan to get the image, the GIMP to clean it up a bit, and gImageReader to OCR it. The results aren't great, because the quality of the original document is not good, but it might be slightly quicker than straight transcription.

07/08/11

Permalink 04:49:24 pm, by ccarlin, 372 words, 97 views   English (CA)
Categories: Academic; Mins. worked: 0

Abbreviations in engravings

Alphonse Chassant, Dictionnaire des abréviations latines et françaises, explains the transition from Latin abbreviations used in manuscripts (from Antiquity through the Middle Ages) to print culture in the 16th century, and how versions of these Latin abbreviations slipped into French printed texts. The phenomenon of such abbreviation disappeared over the course of the 17th century, but one of its latest manifestations was on engravings, where there was at times little room for the desired text.

There are not really named glyphs for these abbreviations when used in French, but we can say that they follow the tradition used in Latin of contractions and ellipsis, for example; these are two the forms of abbreviation we commonly find in the engravings in our corpus. Building on that Latin tradition, Chassant shows specific examples that cover all of the instances listed below. The page number in his book is given in each case.

(N.B.: Chassant also shows how a horizontal line could replace an "m" or an "n" in Latin abbreviations, the only abbreviation that remained common in print in 17th-century France, usually marked by a tilde. There are six such examples in the "mariage" engravings, but I have not listed them as this abbreviation is not only common throughout the corpus, it also is qualitatively different from the others, replacing a letter rather than indicating a specific word.)

For the following abbreviations, I think the best strategy will be to include a note in the transcriptions, but I'll talk with MH about this before making any moves. (AC = Chassant. Full reference: Alphonse Chassant, Dictionnaire des abréviations latines et françaisesusitées dans les inscriptions lapidaires et métallique, les manuscrits et chartes, 2e édition, Paris, Auguste Aubry, 1862. Google Livres, 6 août 2011.)

0619645 Dame recevant un billet doux "nous" x 4 (AC, p. 132); "qu'il" (AC, p. 136)

72-B-59070 La femme qui fouette son mary "vous" x 11 (AC, p. xxxii--a form of the Latin abbreviation for "us"); "pour" (AC, p. 134); "votre" x 2 (AC, p. 139)

G-153297 Le massacre de Lustucru (Larmessin) "pour" x 2 (AC, p. 134); "vous" x 4 (AC, p. xxxii--a form of the Latin abbreviation for "us" = 9); "votre" (AC, p. 139)

G-157562 Bonheur des uns "que" (AC, p. 136)

P123-IFFNo.190 Opérateur céphalique "pour" (AC, p. 134)

26/07/11

Permalink 01:29:21 pm, by lauren, 585 words, 67 views   English (CA)
Categories: Academic; Mins. worked: 270

Final proofing tasks for Le blanc 07-26-2011

Today is my last day, so I am revising my comprehensive list of issues still needing to be worked out on Le blanc and creating an updated one. There is still a lot of editing to be done on this text, and due to its gargantuan size, the edits may take a while. I'm sad to leave it, though, since it's a great read, and I hope that the next person working on it enjoys it as much as I have.

  1. <titlePage>: I had used a “text-indent: 7em” on “& de quelques Sainctes, qui ont beaucoup” but the margin was very off. On 07-19-2011 I used a "text-indent: -1em" in the same place and it made it too far left. At this point, I tried taking out the "text-indent" altogether and made the "padding-left" a -0.5em instead of 0.5em. This also hasn't worked and it is currently 3:33, so I am handing this task over to mholmes, because lspwong and I have spent a good half-hour trying to fix it. :)
  2. There is also a strange space between two lines of the “Avant-propos” (“Dieu vangeur & ennemi” / “des blasphemes...”, as though I had put two or three line breaks between the two lines of text. lspwong checked my code and we're not sure why this strange line break keeps appearing.
  3. Chapter headings and section (of chapter) headings:
    All <head> tags are automatically left-aligning. lspwong looked at this and seems to think my coding is correct, so we're not sure why they aren't centering.
    - i. pgs. 1 to 429: font sizes and font styles (italicized or not) are inconsistent in <head> tags. There is also white space that appears before and after these <head> tags automatically, without me having to put in <lb/> tags. mholmes you'll notice that in instances where the .pdf has no space between a given header and the surrounding paragraphs, I have used a "margin-top: 0; margin-bottom: 0;" to compensate.
    - ii. pgs. 430 to end: font sizes and styles are fine. mholmes you'll notice that I've had to add in <lb/> tags in cases where the .pdf shows white space between the headers and the paragraphs.
  4. Each of the title pages corresponding to the different "livres" (Livre deuxieme, etc.) is also left-aligned, despite being wrapped in a <head> tag as well.
  5. p. 1: the “1.” should be to the left of the drop-cap “I”, as it is in the .pdf, and in the .xml, the drop-cap is to the left of the page number. This also occurs on pgs. 14 and 23.
  6. Please refer to my post from 07-04-2011 for outstanding “Table des chapitres” problems for mholmes to look at.
  7. In the arguments, I have sometimes used a [?] to signal that I was not able to read the text. I was planning on proofing all of my arguments (not just those with the [?]s) by entering the text into sites like Biblos and seeing what I came up with. I will not get a chance to do this before I go.
  8. References need to be done.
  9. Right-aligned arguments (appearing in the right margin of the text) were spaced too far away from the text. I tried changing the number of ems to get them to line up closer with the body, just as the left-aligned arguments do. Most of the adjustments look great now, but the ones on the very first few pages of the "Avant-propos" may still need a bit of tweaking.

20/07/11

Permalink 08:18:33 pm, by ccarlin, 127 words, 54 views   English (CA)
Categories: Academic; Mins. worked: 0

Categories in table of contents

To follow up on Leanna's post about naming of categories, "Eloges du mariage" would work for the side bar. Another issue are the categories in the table of contents under "Tous les documents". We can add "éloge, vers" as the counterpoint to "satire, vers" and "satire, prose," although this makes me a little uneasy because there is a type of poem, "l'épithalame," in praise of marriage, but it applies to celebrating a particular marriage -- so I guess "éloge, vers" works, and indeed it seems like the logical choice.

Thinking about the t of c also brings to my attention the designation "roman, fiction, prose"; I don't think we need both "roman" and "fiction"; "fiction" on its own should suffice. The novel is fiction by definition.

14/07/11

Permalink 10:17:39 am, by lspwong, 62 words, 84 views   English (CA)
Categories: Activity log, Tasks, Academic; Mins. worked: 10

Need to create new category for Blanchon's Stances du mariage

Transcription and annotation of Blanchon's Stances du mariage has been completed. The text is now published on the site, however, it appears under the category of "Satire" which is a problem because it is not a satire. CC suggests that a new category should be created, possibly "Éloges du mariage". We'll have to discuss this with MH when he returns from holiday.

07/07/11

Permalink 03:38:03 pm, by mholmes, 17 words, 1083 views   English (CA)
Categories: Activity log; Mins. worked: 60

Posting time...

...spent over the last few days on moving EG-B to Spartan, timesheets, and discussing various markup issues.

Permalink 03:06:43 pm, by lspwong, 211 words, 54 views   English (CA)
Categories: Academic; Mins. worked: 0

Espines du Mariage complete; Blanchon's "Stances du Mariage" next

I have more or less finished annotating the Espines du mariage. I'm just waiting for CC to get back to a few of my e-mails before I can add the final changes and references. One particular interesting issue that needs to be addressed is the display of abbreviated latin characters. In the Varin text, there are 2 instances, "Vereque" (p. 120) and "Horaque" (p.139), in which the "que" ending is abbreviated by what looks like the letter "q" with an acute accent and a subscript "3". MH suggests that we should just ignore the abbreviation and transcribe the "que" ending in both words. We're running this by CC first to get her opinion on the matter.

In waiting for CC's reply, I've begun transcribing and encoding a short poem called "Stances du Mariage par antithèse à celles de Ph. Des Portes" by Joachim Blanchon. I'll also be doing the same for a few other short poems that belong to the same anthology.

Another project I plan to work on is reviewing previous texts that I have annotated (sonnet_1609.xml, sonnet_1621.xml, forest_nuptiale.xml) as I have realized, after working on the Varin text, that I failed to write notes/references that identified the source and author of all foreign citations in those texts.

05/07/11

Permalink 11:32:59 am, by lauren, 628 words, 86 views   English (CA)
Categories: Academic; Mins. worked: 120

Le blanc proofing tasks 07-04-2011

  1. Spacing/alignment issues:
    a. Title page: I used a “text-indent: 7em” on “& de quelques Sainctes, qui ont beaucoup” and then put a line break before the rest of the sentence, which I had assumed would be aligned with “& de quelques Sainctes” because it, too, is wrapped in the <hi rend=”text-indent: 7em”> tag, but the margin is very off.

    b.There is also a strange space between two lines of the “Avant-propos” (“Dieu vangeur & ennemi” / “des blasphemes...”, as though I had put two or three line breaks between the two lines of text.

    c. p. 554, the header “La direction …” is still left-aligned, despite my use of “text-align: center; margin-left: auto; margin-right: auto;” after the <head>. I did the same thing for page 1 when its heading was left-aligned and it appears to have worked because on page 1, the header is now centered appropriately. I have checked the code and once again, I have no idea why there is this discrepancy.
  2. “Table des chapitres” problems for mholmes to look at:
    a. The “Table des chapitres” heading needs to be centered. Right now it’s showing up as being left-aligned, which is strange because I thought that any time a <head> tag is used, the text is automatically centered. I’ve added in some additional css (“text-align: center; margin-left: auto; margin-right: auto;”) to see if that will fix the problem.

    b. There is also a double space between the “Chapitre (x)” <label> and the chapter description in the <rs>, and once again I’m not entirely sure why. I’m wondering if the <item><label> mark-up automatically formats the heading as being one line, so the <lb> I added after it would then act as a double-space. I’ve tested this theory by deleting the line break after “Chapitre I” of the “Livre premier”.

    c. There are lots of little text indents in the chapter descriptions where in the .pdf, the text is indented. Although I have added in what I thought were the appropriate <hi rend=”text-indent: (x)em;”> tags, the indents are not showing up on the site at the moment. I’m wondering if this has something to do with the <rs> tag that’s wrapping the descriptions...

    d. The page numbers corresponding to each chapter need to appear on the same line as the last line of the chapter description. For some reason, the “page 1” for “Chapitre 1” for the “Livre premier” is formatted correctly; that is, it is on the same line (but right-aligned) as the last line of the chapter description (“vie, & tres-agreable à Dieu”), but none of the others are. I have checked my code and I cannot find a reason for this discrepancy.

    e. The “Chapitre” headings in “Livre III” and “Livre IV” need to be left-aligned and on the same line as the chapter summary. I’m wondering if my hypothesis is correct that the <item><label> element combination automatically puts a line break before the <rs>.

    f. The text wrapped in the <list type=”toc”><head> for "Livres Deuxiesme", III and IV and the summary beneath them (i.e. “La loüange des femmes vertueuses”) is not currently showing up. While the “Chapitre premier” is showing up on the website, the summary underneath it seems to have disappeared as well, although I have coded all summaries and chapter headings the same way in the .xml.

    g. Some of the text appears to be in bold, and I cannot find a reason for this. It seems to start on the second page of the “Table des chapitres” after “Chapitre[e] 1”.

28/06/11

Permalink 04:59:06 pm, by mholmes, 102 words, 64 views   English (CA)
Categories: Activity log; Mins. worked: 120

Working on text output for text analysis

I've started work on a text-output format suitable for text-analysis work. The first stage is normalizing spacing and re-connecting hyphenated words, which I seem to have working OK, although I need to do some more testing. Next we need to look at what normalization strategies we're going to undertake, from the obvious (replacing long s) to the less obvious (such as correcting all variant spellings). I've started learning a little about Mallet, and I've run some of their sample data, but it's obvious that I'm going to have to do a lot of learning before I can make worthwhile use of it.

15/06/11

Permalink 09:21:44 am, by mholmes, 96 words, 54 views   English (CA)
Categories: Activity log; Mins. worked: 20

Quotes and cits -- need to do a review and standardize

An issue arose today due to a collision of XSLT templates which are handling <cit> and <quote> elements. Since our practice has evolved rather than been planned, we've ended up with a complex range of different encoding strategies, involving the use of @type on both cit and quote. As it stands, everything seems to work, but we really should take a close look at the relevant templates in teiGeneral.xsl, as well as the encoding practice, so that we can at least document where and when the various strategies are being used.

01/06/11

Permalink 02:45:14 pm, by mholmes, 18 words, 75 views   English (CA)
Categories: Activity log; Mins. worked: 20

Replacement of abbrévié with abrégé

28 instances in two documents, done on instructions from the French team. Checked them all before making the change.

30/05/11

Permalink 03:50:03 pm, by mholmes, 2 words, 65 views   English (CA)
Categories: Activity log; Mins. worked: 20

Fixed a couple of typos I noticed in the references.xml file

Minor issues.

27/05/11

Permalink 11:07:01 am, by mholmes, 40 words, 63 views   English (CA)
Categories: Activity log; Mins. worked: 60

Alphabet list for references page now complete

The references list is now divided into alphabetical groups by first letter, and there is a list of the letters at the top of the page which you can click on to jump to the appropriate place in the file.

19/05/11

Permalink 10:32:06 am, by lspwong, 367 words, 89 views   English (CA)
Categories: Activity log; Mins. worked: 60

Issues with block quotes

EDIT: SEE COMMENTS BY MH BELOW.

Working on the varin.xml, I discovered a problem when trying to use <cit type="blockquote"><quote type="noMarks">. Although the document validated, it caused an error when trying to view the XSLT. The error was related to XML lines 3218 to 3226 (/TEI/text[1]/body[1]/div[35]/p[4]/cit[1]/quote[1]). I tried to fix the error by changing <cit type="blockquote" rend="margin: 0"><quote type="noMarks"><foreign xml:lang="la"> to simply <cit type="blockquote"><quote><foreign xml:lang="la">, however, this displayed "guillemets" around the quoted text. I ended up having to use <cit type="blockquote" rend="margin: 0"><quote type="italics"><foreign xml:lang="la"> to fix the error and make the text display properly without "guillemets" and only in italics.

MH will probably need to do some more tweaking of block quote handlers.

Investigation by MH

I think this issue arises out of a basic misunderstanding. The handling for blockquotes depends on <cit type="block">, NOT type="blockquote". If you use <cit type="block">, then no quotes (guillemets) will be applied anyway; by default, blockquotes do not have quotation marks. In fact, the @type attribute on the <quote> element is ignored in the case of <cit type="block">.

So if the objective is to have an italicized blockquote, with the italicization due to its being in a foreign language, you can do this:

<cit type="block" rend="margin: 0; margin-top: 0; margin-bottom: 0;"><quote><foreign xml:lang="la">
    I’ay pour le dot beaucoup d’argent receu,<lb/>
    Et mon pouuoir ce faiſant i’ay vendu.</foreign></quote></cit>

There are 24 instances of the erroneous cit type="blockquote" in the following documents: forest_nuptiale, sonnet_1609, and varin. There are 14 instances of the correct formulation, <cit type="block">, which occur in sonnet_1609 and varin. I think we need to look at all the erroneous ones and fix them, making any other adjustments required to make them display properly.

05/05/11

Permalink 11:40:31 am, by mholmes, 37 words, 55 views   English (CA)
Categories: Academic; Mins. worked: 5

New feature request: alphabet links at the top of the index page

We want A through Z links at the top of the Index page, and anchors at the beginning of each letter segment in the page. This would be useful because searching is hard due to spelling anomalies.

02/05/11

Permalink 03:58:56 pm, by mholmes, 34 words, 62 views   English (CA)
Categories: Activity log; Mins. worked: 40

Automated the addition of required xml:ids

I found a way to automate the addition of @xml:id attributes where these are required because of linking from the TOC, so that's been done in Le Bon Mariage, Le Blanc and Ville-Thierry.

Permalink 03:28:07 pm, by mholmes, 162 words, 72 views   English (CA)
Categories: Activity log; Mins. worked: 120

Work on TOC rendering

Started working on rendering of TOC code, as described here. These are the changes I've made:

  • I made substantial edits to Le Bon Mariage, Ville-Thierry and Le Blanc to fix minor inconsistencies and missing attributes.
  • I added a special handler for <ref type="pageNum"> which processes a TOC page number into a link to the <fw> containing that page number. Note, though, that none of the <fw> tags in question actually has an @xml:id attribute yet, with the exception of one that I added to Le Bon Mariage for testing. I'll probably try to automate the addition of those attributes.
  • I added an appropriate CSS ruleset for the page numbers.

Basically everything seems to be working, although the page width of the Ville-Thierry is still wrong; I need to work on that a little. But someone else will take over the markup of that document at some point, so it could be left to them.

26/04/11

Permalink 02:06:16 pm, by mholmes, 109 words, 63 views   English (CA)
Categories: Activity log; Mins. worked: 60

Site changes based on advice from France

Made the following changes to some site content based on advice from the French team:

  • "Allusions/Notes" is now simply "Index".
  • The default style is "Pelouse".
  • Superscripts in editorial introductions have been reconfigured using CSS so that they no longer have any impact on line-spacing. "vertical-align: superscript;" was replaced with "position: relative; bottom: 0.8ex;". I have not made this change in any anthology documents, although we may do that if it proves worthwhile. There are 31 instances of the use of "vertical-align: superscript" in the text_original collection.
  • Mentions of genres on the home page have been linked to their respective TOC pages.

Other changes are pending after some discussion.

20/04/11

Permalink 01:53:38 pm, by mholmes, 104 words, 63 views   English (CA)
Categories: Activity log; Mins. worked: 60

Versioning, SVN and history

LSPW reported that an entry in the references file was missing (Chypre). We checked back through SVN, and confirmed that it had never been present during the time we've been using SVN, but I found an old version of the references file from Mariage version 4, and we retrieved it from there.

Meanwhile, there were conflicts with the le_bon_mariage.xml file, arising I think when EGB forgot to do an svn update before starting work, after I had edited the file on the 14th. I think I've resolved these, but we'll know for sure tomorrow when she comes in and does an update.

19/04/11

Permalink 01:36:10 pm, by mholmes, 64 words, 70 views   English (CA)
Categories: Activity log; Mins. worked: 15

Fix to XSLT issue

My previous changes caused a change to the way paragraphs were displayed in the popup references. I've now refined the XSLT a little so that paragraphs in references no longer get the host document's default paragraph styles added to them (as they shouldn't, because they're editorial content and should be consistent across the collection, rather than varying with the style of their host documents).

18/04/11

Permalink 12:29:41 pm, by lspwong, 70 words, 56 views   English (CA)
Categories: Academic; Mins. worked: 0

"A.C.N./P.C.N." changed back to "av./ap. J.-C."

CC and I decided to revert to our old dating system using "av./ap. J.-C." instead of the more modern dating system "A.C.N./P.C.N." in our references because it is more widely used. I have made these changes to all applicable references in the references.xml file. All dates written in references from now on will also use the "av./ap. J.-C." dating system.

14/04/11

Permalink 04:29:35 pm, by mholmes, 193 words, 72 views   English (CA)
Categories: Activity log; Mins. worked: 120

Fixes to CSS and XSLT

Fixed some bugs in CSS and XSLT shown up by Le Bon Mariage:

  • Unordered lists had no default text-align, so they were inheriting it from (eg) a parent text front div, meaning they'd end up centred. It's almost always the case that they should be left-aligned, so I made that the default in the CSS.
  • Some text paragraphs (those with no @rend attribute) were failing to inherit the default paragraph style from the <teiHeader>; this turned out to be a bug in the XSLT, which I fixed.
  • Marginal (authorial) notes (<note type="marginal">) were being treated as editorial notes, and turned into buttons with popups, so I suppressed that (and see below).

Also added handling for the marginal notes, grouping them in with the arguments, because that's the way to handle them IMHO; in some cases too, judging by EGB's markup, it's hard to tell the difference between them. So the argument markup now handles editorial marginal notes.

In the process, I fixed a few typos in the Bon Mariage XML file too.

Note to self: TOCs are still not being handled at all. That's a must-do sometime soon.

04/04/11

Permalink 11:13:01 am, by mholmes, 16 words, 50 views   English (CA)
Categories: Activity log; Mins. worked: 15

Updates to the team page

Added in the new members of the team and more details about tasks, on CC's instructions.

24/03/11

Permalink 05:21:01 pm, by mholmes, 14 words, 59 views   English (CA)
Categories: Activity log; Mins. worked: 60

Meeting w/ CC

Discussion of plans for the summer, finances, text-analysis, the IMT 2.0, and future conference presentations.

22/03/11

Permalink 03:33:07 pm, by mholmes, 141 words, 75 views   English (CA)
Categories: Activity log; Mins. worked: 20

Marginal notes in Le Bon Mariage

Le Bon Mariage has marginal notes which are linked by superscript letters in the text. This is our plan for dealing with them:

  • The footnote letter in the text should be marked up like this:
    <hi rend="vertical-align: super;">a</hi>
    
  • The marginal note should immediately follow it (assuming the note content appears level with it in the margin), and have a special @type attribute, like this:
    <note type="marginal" rend="float: right; width: 5em; [...etc.]"><ab>[content of the note]</ab></note>
    
    Note that the content appears inside an <ab> element, as in the case of a marginal <argument>.
  • I'll have to write XSLT to distinguish between this and editorial types of note, and render this one just like marginal <argument>s.

17/03/11

Permalink 03:59:42 pm, by mholmes, 169 words, 65 views   English (CA)
Categories: Activity log; Mins. worked: 240

Documentation of CSS

Spent most of the day on a document intended to be a gentle introduction as well as a serious attempt to document our practice with regard to the use of CSS code in our markup. This took a while but was really overdue. It will serve as the intro for EGB and GBS as they start adding CSS to their documents, so any errors or omissions will be corrected pretty quickly.

One of the tools I used to get a good overview of what we're doing was to run this against Mariage documents in oXygen:

distinct-values(tokenize(string-join(distinct-values(//@rend), ';'), ';'))

to get a set of all the unique property-value pairs. This found a couple of errors, of course, which we've fixed. It would be a good idea to refine this into an XSLT stylesheet, perhaps, which could be run against the whole collection, and bring back a list of all the unique property names, and for each name, a list of the values we've used.

28/02/11

Permalink 08:47:54 am, by mholmes, 2634 words, 78 views   English (CA)
Categories: Activity log; Mins. worked: 40

Survey of db images

Four new images have been marked up by CC and entered into the db; five are waiting to be worked on. This is a quick survey of the images already in the db, which will perhaps help us to identify those which have only been partially annotated so far:



Total images in db: 66



Présage malheureux (agresseure.xml):

30 annotations: 

 5  Transcription de texte

 0  Commentaire sur la transcription

 5  Commentaire sur les gestes

 1  Liens aux textes

 2  Graveur ou bibliothèque

 8  Objets

 2  Créatures

 2  Personnages

 5  Transcription en français moderne

 



Ambition de la femme (ambition_de_la_femme.xml):

19 annotations: 

 2  Transcription de texte

 0  Commentaire sur la transcription

 3  Commentaire sur les gestes

 0  Liens aux textes

 4  Graveur ou bibliothèque

 4  Objets

 1  Créatures

 3  Personnages

 2  Transcription en français moderne

 



L’âne et les femmes (ane_des_femmes.xml):

14 annotations: 

 3  Transcription de texte

 1  Commentaire sur la transcription

 1  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 0  Objets

 1  Créatures

 3  Personnages

 3  Transcription en français moderne

 



Aujourd’huy d’une façon, demain de l’autre (aujourdhuy_dune_facon.xml):

75 annotations: 

 31  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 3  Graveur ou bibliothèque

 4  Objets

 3  Créatures

 3  Personnages

 31  Transcription en français moderne

 



Argent fait tout (balance.xml):

34 annotations: 

 9  Transcription de texte

 3  Commentaire sur les gestes

 2  Liens aux textes

 2  Graveur ou bibliothèque

 0  Commentaire sur la transcription

 5  Objets

 0  Créatures

 7  Personnages

 6  Transcription en français moderne

 



La besace (besace.xml):

21 annotations: 

 5  Transcription de texte

 0  Commentaire sur la transcription

 2  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 2  Objets

 0  Créatures

 5  Personnages

 5  Transcription en français moderne

 



Dame recevant un billet doux (billet_doux.xml):

10 annotations: 

 2  Transcription de texte

 0  Commentaire sur la transcription

 0  Liens aux textes

 0  Commentaire sur les gestes

 1  Graveur ou bibliothèque

 3  Objets

 0  Créatures

 2  Personnages

 2  Transcription en français moderne

 



Blaise. Femme criarde mari patient (blaise_femme_criarde.xml):

23 annotations: 

 8  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 2  Objets

 0  Créatures

 3  Personnages

 8  Transcription en français moderne

 



Bonheur des uns (bonheur_des_uns.xml):

28 annotations: 

 8  Transcription de texte

 0  Commentaire sur la transcription

 2  Commentaire sur les gestes

 1  Liens aux textes

 3  Graveur ou bibliothèque

 2  Objets

 1  Créatures

 3  Personnages

 8  Transcription en français moderne

 



Ce pauvre savatier pour contenter sa femme (ce_pauvre_savetier.xml):

12 annotations: 

 1  Transcription de texte

 2  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 0  Graveur ou bibliothèque

 5  Objets

 0  Créatures

 3  Personnages

 1  Transcription en français moderne

 



Chacun a le sien (Le fardeau du menage) (chacun_a_le_sien.xml):

39 annotations: 

 13  Transcription de texte

 3  Commentaire sur la transcription

 0  Liens aux textes

 0  Commentaire sur les gestes

 2  Graveur ou bibliothèque

 3  Objets

 0  Créatures

 6  Personnages

 12  Transcription en français moderne

 



Elle laisse aller le chat au fromage (chat.xml):

28 annotations: 

 7  Transcription de texte

 3  Commentaire sur la transcription

 1  Liens aux textes

 3  Commentaire sur les gestes

 1  Graveur ou bibliothèque

 2  Objets

 2  Créatures

 2  Personnages

 7  Transcription en français moderne

 



Cocus que voſtre joye eſt de courte durée (cocus_que_vostre_joie.xml):

13 annotations: 

 2  Transcription de texte

 0  Commentaire sur la transcription

 2  Commentaire sur les gestes

 0  Liens aux textes

 1  Graveur ou bibliothèque

 4  Objets

 0  Créatures

 2  Personnages

 2  Transcription en français moderne

 



A la corne jenins (corne_jenins.xml):

14 annotations: 

 3  Transcription de texte

 0  Commentaire sur la transcription

 3  Commentaire sur les gestes

 1  Liens aux textes

 1  Graveur ou bibliothèque

 2  Objets

 0  Créatures

 1  Personnages

 3  Transcription en français moderne

 



Corrige si tu peux (corrige_si_tu_peux.xml):

21 annotations: 

 3  Transcription de texte

 1  Commentaire sur la transcription

 2  Liens aux textes

 2  Graveur ou bibliothèque

 3  Commentaire sur les gestes

 3  Objets

 0  Créatures

 4  Personnages

 3  Transcription en français moderne

 



Dame Florence, dame Perrine, dame Jacqueline... (dame_florence.xml):

35 annotations: 

 8  Transcription de texte

 0  Commentaire sur la transcription

 4  Commentaire sur les gestes

 2  Liens aux textes

 1  Graveur ou bibliothèque

 10  Objets

 1  Créatures

 1  Personnages

 8  Transcription en français moderne

 



Le débat pour la culotte (debat_pour_la_culotte.xml):

12 annotations: 

 2  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 1  Liens aux textes

 3  Graveur ou bibliothèque

 2  Objets

 0  Créatures

 2  Personnages

 2  Transcription en français moderne

 



Dialogue de Dame Alison et de Lubin son mary (dialogue_de_dame.xml):

42 annotations: 

 7  Transcription de texte

 1  Commentaire sur la transcription

 11  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 8  Objets

 0  Créatures

 6  Personnages

 7  Transcription en français moderne

 



Le fardeau des pères (fardeau_des_peres.xml):

27 annotations: 

 9  Transcription de texte

 0  Commentaire sur la transcription

 3  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 1  Objets

 0  Créatures

 3  Personnages

 9  Transcription en français moderne

 



Femme à plaindre, mary à corriger (femme_a_plaindre.xml):

28 annotations: 

 6  Transcription de texte

 0  Commentaire sur la transcription

 1  Commentaire sur les gestes

 0  Liens aux textes

 3  Graveur ou bibliothèque

 8  Objets

 0  Créatures

 4  Personnages

 6  Transcription en français moderne

 



La femme de mesnage (femme_de_mesnage.xml):

22 annotations: 

 2  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 1  Graveur ou bibliothèque

 15  Objets

 2  Créatures

 0  Personnages

 2  Transcription en français moderne

 



La femme d’esprit (femme_desprit.xml):

21 annotations: 

 4  Transcription de texte

 1  Commentaire sur la transcription

 2  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 3  Objets

 0  Créatures

 5  Personnages

 4  Transcription en français moderne

 



La femme en menage (femme_en_menage.xml):

30 annotations: 

 8  Transcription de texte

 2  Commentaire sur la transcription

 1  Commentaire sur les gestes

 0  Liens aux textes

 3  Graveur ou bibliothèque

 4  Objets

 0  Créatures

 5  Personnages

 7  Transcription en français moderne

 



Femmes à table en l’absence de leurs maris (femmes_a_table.xml):

12 annotations: 

 1  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 3  Graveur ou bibliothèque

 3  Objets

 2  Créatures

 2  Personnages

 1  Transcription en français moderne

 



L'homme de mesnage (homme_de_mesnage.xml):

17 annotations: 

 2  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 1  Graveur ou bibliothèque

 12  Objets

 0  Créatures

 0  Personnages

 2  Transcription en français moderne

 



I’ay les crochets de mon mary (iai_les_crochets.xml):

14 annotations: 

 1  Transcription de texte

 0  Commentaire sur la transcription

 1  Commentaire sur les gestes

 0  Liens aux textes

 3  Graveur ou bibliothèque

 7  Objets

 0  Créatures

 1  Personnages

 1  Transcription en français moderne

 



Il cherche ce qu'il ne voudroit pas trouver (il_cherche.xml):

19 annotations: 

 4  Transcription de texte

 1  Liens aux textes

 2  Graveur ou bibliothèque

 0  Commentaire sur la transcription

 2  Commentaire sur les gestes

 3  Objets

 0  Créatures

 3  Personnages

 4  Transcription en français moderne

 



Il est bien sage, il met de l’eau dans son vin (il_est_bien_sage.xml):

28 annotations: 

 8  Transcription de texte

 0  Commentaire sur la transcription

 3  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 3  Objets

 1  Créatures

 3  Personnages

 8  Transcription en français moderne

 



Il n’est pas maistre en sa maison (il_nest_pas_maistre.xml):

13 annotations: 

 1  Transcription de texte

 0  Commentaire sur la transcription

 3  Commentaire sur les gestes

 0  Liens aux textes

 1  Graveur ou bibliothèque

 4  Objets

 0  Créatures

 3  Personnages

 1  Transcription en français moderne

 



L'invention des Femmes (1) (invention_femmes_1.xml):

20 annotations: 

 4  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 4  Liens aux textes

 2  Graveur ou bibliothèque

 2  Objets

 0  Créatures

 4  Personnages

 4  Transcription en français moderne

 



L'invention des femmes (2) (invention_femmes_2.xml):

28 annotations: 

 6  Transcription de texte

 0  Commentaire sur la transcription

 4  Liens aux textes

 0  Commentaire sur les gestes

 1  Graveur ou bibliothèque

 3  Objets

 1  Créatures

 7  Personnages

 6  Transcription en français moderne

 



La femme battant son mari (la_femme_battant.xml):

28 annotations: 

 1  Transcription de texte

 1  Commentaire sur la transcription

 4  Commentaire sur les gestes

 2  Liens aux textes

 4  Graveur ou bibliothèque

 8  Objets

 2  Créatures

 5  Personnages

 1  Transcription en français moderne

 



La femme qui fouette son mary (la_femme_qui_fouette.xml):

28 annotations: 

 5  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 1  Graveur ou bibliothèque

 8  Objets

 0  Créatures

 9  Personnages

 5  Transcription en français moderne

 



La grande destruction de Lustucru par les femmes fortes (la_grande_destruction.xml):

26 annotations: 

 2  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 1  Liens aux textes

 2  Graveur ou bibliothèque

 13  Objets

 0  Créatures

 6  Personnages

 2  Transcription en français moderne

 



La rasine d’olà nouvellement inventé pour purger les hommes de leur ivrognerie (la_rasine.xml):

33 annotations: 

 8  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 1  Graveur ou bibliothèque

 7  Objets

 1  Créatures

 8  Personnages

 8  Transcription en français moderne

 



Le bon menage (le_bon_menage.xml):

34 annotations: 

 11  Transcription de texte

 1  Commentaire sur la transcription

 1  Liens aux textes

 2  Commentaire sur les gestes

 2  Graveur ou bibliothèque

 3  Objets

 1  Créatures

 2  Personnages

 11  Transcription en français moderne

 



Le fournox de Jean Tangous (le_fournaux.xml):

42 annotations: 

 16  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 1  Graveur ou bibliothèque

 9  Objets

 0  Créatures

 0  Personnages

 16  Transcription en français moderne

 



Le mari complaisant (le_mari_compliasant.xml):

16 annotations: 

 1  Transcription de texte

 0  Commentaire sur la transcription

 2  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 7  Objets

 0  Créatures

 3  Personnages

 1  Transcription en français moderne

 



Le mari impuissant (le_mari_impuissant.xml):

27 annotations: 

 8  Transcription de texte

 0  Commentaire sur la transcription

 1  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 4  Objets

 0  Créatures

 4  Personnages

 8  Transcription en français moderne

 



Le pauvre badin (le_pauvre_badin.xml):

13 annotations: 

 2  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 3  Graveur ou bibliothèque

 2  Objets

 1  Créatures

 3  Personnages

 2  Transcription en français moderne

 



Le remouleur a la mode des langues piquante (le_remouleur.xml):

24 annotations: 

 3  Transcription de texte

 1  Commentaire sur la transcription

 5  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 8  Objets

 0  Créatures

 2  Personnages

 3  Transcription en français moderne

 



Le Taste-Poule (le_taste_poule.xml):

13 annotations: 

 2  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 3  Graveur ou bibliothèque

 1  Objets

 2  Créatures

 3  Personnages

 2  Transcription en français moderne

 



Les dolens et mal mariez (les_dolens.xml):

44 annotations: 

 17  Transcription de texte

 1  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 2  Objets

 0  Créatures

 5  Personnages

 17  Transcription en français moderne

 



L’estocade du gousset (lestocade.xml):

22 annotations: 

 7  Transcription de texte

 0  Commentaire sur la transcription

 3  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 1  Objets

 0  Créatures

 2  Personnages

 7  Transcription en français moderne

 



Maitre Miché ou la cornette commande (maitre_miche.xml):

17 annotations: 

 4  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 4  Objets

 0  Créatures

 3  Personnages

 4  Transcription en français moderne

 



Le mari qui bat sa femme (mari_bat_femme.xml):

16 annotations: 

 2  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 3  Graveur ou bibliothèque

 4  Objets

 1  Créatures

 4  Personnages

 2  Transcription en français moderne

 



Marie ta fille ou elle se mariera (marie_ta_fille.xml):

16 annotations: 

 6  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 1  Objets

 0  Créatures

 3  Personnages

 4  Transcription en français moderne

 



La confrerie des martyrs (martires.xml):

38 annotations: 

 11  Transcription de texte

 3  Commentaire sur la transcription

 1  Liens aux textes

 2  Commentaire sur les gestes

 3  Graveur ou bibliothèque

 7  Objets

 0  Créatures

 0  Personnages

 11  Transcription en français moderne

 



Le massacre de Lustucru par les femmes (Lagniet) (massacre_lagniet.xml):

38 annotations: 

 11  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 1  Liens aux textes

 1  Graveur ou bibliothèque

 8  Objets

 0  Créatures

 6  Personnages

 11  Transcription en français moderne

 



Le massacre de Lustucru (Larmessin) (massacre_larmessin.xml):

51 annotations: 

 15  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 1  Liens aux textes

 1  Graveur ou bibliothèque

 14  Objets

 0  Créatures

 8  Personnages

 12  Transcription en français moderne

 



Migrainne perpetuelle (Le jaloux discret) (migrainne_perpetuelle.xml):

24 annotations: 

 8  Transcription de texte

 1  Commentaire sur la transcription

 0  Liens aux textes

 0  Commentaire sur les gestes

 2  Graveur ou bibliothèque

 4  Objets

 0  Créatures

 1  Personnages

 8  Transcription en français moderne

 



Miroir des dupes (La Coquette) (miroir_des_dupes.xml):

29 annotations: 

 10  Transcription de texte

 0  Commentaire sur la transcription

 0  Liens aux textes

 2  Commentaire sur les gestes

 2  Graveur ou bibliothèque

 2  Objets

 0  Créatures

 3  Personnages

 10  Transcription en français moderne

 



Ce ne sont mye des femmes (mye_des_femmes.xml):

9 annotations: 

 1  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 3  Objets

 0  Créatures

 2  Personnages

 1  Transcription en français moderne

 



Opérateur céphalique (operateur_cephalique.xml):

63 annotations: 

 18  Transcription de texte

 0  Commentaire sur la transcription

 2  Commentaire sur les gestes

 2  Liens aux textes

 1  Graveur ou bibliothèque

 14  Objets

 4  Créatures

 6  Personnages

 16  Transcription en français moderne

 



La poulle qui chante devant le coq (poule_qui_chante.xml):

23 annotations: 

 6  Transcription de texte

 1  Commentaire sur la transcription

 1  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 2  Objets

 2  Créatures

 3  Personnages

 6  Transcription en français moderne

 



Quand avec cinq ou six, parmy les bonnes cheres (quand_avec_cinq.xml):

19 annotations: 

 1  Transcription de texte

 0  Commentaire sur la transcription

 1  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 10  Objets

 2  Créatures

 2  Personnages

 1  Transcription en français moderne

 



Tout d’amitié (tout_damitie.xml):

14 annotations: 

 4  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 1  Liens aux textes

 2  Graveur ou bibliothèque

 1  Objets

 0  Créatures

 2  Personnages

 4  Transcription en français moderne

 



Un homme porte sa femme sur son dos
   (un_homme_porte.xml):

8 annotations: 

 2  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 1  Graveur ou bibliothèque

 1  Objets

 0  Créatures

 2  Personnages

 2  Transcription en français moderne

 



Un homme pousse sa femme dans une brouette (un_homme_pousse.xml):

8 annotations: 

 2  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 1  Graveur ou bibliothèque

 1  Objets

 0  Créatures

 2  Personnages

 2  Transcription en français moderne

 



La visse sans fin. Gorgibus, la dent seiche (visse_sans_fin.xml):

27 annotations: 

 7  Transcription de texte

 1  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 7  Objets

 0  Créatures

 3  Personnages

 7  Transcription en français moderne

 



A femme désolée mari ioyeux (a_femme_desolee.xml):

19 annotations: 

 5  Transcription de texte

 2  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 2  Graveur ou bibliothèque

 2  Objets

 0  Créatures

 3  Personnages

 5  Transcription en français moderne

 



Ce n’est ce que c’estoit (ce_nest_ce_que_cestoit.xml):

8 annotations: 

 1  Transcription de texte

 1  Commentaire sur la transcription

 1  Commentaire sur les gestes

 0  Liens aux textes

 1  Graveur ou bibliothèque

 1  Objets

 0  Créatures

 2  Personnages

 1  Transcription en français moderne

 



Conception des cornes (conception_des_cornes.xml):

15 annotations: 

 2  Transcription de texte

 1  Commentaire sur la transcription

 2  Commentaire sur les gestes

 0  Liens aux textes

 1  Graveur ou bibliothèque

 4  Objets

 0  Créatures

 3  Personnages

 2  Transcription en français moderne

 



Elle est revenue Denise (elle_est_revenue.xml):

8 annotations: 

 2  Transcription de texte

 0  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 1  Graveur ou bibliothèque

 0  Objets

 0  Créatures

 3  Personnages

 2  Transcription en français moderne

 



Elle ploye la thoilette (elle_ploye_la_thoilette.xml):

6 annotations: 

 2  Transcription de texte

 1  Commentaire sur la transcription

 0  Commentaire sur les gestes

 0  Liens aux textes

 0  Graveur ou bibliothèque

 0  Objets

 0  Créatures

 1  Personnages

 2  Transcription en français moderne

 



Les fames ne tombent jamais le nez en terre (les_fames_ne_tombent.xml):

11 annotations: 

 2  Transcription de texte

 1  Commentaire sur la transcription

 1  Commentaire sur les gestes

 0  Liens aux textes

 0  Graveur ou bibliothèque

 3  Objets

 0  Créatures

 2  Personnages

 2  Transcription en français moderne


This was generated with this code (then I removed the tags):

xquery version "1.0";

declare default element namespace "http://www.tei-c.org/ns/1.0";

let $ret := '
'

return
<results>
<total>
Total images in db: {count(collection('/db/mariage/anthology/images/')//TEI)}
</total>
{
	for $t in collection('/db/mariage/anthology/images/')/TEI
	
	return
		(
			concat($ret, $ret, $t/teiHeader/fileDesc/titleStmt/title[1]/text(), ' (', $t/@xml:id, '.xml):', $ret, count($t//zone), ' annotations: ', $ret),
			for $r in $t//rendition
				
					return 
						(
						concat(count($t//zone[@rendition = $r/@xml:id]), '  ', $r/label/text(), $ret)
						)
		)
}

</results>

27/02/11

Permalink 10:00:35 am, by mholmes, 11 words, 51 views   English (CA)
Categories: Activity log; Mins. worked: 60

Added final image from the current batch

Processed and added Doctus est in calicem image to the repo.

Permalink 07:09:47 am, by mholmes, 15 words, 40 views   English (CA)
Categories: Activity log; Mins. worked: 60

Another IMT image added

Processed and added Le Cornar Content. Only one more to do from the latest batch.

25/02/11

Permalink 12:53:35 pm, by mholmes, 10 words, 45 views   English (CA)
Categories: Activity log; Mins. worked: 60

Another IMT image added

Les Fruits Malheureux cleaned up, processed and added to SVN.

24/02/11

Permalink 10:30:55 am, by mholmes, 11 words, 54 views   English (CA)
Categories: Activity log; Mins. worked: 60

Another IMT image added

La Cornologie cleaned up, processed and turned into an IMT file.

18/02/11

Permalink 02:29:25 pm, by mholmes, 9 words, 52 views   English (CA)
Categories: Activity log; Mins. worked: 60

One more IMT image added

Le Jenin Parfait now cleaned up, processed and XML'ed.

17/02/11

Permalink 01:22:14 pm, by mholmes, 53 words, 53 views   English (CA)
Categories: Activity log; Mins. worked: 75

Added another IMT document

Added Conception des Cornes. This image took a lot of cleaning up. The approach I eventually took with some success was to increase both contrast and brightness (32% and 33% respectively), then I was able to do select-similar stuff to get wide blocks of the noise; using Despeckle carefully on selected areas worked well too.

Permalink 09:17:19 am, by mholmes, 16 words, 71 views   English (CA)
Categories: Academic; Mins. worked: 30

Added another image markup document

New doc elle_ploye_la_thoilette.xml (RES-Z-1746-P-115.jpg) set up and added to SVN.

11/02/11

Permalink 12:12:05 pm, by mholmes, 40 words, 68 views   English (CA)
Categories: Activity log; Mins. worked: 60

Finished preparation of new engraving RES-Z-1746-P-108.jpg

As with the previous one, cleaned up the image, created the two smaller images, and built the Image Markup file, although I haven't entered any text yet. Added everything to SVN, but I haven't uploaded it to the site yet.

08/02/11

Permalink 03:15:30 pm, by mholmes, 52 words, 53 views   English (CA)
Categories: Activity log; Mins. worked: 45

Finished preparation of new engraving RES-Z-1746-P-114.jpg

Finished cleaning up the image, created the two smaller images, and built the Image Markup file, although I haven't entered any text yet. Added everything to SVN, but I haven't uploaded it to the site yet.

Note to self: when cleaning up images, select by colour in the Gimp is your friend.

Permalink 02:37:55 pm, by mholmes, 157 words, 47 views   English (CA)
Categories: Activity log; Mins. worked: 120

Schema changes

Up to now, we've been using a schema which was generated using oddbyexample.xsl from our document collection last year, and then slightly modified. I've now substantially modified the ODD file to make the following changes:

  • Allow <argument> to appear in various locations in which it's not normally allowed, because we have marginal-style arguments throughout our new prose texts, and many of the appear in locations where there's no sentence or division break.
  • Fix some previous errors in attribute values that were available because of errors in the original documents.
  • Added back the <rs> element, which we use in TOCs.
  • Added lots of new attribute values for e.g. @type, which are required for the new documents.

In the process, I fixed some issues in a couple of the actual documents, and I now have the two new documents linking to the updated schema instead of the tei_all they were temporarily using.

07/02/11

Permalink 04:22:05 pm, by mholmes, 56 words, 419 views   English (CA)
Categories: Activity log, Tasks; Mins. worked: 15

TODO: fix problem with arguments not allowed in quotes

In our texts, arguments frequently occur in marginal position, without interrupting the flow of the text. We now have an example of one which occurs inside one of our weird extended quote forms, so I need to make <argument> an acceptable child of <quote> in the ODD, and generate a new schema.

01/02/11

Permalink 03:36:23 pm, by mholmes, 88 words, 57 views   English (CA)
Categories: Activity log; Mins. worked: 30

Partial fix for reference list popup issue

Adding id attributes to the reference elements themselves was enough to make the popup function, in that the reference element is copied to the popup and displayed; so that's one issue fixed. However, because the default display in the reference list shows each reference collapsed, it's also collapsed when it's copied to the popup. I need to elaborate the showReference() function a little so that it can auto-expand that stuff if necessary (without breaking the way the same function works in the context of anthology documents and articles).

31/01/11

Permalink 04:07:19 pm, by mholmes, 43 words, 51 views   English (CA)
Categories: Activity log; Mins. worked: 20

Eight new images waiting to be processed

That makes nine with last week's half-finished one. They'll need quite a lot of cleanup too. That's a priority for next week. They're appropriately named, and in the "incoming" folder. Three are pairs in the same image, and will need to be split.

Permalink 04:05:55 pm, by mholmes, 30 words, 60 views   English (CA)
Categories: Activity log; Mins. worked: 30

GBS and CC now on SVN

The Subversion rollout is working well. GBS is now using it, and CC has been set up on Carrot with it (although isn't editing XML right now). Only LCC left.

Permalink 02:07:56 pm, by mholmes, 59 words, 54 views   English (CA)
Categories: Activity log; Mins. worked: 30

Fixed a bug in ref_by_ajax.xsl

This was generating sporadic errors because it was attempting to use a variable defined in another stylesheet, which wasn't included.

This reminded me, though, that references to other references (nested refs) don't work in the references page itself. That needs fixing. It may be as simple as including the popup on that page, but it might be more complicated.

27/01/11

Permalink 03:41:01 pm, by mholmes, 22 words, 54 views   English (CA)
Categories: Activity log; Mins. worked: 20

New engraving image arrived

RES-Z-1746-P-114.jpg arrived. I've made a start on cleaning it up, then I'll build the other sizes and the XML file.

Permalink 01:44:07 pm, by mholmes, 80 words, 59 views   English (CA)
Categories: Activity log; Mins. worked: 60

Move to Subversion

We're now working out of a subversion repository instead of editing XML directly in home1t; this is basically essential since we have five or more people editing files in the same collection. I've set up LSPW and EGB on their editing machines with subversion and instructions, and all seems to be going well. I've also set up Radish so that it's ready for GBS when she comes in on Monday. I haven't yet set up the Mac for LCC.

26/01/11

Permalink 02:58:40 pm, by mholmes, 52 words, 60 views   English (CA)
Categories: Activity log; Mins. worked: 20

Unsignalled quotes

Some texts (such as Varin) have quotes in them which are not signalled with quote marks; it's only clear they're quotes because of the surrounding text. These are being tagged with <quote type="noMarks">, which I've added to the schema alongside @type="italics", and added a handler to the XSLT.

Permalink 09:27:06 am, by mholmes, 770 words, 94 views   English (CA)
Categories: Activity log; Mins. worked: 60

Encoding tables of contents

The two latest texts have long and detailed tables of contents which need to be encoded. This is the recommended approach, which I've based on the Brown Women Writers Project guidelines, but which use examples from our own texts.

  • The TOC as a whole is encoded as <div type="contents">.
  • The main heading for the whole TOC is encoded as a <head> element.
  • The TOC may then be divided into several sections (as in Le Bon Mariage), or it may be a single section (as in the case of the Ville-Thierry text). Each section should be encoded as a <list type="toc">.
  • If a section of the table has its own subheader (as in the case of Le Bon Mariage), this should be included in a <head> tag inside the <list> tag.
  • Each item in the TOC will be an <item> element inside the <list> element.
  • Each item may contain three elements:
    1. <label> (containing a label such as "Chap. XXXII" in the Ville-Thierry, if there is one).
    2. <rs> ("referring string"), which contains the chapter title, heading or description.
    3. <ref type="pageNum">, containing the page number. This will have two other attributes:
      • target="p_1" (which points, in this example, to page 1 of the text). It is important that you later ensure that an element with the xml:id "p_1" exists in the document; normally this would be the <fw type="pageNum">1</fw> -- in other words, the <fw> tag which contains the page number "1".
      • rend="float:right". This designates that the page number appears on the right of the page, which is normally the case.

Here are some example encodings. This is from Le Bon Mariage:

<div type="contents">
  
  <head>
    <figure><figDesc>Motif décoratif</figDesc></figure>
    <hi>TABLE</hi><lb/>
    <hi>DES CHAPITRES</hi><lb/>
    <hi>CONTENVS EN CE LIVRE</hi><lb/>
  </head>
  
  <list type="toc">
    <head>
      <figure><figDesc>Ligne droite ray/e</figDesc></figure>
      <hi>LIVRE PREMIER.</hi><lb/>
      <hi>TRAITE PREMIER.</hi><lb/>
      <hi>De l'excellence & des fins du Mariage</hi><lb/>
    </head>
    <item>
      <rs>Des trois diverſes eſtats qui ſe retrouuent en l'Egliſe, page </rs>
      <ref type="pagenum" target="#p_1" rend="float: right;">1</ref>
    </item>
    <item>
      <rs>Combien le mariage eſt honorable. </rs>
      <ref type="pagenum" target="#p_6" rend="float: right;">6</ref>
    </item>
    <item>
      <rs>Qu'au mariage & en toutes autres actions ſe ſaut propoſer vne bonne fin.</rs>
      <ref type="pagenum" target="#p_10" rend="float: right;">10</ref>
    </item>
  </list>
  
</div>

This example comes from the Ville-Thierry:

<div type="contents"> 
  <head>
    <figure>
      <figDesc>Motif décoratif</figDesc>
    </figure>
    <hi>TABLE</hi><lb/>
    <hi>DES CHAPITRES.</hi><lb/>
    <figure><figDesc>Ligne droite foncée</figDesc></figure>
    <hi>CHAPITRE PREMIER.</hi><lb/></head>
  <list type="toc">
    <item>
      <rs>De la grandeur & de l'excellence du Mariage. pag.</rs> 
      <ref target="#p_1">1</ref>
    </item>
    
    <item>
      <label>Chap. II.</label> 
      <rs>Qu'il n'y a rien de plus mal-
        heureux que l'état de ceux qui entrent
        mal dans le Mariage , & qui ne s'y con-
        duisent pas par les regles de la charité
        & de la pieté chrétienne.</rs> 
      <ref target="#p_19">19</ref>
    </item>
    
    <item>
      <label>Chap. III.</label> 
      <rs>Quelles sont les fins que les
        Chrétiens doivent se proposer , lorsqu'ils
        s'engagent dans le Mariage.</rs> 
      <ref target="#p_26">26</ref></item>
  </list>
</div>

25/01/11

Permalink 02:57:31 pm, by lspwong, 59 words, 71 views   English (CA)
Categories: Academic; Mins. worked: 0

New project: Varin text

As asked by CC, I have begun to work on the Espines du Mariage text by Varin, which will involve the following:

1) proofreading transcribed text
2) tweaking formatting of text where needed
3) proofreading, writing, and linking references
4) writing notes
5) making list of all Greek texts that appear so that they can be sent to Greek Masters student for transcription/proofreading

24/01/11

Permalink 11:45:13 am, by lspwong, 80 words, 71 views   English (CA)
Categories: Tasks, Academic; Mins. worked: 0

Handler to be written for Bon Mariage and Ville Thierry table of contents

In both the original texts of Bon Mariage and Ville Thierry, the table of contents appears in such a way that the chapter headings/descriptions are aligned to the left while the page numbers on which the chapters begin are aligned to the right. Because of this complex formatting, we cannot simply encode the table of contents as a list as we normally do. I discussed this with MH and he will reflect on how to deal with those texts.

20/01/11

Permalink 02:32:53 pm, by mholmes, 218 words, 55 views   English (CA)
Categories: Activity log; Mins. worked: 90

Reference display bugs fixed

There were some oddities in the display of references linked from inside references -- see my prior post for details. I've now fixed those. The problems arose because:

  1. When a reference was linked from inside another reference, it wasn't necessarily already in the page, because only references linked directly from the text are pulled into the document initially. That meant it had to be retrieved by AJAX.
  2. When it was retrieved by AJAX, it was a fragment rather than a full document; that meant that a lot of the context which was used by XSL matching templates was absent, so a lot of the styling was not applied.
  3. When the reference was retrieved, it was supposed to be added to the list of existing references. However, it replaced them instead, meaning that all subsequent reference links had to retrieve their references anew -- and they then inherited the style problems described above.

I've now fixed this so that the transformation XSL is more sensitive to fragments (it uses a formula local-name(/*) ne 'TEI' to identify a fragment, which is typically rooted on <div> rather than <TEI>). I've also tweaked the AJAX code so that it can append to the content of the target element as well as replace it, using a switch called clearTargetFirst.

Permalink 12:29:10 pm, by lspwong, 92 words, 58 views   English (CA)
Categories: Academic; Mins. worked: 0

Superscript/notes in Bon Mariage text

While working with EGB on encoding Le Bon Mariage, we found that superscripts "a", "b", "c" and "d" appeared within the body of the text of the dedication. These letters correspond to notes "a" to "d" that appear in the margin. MH suggested that I enclose the notes in note tags and that he write a new handler for those particular notes so that they will be numbered by letter. We also enclosed all superscript letters in the text in a comment tag and explained that they corresponded with the marginal notes.
Permalink 10:27:16 am, by lspwong, 47 words, 56 views   English (CA)
Categories: Academic; Mins. worked: 0

Forest Nuptiale complete; Greek transcriptions

I have finished proofreading the Forest Nuptiale. All references, linguistic notes and arguments for the text have also been added.

Currently, I'm working on inputting and formatting the Greek transcriptions that hadn't been added in the Sonnet 1609.

Will be teaching new RA XML encoding this morning/afternoon.

19/01/11

Permalink 08:49:19 am, by mholmes, 24 words, 52 views   English (CA)
Categories: Activity log; Mins. worked: 15

Added new engraving into db

CC sent the last of the metadata, plus a transcription. It's now up on the site, but it will be annotated further by CC.

18/01/11

Permalink 10:15:30 am, by mholmes, 104 words, 65 views   English (CA)
Categories: Activity log, Tasks; Mins. worked: 15

Bug with referencing -- needs fixing

A bug has emerged in the handling of references which reference other references. This is how to reproduce it:

  • Go to the Forest Nuptiale.
  • Search for Braga, and click on the reference link.
  • Inside that reference link, click on the link to Concile de Trente.
  • Click on any other reference link in the document.

When the embedded Trente reference is shown, it's unformatted (paragraphs and links are missing). Thereafter, formatting is missing from any reference which shows up in the box. No idea why this is happening (yet), but we never predicted there would be references which are only linked from inside other references.

Permalink 09:39:50 am, by mholmes, 44 words, 60 views   English (CA)
Categories: Activity log; Mins. worked: 40

XML file created for new engraving

Got some metadata from CC, and created the XML file. Did the initial transcription of the legend (it'll have to be checked, of course), and added the metadata. Still missing a modernization of the legend (which I'm not equipped to do) and the teaser.

Permalink 08:59:02 am, by mholmes, 335 words, 78 views   English (CA)
Categories: Activity log; Mins. worked: 75

New engraving: cleanup/prep process

A new engraving has arrived, so I've gone through the process of cleaning it up and prepping it using the Gimp instead of Photoshop. This is the procedure, updated from the previous PS procedure:

  • Archive the original image in a backup folder (backup, parallel to mariage_5).
  • Open in Gimp. Previously we would have converted to RGB, but I see no reason to do that; I left this at grayscale. Save into the scans folder.
  • Using the magic wand, select some background grey or speckle, and delete. Play around with this step until you've got a relatively clean image, without losing detail. You might also use Select / By Colour if that helps without deleting anything important.
  • Clean up the edges of the image, along with any egregious surface marks in the picture.
  • If there's a BN mark with a ruler, it can be moved in closer to the image sometimes, to keep the size down.
  • Crop the image to remove excessive whitespace around the outside.
  • Resize the image using Bicubic to create one 800px wide. Save this image at around 90% quality, for an image size of two to three hundred K, with the wv_ prefix, into the scans folder.
  • Undo the resize to get back to the original image, then start selecting a thumbnail area; once the selection has started, press the Shift key to constrain it to a square. Copy the selection to the clipboard.
  • Click on File / Create / From Clipboard to create a new thumbnail image.
  • Resize this image to 200 x 200, then save to the images folder with the thumb_ prefix.
  • Copy the main image file and the web view )wv_) file from scans to exist_db/mariage/anthology/images.
  • Create a new XML file in exist_db/mariage/anthology/images linking to the full size image file in the same directory, and add all the metadata.
  • Update the server directories and db.

I haven't created the XML file in this case yet, because I don't have any metadata at all about this image.

06/01/11

Permalink 10:03:45 am, by lspwong, 461 words, 145 views   English (CA)
Categories: Activity log, Tasks; Mins. worked: 40

4 arguments with extra line breaks to be fixed

[EDIT BY MDH: I've looked at these and can't find anything wrong in any of the programming code (XSLT, CSS or XHTML output); furthermore, the extra line breaks don't occur in Opera or Google Chrome, so I conclude this is caused by a Firefox bug, and I'm marking the task as Completed.]

Just a reminder to MH to fix the following lines contained in argument tags that include an extra line break when viewed online: Lines 10327, 10358, 10396 and 10534. Thank you!

[This is in the FOREST NUPTIALE.]

1) Lines 10328-10338 (extra line break occurs between "phes" and "In-" in XSLT):

<div>
<argument rend="float: left; margin-left: -6em; text-align: left;">
<ab rend="font-style: italic; font-size: 70%; text-align: left;">
Philoſo-<lb/>
phes <ref target="../../back_matter/references/references.xml#indien">In-<lb/>
diens</ref>.<lb/>
</ab>
</argument>
<p rend="margin-top: 0; margin-bottom: 0; text-indent: 0;">
ſans la liſte des Philoſophes <ref target="../../back_matter/references/references.xml#indien">In-<lb/>
diens</ref> en font pluſieurs cantons de<lb/>

2) Lines 10359-10368 (extra line break also occurs between "phes" and "In-" in XSLT):

<div>
<argument rend="float: right; margin-right: 5em; text-align: left;">
<ab rend="font-style: italic; font-size: 70%; text-align: left;">
Philoſo-<lb/>
phes <ref target="../../back_matter/references/references.xml#indien">In-<lb/>
diens</ref>.<lb/>
</ab>
</argument>
<p rend="margin-top: 0; margin-bottom: 0; text-indent: 0;">
deſquels ſont rentés, és autres, il faut<lb/>

3) Lines 10394-10406 (extra line break occurs between "des" and "Bra-" in XSLT):

<div>
<argument rend="float: left; margin-left: -5em; text-align: left;">
<ab rend="font-style: italic; font-size: 70%; text-align: left;">
Solenni<lb/>
tez au<lb/>
iour des<lb/>
nopces<lb/>
des <ref target="../../back_matter/references/references.xml#bracmane">Bra-<lb/>
mins</ref>.<lb/>
</ab>
</argument>
<p rend="margin-top: 0; margin-bottom: 0; text-indent: 0;">
ieunes,hommes & femmes: le iour<lb/>

4) Lines 10535-10544 (extra line break also occurs between "des" and "Bra-" in XSLT):

<div>
<argument rend="float: right; margin-right: 5em; text-align: left;">
<ab rend="font-style: italic; font-size: 70%; text-align: left;">
Femmes<lb/>
des <ref target="../../back_matter/references/references.xml#bracmane">Bra-<lb/>
mins</ref>.<lb/>
</ab>
</argument>
<p rend="margin-top: 0; margin-bottom: 0; text-indent: 0;">
quels couchent auec les ſœurs du<lb/>

05/01/11

Permalink 03:03:37 pm, by lspwong, 96 words, 98 views   English (CA)
Categories: Academic; Mins. worked: 0

"av./ap. J.-C." change to "A.C.N./P.C.N."

Discussed with Claire whether or not to replace all instances of "avant/après Jésus-Christ" used to date years and centuries in our references with the more modern dating system "A.C.N./P.C.N." as suggested by the Presses Universitaires de Laval. We both agreed that we should use the modern dating system. I have made these changes to all applicable references in the references.xml file by using the "Find/Replace" tool. All dates written in references from now on will also use the "A.C.N./P.C.N." dating system.
Permalink 09:31:02 am, by mholmes, 57 words, 69 views   English (CA)
Categories: Activity log; Mins. worked: 20

User set up on db for LSPW

I've set up a user on the db for LSPW to log on with, so that she can upload her documents while I'm away. I'll document it in the usual place. All docs are now owned by admin but with group "ras", so both admin and LSPW can update any document. The system collection remains admin/dba.

Mariage

Faut-il se marier? La question de Panurge s’avère incontournable en Occident, surtout à partir de la contre-réforme. Des débuts de la Concile de Trente en 1545 jusqu’à la fin du règne de Louis XIV, la tentative de renouveler le mariage se heurte en France à l’intervention croissante de la monarchie dans cette institution dominée auparavent par l’Église. La rencontre entre ces deux autorités fut tumultueuse mais propice au foisonnement des documents qui font l’objet de ce site : « l’imaginaire nuptial » se compose de divers genres textuels, chacun ayant son caractère propre, mais tous traitant des peurs, des désirs et des fantasmes de plus en plus visibles dans la société d’Ancien Régime grâce aux débats soulevés par la nouvelle problématique de l’union conjugale. L’accent pour le moment est sur les textes et images misogames qui font partie d’un renouveau de la Querelle des femmes pendant les 25 premières années du XVIIe siècle.

Reports

Categories

2011
 << Current>>
Jan Feb Mar Apr
May Jun Jul Aug
Sep Oct Nov Dec

XML Feeds