This task has been outstanding for a while, but I've managed to solve it in a very simple way using CSS columns. The solution is specialized to lists at the moment, but it should work identically for any other element on which we want to implement it. Notes/limitations:
child::cb, but it uses descendant::cb for counting the number of columns, on the assumption that some column breaks may occur within list items. It would fail to work if there were no <cb> that was a direct child of the <list>, though. If we were implementing a more general solution, we would need to figure out what level the columnar layout should be implemented on, and we'd probably have to require @type="columnar" or something like that on a block-level element, to trigger the appropriate CSS.-moz- and -webkit- prefixes.Fixed this bug, which was caused by the XSLT not expecting that notes would be placed inside page number <fw> tags. In the process of fixing it, I realized that links from TOC page numbers to the pages concerned would not work in the continuous view because we're not showing page numbers in that view, so I added code to create empty anchors in the text; this allows links from TOCs to work as expected.
Four tasks have been cleared off the list -- two had already been fixed, and just needed checking, but others had to be implemented. Fixes detailed on the original posts.
The cleanup of notes resulting from input by site reviewers last year resulted in many <note>s being reduced to a single word, which is often a simple correction for the word or phrase being annotated. Such cases are not actually <note>s at all; they're better represented by <choice>/<sic>/<corr>, in the same way as we already use <choice>/<orig>/<reg>. Noticing this, we also noticed that the <mentioned> tag has been rather misused in some cases, when either <quote>, <soCalled> or nothing at all would be more appropriate. I've also added a handler for <choice>/<sic>/<corr> which is parallel with the existing one for <choice>/<orig>/<reg>
I've worked through the Sonnet 1609 text to change all such <note>s to <choice>/<sic>/<corr>; some should be <choice>/<orig>/<reg>, but I can't determine that, so LSPW will make another pass through the file and fix those. This markup can then be used as a model where similar situations pertain in other files.
EDIT: Bug fixed by MDH 2011-12-22.
I found the following problems with the Le Blanc table of contents that need to be fixed:
1) When you click on the following page number links, they don't jump to the corresponding page in the body of the text:
<ref type="pageNum" target="#p_1" rend="float: right;">page I</ref>
<ref type="pageNum" target="#p_564" rend="float: right;">564</ref>
<ref type="pageNum" target="#p_621" rend="float: right;"><choice><sic>611</sic><corr>621</corr></choice></ref>
<ref type="pageNum" target="#p_644" rend="float: right;">644</ref>
2) Page number 76 should be in line with the chapter heading in <rs> tag but it appears on the following line even though there is no <lb/>.
Following much discussion of the way we handle abbreviations, we're now thinking that it would be desirable to compile some example images, one for each generic type of abbreviation, from our source pages, and link them into the abbreviation definitions. For the first stage in this, MDH should use XQuery to extract a list of all abbreviations marked up in the text, along with their page numbers, so we can look through and pick out the best examples in each case.
Fixed a bug with page number positioning by re-working some of the margins on the various types of <fw>. Changes documented in CSS file.
This bug proved to be a bug in Firefox so it's now closed.
<g> elements were previously only handled when they occurred within <abbr> tags, but they do occur elsewhere, so I've now added a new handler which shows their character description as a mouseover popup wherever they show up. The original handler for <xsl:template match="choice[child::abbr[descendant::g] and child::expan]"> has also been tweaked so that it more clearly distinguishes between the expansion of the abbreviation, and the explanation of the one or more gaiji characters which appear in it.
The original References page has now been split into two indexes, "Noms propres" and "Terminologie médicale". These were the required changes:
@subtype attribute on the parent <list> element that contains the reference <item>s.@subtype parameter and retrieves the appropriate page title and introductory text for the reference list.There is currently no way to retrieve the full page with all references from a menu item or a link, although of course it's still there at its original location. Should we include a link on each of the reference pages enabling the user to see the complete merged list? If so, what should it say?
On instructions from CC, references are to be split into two groups: botanical references and other references. I've created a new file, botanical.xml, which will hold the new references, and LSPW will separate them out. Then we'll need to decide on how to show them on the site.
<g> tags in the Maladies des femmes<g> tags that are not in context of an abbreviation.
EDIT by MDH: This has been fixed. replace() using this regex:
(text-indent[\s]*:[\s]*[^;]*[;])|(text-indent[\s]*:[\s*][\d][^\s;]*$)
LSPW pointed out that headings in chapters which are nested in <div>s to the point where they become <h3> elements were not centred, whereas those which become <h2>s are centred by default. I added text-align: center to the h3 ruleset in the stylesheet, but then we discovered that reference items have <h3> headings, and those should be left-aligned, so I added another ruleset to set them back to left-aligned.
GM has joined the team, and will be working 6 hours per week on the Ville-Thierry. Slots have been booked on Spartan, and he's had a general intro to the project; LSPW will direct him to the TEI Guidelines initially, and then show him the V-T text and get him started. The V-T has little or no CSS code in it yet, so it's straightforward transcription and markup.
Task for MH: Please find out why note tags are interfering with alignment of chapter and page numbers in table of contents in Maladies des femmes text. Chapter and page numbers should be floating right.
EDIT by MDH: This task seems to have been obviated by two things: First, most of the notes included in the TOC should actually be <choice>/<sic>/<corr> or <choice>/<orig>/<reg> blocks, so there will be a lot fewer note buttons in the TOCs in any case. Second, including a note inside a linked page reference is impractical, because it's not clear what clicking on it could or should do -- show the note, or go to the page reference? So on balance, it makes more sense to attach notes to the text preceding floating page numbers, if there's actually a need for a note, but when the note is simply (for instance) noting an incorrect page number in the original, we can just use <choice>/<sic>/<corr>.
I've made substantial changes to the Le Blanc, adding many linebreaks, moving all right-floated labels further into the margin, and reducing the page size. There are still problems: one issue is that the <quote type="italics"> inline quotes often actually have a smaller font size than the surrounding text, but @rend attributes on <quote> are not carried through to the XHTML yet. That needs to be fixed in the XSLT, and then the relevant <quote> elements (there are 394 of them) need to be examined to see if they need the @rend to reduce their font size.
I've now switched all files over to using the new schema, and standardized them all on the xml-model processing instruction, replacing the oxygen and oasis-schema PIs wherever they occurred. I've also trimmed out a bunch of old unnecessary namespace declarations from root elements.
Following that, I've worked extensively on the Le Blanc, which had a disastrous document model; some chapters were identified as such and some not; chapters were embedded within chapters within other chapters; other divs were left hanging outside chapters; and the structure took no account at all of the volume's organization into four books. That's all now fixed, along with many typos or layout issues I came across while working on it, but it will need proofing again eventually. I'm going to spend a little more time going through LCC's list of issues while I'm focused on that text, though.
I've done some major refactoring of Le Blanc, which is my pilot text for using <label> instead of <argument> for the marginal headings. That feature is working well, but I've had to do a huge amount of work to fix wildly proliferating <div>s in the original markup; they're still a bit of a mess, with chapter <div>s interspersed with other unidentified <div>s that have no apparent purpose, while some subsections in chapters are nested <div>s and others aren't. But a few more hours should sort it.
I've also generated a new schema, which includes <argument> in all the right places as before, but also includes <label> and adds the @type attribute to it (so we can distinguish <label>s used for marginal headings from those in e.g. TOCs.
In the process of this work, I discovered some aberrant behaviour in oXygen which threw me for a while. The old oasis-schema PI, linking to an RNG file, was not actually working, now that oXygen has switched to xml-model; instead, it was silently switching to validation with tei_all, which meant that many things would validate under the oasis-schema heading but fail to validate when xml-model was used.
EDIT by MDH 2011-12-22: I believe this task has been obviated by the proposed (and partially-implemented) change from using <argument> to using <label>. Setting this task to Completed.
Note to self: I thought I had made substantial amendments to the schema that would permit us to use <argument> inside e.g. <p>, because that's where our marginal arguments most often show up. However, in the Le Blanc text, that seems to be failing, so I need to revisit it. The text has been marked up in such a way that open ps and divs are being closed before and argument and opened again afterwards, and when I try to refactor appropriately, the code turns out not to be valid. Either the validation process is failing somehow, or the schema is not the most recent, or perhaps my modifications didn't go far enough.
I've spent quite a while this morning fixing page sizing and margin issues with the Le Blanc; this arises out of the fact that (as far as I can tell) the original page dimensions were measured from the PDF page width rather than the width of the page-image sitting inside the PDF. I think I have that problem basically solved (I had to recalculate about 400 margin-right settings).
However, we have another transformational nesting issue which is exemplified 45 times in Le Blanc, but also shows up in three other texts in the collection: <cit>/<quote> elements cross page boundaries, and therefore contain <fw> and <pb>, but in this context, <quote type="italic"> was converted into an XHTML <i> tag. Initially, I thought this was another case where we needed to split out the contents before the main transformation, but I began to re-think, because there's such a range of related phenomena, including the use of <blockquote>.
I've decided to go for even more divs and spans, eschewing more specialized XHTML elements, because it's easier to get valid structures in highly complicated documents this way; I can then control the display with CSS, making <div>s inline and <span>s block where required. This makes for valid documents (most of them validate now), which ensures that the rest of the CSS gets applied correctly.
LCC left a detailed list of issues to address in the Le Blanc text, and I've started on the first one, which involves the markup and rendering of the title page. Some fixes will be required in XSLT, and also the markup needs a bit of reconfiguring. <titlePart> elements do not seem to be processed as expected; instead of getting <h2> and <h3> tags, they're getting block-display <span>s, and I can't yet figure out how that's happening. It means their rend attributes are not being passed through.
Five of the engravings contain unusual glyphs used for abbreviation, and now the same has appeared in one of the regular texts, so I've documented the markup process for these glyphs, and passed the document on to LSPW so she can merge it into her growing markup documentation.
Tracked down a bunch of bugs, which EGB fixed, and the output now validates, after coming through our two-stage transform. Yay! From 406 errors to zero in a few days.
I've written an intermediary stylesheet to split out <list> elements containing <pb> and <fw> children into separate lists with the <pb>s and <fw>s outside them. This is processed prior to the main transformation, and has solved the problem for lists. I may revisit it for lg/l at some point. For the record, here's the template (the only non-pass-through template in an identity transform):
<!-- Split out interrupted lists into a sequence of lists with the interruptions
between them. -->
<xsl:template match="list[child::fw or child::pb]">
<!-- Stash a reference to the current list so we can easily refer to it in later nested code. -->
<xsl:variable name="thisList" select="."/>
<xsl:for-each-group select="child::item" group-adjacent="count(preceding-sibling::fw) + count(preceding-sibling::pb)">
<list>
<xsl:for-each select="$thisList/@*">
<xsl:copy/>
</xsl:for-each><xsl:text>
</xsl:text>
<xsl:for-each select="current-group()">
<xsl:copy-of select="."/><xsl:text>
</xsl:text>
</xsl:for-each>
</list><xsl:text>
</xsl:text>
<xsl:variable name="lastItem" select="current-group()[position() = last()]"/>
<xsl:for-each select="$lastItem/following-sibling::fw[preceding-sibling::item[1] = $lastItem] | $lastItem/following-sibling::pb[preceding-sibling::item[1] = $lastItem]">
<xsl:copy-of select="."/><xsl:text>
</xsl:text>
</xsl:for-each><xsl:text>
</xsl:text>
</xsl:for-each-group>
</xsl:template>
Created a thumbnail image for the title page of Maladies des Femmes.
EDIT by MDH 2011-12-22: I believe both of these issues are now fixed. I wrote a template to split lists and it's now in use, and I think LSPW has fixed the erroneous @target attributes. Setting this task to Completed.
Several XHTML validation issues have arisen recently. They fall into the following categories:
le_bon_mariage.xml has many instances of page breaks, and forme works which occur inside lists. The XSLT doesn't expect this, and outputs XHTML elements inside <ul> and <ol> which are of course invalid (only <li> is allowed).The first requires a policy decision (do we allow other elements than item inside of list?), followed by fixes to the XML and/or the XSLT to deal with the consequences of the policy. The second seems to be caused by typos in the @target attributes of ref elements in the references file; I've identified a set of these and asked LSPW to fix them. The same kind of typos also show up in anthology documents, so I've identified a stack of them using XQuery and sent them to LSPW too.
NOTE: With regard to the first issue, I wrote to the TEI list, and had a number of useful responses; I think the most appropriate was to keep the current markup (which is, after all, "correct"), but use a two-stage transformation to get the document to XHTML. In the first stage, any lists bisected by page breaks, forme works etc. would be split into two lists, allowing the second transform to proceed appropriately. This is a really good idea, and could also be used to handle bisected lgs, which are currently output as <ul>/<li> sets if they're not bisected, or <div>/<span> sets if they are.
Following a meeting discussing contracts, working hours and future plans, we added LSPW's word-processor file, in which she's documenting the markup practices of the project, to the repo, in a new directory called documentation. It looks pretty comprehensive and extremely useful. I might just store it in the exist db so it could be made available on the site (although a PDF version would be more appropriate for that).
I've always been unhappy with the way original and modern transcriptions in the engravings are stored in separate annotations and linked to separate zones. I think a better approach would be to combine the two within one annotation, inside special divs that mark them as alternates:
<div type="transcription_alternates">
<div type="transcription_orig">
<p>[...]</p>
</div>
<div type="transcription_modern">
<p>[...]</p>
</div>
</div>
The XSLT could then provide a single pop-up annotation window with a choice of two tabs at the top or the bottom, so you could switch between them. It could even remember your preference (the last tab you looked at) in a cookie.
It might be possible to automate the conversion of the existing code to this new approach using XSLT; you'd have to identify zones which were equivalent (mostly overlapping), and rewrite the contents of the Transcription category so it contains both, then delete the modernized ones. I think this would definitely be a cleaner approach to providing both transcriptions.
More research on the regularizations suggests that those which handle i/j and u/v variation are also unnecessary in the original transcription field. Using this XQuery, I was able to identify 573 choice elements which have 340 distinct variants:
xquery version "1.0"; declare default element namespace "http://www.tei-c.org/ns/1.0"; let $list := ( for $c in //choice[orig] where translate($c/orig, 'sſuvUVijIJ', '') = translate($c/reg, 'sſuvUVijIJ', '') return concat($c/orig, ' : ', $c/reg) ), $distList := distinct-values($list) return (concat(count($list), ' items, of which there are ', count($distList), ' distinct values: '), $distList)
If CC agrees that all these are irrelevant, it would be easy to remove them. That would leave 247 other regularizations, which mainly seem to consist of supplying missing spacing and apostrophes. These could presumably be kept.
I discovered when validating documents that some had information missing from their headers (part of the Image Markup Tool header info, perhaps dating back to a version of the IMT that didn't handle that data properly under Wine on Linux). Fixed those, and also fixed some errors in il_cherche.xml, which had one annotation that was classed as an Objet but actually contained transcription.
I've completed the markup of all the latinate abbreviations identified by CC in the five engravings which contain them; I've also found what I think is one more instance of apostrophe-with-dot-below used for -(m)ent, which I've marked up without a description, waiting for word from CC.
I tweaked the title attribute so it begins with the expansion, for additional clarification.
Arising out of a discussion with CC: The intended use of <choice>/<orig>/<reg> in the original transcriptions of the engravings is to restore normal contemporary usage, while the modern transcription is a full modernization. However, there are instances of <choice>/<orig>/<reg> which are used to replace long s with short s (dating back to when we had no modernizations), so these are inconsistent with our policy, and we should get rid of them. I might be able to do this entirely mechanically -- in the case of any regularization where the only difference is between long and short s, I can simply replace the entire <choice> element with the contents of <orig>; while for every regularization where a long s exists in the <orig> but other things are also regularized in the <reg>, I can add an attribute we can use temporarily to find and manually fix them. This is now on my task list.
I've added the required tagging to billet_doux.xml, which has five odd abbreviations of two different types, and implemented the XSLT and CSS to render it as planned. I'm now waiting for textual descriptions in French from CC of the individual symbols, then I'll make the same changes to the other four files which contain the abbreviations.
In the process, I had to modify the schema, which I did using Roma 2, starting from the previous ODD file (mariage_2011_02.odd), and simply adding the gaiji module and a couple of other things. That seemed to work fine, except for the file forest_nuptiale.xml, which was showing validation errors on the @rend attribute when it appears in argument. This was weird, because exactly the same structures validated fine in other files. After some work comparing files, I discovered that the forest_nuptiale.xml file had this declaration:
<?oxygen RNGSchema="../../mariage.rng" type="xml"?>
while other files such as le_bon_mariage.xml had this:
<?oasis-schema href="../../mariage.rng" type="application/xml"?>
Changing to the oasis-schema declaration made the file valid for some reason. Weird, but worth noting in case it crops up again. Other files have combinations of both schema declarations. I should clean up those inconsistencies.
When adding XSLT processing for the <head> tag, I had neglected to suppress its processing inside <lg> during both the <lg> scenarios (one of which uses <ul>, while the other uses <div>s and <span>s because <fw> tags appear inside <lg>s). <head> is now rendered first, and suppressed later, in both scenarios.
The abbreviations deriving from Latin, listed in CC's post, will be handled like this:
<glyph> element in the <charDecl> section of the file header, explaining it briefly.<choice> <abbr>Vo<hi rend="vertical-align: super;"><g ref="#theGlyphTag">9</g></hi></abbr> <expan>Vous</expan> </choice>
<abbr> element, and will give it a @title attribute for mouseover which is constructed from the information in the <glyph> element(s) referenced inside the <abbr>. This differs from a regular <choice>/<abbr>/<expan>, in which the <expan> is used as the @title attribute. The actual expansion will normally be available to readers in the modernized transcription, since these all occur in the engravings.The following updates and changes were made in response to problems and new feature requests, in consultation with LSPW:
<head> inside <lg>, so I've added processing for that.Finally, we looked at the possibility of using OCR to handle the Tableau de l'Amour Conjugal. I installed the Tesseract back-end along with a sourceforge front-end called gImageReader, and also the Hunspell and MySpell French dictionaries, and we tried it out, using Simple Scan to get the image, the GIMP to clean it up a bit, and gImageReader to OCR it. The results aren't great, because the quality of the original document is not good, but it might be slightly quicker than straight transcription.
Alphonse Chassant, Dictionnaire des abréviations latines et françaises, explains the transition from Latin abbreviations used in manuscripts (from Antiquity through the Middle Ages) to print culture in the 16th century, and how versions of these Latin abbreviations slipped into French printed texts. The phenomenon of such abbreviation disappeared over the course of the 17th century, but one of its latest manifestations was on engravings, where there was at times little room for the desired text.
There are not really named glyphs for these abbreviations when used in French, but we can say that they follow the tradition used in Latin of contractions and ellipsis, for example; these are two the forms of abbreviation we commonly find in the engravings in our corpus. Building on that Latin tradition, Chassant shows specific examples that cover all of the instances listed below. The page number in his book is given in each case.
(N.B.: Chassant also shows how a horizontal line could replace an "m" or an "n" in Latin abbreviations, the only abbreviation that remained common in print in 17th-century France, usually marked by a tilde. There are six such examples in the "mariage" engravings, but I have not listed them as this abbreviation is not only common throughout the corpus, it also is qualitatively different from the others, replacing a letter rather than indicating a specific word.)
For the following abbreviations, I think the best strategy will be to include a note in the transcriptions, but I'll talk with MH about this before making any moves. (AC = Chassant. Full reference: Alphonse Chassant, Dictionnaire des abréviations latines et françaisesusitées dans les inscriptions lapidaires et métallique, les manuscrits et chartes, 2e édition, Paris, Auguste Aubry, 1862. Google Livres, 6 août 2011.)
0619645 Dame recevant un billet doux "nous" x 4 (AC, p. 132); "qu'il" (AC, p. 136)
72-B-59070 La femme qui fouette son mary "vous" x 11 (AC, p. xxxii--a form of the Latin abbreviation for "us"); "pour" (AC, p. 134); "votre" x 2 (AC, p. 139)
G-153297 Le massacre de Lustucru (Larmessin) "pour" x 2 (AC, p. 134); "vous" x 4 (AC, p. xxxii--a form of the Latin abbreviation for "us" = 9); "votre" (AC, p. 139)
G-157562 Bonheur des uns "que" (AC, p. 136)
P123-IFFNo.190 Opérateur céphalique "pour" (AC, p. 134)
Today is my last day, so I am revising my comprehensive list of issues still needing to be worked out on Le blanc and creating an updated one. There is still a lot of editing to be done on this text, and due to its gargantuan size, the edits may take a while. I'm sad to leave it, though, since it's a great read, and I hope that the next person working on it enjoys it as much as I have.
<titlePage>: I had used a “text-indent: 7em” on “& de quelques Sainctes, qui ont beaucoup” but the margin was very off. On 07-19-2011 I used a "text-indent: -1em" in the same place and it made it too far left. At this point, I tried taking out the "text-indent" altogether and made the "padding-left" a -0.5em instead of 0.5em. This also hasn't worked and it is currently 3:33, so I am handing this task over to mholmes, because lspwong and I have spent a good half-hour trying to fix it. :)<head> tags are automatically left-aligning. lspwong looked at this and seems to think my coding is correct, so we're not sure why they aren't centering.<head> tags. There is also white space that appears before and after these <head> tags automatically, without me having to put in <lb/> tags. mholmes you'll notice that in instances where the .pdf has no space between a given header and the surrounding paragraphs, I have used a "margin-top: 0; margin-bottom: 0;" to compensate.<lb/> tags in cases where the .pdf shows white space between the headers and the paragraphs.<head> tag as well.To follow up on Leanna's post about naming of categories, "Eloges du mariage" would work for the side bar. Another issue are the categories in the table of contents under "Tous les documents". We can add "éloge, vers" as the counterpoint to "satire, vers" and "satire, prose," although this makes me a little uneasy because there is a type of poem, "l'épithalame," in praise of marriage, but it applies to celebrating a particular marriage -- so I guess "éloge, vers" works, and indeed it seems like the logical choice.
Thinking about the t of c also brings to my attention the designation "roman, fiction, prose"; I don't think we need both "roman" and "fiction"; "fiction" on its own should suffice. The novel is fiction by definition.
Transcription and annotation of Blanchon's Stances du mariage has been completed. The text is now published on the site, however, it appears under the category of "Satire" which is a problem because it is not a satire. CC suggests that a new category should be created, possibly "Éloges du mariage". We'll have to discuss this with MH when he returns from holiday.
...spent over the last few days on moving EG-B to Spartan, timesheets, and discussing various markup issues.
I have more or less finished annotating the Espines du mariage. I'm just waiting for CC to get back to a few of my e-mails before I can add the final changes and references. One particular interesting issue that needs to be addressed is the display of abbreviated latin characters. In the Varin text, there are 2 instances, "Vereque" (p. 120) and "Horaque" (p.139), in which the "que" ending is abbreviated by what looks like the letter "q" with an acute accent and a subscript "3". MH suggests that we should just ignore the abbreviation and transcribe the "que" ending in both words. We're running this by CC first to get her opinion on the matter.
In waiting for CC's reply, I've begun transcribing and encoding a short poem called "Stances du Mariage par antithèse à celles de Ph. Des Portes" by Joachim Blanchon. I'll also be doing the same for a few other short poems that belong to the same anthology.
Another project I plan to work on is reviewing previous texts that I have annotated (sonnet_1609.xml, sonnet_1621.xml, forest_nuptiale.xml) as I have realized, after working on the Varin text, that I failed to write notes/references that identified the source and author of all foreign citations in those texts.
“text-indent: 7em” on “& de quelques Sainctes, qui ont beaucoup” and then put a line break before the rest of the sentence, which I had assumed would be aligned with “& de quelques Sainctes” because it, too, is wrapped in the <hi rend=”text-indent: 7em”> tag, but the margin is very off.“text-align: center; margin-left: auto; margin-right: auto;” after the <head>. I did the same thing for page 1 when its heading was left-aligned and it appears to have worked because on page 1, the header is now centered appropriately. I have checked the code and once again, I have no idea why there is this discrepancy.<head> tag is used, the text is automatically centered. I’ve added in some additional css (“text-align: center; margin-left: auto; margin-right: auto;”) to see if that will fix the problem.<label> and the chapter description in the <rs>, and once again I’m not entirely sure why. I’m wondering if the <item><label> mark-up automatically formats the heading as being one line, so the <lb> I added after it would then act as a double-space. I’ve tested this theory by deleting the line break after “Chapitre I” of the “Livre premier”.<hi rend=”text-indent: (x)em;”> tags, the indents are not showing up on the site at the moment. I’m wondering if this has something to do with the <rs> tag that’s wrapping the descriptions...<item><label> element combination automatically puts a line break before the <rs>.<list type=”toc”><head> for "Livres Deuxiesme", III and IV and the summary beneath them (i.e. “La loüange des femmes vertueuses”) is not currently showing up. While the “Chapitre premier” is showing up on the website, the summary underneath it seems to have disappeared as well, although I have coded all summaries and chapter headings the same way in the .xml.I've started work on a text-output format suitable for text-analysis work. The first stage is normalizing spacing and re-connecting hyphenated words, which I seem to have working OK, although I need to do some more testing. Next we need to look at what normalization strategies we're going to undertake, from the obvious (replacing long s) to the less obvious (such as correcting all variant spellings). I've started learning a little about Mallet, and I've run some of their sample data, but it's obvious that I'm going to have to do a lot of learning before I can make worthwhile use of it.
An issue arose today due to a collision of XSLT templates which are handling <cit> and <quote> elements. Since our practice has evolved rather than been planned, we've ended up with a complex range of different encoding strategies, involving the use of @type on both cit and quote. As it stands, everything seems to work, but we really should take a close look at the relevant templates in teiGeneral.xsl, as well as the encoding practice, so that we can at least document where and when the various strategies are being used.
28 instances in two documents, done on instructions from the French team. Checked them all before making the change.
Minor issues.
The references list is now divided into alphabetical groups by first letter, and there is a list of the letters at the top of the page which you can click on to jump to the appropriate place in the file.
EDIT: SEE COMMENTS BY MH BELOW.
Working on the varin.xml, I discovered a problem when trying to use <cit type="blockquote"><quote type="noMarks">. Although the document validated, it caused an error when trying to view the XSLT. The error was related to XML lines 3218 to 3226 (/TEI/text[1]/body[1]/div[35]/p[4]/cit[1]/quote[1]). I tried to fix the error by changing <cit type="blockquote" rend="margin: 0"><quote type="noMarks"><foreign xml:lang="la"> to simply <cit type="blockquote"><quote><foreign xml:lang="la">, however, this displayed "guillemets" around the quoted text. I ended up having to use <cit type="blockquote" rend="margin: 0"><quote type="italics"><foreign xml:lang="la"> to fix the error and make the text display properly without "guillemets" and only in italics.
MH will probably need to do some more tweaking of block quote handlers.
I think this issue arises out of a basic misunderstanding. The handling for blockquotes depends on <cit type="block">, NOT type="blockquote". If you use <cit type="block">, then no quotes (guillemets) will be applied anyway; by default, blockquotes do not have quotation marks. In fact, the @type attribute on the <quote> element is ignored in the case of <cit type="block">.
So if the objective is to have an italicized blockquote, with the italicization due to its being in a foreign language, you can do this:
<cit type="block" rend="margin: 0; margin-top: 0; margin-bottom: 0;"><quote><foreign xml:lang="la">
I’ay pour le dot beaucoup d’argent receu,<lb/>
Et mon pouuoir ce faiſant i’ay vendu.</foreign></quote></cit>
There are 24 instances of the erroneous cit type="blockquote" in the following documents: forest_nuptiale,
sonnet_1609, and varin. There are 14 instances of the correct formulation, <cit type="block">, which occur in sonnet_1609 and varin. I think we need to look at all the erroneous ones and fix them, making any other adjustments required to make them display properly.
We want A through Z links at the top of the Index page, and anchors at the beginning of each letter segment in the page. This would be useful because searching is hard due to spelling anomalies.
I found a way to automate the addition of @xml:id attributes where these are required because of linking from the TOC, so that's been done in Le Bon Mariage, Le Blanc and Ville-Thierry.
Started working on rendering of TOC code, as described here. These are the changes I've made:
<ref type="pageNum"> which processes a TOC page number into a link to the <fw> containing that page number. Note, though, that none of the <fw> tags in question actually has an @xml:id attribute yet, with the exception of one that I added to Le Bon Mariage for testing. I'll probably try to automate the addition of those attributes.Basically everything seems to be working, although the page width of the Ville-Thierry is still wrong; I need to work on that a little. But someone else will take over the markup of that document at some point, so it could be left to them.
Made the following changes to some site content based on advice from the French team:
Other changes are pending after some discussion.
LSPW reported that an entry in the references file was missing (Chypre). We checked back through SVN, and confirmed that it had never been present during the time we've been using SVN, but I found an old version of the references file from Mariage version 4, and we retrieved it from there.
Meanwhile, there were conflicts with the le_bon_mariage.xml file, arising I think when EGB forgot to do an svn update before starting work, after I had edited the file on the 14th. I think I've resolved these, but we'll know for sure tomorrow when she comes in and does an update.
My previous changes caused a change to the way paragraphs were displayed in the popup references. I've now refined the XSLT a little so that paragraphs in references no longer get the host document's default paragraph styles added to them (as they shouldn't, because they're editorial content and should be consistent across the collection, rather than varying with the style of their host documents).
Fixed some bugs in CSS and XSLT shown up by Le Bon Mariage:
<teiHeader>; this turned out to be a bug in the XSLT, which I fixed.<note type="marginal">) were being treated as editorial notes, and turned into buttons with popups, so I suppressed that (and see below).Also added handling for the marginal notes, grouping them in with the arguments, because that's the way to handle them IMHO; in some cases too, judging by EGB's markup, it's hard to tell the difference between them. So the argument markup now handles editorial marginal notes.
In the process, I fixed a few typos in the Bon Mariage XML file too.
Note to self: TOCs are still not being handled at all. That's a must-do sometime soon.
Added in the new members of the team and more details about tasks, on CC's instructions.
Discussion of plans for the summer, finances, text-analysis, the IMT 2.0, and future conference presentations.
Le Bon Mariage has marginal notes which are linked by superscript letters in the text. This is our plan for dealing with them:
<hi rend="vertical-align: super;">a</hi>
<note type="marginal" rend="float: right; width: 5em; [...etc.]"><ab>[content of the note]</ab></note>Note that the content appears inside an
<ab> element, as in the case of a marginal <argument>.
<argument>s. Spent most of the day on a document intended to be a gentle introduction as well as a serious attempt to document our practice with regard to the use of CSS code in our markup. This took a while but was really overdue. It will serve as the intro for EGB and GBS as they start adding CSS to their documents, so any errors or omissions will be corrected pretty quickly.
One of the tools I used to get a good overview of what we're doing was to run this against Mariage documents in oXygen:
distinct-values(tokenize(string-join(distinct-values(//@rend), ';'), ';'))
to get a set of all the unique property-value pairs. This found a couple of errors, of course, which we've fixed. It would be a good idea to refine this into an XSLT stylesheet, perhaps, which could be run against the whole collection, and bring back a list of all the unique property names, and for each name, a list of the values we've used.
Four new images have been marked up by CC and entered into the db; five are waiting to be worked on. This is a quick survey of the images already in the db, which will perhaps help us to identify those which have only been partially annotated so far:
Total images in db: 66 Présage malheureux (agresseure.xml): 30 annotations: 5 Transcription de texte 0 Commentaire sur la transcription 5 Commentaire sur les gestes 1 Liens aux textes 2 Graveur ou bibliothèque 8 Objets 2 Créatures 2 Personnages 5 Transcription en français moderne Ambition de la femme (ambition_de_la_femme.xml): 19 annotations: 2 Transcription de texte 0 Commentaire sur la transcription 3 Commentaire sur les gestes 0 Liens aux textes 4 Graveur ou bibliothèque 4 Objets 1 Créatures 3 Personnages 2 Transcription en français moderne L’âne et les femmes (ane_des_femmes.xml): 14 annotations: 3 Transcription de texte 1 Commentaire sur la transcription 1 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 0 Objets 1 Créatures 3 Personnages 3 Transcription en français moderne Aujourd’huy d’une façon, demain de l’autre (aujourdhuy_dune_facon.xml): 75 annotations: 31 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 3 Graveur ou bibliothèque 4 Objets 3 Créatures 3 Personnages 31 Transcription en français moderne Argent fait tout (balance.xml): 34 annotations: 9 Transcription de texte 3 Commentaire sur les gestes 2 Liens aux textes 2 Graveur ou bibliothèque 0 Commentaire sur la transcription 5 Objets 0 Créatures 7 Personnages 6 Transcription en français moderne La besace (besace.xml): 21 annotations: 5 Transcription de texte 0 Commentaire sur la transcription 2 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 2 Objets 0 Créatures 5 Personnages 5 Transcription en français moderne Dame recevant un billet doux (billet_doux.xml): 10 annotations: 2 Transcription de texte 0 Commentaire sur la transcription 0 Liens aux textes 0 Commentaire sur les gestes 1 Graveur ou bibliothèque 3 Objets 0 Créatures 2 Personnages 2 Transcription en français moderne Blaise. Femme criarde mari patient (blaise_femme_criarde.xml): 23 annotations: 8 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 2 Objets 0 Créatures 3 Personnages 8 Transcription en français moderne Bonheur des uns (bonheur_des_uns.xml): 28 annotations: 8 Transcription de texte 0 Commentaire sur la transcription 2 Commentaire sur les gestes 1 Liens aux textes 3 Graveur ou bibliothèque 2 Objets 1 Créatures 3 Personnages 8 Transcription en français moderne Ce pauvre savatier pour contenter sa femme (ce_pauvre_savetier.xml): 12 annotations: 1 Transcription de texte 2 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 0 Graveur ou bibliothèque 5 Objets 0 Créatures 3 Personnages 1 Transcription en français moderne Chacun a le sien (Le fardeau du menage) (chacun_a_le_sien.xml): 39 annotations: 13 Transcription de texte 3 Commentaire sur la transcription 0 Liens aux textes 0 Commentaire sur les gestes 2 Graveur ou bibliothèque 3 Objets 0 Créatures 6 Personnages 12 Transcription en français moderne Elle laisse aller le chat au fromage (chat.xml): 28 annotations: 7 Transcription de texte 3 Commentaire sur la transcription 1 Liens aux textes 3 Commentaire sur les gestes 1 Graveur ou bibliothèque 2 Objets 2 Créatures 2 Personnages 7 Transcription en français moderne Cocus que voſtre joye eſt de courte durée (cocus_que_vostre_joie.xml): 13 annotations: 2 Transcription de texte 0 Commentaire sur la transcription 2 Commentaire sur les gestes 0 Liens aux textes 1 Graveur ou bibliothèque 4 Objets 0 Créatures 2 Personnages 2 Transcription en français moderne A la corne jenins (corne_jenins.xml): 14 annotations: 3 Transcription de texte 0 Commentaire sur la transcription 3 Commentaire sur les gestes 1 Liens aux textes 1 Graveur ou bibliothèque 2 Objets 0 Créatures 1 Personnages 3 Transcription en français moderne Corrige si tu peux (corrige_si_tu_peux.xml): 21 annotations: 3 Transcription de texte 1 Commentaire sur la transcription 2 Liens aux textes 2 Graveur ou bibliothèque 3 Commentaire sur les gestes 3 Objets 0 Créatures 4 Personnages 3 Transcription en français moderne Dame Florence, dame Perrine, dame Jacqueline... (dame_florence.xml): 35 annotations: 8 Transcription de texte 0 Commentaire sur la transcription 4 Commentaire sur les gestes 2 Liens aux textes 1 Graveur ou bibliothèque 10 Objets 1 Créatures 1 Personnages 8 Transcription en français moderne Le débat pour la culotte (debat_pour_la_culotte.xml): 12 annotations: 2 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 1 Liens aux textes 3 Graveur ou bibliothèque 2 Objets 0 Créatures 2 Personnages 2 Transcription en français moderne Dialogue de Dame Alison et de Lubin son mary (dialogue_de_dame.xml): 42 annotations: 7 Transcription de texte 1 Commentaire sur la transcription 11 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 8 Objets 0 Créatures 6 Personnages 7 Transcription en français moderne Le fardeau des pères (fardeau_des_peres.xml): 27 annotations: 9 Transcription de texte 0 Commentaire sur la transcription 3 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 1 Objets 0 Créatures 3 Personnages 9 Transcription en français moderne Femme à plaindre, mary à corriger (femme_a_plaindre.xml): 28 annotations: 6 Transcription de texte 0 Commentaire sur la transcription 1 Commentaire sur les gestes 0 Liens aux textes 3 Graveur ou bibliothèque 8 Objets 0 Créatures 4 Personnages 6 Transcription en français moderne La femme de mesnage (femme_de_mesnage.xml): 22 annotations: 2 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 1 Graveur ou bibliothèque 15 Objets 2 Créatures 0 Personnages 2 Transcription en français moderne La femme d’esprit (femme_desprit.xml): 21 annotations: 4 Transcription de texte 1 Commentaire sur la transcription 2 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 3 Objets 0 Créatures 5 Personnages 4 Transcription en français moderne La femme en menage (femme_en_menage.xml): 30 annotations: 8 Transcription de texte 2 Commentaire sur la transcription 1 Commentaire sur les gestes 0 Liens aux textes 3 Graveur ou bibliothèque 4 Objets 0 Créatures 5 Personnages 7 Transcription en français moderne Femmes à table en l’absence de leurs maris (femmes_a_table.xml): 12 annotations: 1 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 3 Graveur ou bibliothèque 3 Objets 2 Créatures 2 Personnages 1 Transcription en français moderne L'homme de mesnage (homme_de_mesnage.xml): 17 annotations: 2 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 1 Graveur ou bibliothèque 12 Objets 0 Créatures 0 Personnages 2 Transcription en français moderne I’ay les crochets de mon mary (iai_les_crochets.xml): 14 annotations: 1 Transcription de texte 0 Commentaire sur la transcription 1 Commentaire sur les gestes 0 Liens aux textes 3 Graveur ou bibliothèque 7 Objets 0 Créatures 1 Personnages 1 Transcription en français moderne Il cherche ce qu'il ne voudroit pas trouver (il_cherche.xml): 19 annotations: 4 Transcription de texte 1 Liens aux textes 2 Graveur ou bibliothèque 0 Commentaire sur la transcription 2 Commentaire sur les gestes 3 Objets 0 Créatures 3 Personnages 4 Transcription en français moderne Il est bien sage, il met de l’eau dans son vin (il_est_bien_sage.xml): 28 annotations: 8 Transcription de texte 0 Commentaire sur la transcription 3 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 3 Objets 1 Créatures 3 Personnages 8 Transcription en français moderne Il n’est pas maistre en sa maison (il_nest_pas_maistre.xml): 13 annotations: 1 Transcription de texte 0 Commentaire sur la transcription 3 Commentaire sur les gestes 0 Liens aux textes 1 Graveur ou bibliothèque 4 Objets 0 Créatures 3 Personnages 1 Transcription en français moderne L'invention des Femmes (1) (invention_femmes_1.xml): 20 annotations: 4 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 4 Liens aux textes 2 Graveur ou bibliothèque 2 Objets 0 Créatures 4 Personnages 4 Transcription en français moderne L'invention des femmes (2) (invention_femmes_2.xml): 28 annotations: 6 Transcription de texte 0 Commentaire sur la transcription 4 Liens aux textes 0 Commentaire sur les gestes 1 Graveur ou bibliothèque 3 Objets 1 Créatures 7 Personnages 6 Transcription en français moderne La femme battant son mari (la_femme_battant.xml): 28 annotations: 1 Transcription de texte 1 Commentaire sur la transcription 4 Commentaire sur les gestes 2 Liens aux textes 4 Graveur ou bibliothèque 8 Objets 2 Créatures 5 Personnages 1 Transcription en français moderne La femme qui fouette son mary (la_femme_qui_fouette.xml): 28 annotations: 5 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 1 Graveur ou bibliothèque 8 Objets 0 Créatures 9 Personnages 5 Transcription en français moderne La grande destruction de Lustucru par les femmes fortes (la_grande_destruction.xml): 26 annotations: 2 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 1 Liens aux textes 2 Graveur ou bibliothèque 13 Objets 0 Créatures 6 Personnages 2 Transcription en français moderne La rasine d’olà nouvellement inventé pour purger les hommes de leur ivrognerie (la_rasine.xml): 33 annotations: 8 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 1 Graveur ou bibliothèque 7 Objets 1 Créatures 8 Personnages 8 Transcription en français moderne Le bon menage (le_bon_menage.xml): 34 annotations: 11 Transcription de texte 1 Commentaire sur la transcription 1 Liens aux textes 2 Commentaire sur les gestes 2 Graveur ou bibliothèque 3 Objets 1 Créatures 2 Personnages 11 Transcription en français moderne Le fournox de Jean Tangous (le_fournaux.xml): 42 annotations: 16 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 1 Graveur ou bibliothèque 9 Objets 0 Créatures 0 Personnages 16 Transcription en français moderne Le mari complaisant (le_mari_compliasant.xml): 16 annotations: 1 Transcription de texte 0 Commentaire sur la transcription 2 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 7 Objets 0 Créatures 3 Personnages 1 Transcription en français moderne Le mari impuissant (le_mari_impuissant.xml): 27 annotations: 8 Transcription de texte 0 Commentaire sur la transcription 1 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 4 Objets 0 Créatures 4 Personnages 8 Transcription en français moderne Le pauvre badin (le_pauvre_badin.xml): 13 annotations: 2 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 3 Graveur ou bibliothèque 2 Objets 1 Créatures 3 Personnages 2 Transcription en français moderne Le remouleur a la mode des langues piquante (le_remouleur.xml): 24 annotations: 3 Transcription de texte 1 Commentaire sur la transcription 5 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 8 Objets 0 Créatures 2 Personnages 3 Transcription en français moderne Le Taste-Poule (le_taste_poule.xml): 13 annotations: 2 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 3 Graveur ou bibliothèque 1 Objets 2 Créatures 3 Personnages 2 Transcription en français moderne Les dolens et mal mariez (les_dolens.xml): 44 annotations: 17 Transcription de texte 1 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 2 Objets 0 Créatures 5 Personnages 17 Transcription en français moderne L’estocade du gousset (lestocade.xml): 22 annotations: 7 Transcription de texte 0 Commentaire sur la transcription 3 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 1 Objets 0 Créatures 2 Personnages 7 Transcription en français moderne Maitre Miché ou la cornette commande (maitre_miche.xml): 17 annotations: 4 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 4 Objets 0 Créatures 3 Personnages 4 Transcription en français moderne Le mari qui bat sa femme (mari_bat_femme.xml): 16 annotations: 2 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 3 Graveur ou bibliothèque 4 Objets 1 Créatures 4 Personnages 2 Transcription en français moderne Marie ta fille ou elle se mariera (marie_ta_fille.xml): 16 annotations: 6 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 1 Objets 0 Créatures 3 Personnages 4 Transcription en français moderne La confrerie des martyrs (martires.xml): 38 annotations: 11 Transcription de texte 3 Commentaire sur la transcription 1 Liens aux textes 2 Commentaire sur les gestes 3 Graveur ou bibliothèque 7 Objets 0 Créatures 0 Personnages 11 Transcription en français moderne Le massacre de Lustucru par les femmes (Lagniet) (massacre_lagniet.xml): 38 annotations: 11 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 1 Liens aux textes 1 Graveur ou bibliothèque 8 Objets 0 Créatures 6 Personnages 11 Transcription en français moderne Le massacre de Lustucru (Larmessin) (massacre_larmessin.xml): 51 annotations: 15 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 1 Liens aux textes 1 Graveur ou bibliothèque 14 Objets 0 Créatures 8 Personnages 12 Transcription en français moderne Migrainne perpetuelle (Le jaloux discret) (migrainne_perpetuelle.xml): 24 annotations: 8 Transcription de texte 1 Commentaire sur la transcription 0 Liens aux textes 0 Commentaire sur les gestes 2 Graveur ou bibliothèque 4 Objets 0 Créatures 1 Personnages 8 Transcription en français moderne Miroir des dupes (La Coquette) (miroir_des_dupes.xml): 29 annotations: 10 Transcription de texte 0 Commentaire sur la transcription 0 Liens aux textes 2 Commentaire sur les gestes 2 Graveur ou bibliothèque 2 Objets 0 Créatures 3 Personnages 10 Transcription en français moderne Ce ne sont mye des femmes (mye_des_femmes.xml): 9 annotations: 1 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 3 Objets 0 Créatures 2 Personnages 1 Transcription en français moderne Opérateur céphalique (operateur_cephalique.xml): 63 annotations: 18 Transcription de texte 0 Commentaire sur la transcription 2 Commentaire sur les gestes 2 Liens aux textes 1 Graveur ou bibliothèque 14 Objets 4 Créatures 6 Personnages 16 Transcription en français moderne La poulle qui chante devant le coq (poule_qui_chante.xml): 23 annotations: 6 Transcription de texte 1 Commentaire sur la transcription 1 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 2 Objets 2 Créatures 3 Personnages 6 Transcription en français moderne Quand avec cinq ou six, parmy les bonnes cheres (quand_avec_cinq.xml): 19 annotations: 1 Transcription de texte 0 Commentaire sur la transcription 1 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 10 Objets 2 Créatures 2 Personnages 1 Transcription en français moderne Tout d’amitié (tout_damitie.xml): 14 annotations: 4 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 1 Liens aux textes 2 Graveur ou bibliothèque 1 Objets 0 Créatures 2 Personnages 4 Transcription en français moderne Un homme porte sa femme sur son dos (un_homme_porte.xml): 8 annotations: 2 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 1 Graveur ou bibliothèque 1 Objets 0 Créatures 2 Personnages 2 Transcription en français moderne Un homme pousse sa femme dans une brouette (un_homme_pousse.xml): 8 annotations: 2 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 1 Graveur ou bibliothèque 1 Objets 0 Créatures 2 Personnages 2 Transcription en français moderne La visse sans fin. Gorgibus, la dent seiche (visse_sans_fin.xml): 27 annotations: 7 Transcription de texte 1 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 7 Objets 0 Créatures 3 Personnages 7 Transcription en français moderne A femme désolée mari ioyeux (a_femme_desolee.xml): 19 annotations: 5 Transcription de texte 2 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 2 Graveur ou bibliothèque 2 Objets 0 Créatures 3 Personnages 5 Transcription en français moderne Ce n’est ce que c’estoit (ce_nest_ce_que_cestoit.xml): 8 annotations: 1 Transcription de texte 1 Commentaire sur la transcription 1 Commentaire sur les gestes 0 Liens aux textes 1 Graveur ou bibliothèque 1 Objets 0 Créatures 2 Personnages 1 Transcription en français moderne Conception des cornes (conception_des_cornes.xml): 15 annotations: 2 Transcription de texte 1 Commentaire sur la transcription 2 Commentaire sur les gestes 0 Liens aux textes 1 Graveur ou bibliothèque 4 Objets 0 Créatures 3 Personnages 2 Transcription en français moderne Elle est revenue Denise (elle_est_revenue.xml): 8 annotations: 2 Transcription de texte 0 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 1 Graveur ou bibliothèque 0 Objets 0 Créatures 3 Personnages 2 Transcription en français moderne Elle ploye la thoilette (elle_ploye_la_thoilette.xml): 6 annotations: 2 Transcription de texte 1 Commentaire sur la transcription 0 Commentaire sur les gestes 0 Liens aux textes 0 Graveur ou bibliothèque 0 Objets 0 Créatures 1 Personnages 2 Transcription en français moderne Les fames ne tombent jamais le nez en terre (les_fames_ne_tombent.xml): 11 annotations: 2 Transcription de texte 1 Commentaire sur la transcription 1 Commentaire sur les gestes 0 Liens aux textes 0 Graveur ou bibliothèque 3 Objets 0 Créatures 2 Personnages 2 Transcription en français moderne
This was generated with this code (then I removed the tags):
xquery version "1.0";
declare default element namespace "http://www.tei-c.org/ns/1.0";
let $ret := '
'
return
<results>
<total>
Total images in db: {count(collection('/db/mariage/anthology/images/')//TEI)}
</total>
{
for $t in collection('/db/mariage/anthology/images/')/TEI
return
(
concat($ret, $ret, $t/teiHeader/fileDesc/titleStmt/title[1]/text(), ' (', $t/@xml:id, '.xml):', $ret, count($t//zone), ' annotations: ', $ret),
for $r in $t//rendition
return
(
concat(count($t//zone[@rendition = $r/@xml:id]), ' ', $r/label/text(), $ret)
)
)
}
</results>
Processed and added Doctus est in calicem image to the repo.
Processed and added Le Cornar Content. Only one more to do from the latest batch.
Les Fruits Malheureux cleaned up, processed and added to SVN.
La Cornologie cleaned up, processed and turned into an IMT file.
Le Jenin Parfait now cleaned up, processed and XML'ed.
Added Conception des Cornes. This image took a lot of cleaning up. The approach I eventually took with some success was to increase both contrast and brightness (32% and 33% respectively), then I was able to do select-similar stuff to get wide blocks of the noise; using Despeckle carefully on selected areas worked well too.
New doc elle_ploye_la_thoilette.xml (RES-Z-1746-P-115.jpg) set up and added to SVN.
As with the previous one, cleaned up the image, created the two smaller images, and built the Image Markup file, although I haven't entered any text yet. Added everything to SVN, but I haven't uploaded it to the site yet.
Finished cleaning up the image, created the two smaller images, and built the Image Markup file, although I haven't entered any text yet. Added everything to SVN, but I haven't uploaded it to the site yet.
Note to self: when cleaning up images, select by colour in the Gimp is your friend.
Up to now, we've been using a schema which was generated using oddbyexample.xsl from our document collection last year, and then slightly modified. I've now substantially modified the ODD file to make the following changes:
<argument> to appear in various locations in which it's not normally allowed, because we have marginal-style arguments throughout our new prose texts, and many of the appear in locations where there's no sentence or division break.<rs> element, which we use in TOCs.In the process, I fixed some issues in a couple of the actual documents, and I now have the two new documents linking to the updated schema instead of the tei_all they were temporarily using.
In our texts, arguments frequently occur in marginal position, without interrupting the flow of the text. We now have an example of one which occurs inside one of our weird extended quote forms, so I need to make <argument> an acceptable child of <quote> in the ODD, and generate a new schema.
Adding id attributes to the reference elements themselves was enough to make the popup function, in that the reference element is copied to the popup and displayed; so that's one issue fixed. However, because the default display in the reference list shows each reference collapsed, it's also collapsed when it's copied to the popup. I need to elaborate the showReference() function a little so that it can auto-expand that stuff if necessary (without breaking the way the same function works in the context of anthology documents and articles).
That makes nine with last week's half-finished one. They'll need quite a lot of cleanup too. That's a priority for next week. They're appropriately named, and in the "incoming" folder. Three are pairs in the same image, and will need to be split.
The Subversion rollout is working well. GBS is now using it, and CC has been set up on Carrot with it (although isn't editing XML right now). Only LCC left.
This was generating sporadic errors because it was attempting to use a variable defined in another stylesheet, which wasn't included.
This reminded me, though, that references to other references (nested refs) don't work in the references page itself. That needs fixing. It may be as simple as including the popup on that page, but it might be more complicated.
RES-Z-1746-P-114.jpg arrived. I've made a start on cleaning it up, then I'll build the other sizes and the XML file.
We're now working out of a subversion repository instead of editing XML directly in home1t; this is basically essential since we have five or more people editing files in the same collection. I've set up LSPW and EGB on their editing machines with subversion and instructions, and all seems to be going well. I've also set up Radish so that it's ready for GBS when she comes in on Monday. I haven't yet set up the Mac for LCC.
Some texts (such as Varin) have quotes in them which are not signalled with quote marks; it's only clear they're quotes because of the surrounding text. These are being tagged with <quote type="noMarks">, which I've added to the schema alongside @type="italics", and added a handler to the XSLT.
The two latest texts have long and detailed tables of contents which need to be encoded. This is the recommended approach, which I've based on the Brown Women Writers Project guidelines, but which use examples from our own texts.
<div type="contents">.<head> element.<list type="toc">.<head> tag inside the <list> tag.<item> element inside the <list> element.<label> (containing a label such as "Chap. XXXII" in the Ville-Thierry, if there is one).<rs> ("referring string"), which contains the chapter title, heading or description.<ref type="pageNum">, containing the page number. This will have two other attributes:
target="p_1" (which points, in this example, to page 1 of the text). It is important that you later ensure that an element with the xml:id "p_1" exists in the document; normally this would be the <fw type="pageNum">1</fw> -- in other words, the <fw> tag which contains the page number "1".rend="float:right". This designates that the page number appears on the right of the page, which is normally the case.Here are some example encodings. This is from Le Bon Mariage:
<div type="contents"> <head> <figure><figDesc>Motif décoratif</figDesc></figure> <hi>TABLE</hi><lb/> <hi>DES CHAPITRES</hi><lb/> <hi>CONTENVS EN CE LIVRE</hi><lb/> </head> <list type="toc"> <head> <figure><figDesc>Ligne droite ray/e</figDesc></figure> <hi>LIVRE PREMIER.</hi><lb/> <hi>TRAITE PREMIER.</hi><lb/> <hi>De l'excellence & des fins du Mariage</hi><lb/> </head> <item> <rs>Des trois diverſes eſtats qui ſe retrouuent en l'Egliſe, page </rs> <ref type="pagenum" target="#p_1" rend="float: right;">1</ref> </item> <item> <rs>Combien le mariage eſt honorable. </rs> <ref type="pagenum" target="#p_6" rend="float: right;">6</ref> </item> <item> <rs>Qu'au mariage & en toutes autres actions ſe ſaut propoſer vne bonne fin.</rs> <ref type="pagenum" target="#p_10" rend="float: right;">10</ref> </item> </list> </div>
This example comes from the Ville-Thierry:
<div type="contents"> <head> <figure> <figDesc>Motif décoratif</figDesc> </figure> <hi>TABLE</hi><lb/> <hi>DES CHAPITRES.</hi><lb/> <figure><figDesc>Ligne droite foncée</figDesc></figure> <hi>CHAPITRE PREMIER.</hi><lb/></head> <list type="toc"> <item> <rs>De la grandeur & de l'excellence du Mariage. pag.</rs> <ref target="#p_1">1</ref> </item> <item> <label>Chap. II.</label> <rs>Qu'il n'y a rien de plus mal- heureux que l'état de ceux qui entrent mal dans le Mariage , & qui ne s'y con- duisent pas par les regles de la charité & de la pieté chrétienne.</rs> <ref target="#p_19">19</ref> </item> <item> <label>Chap. III.</label> <rs>Quelles sont les fins que les Chrétiens doivent se proposer , lorsqu'ils s'engagent dans le Mariage.</rs> <ref target="#p_26">26</ref></item> </list> </div>
As asked by CC, I have begun to work on the Espines du Mariage text by Varin, which will involve the following:
1) proofreading transcribed text
2) tweaking formatting of text where needed
3) proofreading, writing, and linking references
4) writing notes
5) making list of all Greek texts that appear so that they can be sent to Greek Masters student for transcription/proofreading
In both the original texts of Bon Mariage and Ville Thierry, the table of contents appears in such a way that the chapter headings/descriptions are aligned to the left while the page numbers on which the chapters begin are aligned to the right. Because of this complex formatting, we cannot simply encode the table of contents as a list as we normally do. I discussed this with MH and he will reflect on how to deal with those texts.
There were some oddities in the display of references linked from inside references -- see my prior post for details. I've now fixed those. The problems arose because:
I've now fixed this so that the transformation XSL is more sensitive to fragments (it uses a formula local-name(/*) ne 'TEI' to identify a fragment, which is typically rooted on <div> rather than <TEI>). I've also tweaked the AJAX code so that it can append to the content of the target element as well as replace it, using a switch called clearTargetFirst.
I have finished proofreading the Forest Nuptiale. All references, linguistic notes and arguments for the text have also been added.
Currently, I'm working on inputting and formatting the Greek transcriptions that hadn't been added in the Sonnet 1609.
Will be teaching new RA XML encoding this morning/afternoon.
CC sent the last of the metadata, plus a transcription. It's now up on the site, but it will be annotated further by CC.
A bug has emerged in the handling of references which reference other references. This is how to reproduce it:
When the embedded Trente reference is shown, it's unformatted (paragraphs and links are missing). Thereafter, formatting is missing from any reference which shows up in the box. No idea why this is happening (yet), but we never predicted there would be references which are only linked from inside other references.
Got some metadata from CC, and created the XML file. Did the initial transcription of the legend (it'll have to be checked, of course), and added the metadata. Still missing a modernization of the legend (which I'm not equipped to do) and the teaser.
A new engraving has arrived, so I've gone through the process of cleaning it up and prepping it using the Gimp instead of Photoshop. This is the procedure, updated from the previous PS procedure:
backup, parallel to mariage_5).scans folder.wv_ prefix, into the scans folder.images folder with the thumb_ prefix.scans to exist_db/mariage/anthology/images.exist_db/mariage/anthology/images linking to the full size image file in the same directory, and add all the metadata.I haven't created the XML file in this case yet, because I don't have any metadata at all about this image.
[EDIT BY MDH: I've looked at these and can't find anything wrong in any of the programming code (XSLT, CSS or XHTML output); furthermore, the extra line breaks don't occur in Opera or Google Chrome, so I conclude this is caused by a Firefox bug, and I'm marking the task as Completed.]
Just a reminder to MH to fix the following lines contained in argument tags that include an extra line break when viewed online: Lines 10327, 10358, 10396 and 10534. Thank you!
[This is in the FOREST NUPTIALE.]
1) Lines 10328-10338 (extra line break occurs between "phes" and "In-" in XSLT):
<div>
<argument rend="float: left; margin-left: -6em; text-align: left;">
<ab rend="font-style: italic; font-size: 70%; text-align: left;">
Philoſo-<lb/>
phes <ref target="../../back_matter/references/references.xml#indien">In-<lb/>
diens</ref>.<lb/>
</ab>
</argument>
<p rend="margin-top: 0; margin-bottom: 0; text-indent: 0;">
ſans la liſte des Philoſophes <ref target="../../back_matter/references/references.xml#indien">In-<lb/>
diens</ref> en font pluſieurs cantons de<lb/>
2) Lines 10359-10368 (extra line break also occurs between "phes" and "In-" in XSLT):
<div>
<argument rend="float: right; margin-right: 5em; text-align: left;">
<ab rend="font-style: italic; font-size: 70%; text-align: left;">
Philoſo-<lb/>
phes <ref target="../../back_matter/references/references.xml#indien">In-<lb/>
diens</ref>.<lb/>
</ab>
</argument>
<p rend="margin-top: 0; margin-bottom: 0; text-indent: 0;">
deſquels ſont rentés, és autres, il faut<lb/>
3) Lines 10394-10406 (extra line break occurs between "des" and "Bra-" in XSLT):
<div>
<argument rend="float: left; margin-left: -5em; text-align: left;">
<ab rend="font-style: italic; font-size: 70%; text-align: left;">
Solenni<lb/>
tez au<lb/>
iour des<lb/>
nopces<lb/>
des <ref target="../../back_matter/references/references.xml#bracmane">Bra-<lb/>
mins</ref>.<lb/>
</ab>
</argument>
<p rend="margin-top: 0; margin-bottom: 0; text-indent: 0;">
ieunes,hommes & femmes: le iour<lb/>
4) Lines 10535-10544 (extra line break also occurs between "des" and "Bra-" in XSLT):
<div>
<argument rend="float: right; margin-right: 5em; text-align: left;">
<ab rend="font-style: italic; font-size: 70%; text-align: left;">
Femmes<lb/>
des <ref target="../../back_matter/references/references.xml#bracmane">Bra-<lb/>
mins</ref>.<lb/>
</ab>
</argument>
<p rend="margin-top: 0; margin-bottom: 0; text-indent: 0;">
quels couchent auec les ſœurs du<lb/>
I've set up a user on the db for LSPW to log on with, so that she can upload her documents while I'm away. I'll document it in the usual place. All docs are now owned by admin but with group "ras", so both admin and LSPW can update any document. The system collection remains admin/dba.
Faut-il se marier? La question de Panurge s’avère incontournable en Occident, surtout à partir de la contre-réforme. Des débuts de la Concile de Trente en 1545 jusqu’à la fin du règne de Louis XIV, la tentative de renouveler le mariage se heurte en France à l’intervention croissante de la monarchie dans cette institution dominée auparavent par l’Église. La rencontre entre ces deux autorités fut tumultueuse mais propice au foisonnement des documents qui font l’objet de ce site : « l’imaginaire nuptial » se compose de divers genres textuels, chacun ayant son caractère propre, mais tous traitant des peurs, des désirs et des fantasmes de plus en plus visibles dans la société d’Ancien Régime grâce aux débats soulevés par la nouvelle problématique de l’union conjugale. L’accent pour le moment est sur les textes et images misogames qui font partie d’un renouveau de la Querelle des femmes pendant les 25 premières années du XVIIe siècle.
| << | Current | >> | |
| Jan | Feb | Mar | Apr |
| May | Jun | Jul | Aug |
| Sep | Oct | Nov | Dec |