Books of the Bible, I learned today, will take a "title" tag with nothing inside it - this way we're still acknowledging that it is a title of something but they will not come up in Italics or guillemets, which the Chicago style disapproves of.
Hello again, I just wanted to update you on CC's email regarding direct citation of the Petit Robert. I am copy-pasting what she has said:
(Protocal as suggested by me):
"Most of the verbatim references are between 100 and 150 words, so unless I hear otherwise, I am leaving these as they are and not touching them because they're safe. The ones that are over 250 words, I have written my own entry and incorporated another source (often times I'll use the Petit Robert and Wikipedia -- provided that the article corroborates)."
CC responded: "This sounds good to me."
I've now got the main bibliography page rendering appropriately as part of the site; still waiting for an OK from CC to add it to the menu. In the process, I decided to disentangle some potential sources of confusion in the file names and site structure: we had one set of documents and pipelines labelled "biblio" which were actually related to the TOC for the anthology, and another set which were actually the bibliography (list of works referenced). I've now renamed files and pipeline triggers to use "toc" for anything related to the TOC, and "works_list" for anything related to the list of works referenced.
Next is markup of the first of the articles.
Hello again,
- For instances where the meaning of a foreign name is being given, I had been putting:
<head>Homère en gr. (<foreign xml:lang="gr">Homêros</foreign> <cit><quote>l'aveugle</quote></cit>)</head>
Actually, I should be using a<soCalled>l'aveugle</soCalled>
tag: mholmes will make it so that the XSLT will render it as guillemets. (Thank you, mholmes!)
Hello everyone, I have decided to summarize our decisions regarding citations and mark-up on the blog for easy reference. Please feel free to let me know if anything needs to be updated, as it likely will.
- When working with Saints, Anges, etc: The header should be as follows: Saint Autel. The XML id, however, needs to have the proper name before the Saint (e.g. autelSaint instead of saintAutel).
- Mholmes: "Duplicated the Pyrame entry to create a separate Thisbe entry next to it; someone will need to write the two entries to be complementary, and make sure any ref tags point at the appropriate one."
- ccarlin: in agreement with parentheses () and with the foreign tag for headings (no need for an italic attribute as Martin has fixed the XSLT so that all foreign is rendered italic. Here's my template (based off lspwong's beautiful entry):
<head>Alcmène en gr. (<foreign xml:lang="gr">Alkmênê</foreign>)</head>
- ccarlin: "I really don’t think a paraphrase absolves one of accusations of plagiarism unless more than one source is being amalgamated. I suggest a word limit for direct quotations from the Robert Illustré — that is the most clear and honest path, especially if it’s within limits. How about (I suggest arbitrarily) 250 words? It is perfectly OK to take all of the Robert’s reference, with attribution, if it’s not enormous."
- I note that the “Grand Robert en ligne” gives the following definition of “nippe”: Objet servant à l'ajustement et à la parure, « comme hardes, linges, bagues et autres pareilles choses » (Richelet, 1680). Richelet is one of the 17th-century dictionaries we have in the departmental library; I’m sure the word is in Furetière, and probably the Dico de l’Académie too. In an instance where the Robert is citing a 17th-century dictionary, then I think it makes sense for us to give a 17th-century dictionary citation. (Remember that the wonderful Furetière dictionary is now on Gallica.)
- ccarlin: (for my work on the Agremens) “Écu” is another interesting example, in that its meaning depends on the 17th-century context. How much was an “écu” worth? Please consult me in such instances, since I can sometimes quickly come up with information, and other times, with a bit of work. I thought I had a link to a site discussing monetary values, but all I have is a long discussion thread about the complexity of equivalencies, this from the H-France discussion group. Bottom line for our site: additional contextual information, going beyond mere definition, makes this kind of reference particularly worthwhile.
- ccarlin: "Please use Wikipedia judiciously. A brief direct quotation from the Robert for identification purposes is fine. Taking into account the context (how much information is necessary), use of a variety of sources is a good idea, but don’t give up completely on brief verbatim quotations, as I said in my opening paragraph."
My current attempt, based on some back-and-forth with WM, goes like this:
- Replace the
build.xml
,exist-gump.xsl
andexist-jars.xsl
with the ones from here:
http://exist.svn.sourceforge.net/viewvc/exist?view=rev&revision=10205 - Download a fresh Cocoon 2.1.11.
- Patch the Cocoon blocks.properties with a local.blocks.properties which disables Lucene and querybean, per WM's instructions.
- Check the build/scripts/cocoon/build.properties file in $EXIST_HOME to make sure it points to your local $COCOON_HOME instead of WM's
- Sign the eXist jars: in $EXIST_HOME
./build.sh -f build/scripts/jarsigner.xml
- Build eXist: in $EXIST_HOME
./build.sh -f build/scripts/cocoon/build.xml
- Clean up the Cocoon tree: in $COCOON_HOME
./build.sh clean
- Rebuild the Cocoon webapp: in $COCOON_HOME
./build.sh
- Copy
ws-commons-util-1.0.2.jar
andsunxacml-1.2.jar
fromEXIST_HOME/lib/core
into the generatedCOCOON/build/webapp/WEB-INF/lib
- Copy my own TitleSortComparator.jar and xqSearchUtils.jar into the same folder (optional for the moment)
- Copy the generated
webapp
folder to [Tomcat]/webapps, and rename itcocoon
. - Start Tomcat.
- Confirm that Cocoon and eXist are running.
- Try the JNLP: It should startup OK.
- Connect using the client: $EXIST_HOME/bin,
./client.sh
, then connect toxmldb:exist://localhost:8080/cocoon/xmlrpc
- Using either the client or the JNLP, upload documents, then create a backup to make the process simpler in future.
- Shut down Tomcat.
- Add
saxon9he.jar
to[cocoon]/WEB-INF/lib
. - Change
web.xml
to set all encodings to UTF-8 (some are still 8859-1 in the default setup). This assumes that the Tomcat process is itself being launched with a UTF-8 flag in the VM, as part of the Java launch command. I do this using a startup_as_utf8.sh file in [Tomcat]/bin:#!/bin/bash ./startup.sh dFile.encoding="UTF-8"
- Configure Cocoon so that Saxon can be called. First, open cocoon/WEB-INF/cocoon.xconf, and find the bit that refers to Saxon XSLT, which is commented out by default. Uncomment the code and change it according to the instructions in the file, so that it enables Saxon 9:
<component logger="core.xslt" role="org.apache.excalibur.xml.xslt.XSLTProcessor/saxon" class="org.apache.cocoon.components.xslt.TraxProcessor"> <parameter name="use-store" value="true"/> <parameter name="transformer-factory" value="net.sf.saxon.TransformerFactoryImpl"/> </component>
- Now we need to edit
cocoon/sitemap.xmap
to enable the Saxon transformer. In the<map:transformers>
section, add this below the other XSLT transformers:<map:transformer name="saxon" pool-grow="2" pool-max="32" pool-min="8" src="org.apache.cocoon.transformation.TraxTransformer"> <use-request-parameters>false</use-request-parameters> <use-browser-capabilities-db>false</use-browser-capabilities-db> <xslt-processor-role>saxon</xslt-processor-role> </map:transformer>
- Add the following in the
<map:serializers>
section, to enable a couple more useful output formats:<!-- Customization: compatibility setting for IE6 --> <map:serializer logger="sitemap.serializer.xhtml" mime-type="text/html" name="xhtml11_compat" pool-grow="2" pool-max="64" pool-min="2" src="org.apache.cocoon.serialization.XMLSerializer"> <doctype-public>-//W3C//DTD XHTML 1.1//EN</doctype-public> <doctype-system>http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd</doctype-system> <encoding>UTF-8</encoding> </map:serializer> <!-- Customization: set text output to UTF-8 --> <map:serializer logger="sitemap.serializer.text" mime-type="text/plain" name="text" src="org.apache.cocoon.serialization.TextSerializer"> <encoding>UTF-8</encoding> </map:serializer>
- In eXist's configuration (
WEB-INF/conf.xml
), make change the indexing settings from this:<indexer caseSensitive="yes" index-depth="5" preserve-whitespace-mixed-content="no" stemming="no" suppress-whitespace="none" tokenizer="org.exist.storage.analysis.SimpleTokenizer" track-term-freq="yes">
to this:<indexer caseSensitive="no" index-depth="8" preserve-whitespace-mixed-content="yes" stemming="no" suppress-whitespace="none" tokenizer="org.exist.storage.analysis.SimpleTokenizer" track-term-freq="yes">
- In eXist's
<conf.xml>
, change this:<serializer add-exist-id="none" compress-output="no" enable-xinclude="yes" enable-xsl="no" indent="yes" match-tagging-attributes="no" match-tagging-elements="no">
to this:<serializer add-exist-id="none" compress-output="no" enable-xinclude="yes" enable-xsl="no" indent="yes" match-tagging-attributes="no" match-tagging-elements="yes">
(just the final attribute changes) to turn on exist:match tagging. - In eXist's
<conf.xml>
, change this:<index> <fulltext attributes="false" default="none"> <exclude path="/auth"/> </fulltext> </index>
to this:<index> <fulltext attributes="false" default="all"> <exclude path="/auth"/> </fulltext> </index>
to turn on fulltext indexing. You can also do this by collection in the admin client. - Restart Tomcat. Check that things work.
I've now made all those changes in the actual files themselves, so the right @target
attributes should be pointing at the right reference items. These are things that still need to be cleaned up:
The references for Pyramus and Thisbe are now separate, but they're duplicated. LCC will need to write separate content for each of them.
There are multiple "Humeur" entries (humeurSpermatique, humeurSanguine etc.). These will need to be consolidated so that the single "Humeur" entry covers all the varieties (it might already do that), and the individual sub-entries get deleted. The references in the Sonnet 1621 are already all pointing to "#humeur", the main entry.
Biblio markup is now completed; I just need to add the mechanism for sorting it by various parameters on the actual page.
These are changes I've made to xml:ids
and <head>
elements:
- Removed all initial and final spaces from
<head>
tags. - Normalized all saints' names in head tags to: Saint Paul, Saint Autel, etc.
- Changed the following
xml:ids
:aecus -> eaque aeolus -> eole aeneas -> enee aegysthus -> egisthe aeta -> aetes aeneides -> eneide iapetus -> japet iason -> jason ieanneDeFrance -> jeanneDeFranceSaint isleDeCornouaille -> cornouaille liureDeJuges -> juges laCiteDeDieu -> citeDeDieu royCharlesVIII -> charlesVIII sainctAutel -> autelSaint saintPrix -> prixSaint saintAngeRaphael-> raphaelAnge sainctPaul -> paulSaint timothee -> timotheeSaint sainctIeanBaptiste-> jeanBaptisteSaint augustin -> augustinSaint saintCene -> ceneSaint cocus -> martial libra -> balance
- Removed "(Cocus)" from the Martial entry -- seems very odd that it was there in the first place.
- Fixed some oddities in
<head>
contents. - Deleted the "example reference".
- Found the blank entries, which turned out to be more "Humeur" entries (there are now several of these) which had no
<head>
contents. I've added the appropriate<head>
contents so that we can now see what they are; ultimately, Lauren or Leanna will need to check that the overall "humeur" entry covers all these subtopics, then delete the subtopics, and fix any ref tags pointing at them. - Duplicated the Pyrame entry to create a separate Thisbe entry next to it; someone will need to write the two entries to be complementary, and make sure any ref tags point at the appropriate one.
- Deleted the "cocu" entry. Any ref tags pointing to it will also need to be deleted.
- Deleted the "moliere" entry. Any ref tags will have to be removed in the documents.
- Fixed the entry for Tartare, which had no title but all of its content in the
<head>
tag.
I've uploaded these changes to references.xml, but I haven't fixed any of the documents which might be pointing to old xml:id
s yet. I'll try that tomorrow. In the meantime, I notice there are some titles which have periods at the end, which looks odd...
Just to be absolutely clear about the citation of Le Robert des noms propres, since it's a dictionary, you don't need the page number. Including the date is appropriate, but the year, not the month. So (with the title italicized):
Le Petit Robert: Dictionnaire illustré des noms propres, Paris, Dictionnaires le Robert, 1994. would be the preferred format.