<?xml version="1.0" encoding="UTF-8"?>
<TEI.2 id="paper_183_van_zundert">
   <teiHeader>
      <fileDesc>
         <titleStmt>
            <title>The e-Laborate Project and the Usability of Another Textual Paradigm</title>
            <author>
               <name reg="van Zundert, Joris">Joris van Zundert</name>
            </author>
            <author>
               <name reg="van Dalen-Oskam, Karina">Karina van Dalen-Oskam</name>
            </author>
            <respStmt>
               <resp>Marked up by </resp>
               <name reg="Holmes, Martin">Martin Holmes</name>
               <lb/>
               <name reg="Baer, Patricia">Patricia Baer</name>
            </respStmt>
         </titleStmt>
         <publicationStmt>
            <p>Marked up to be included in the ACH/ALLC 2005 Conference Abstracts book.</p>
         </publicationStmt>
         <sourceDesc>
            <p>None</p>
         </sourceDesc>
      </fileDesc>
      <profileDesc>
         <textClass>
            <classCode>paper</classCode>
            <keywords>
               <list>
                  <item>digital editing</item>
                  <item>theory of text</item>
                  <item>annotation</item>
               </list>
            </keywords>
         </textClass>
      </profileDesc>
      <revisionDesc>
         <list>
            <item>MDH: Created from John Bradley's XML <date value="2005-03">March 2005</date>
            </item>
            <item>MDH: RS proofed and signed off without changes <date value="2005-05-18">18 May 2005</date>.</item>
         </list>
      </revisionDesc>
   </teiHeader>
   <text>
      <front>
         <docTitle n="The e-Laborate Project and the Usability of Another Textual Paradigm">
            <titlePart>The <title level="m">e-Laborate</title> Project and the Usability of Another Textual Paradigm</titlePart>
         </docTitle>
         <docAuthor>
            <name reg="van Zundert, Joris">Joris van Zundert</name>
            <address>
               <addrLine>joris.van.zundert@niwi.knaw.nl</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">Dept. Dutch Linguistics and Literary Studies</titlePart>
         <docAuthor>
            <name reg="van Dalen-Oskam, Karina">Karina van Dalen-Oskam</name>
            <address>
               <addrLine>karina.van.dalen@niwi.knaw.nl</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">Dept. Dutch Linguistics and Literary Studies</titlePart>
      </front>
      <body>
         <p>In 2003 we embarked upon the project <title level="m">e-Laborate: a digital platform for partnerships in the humanities and social sciences</title>. The web application (at <xptr to="http://www.e-laborate.nl/"/>) resulting from this project is intended as a virtual workplace for researchers in the humanities and social sciences.The <title level="m">e-Laborate</title> collaboratory contains text collections, collections of statistical data and basic content management tools for sharing and working on text material and datasets. The project allows individual researchers as well as research groups to explore the potential of the collaboratory and to generate feedback. The tools enable users to expand the collection of material continuously and to improve its quality. In our paper we will present <title level="m">e-Laborate</title> as an on line research collaboratory and as a web enabled tool for editing and analysing textual content. We will also show how <title level="m">e-Laborate</title> provided a research environment in which we can explore the usefulness and usability of a specific text paradigm. </p>
         <p>The text material we used in our project issued from the historical cultural journal <title level="a">Vaderlandsche Letteroefeningen</title>. The title means <gloss>National Literary Exercises</gloss> and in academic writing is usually shorthanded as <title level="a">VLO</title>. Published between 1761 and 1876, the <title level="a">VLO</title> is of great importance for every research discipline concerned with the study of culture in the Netherlands during that period. There has long been a widely held desire to see a complete set of the journal available in digital form. However, because of its huge size and the enormous costs that digitisation would entail this has not been possible before now. The approach we have chosen differs fundamentally from the way in which textual material has usually been digitised and published in the past. The <title level="a">VLO</title> component of the <title level="m">e-Laborate</title> project uses a bottom up collaborative approach, drawing upon the assistance of researchers, to produce a continuous developing and evolving digital version of the publication. Using this approach NIWI will now be able to publish facsimiles (scans) of the first 50.000 pages of <title level="a">VLO</title> editions by the first quarter of 2005.</p>
         <p>We will describe the development process used in building <title level="m">e-Laborate</title>.The <term>eXtreme Programming</term> protocol (XP) was closely followed. Researchers' demands concerning the texts were closely monitored during the project and used to drive the development of the electronic tools for joint working on text and textual material. Every two weeks new elements were delivered, tested and approved of or commented on. Also critique and additional wishes were communicated with the developers. In this way we made sure that the tools would really be what researchers collaboratively working on text wanted and needed. The participating researchers are enthusiastic about this development approach and about the tools delivered up till now. Formal evaluation and retrospection showed especially appreciation for the pragmatically forward looking vision of the project (i.e. building the collaboratory brick by brick, feature by feature).</p>
         <p>The paper will provide a functional and architectural overview of <title level="m">e-Laborate</title> as a collaborative tool for supporting the production of digital editions. At the core of <title level="m">e-Laborate</title> is the <term>transcription object</term>. The transcription object is a container object holding the scanned image of a page from an original publication and a transcription field. Each transcription object's authorisation may be tailored by its creator / owner. Depending on the user's authorisation the transcription field of a transcription object is depicted either as a text edit box or as rendered text. Arbitrary additional metadata may be added. In the case of the <title level="a">VLO</title> a standard id field is added to hold the number of the year, volume and page the scan shows. Standard content management utilities available within the <title level="m">e-Laborate</title> platform allow for the arbitrary placing and grouping of individual transcription objects into a page or folder hierarchy. Any transcription object is automatically indexed so an authorised user or editor can search through the text base and present the search results in a comprehensive way. A fuzzy matching algorithm amends search input as well as the indexed material for spelling variants. In the future tools to further process or statistically analyse those results may be added. The addition of modules, tools, or components to <title level="m">e-Laborate</title> is easily facilitated by its plain plugin architecture and open source nature. Current additions under development are the inclusion of an open source OCR engine to facilitate text recognition on demand for uploaded scans.</p>
         <p>Current work in the project is focused on the development of a flexible annotation tool. This tool will empower researchers to create annotations to every part of a scan or the transcription text of a transcription object, simply by pointing to and highlighting the image part or text range they desire to annotate. Researchers will also have the possibility to react to annotations by annotating the annotation (<emph>ad infinitum</emph>). A researcher may choose to categorize his or her annotation using a standard or personalised typology of annotations. Standard annotation typologies that will be provided concern a.o. basic formatting (italic, bold, capitalization etc.), ranges of interpretation (word, part of the text etc.) and information type (back ground historical information, biographical etc.). Any annotation may be categorised in multiple typologies.</p>
         <p>The annotation tool will be as much WYSIWYG as possible. This means that a researcher wanting to add annotations will not be bothered by laborious tagging and will need no prior knowledge of any particular mark up language. This is a design choice fundamental to our view of text and textual research. We think that it's not a researcher's concern to produce or validate XML or any other marked up form of text. Knowing about mark up is not fundamental to the task of a text researcher, but making inferences about the meaning, structure and form of a text and putting such inferences into annotations is. Therefore tools for the production and enrichment of digital editions should focus on that research related task and not on mark up particulars. As a consequence the digital editing tools of <title level="m">e-Laborate</title> will take care of the creation of valid mark up <soCalled>in the background</soCalled>, providing ample information about the name of the user who created the annotation, the date and time of creation, the part of the text or scan the annotation belongs to, and of course, the annotation text itself and any additional metadata provided by the user.</p>
         <p>Elementary for our project are the leading principals behind the design choices described in the preceding paragraph. That is, the design choice not to define yet another mark up solution, but to concentrate on the researcher's interactions with the textual material, leaving the description of these interactions in the form of XML to the application. We will show that these principals define another textual paradigm, meaning <emph>another</emph> textual paradigm than the text paradigm implicitly emanating from the concepts of <title level="m">TEI</title>. </p>
         <p>At present a powerful surge of <title level="m">TEI</title>-driven edition projects, seems to have propagated <title level="m">TEI</title> into a de facto standard. Although undeniably useful as a means for marking up texts for editorial use, the apparent all round applicability and efficiency of <title level="m">TEI</title> needs to be contested. We will argue that <title level="m">TEI</title> in it's form of explicit mark up is not a very efficient means of editorial mark up. We will also argue that <title level="m">TEI</title> is far from efficient nor very useful when computer supported textual analysis is the focus of research. We will show that the use of <title level="m">TEI</title> forces an a priori, top down view of text onto a researcher trying to model a text using <title level="m">TEI</title>-tagging. <title level="m">TEI</title>'s particular use of XML and its DTD implicitly present a vision of a text being a flat hierarchy of meaningful text elements. To a researcher wanting to express and analyse overlapping interpretations, associative relations, layered narratives (to name but a few common textual constructs <title level="m">TEI</title> has difficulty expressing) <title level="m">TEI</title> does not provide effective or efficient solutions. We will argue that such a researcher would be better off considering the use of lightly embedded mark up solutions and layered cross tagged mark up as provided for example by the <title level="m">JITT</title> and <title level="m">LMNL</title> models. Although problematic in themselves, these models do address the non linear, non hierarchical nature of texts more adequately than <title level="m">TEI</title>. We will also argue how these models can be combined to provide an intuitive way of structuring and annotating text, resulting in a dynamic layered model of text that can be represented by proper XML. We will show how within the context of <title level="m">e-Laborate</title> a graphical user interface enables structuring and annotating texts according to this dynamic model of text representation. We are convinced that this interface enables a researcher to interact with a text on a research and interpretative level rather than a mark up level. We will also show that in such a dynamic research environment it is still possible to provide backward compatibility with <title level="m">TEI</title> mark up using transformational languages.</p>
      </body>
      <back>
         <div type="Bibliography">
            <head>Bibliography</head>
            <listBibl>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="DARE, Digital Academic Repositories">DARE, Digital Academic Repositories</name>
                     </title>
                     <imprint/>
                  </monogr>
                  <note>
                     <xptr crdate="2004-11-20" to="http://www.surf.nl/en/themas/index2.php?oid=7"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="e-Laborate">e-Laborate</name>
                     </title>
                     <imprint/>
                  </monogr>
                  <note>
                     <xptr crdate="2004-11-20" to="http://www.e-laborate.nl/"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="JITT">JITT</name>
                     </title>
                     <imprint/>
                  </monogr>
                  <note>
                     <xptr crdate="2004-11-20" to="http://www.sbl-site2.org/Extreme2002/"/> and <xptr to="http://www.idealliance.org/papers/xml02/dx_xml02/index/title/e93017c13fc3874332dee40367.html"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="LMNL">LMNL</name>
                     </title>
                     <imprint/>
                  </monogr>
                  <note>
                     <xptr crdate="2004-11-20" to="http://lmnl.net/"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="NHDA">NHDA</name>
                     </title>
                     <imprint/>
                  </monogr>
                  <note>
                     <xptr crdate="2004-11-20" to="http://www.niwi.knaw.nl/en/geschiedenis/collecties/"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="NIWI-KNAW">NIWI-KNAW</name>
                     </title>
                     <imprint/>
                  </monogr>
                  <note>
                     <xptr crdate="2004-11-20" to="http://www.niwi.knaw.nl"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="SURF">SURF</name>
                     </title>
                     <imprint/>
                  </monogr>
                  <note>
                     <xptr crdate="2004-11-20" to="http://www.surf.nl/en/home/index.php"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="TEI and TEI-Consortium">TEI and TEI-Consortium</name>
                     </title>
                     <imprint/>
                  </monogr>
                  <note>
                     <xptr crdate="2004-11-20" to="http://www.tei-c.org/"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="Women Writers">Women Writers</name>
                     </title>
                     <imprint/>
                  </monogr>
                  <note>
                     <xptr crdate="2004-11-20" to="http://www.roquade.nl/womenwriters/"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="XML and the World Wide Web Consortium">XML and the World Wide Web Consortium</name>
                     </title>
                     <imprint/>
                  </monogr>
                  <note>
                     <xptr crdate="2004-11-20" to="http://www.w3c.org/XML"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="Xpast">Xpast</name>
                     </title>
                     <imprint/>
                  </monogr>
                  <note>
                     <xptr crdate="2004-11-20" to="http://www.e-laborate.nl/nl/new_2/toon"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="Agosti, M.">M. Agosti</name>
                     </author>
                     <author>
                        <name reg="Ferro, I.">I. Ferro</name>
                     </author>
                     <author>
                        <name reg="Frommholz, I.">I. Frommholz</name>
                     </author>
                     <author>
                        <name reg="Thiel, U.">U. Thiel</name>
                     </author>
                     <title level="a">Annotations in Digital Libraries and Collaboratories</title>
                  </analytic>
                  <monogr>
                     <editor>
                        <name reg="Heery, R.">R. Heery</name>
                     </editor>
                     <editor>
                        <name reg="L. Lyon">L. Lyon</name>
                     </editor>
                     <title level="m">Proceedings of the 8th European Conference, EDCL 2004. Bath, UK, September 12-17, 2004</title>
                     <imprint>
                        <publisher>EDCL</publisher>
                        <pubPlace>Berlin</pubPlace>
                        <date value="2004">2004</date>
                        <biblScope type="pages">244- 255</biblScope>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="Buzzetti, D.">D. Buzzetti</name>
                     </author>
                     <title level="a">Digital Representation and the Text Model</title>
                  </analytic>
                  <monogr>
                     <title level="j">New literary History</title>
                     <imprint>
                        <biblScope type="vol">33</biblScope>
                        <biblScope type="pages">61-88</biblScope>
                        <date value="2002">2002</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="McGann, J.P.">J.P. McGann</name>
                     </author>
                     <title level="a">Dialogue and interpretation at the interface of man and machine, reflections on textuality and a proposal for an experiment in machine reading</title>
                  </analytic>
                  <monogr>
                     <title level="j">Computers and the Humanities</title>
                     <imprint>
                        <biblScope type="vol">36</biblScope>
                        <biblScope type="pages">95-107</biblScope>
                        <date value="2002">2002</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="van Dijk, S.">S. van Dijk</name>
                     </author>
                     <title level="a">Introduction</title>
                  </analytic>
                  <monogr>
                     <editor>
                        <name reg="van Dijk, S.">S. van Dijk</name>
                     </editor>
                     <editor>
                        <name reg="Broomans, P.">P. Broomans</name>
                     </editor>
                     <editor>
                        <name reg="van der Meulen, J.F.">J.F. van der Meulen</name>
                     </editor>
                     <editor>
                        <name reg="van Oostrum, W.R.D.">W.R.D. van Oostrum</name>
                     </editor>
                     <title level="m">'I have heard about you'.  Women's writing crossing borders</title>
                     <imprint>
                        <publisher>Verloren</publisher>
                        <pubPlace>Hilversum</pubPlace>
                        <date value="2004">2004</date>
                     </imprint>
                  </monogr>
                  <note>[Information about the <title level="a">VLO</title>.]</note>
               </biblStruct>
            </listBibl>
         </div>
      </back>
   </text>
</TEI.2>