<?xml version="1.0" encoding="UTF-8"?>
<TEI.2 id="paper_203_schreibman">
   <teiHeader>
      <fileDesc>
         <titleStmt>
            <title>Letters and Lacunae: Editing an Electronic Scholarly Edition of Correspondence</title>
            <author>
               <name reg="Schreibman, Susan">Susan Schreibman</name>
            </author>
            <author>
               <name reg="Gueguen, Gretchen">Gretchen Gueguen</name>
            </author>
            <author>
               <name reg="Kumar, Amit">Amit Kumar</name>
            </author>
            <author>
               <name reg="Saddlemyer, Ann">Ann Saddlemyer</name>
            </author>
            <respStmt>
               <resp>Marked up by </resp>
               <name reg="Holmes, Martin">Martin Holmes</name>
               <lb/>
               <name reg="Baer, Patricia">Patricia Baer</name>
            </respStmt>
         </titleStmt>
         <publicationStmt>
            <p>Marked up to be included in the ACH/ALLC 2005 Conference Abstracts book.</p>
         </publicationStmt>
         <sourceDesc>
            <p>None</p>
         </sourceDesc>
      </fileDesc>
      <profileDesc>
         <textClass>
            <classCode>paper</classCode>
            <keywords>
               <list>
                  <item>TEI</item>
                  <item>scholarly editing</item>
                  <item>text encoding</item>
               </list>
            </keywords>
         </textClass>
      </profileDesc>
      <revisionDesc>
         <list>
            <item>MDH: Created from John Bradley's XML <date value="2005-03">March 2005</date>
            </item>
            <item>MDH: Proofed by Ray Siemens <date value="2005-04-03">3 April 2005</date>
            </item>
         </list>
      </revisionDesc>
   </teiHeader>
   <text>
      <front>
         <docTitle n="Letters and Lacunae: Editing an Electronic Scholarly Edition of Correspondence">
            <titlePart>Letters and Lacunae: Editing an Electronic Scholarly Edition of Correspondence</titlePart>
         </docTitle>
         <docAuthor>
            <name reg="Schreibman, Susan">Susan Schreibman</name>
            <address>
               <addrLine>sschreib@umd.edu</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">University of Maryland</titlePart>
         <docAuthor>
            <name reg="Gueguen, Gretchen">Gretchen Gueguen</name>
            <address>
               <addrLine>ggueguen@wam.umd.edu</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">University of Maryland</titlePart>
         <docAuthor>
            <name reg="Kumar, Amit">Amit Kumar</name>
            <address>
               <addrLine>amitku@uiuc.edu</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">University of Illinois at Urbana Champaign</titlePart>
         <docAuthor>
            <name reg="Saddlemyer, Ann">Ann Saddlemyer</name>
            <address>
               <addrLine>sadlemy@uvic.ca</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">University of Victoria</titlePart>
      </front>
      <body>
         <div0>
            <p>Encoding editions of documentary texts, particularly editions of correspondence,
               within the <title level="m">Text Encoding Initiative (TEI) Guidelines</title> raises
               special challenges not encountered when editing previously published works. The
               challenges fall into three broad categories: 1) difficulties in capturing
               bibliographic meta-information describing the physical object and its transmission
               history; 2) challenges in developing a controlled vocabulary suitable to the informal
               nature of texts which were never intended for publication; and 3) difficulties in
               encoding both physical characteristics of the documentary texts, as well as their
               intellectual content, i.e. adopting a principle of encoding the text either as a
               physical artifact or as a conceptual work. These challenges, particularly as they
               relate to encoding letters, will be explored by through an edition currently being
               edited entitled <title level="m">Thomas MacGreevy and George Yeats: A Friendship in
                  Letters</title>.</p>
            <p>During the next two years members of <title level="m">The Thomas MacGreevy
               Archive</title> team will be creating for online publication an edition of the
               correspondence between George Yeats (1893-1968), wife of the Irish poet W.B. Yeats,
               and Thomas MacGreevy (1893-1967), Irish poet, art and literary critic, and Director
               of the National Gallery of Ireland (1950-63). It is a collection spanning 41 years,
               comprising 148 letters. The letters are fascinating documentary records which provide
               a window not only into the personal lives of the authors, but into the artistic and
               political circles in which they moved, providing a unique insight into the new Irish
               Free State and the cultural climate of Europe during the first half of the twentieth
               century. The letters are being encoded using Extensible Markup Language (XML)
               according to newly released <title level="m">P5 TEI Guidelines</title> to take
               advantage of the TEI’s new chapter on Manuscript Description.</p>
            <p>Although the <title level="m">TEI Guidelines</title> were not developed specifically
               to encode previously published texts, many of the rules built into the syntax of the
               Document Type Definitions (DTDs) favor this document type. To cite but one example,
               the content model of <hi rend="code">tei.divbot</hi> does not allow for a paragraph
                  <hi rend="code">&lt;p&gt;</hi> element after the closer element <hi rend="code">&lt;closer&gt;</hi>. While the need for additional paragraphs
               after closing material in published texts may be uncommon, letters frequently have a
               closing salutation, followed by a postscript. Moreover, it has proved difficult
               within the TEI header to detail the type of descriptive information that editors,
               scholars, and bibliographers require when engaging with handwritten documents.</p>
            <p>Individual projects (such as <title level="m">DALF: Digital Archive of Letters in
                  Flanders Project</title>) and subject- area consortiums (such as <title level="m">The Model Editions Partnership</title>) have developed their own extensions to
               the <title level="m">TEI Guidelines</title> to accommodate the needs of electronic
               editions of correspondence. After a brief survey of the strategies employed by these
               and other editions, we will discuss how TEI’s new chapter on manuscript description
               alleviates some of the problems previous projects solved with local solutions. The
               chapter on Manuscript Description builds on the work of two separate initiatives
               which have been recently combined: <title level="m">MASTER</title> project
               (1999-2001), an EU-funded project headed by Peter Robinson, and the work of the
                  <title level="m">TEI Medieval Manuscripts Description Work Group</title>
               (1998-2000), headed by Consuelo Dutschke and Ambrogio Piazzoni . The new elements
               available in this tagset provide for detailed description of primary texts including
               transmission, physical description, the relationship between parts of the manuscript
               (for example, when a poem is enclosed with a letter), dimensions, location,
               manuscript identification, provenance, and history of ownership.</p>
            <p> Another area to be discussed is the difficulties in developing an ontology or
               controlled vocabulary for a correspondence. The ontology, the backbone for the search
               page, is more difficult to develop for a collection of letters than other document
               types. Subject headings, such as the <title level="m">Library of Congress Subject
                  Headings</title> (<title level="m">LCSH</title>), which are used to describe
               entire collections or self-contained bodies of information, are not suitable for this
               project which describes each letter individually. The problem with using schemes such
               as <title level="m">LCSH</title> is twofold: one, the letters cover many subjects and
               follow no formal organization pattern, making it difficult to make a faceted indexing
               schema like <title level="m">LCSH</title> worthwhile; secondly, the subject headings
               were meant to be used in the cataloging of cohesive works or collections, and were
               not designed to be brief entries in the index for a specific work or collection.</p>
            <p>The indexing done for this edition more closely resembles back-of-the-book style
               indexing in terms of its description of the details of the text. Standard controlled
               vocabularies that might be used in this type of indexing, like the <title level="m">Getty Art and Architecture Thesaurus</title>, on the other hand, are too specific
               and terms do not sufficiently summarize or categorize the topics discussed.
               Capturing, representing, and, indeed, interpreting a multitude of topics present in
               any given letter — from general subjects to more intimate personal details
               — is of paramount importance. If ontology is defined as a <cit>
                  <q>formal, explicit specification of a shared conceptualization</q>
                  <bibl>Fensel 11</bibl>
               </cit>, the burden of interpreting by a third party what a <cit>
                  <q>shared conceptualization</q>
               </cit> of a text written for an intended audience of one is immense. Indeed, as the
               correspondence itself often indicates, meaning is often misconstrued by the intended
               recipient. Given these difficulties, other types of structured data, such as
               annotation and abstracts, may be used to mitigate issues of keywords conveying
               different meanings when taken out of textual context. </p>
            <p> Another challenge when editing documentary texts for electronic publication is
               choosing a philosophy by which to encode. This is particularly true in the case of
               editing modern correspondence. Editors have had to traditionally decide whether the
               purpose of the encoding is to capture the physical appearance of the page (regardless
               of the text's logical sequence), or whether it is to record the textual/ontological
               flow (regardless of the text's physical appearance). In traditional print
               publications, editions (except for facsimiles) reflect a logical sequencing of the
               text. For example, text which appears in the margins is placed where the editor feels
               it belongs logically, even when the writing crosses page boundaries (such as
               finishing a letter in the margins of the first page when the author ran out of room
               on the last). </p>
            <p>This edition is exploring methods of encoding both the physical appearance of the
               page, as well as the letter’s logic. This is particularly challenging when encoding,
               for example, marginalia. To represent the marginalia within the logical sequence of
               the text, the editor must decide where it is to be anchored within the textual flow.
               To represent it in a physical representation, the editor must provide coordinates
               that will anchor the text vertically and horizontally in relation to the main body of
               the work. While some of this positioning is absolute, for example, anchoring text at
               the top of the page, other positioning is relative, for example, anchoring marginalia
               relative to the paragraph it appears next to. While the encoding must take into
               account, in some measure, the technologies available to us today, XSLT, CSS, and
               JavaScript, for example, at the same time it must also be encoded with a view to
               future presentations, independent of current technologies. </p>
            <p>These are a sampling of issues that will be discussed.</p>
         </div0>
      </body>
      <back>
         <div type="Bibliography">
            <head>Bibliography</head>
            <listBibl>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="Chestnutt, R. David">R. David Chestnutt</name>
                     </author>
                     <title level="a">The e Model Editions Partnership: 'Smart Text' and
                     Beyond</title>
                  </analytic>
                  <monogr>
                     <title level="j">DLib Magazine</title>
                     <imprint>
                        <date value="1997">July/August 1997</date>
                     </imprint>
                  </monogr>
                  <note>
                     <xptr to="http://www.dlib.org/dlib/july97/07chesnutt.html"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="DALF: Digital Archive of Letters in Flanders Project">DALF:
                           Digital Archive of Letters in Flanders Project</name>
                     </title>
                     <imprint>
                        <publisher>Centrum voor Teksteditie en Bronnenstudie (KANTL)</publisher>
                     </imprint>
                  </monogr>
                  <note>
                     <xptr crdate="2005-03-21" to="http://www.kantl.be/ctb/project/dalf/"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="DeRose, Steven J.">Steven J. DeRose</name>
                     </author>
                     <author>
                        <name reg="Durand, David G.">David G. Durand</name>
                     </author>
                     <author>
                        <name reg="Mylonas, Elli">Elli Mylonas</name>
                     </author>
                     <author>
                        <name reg="Renear, Allen H.">Allen H. Renear</name>
                     </author>
                     <title level="a">What is Text, Really?</title>
                  </analytic>
                  <monogr>
                     <title level="j">Journal of Computing in Higher Education</title>
                     <imprint>
                        <biblScope type="vol">2.1 (Winter)</biblScope>
                        <biblScope type="pages">3-26</biblScope>
                        <date value="1990">1990</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="Farrow, John">John Farrow</name>
                     </author>
                     <title level="a">All in the Mind: Concept Analysis in Indexing</title>
                  </analytic>
                  <monogr>
                     <title level="j">The Indexer</title>
                     <imprint>
                        <biblScope type="vol">19.4</biblScope>
                        <biblScope type="pages">243-247</biblScope>
                        <date value="1995">1995</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <author>
                        <name reg="Fensel, Dieter et al.">Dieter Fensel et al.</name>
                     </author>
                     <title level="m">Spinning the Semantic Web</title>
                     <imprint>
                        <publisher>MIT Press</publisher>
                        <pubPlace>Cambridge, Massachusetts</pubPlace>
                        <date value="2005">2005</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="Matthews, Douglas">Douglas Matthews</name>
                     </author>
                     <title level="a">Indexing Published Letters</title>
                  </analytic>
                  <monogr>
                     <title level="j">The Indexer</title>
                     <imprint>
                        <biblScope type="vol">22.3</biblScope>
                        <biblScope type="pages">135-141</biblScope>
                        <date value="2001">2001</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="Renear, Allen H.">Allen H. Renear</name>
                     </author>
                     <author>
                        <name reg="Mylonas, Elli">Elli Mylonas</name>
                     </author>
                     <author>
                        <name reg="Durand, David G.">David G. Durand</name>
                     </author>
                     <title level="a">Refining Our Notion of What Text Really Is: The Problem of
                        Overlapping Hierarchies</title>
                  </analytic>
                  <monogr>
                     <editor>
                        <name reg="Hockey, Susan">Susan Hockey</name>
                     </editor>
                     <editor>
                        <name reg="Idle, Nancy">Nancy Idle</name>
                     </editor>
                     <title level="m">Research in Humanities Computing 4: Selected Papers from the
                        ALLC/ACH Conference, Christ Church Oxford, April 1992</title>
                     <imprint>
                        <publisher>Oxford University Press</publisher>
                        <pubPlace>Oxford</pubPlace>
                        <biblScope type="pages">263-280</biblScope>
                        <date value="1996">1996</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <author>
                        <name reg="Schreibman, Susan">Susan Schreibman</name>
                     </author>
                     <title level="m" type="WWW document">The Thomas MacGreevy Archive</title>
                     <imprint/>
                  </monogr>
                  <note>
                     <xptr crdate="2005-03-21" to="http://macgreevy.org"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="TEI Guidelines P4">TEI Guidelines P4</name>
                     </title>
                     <imprint/>
                  </monogr>
                  <note>
                     <xptr crdate="2005-03-15" to="http://www.tei-c.org/Guidelines2/index.html"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="TEI Guidelines P5, Manuscript Description Chapter">TEI Guidelines
                           P5, Manuscript Description Chapter</name>
                     </title>
                     <imprint/>
                  </monogr>
                  <note>
                     <xptr crdate="2005-03-15" to="http://www.tei-c.org/Activities/MS/FASC-ms.pdf"/>
                  </note>
               </biblStruct>
            </listBibl>
         </div>
      </back>
   </text>
</TEI.2>