<?xml version="1.0" encoding="UTF-8"?>
<TEI.2 id="paper_141_cantara">
   <teiHeader>
      <fileDesc>
         <titleStmt>
            <title>The Tibet Oral History Archive Project and Digital Preservation</title>
            <author>
               <name reg="Cantara, Linda">Linda Cantara</name>
            </author>
            <respStmt>
               <resp>Marked up by </resp>
               <name reg="Holmes, Martin">Martin Holmes</name>
               <lb/>
               <name reg="Baer, Patricia">Patricia Baer</name>
            </respStmt>
         </titleStmt>
         <publicationStmt>
            <p>Marked up to be included in the ACH/ALLC 2005 Conference Abstracts book.</p>
         </publicationStmt>
         <sourceDesc>
            <p>None</p>
         </sourceDesc>
      </fileDesc>
      <profileDesc>
         <textClass>
            <classCode>paper</classCode>
            <keywords>
               <list>
                  <item>oral history</item>
                  <item>digital preservation</item>
                  <item>metadata</item>
               </list>
            </keywords>
         </textClass>
      </profileDesc>
      <revisionDesc>
         <list>
            <item>MDH: Created from John Bradley's XML <date value="2005-03">March 2005</date>
            </item>
            <item>PAB: Marked up <date value="2005-04-04">4 April 2005</date>
            </item>
            <item>MDH: Merged author's changes <date value="2005-04-28">28 April 2005</date>
            </item>
         </list>
      </revisionDesc>
   </teiHeader>
   <text>
      <front>
         <docTitle n="The Tibet Oral History Archive Project and Digital Preservation">
            <titlePart>The <title level="m">Tibet Oral History Archive Project</title> and Digital Preservation</titlePart>
         </docTitle>
         <docAuthor>
            <name reg="Cantara, Linda">Linda Cantara</name>
            <address>
               <addrLine>linda.cantara@case.edu</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">Case Western Reserve University</titlePart>
      </front>
      <body>
         <div0>
            <p>The <title level="m">Tibet Oral History Archive Project</title>
               <note n="1">This project is sponsored by the <title>Henry Luce Foundation</title> with additional support from the <title>National Endowment for the Humanities</title> (grant no. RZ-20585-00) and the <title>National Geographic Society</title>.</note> (<title level="m">TOHAP</title>) is part of the research and education program of the <title>Center for Research on Tibet</title> in the Department of Anthropology at Case Western Reserve University.<note n="2">The <title>Center for Research on Tibet</title> Web Site is <xptr to="http://www.case.edu/affil/tibet/index.htm"/>.</note> The Center was created in 1987 by Melvyn Goldstein, John Reynold Harkness Professor of Anthropology, and Cynthia Beall, Sarah Idell Pyle Professor of Anthropology, to generate and disseminate new knowledge about Tibetan culture, society, and history, and was the academic pioneer in opening Tibet to in-depth anthropological and historical research. The <title level="m">TOHAP</title> builds on a series of fieldwork-based studies that have examined the adaptation of Tibetans to high altitude, and the changes that have occurred since Tibet's incorporation into the People’s Republic of China in 1951.</p>
            <p>The <title level="m">Tibet Oral History Archive</title> includes three primary collections:
<list type="unordered">
                  <item>
                     <title level="m">The Common Folk Oral History Collection</title>: nearly 2,000 hours of interviews with hundreds of ordinary rural and urban Tibetans about their life experiences. Since the number of individuals in Tibet who were adults in 1959 -- the end of the traditional era -- is rapidly dwindling, there is particular urgency to document the voices of ordinary Tibetans in order understand the diversity of life as it was lived in Tibet as well as the way the salient historical events played out among the different strata of society.</item>
                  <item>
                     <title level="m">The Political History Collection</title>: approximately 400 hours of historical interviews with former Tibetan government officials who played important roles in modern Tibetan history, including His Holiness the Dalai Lama. These interviews cover the traditional period before Tibet was incorporated into the People's Republic of China (1913-1951) and the subsequent period up to the end of the Cultural Revolution in 1976.</item>
                  <item>
                     <title level="m">The Drepung Monastery Collection</title>: approximately 350 hours of interviews with about one hundred monks who were members of Drupung Monastery, Tibet's largest monastery, at the end of the traditional era. These interviews are unique in that they provide the only in-depth window into large-scale monasticism in traditional Tibetan society.</item>
               </list>
            </p>
            <p>Conducted primarily in the Tibetan language, the interviews were taped on audio cassettes which have subsequently been digitized in three formats: archival WAVE files, medium format QuickTime files, and compressed delivery MP3 (MPEG) files. The interviews have been transcribed and translated into English and were initially saved as Microsoft Word documents. Professor Goldstein, Editor of the Archive, has partnered with Kelvin Smith Library to prepare the audio files and transcripts for online dissemination and long-term preservation.  For online dissemination via the World Wide Web, we are converting the Word documents to plain text and encoding them in XML using the <title>Text Encoding Initiative (TEI) Document Type Definition (DTD) for Transcriptions of Speech</title>.<note n="3">Chapter 11 of the <title level="m">TEI Guidelines</title> (P4); see <xptr to="http://www.tei-c.org/P4X/TS.html"/>.</note> To facilitate understanding, the Archive will also include a glossary of terms, encoded in XML using the <title>TEI-DTD for Printed Dictionaries</title>.<note n="4">Chapter 12 of the <title level="m">TEI Guidelines</title> (P4); see <xptr to="http://www.tei-c.org/P4X/DI.html"/>.</note> A programmer has been hired to create a Web-based tool for creating the glossary and an application for automatically encoding extended pointer notation to link terms in the transcripts to their definitions in the glossary. Work is also underway to design an end user interface which will include browse and search functions. In the meantime, we are temporarily transforming the XML files to XHTML and using the <title level="m">Greenstone Digital Library Software</title> to facilitate local access.<note n="5">
                  <title level="m">Greenstone</title> is open source software for building and distributing digital library collections, produced by the <title level="m">New Zealand Digital Library Project</title> at the University of Waikato, and developed and distributed in cooperation with <title level="m">UNESCO</title> and the <title level="m">Human Info NGO</title>. See <xptr to="http://www.greenstone.org"/>.</note>
            </p>
            <p>A larger concern, however, is how to ensure long-term preservation of and access to the Archive. In 1996, the <title>Commission on Preservation and Access (CPA) and Research Library Group (RLG) Task Force on Archiving of Digital Information</title> published a seminal report on the long-term preservation of digital resources.<note n="6">
                  <title>Commission on Preservation and Access (CPA) and Research Library Group (RLG)</title>. <title level="m">Preserving Digital Information: Report of the Task Force on Archiving of Digital Information</title>. May 1996. Online at <xptr to="http://www.rlg.org/legacy/ftpd/pub/archtf/final-report.pdf"/>.</note> Since then, virtually every significant publication about digital preservation has indicated that primary responsibility for initiation and management of the metadata necessary to ensure long-term access to digital resources begins with the creator of the resource. Traditionally, it has been the role of librarians and archivists to ensure long-term viability of and access to cultural heritage materials, but this is not within the realm of expertise of the majority of scholars in the humanities and social sciences. Thus, if the creators of digital resources are responsible for initiating lifecycle documentation of the descriptive, administrative, and structural metadata necessary to migrate, emulate, or otherwise translate existing resources to future hardware and software configurations -- a task foreign to most discipline-based scholars -- close collaboration with information technology professionals early in a project is imperative.
   </p>
            <p>Protocols and standards for digital preservation are now under vigorous development, yet there are still many unknowns. For the short-term, multiple copies of the audio and XML files will be maintained in multiple locations at Case Western Reserve University, both at the <title level="m">Center for Research on Tibet</title> as well as in <title level="m">Digital Case</title>, Kelvin Smith Library's <title level="m">Fedora</title> repository.<note n="7">
                  <title level="m">Fedora™ Flexible and Extensible Digital Object Repository Architecture</title> -- is an open source digital repository management system, developed by Cornell University and the University of Virginia, available at <xptr to="http://www.fedora.info"/>.</note> For the long-term, the Asian Division of the Library of Congress has expressed interest in hosting the completed Archive. To prepare the <title level="m">Tibet Oral History Archive</title> for deposit with the Library of Congress, we are creating a <title>Submission Information Package</title> (<title>SIP</title>) in compliance with the <title level="m">Reference Model for an Open Archival Information System</title> (<title level="m">OAIS</title>),<note n="8">A <title>SIP</title> is <cit>
                     <q>an information package that is delivered by the producer [of a digital object] to the <title level="m">OAIS</title> for use in the construction of one or more <title>AIP</title>s [<title>Archival Information Packages</title>].</q>
                  </cit> See <title level="a">OAIS Terms</title>. <title level="m">Digital Preservation Management: Implementing Short-term Strategies for Long-term Problems</title>. Cornell University Library. 2003. Online at <xptr to="http://www.library.cornell.edu/iris/dpworkshop/working/terminology/oais.html"/>.  See also, <title level="m">Consultative Committee for Space Data Systems (CCSDS). Reference Model for an Open Archival Information System OAIS</title>). CCSDS 650.0-B-1. ISO 14721:2003. January 2002. Online at <xptr to="http://ssdoo.gsfc.nasa.gov/nost/wwwclassic/documents/pdf/CCSDS-650.0-B-1.pdf"/>.</note> using the Metadata Encoding and Transmission Standard (<title level="m">METS</title>), a metadata standard for encoding descriptive, administrative, and structural metadata regarding objects within a digital library.<note n="9">
                  <title level="m">METS</title> is maintained in the Network Development and MARC Standards Office of the Library of Congress, and is being developed as an initiative of the <title level="m">Digital Library Federation</title>. See <xptr to="http://www.loc.gov/standards/mets"/>.</note> This paper will present a prototype for scholar-librarian collaboration in the digital preservation of multimedia resources, including a discussion of the practical aspects of constructing a <title level="m">METS</title> document for the <title level="m">Tibet Oral History Archive</title>, with particular attention to the multiple metadata standards that must be bundled with the digital files to create a robust Submission Information Package.</p>
         </div0>
      </body>
      <back>
         <div type="Bibliography">
            <head>Bibliography</head>
            <listBibl>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="Center for Research on Tibet">The Center for Research on Tibet's Web Site</name>
                     </title>
                     <imprint/>
                  </monogr>
                  <note>
                     <xptr crdate="2005-03-29" to="http://www.case.edu/affil/tibet/index.htm"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <title level="a" type="WWW document">
                        <name reg="Chapter 11: Transcriptions of Speech">Chapter 11: Transcriptions of Speech</name>
                     </title>
                  </analytic>
                  <monogr>
                     <title level="m">TEI Guidelines (P4)</title>
                     <imprint>
                        <publisher>Text-Encoding Initiative</publisher>
                     </imprint>
                  </monogr>
                  <note>
                     <xptr crdate="2005-03-29" to="http://www.tei-c.org/P4X/TS.html"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <title level="a" type="WWW document">
                        <name reg="Chapter 12: Print Dictionaries">Chapter 12: Print Dictionaries</name>
                     </title>
                  </analytic>
                  <monogr>
                     <title level="m">TEI Guidelines (P4)</title>
                     <imprint>
                        <publisher>Text-Encoding Initiative</publisher>
                     </imprint>
                  </monogr>
                  <note>
                     <xptr crdate="2005-03-29" to="http://www.tei-c.org/P4X/DI.html"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="Greenstone">Greenstone</name>
                     </title>
                     <imprint>
                        <publisher>University of Waikato</publisher>
                     </imprint>
                  </monogr>
                  <note>
                     <xptr crdate="2004-07-16" to="http://www.greenstone.org"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <author>
                        <name reg="Waters, Donald">Donald Waters</name>
                     </author>
                     <author>
                        <name reg="Garrett, John">John Garrett</name>
                     </author>
                     <title level="m" type="WWW document">
                        <name reg="Preserving Digital Information: Report of the Task Force on Archiving of Digital Information">Preserving Digital Information: Report of the Task Force on Archiving of Digital Information</name>
                     </title>
                     <imprint/>
                  </monogr>
                  <note>
                     <xptr crdate="2005-03-29"
                           to="http://www.rlg.org/legacy/ftpd/pub/archtf/final-report.pdf"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="Fedora">Fedora</name>
                     </title>
                     <imprint>
                        <publisher>Cornell University and the University of Virginia</publisher>
                     </imprint>
                  </monogr>
                  <note>
                     <xptr crdate="2005-03-29" to="http://www.fedora.info"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <title level="a" type="WWW document">
                        <name reg="OAIS Terms">OAIS Terms</name>
                     </title>
                  </analytic>
                  <monogr>
                     <title level="m">Digital Preservation Management: Implementing Short-term Strategies for Long-term Problems</title>
                     <imprint>
                        <publisher>Cornell University Library</publisher>
                     </imprint>
                  </monogr>
                  <note>
                     <xptr crdate="2005-03-29"
                           to="http://www.library.cornell.edu/iris/dpworkshop/working/terminology/oais.html"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="Reference Model for an Open Archival Information System">Reference Model for an Open Archival Information System (OAIS)</name>
                     </title>
                     <imprint>
                        <publisher>CCSDS Secretariat</publisher>
                     </imprint>
                  </monogr>
                  <note>
                     <xptr crdate="2002-01"
                           to="http://ssdoo.gsfc.nasa.gov/nost/wwwclassic/documents/pdf/CCSDS-650.0-B-1.pdf"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="METS">METS</name>
                     </title>
                     <imprint>
                        <publisher>Digital Library Federation</publisher>
                     </imprint>
                  </monogr>
                  <note>
                     <xptr crdate="2005-01-25" to="http://www.loc.gov/standards/mets"/>
                  </note>
               </biblStruct>
            </listBibl>
         </div>
      </back>
   </text>
</TEI.2>