<?xml version="1.0" encoding="UTF-8"?>
<TEI.2 id="session_161_fegan">
   <teiHeader>
      <fileDesc>
         <titleStmt>
            <title>User Generated Metadata: Creating Personalized Web Experiences</title>
            <author>
               <name reg="Fegan, Michael">Michael Fegan</name>
            </author>
            <author>
               <name reg="Hart-Davidson, Bill">Bill Hart-Davidson</name>
            </author>
            <author>
               <name reg="Palmer, Joy">Joy Palmer</name>
            </author>
            <author>
               <name reg="Rehberger, Dean">Dean Rehberger</name>
            </author>
            <respStmt>
               <resp>Marked up by </resp>
               <name reg="Holmes, Martin">Martin Holmes</name>
               <lb/>
               <name reg="Baer, Patricia">Patricia Baer</name>
            </respStmt>
         </titleStmt>
         <publicationStmt>
            <p>Marked up to be included in the ACH/ALLC 2005 Conference Abstracts book.</p>
         </publicationStmt>
         <sourceDesc>
            <p>None</p>
         </sourceDesc>
      </fileDesc>
      <profileDesc>
         <textClass>
            <classCode>3-paper session</classCode>
            <keywords>
               <list>
                  <item>user-performance data</item>
                  <item>secondary repositories</item>
                  <item>digital libraries</item>
               </list>
            </keywords>
         </textClass>
      </profileDesc>
      <revisionDesc>
         <list>
            <item>MDH: Created from John Bradley's XML <date value="2005-04">April 2005</date>
            </item>
            <item>MDH: Wrote to authors to ask about missing bibliography <date value="2005-04-06">6 April 2005</date>
            </item>
            <item>MDH: RS proofed and signed off without changes <date value="2005-05-18">18 May 2005</date>.</item>
         </list>
      </revisionDesc>
   </teiHeader>
   <text>
      <front>
         <docTitle n="User Generated Metadata: Creating Personalized Web Experiences">
            <titlePart>User Generated Metadata: Creating Personalized Web Experiences</titlePart>
         </docTitle>
         <docAuthor>
            <name reg="Fegan, Michael">Michael Fegan</name>
            <address>
               <addrLine>mfegan@msu.edu</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">Matrix, Michigan State University</titlePart>
         <docAuthor>
            <name reg="Hart-Davidson, Bill">Bill Hart-Davidson</name>
            <address>
               <addrLine>hartdav2@msu.edu</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">WIDE Center, Michigan State University</titlePart>
         <docAuthor>
            <name reg="Palmer, Joy">Joy Palmer</name>
            <address>
               <addrLine>palmerjo@msu.edu</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">Matrix, Michigan State University</titlePart>
         <docAuthor>
            <name reg="Rehberger, Dean">Dean Rehberger</name>
            <address>
               <addrLine>rehberge@msu.edu</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">Matrix, Michigan State University</titlePart>
      </front>
      <body>
         <div0>
            <head>Abstract</head>
            <p> This session will focus on the importance of measuring how communities of users interact with digital objects. By drawing on user-performance data and metadata generated for secondary repositories, we explore ways to enhance use and access of documents and digital libraries..</p>
            <p>Speaker 1 proposes an approach to representing the structure of a document based on the way readers or users interact with it in the context of a deliberative task. This approach contrasts with other ways to model the structure of documents including approaches which map authorial intention and those which rely upon a well-known information model or genre. This presentation will highlight the benefits of understanding the structure of documents based on the rhetorical reading/using strategies of those who interact with them. These structures can be rendered as <soCalled>paths</soCalled> through a given set of information resources, offering insight into the way that objects and relationships that make up a document can mediate, or complicate, activity. Speaker 1 will conclude by showing some examples which make use of user-performance data to create task-appropriate views of complex (multiscale) documents. </p>
            <p>Speakers 2 and 3 will examine the role of secondary repositories can play in enhancing access and interaction for students and scholars in the humanities. The most entrenched <hi rend="foreign">a priori</hi> models for information structuring and delivery online are derived from library and archival cataloguing practices. In line with digital library best practices, digitized sources are typically cataloged to describe their bibliographic information, along with technical, administrative, and rights metadata. While these practices are essential for preserving the digital object and making it available to users, unfortunately they do so in a language and guise often difficult to understand within the context of use. In addition, materials in digital libraries do not literally <soCalled>speak</soCalled> for themselves and impart wisdom; they require interpretation and analysis within a context of use. Access and use of digital objects can no longer be thought of in terms of stand alone files or individual digital objects, but rather must directly impact the ways in which users reuse, repurpose, combine and build complex digital objects. This assumption relies on a more complex meaning for the term <term>access</term> that will be detailed and explained in this paper. </p>
            <p>Following the examples in the first paper, speaker 4 will demonstrate an application that can be used to collect user generated metadata. Following the concepts developed in the second paper, speaker 4 will develop the argument in practice that one way we can enhance access to online digital objects  is to facilitate the creation of secondary repositories. These repositories will provide discipline/community specific metadata and applications and will allow users to find, use, manipulate and analyze digital objects more easily. To this end, Speaker 4 has developed <title level="m">Media Matrix</title>  1.0 — an online, server-side suite of tools that allows users to locate specific media and streaming media files found in digital repositories and segment, annotate and organize this media online. This application provides users with an environment both to work with and personalize digital media, and also to share and discuss their findings with a community of users. This paper will explore if the creation of secondary repositories of usage statistics and user-generated materials/metadata (to supplement both traditional cataloging records and discipline-specific online indexes) can help scholars and students in the humanities gain better access to online materials. </p>
         </div0>
         <div0 type="EmbeddedDoc">
            <div1>
               <head>Modeling Documents Based on User Performance: An Alternative to Author Intention and a priori Information Model Approaches</head>
               <p rend="Presenter">Bill Hart-Davidson, Ph.D.</p>
               <p>This paper proposes an approach to representing the structure of a document based on the way readers or users interact with it in the context of a deliberative task. This approach contrasts with other ways to model the structure of documents including approaches which map authorial intention and those which rely upon a well-known information model or genre. This presentation will highlight the benefits of understanding the structure of documents based on the rhetorical reading/using strategies of those who interact with them. These structures can be rendered as <soCalled>paths</soCalled> through a given set of information resources, offering insight into the way that objects and relationships that make up a document can mediate, or complicate, activity. The paper will conclude by showing some examples which make use of user-performance data to create task-appropriate views of complex (multiscale) documents. </p>
               <p>There is a great deal of interest today in the idea of building models of texts. One reason is that, with the growth of the Web as a way to reach a wide and diverse audience, publishers of information of many types are now interested in building information structures that support multiple-audience adaptation. Another reason is to maximize the value of content by delivering information that is tailored to a particular task. </p>
               <p>For example, imagine the day-to-day work of a claims processing agent for a large insurance company. The agent is responsible for making decisions based on information in documents of various types - claim forms, telephone call records, police and adjustor reports, medical records, even photographs - all stored in electronic policyholder files. There may be discernable patterns in these types of workflows which can be documented and used as the basis for information models that structure information at or below the level of an individual document. The model could allow information contained in all the documents associated with the policyholder file to transform to suit the decision-making needs of the specific users who interact with it. </p>
               <p>Creating a system like the one described above would require analyzing and modeling fundamental patterns of document use, defining a basic modeling language for document-mediated interaction that can capture recurrent patterns of user performance. As Moser &amp; Moore (1996) point out, most semantic modeling approaches construct formal text structures based on either author intention or, alternatively, on an information structure presumed to be instantiated in the text. Neither of these approaches is entirely appropriate for creating effective displays of information for potential users. Creating such displays requires a model of the text-mediated interaction between writers and readers which can then be used to define display conditions for a range of information "views" that a given document might support. </p>
               <p>Performance-based text structure models differ from other types in that they are not primarily representations of a stable <soCalled>core</soCalled> semantic structure that is assumed to be either domain independent (e.g, Mann and Thompson, 1987), or genre specific, as suggested by the work of Bazerman(1988) and others. Nor are these structures maps of author intentionality and/or struggles in creating intentional relationships similar to analyses by Van Wijk and Sanders (1999). Rather, the models emphasize structures that constitute the resources authors and the specific users or readers of a document share in order to come to some kind of agreement about an issue or question that all parties have a stake in. What gains status as a <soCalled>unit</soCalled> or <soCalled>object</soCalled> in user-performance based models depends upon the deliberative activity that the document is meant to support. Relationships among objects are similarly defined by how the objects mediate a given decision. In this way, we can expect the model to account for both the regularities in text structures which correspond with similar texts doing similar mediational work, as well as quite specific and arbitrary text structures associated with any given deliberative activity as it unfolds in a social context. This modeling approach comes very close to a process described by Phelps (1985) who articulated an approach to structural analysis drawing on and responding to work by Faigley &amp; Witte (1981) and Van de Kopple (1985) in composition studies, as well as Halliday &amp; Hassan (1976) and Van Dijk (1976) in linguistics. The process, broadly, understands texts as objects with histories, requiring us to study them<soCalled> in process</soCalled> if we are to understand how they shape the experiences of a reader. </p>
            </div1>
            <div1>
               <head>Bibliography</head>
               <listBibl>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Moser, M.">M. Moser</name>
                        </author>
                        <author>
                           <name reg="Moore, J.D.">J.D. Moore</name>
                        </author>
                        <title level="a">Toward a synthesis of two accounts of discourse structure</title>
                     </analytic>
                     <monogr>
                        <title level="j">Computational Linguistics</title>
                        <imprint>
                           <biblScope type="vol">22.3</biblScope>
                           <biblScope type="pages">409-419</biblScope>
                           <date value="1996">1996</date>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Mann, W.C.">W.C. Mannr</name>
                        </author>
                        <author>
                           <name reg="Thompson, S.A.">S.A. Thompson</name>
                        </author>
                        <title level="a">Rhetorical structure theory: Toward a functional theory of text organization.</title>
                     </analytic>
                     <monogr>
                        <title level="j">Text</title>
                        <imprint>
                           <biblScope type="vol">8.3</biblScope>
                           <biblScope type="pages">243-281</biblScope>
                           <date value="1988">1988</date>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <monogr>
                        <author>
                           <name reg="Bazerman, C.">C. Bazerman</name>
                        </author>
                        <title level="m">Shaping written knowledge</title>
                        <imprint>
                           <publisher>University of Wisxonsin Press</publisher>
                           <pubPlace>Madison</pubPlace>
                           <date value="1988">1988</date>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Van Wijk, C.">C. Van Wijk</name>
                        </author>
                        <author>
                           <name reg="Sanders, T.">T. Sanders</name>
                        </author>
                        <title level="a">Identify writing strategies through text analysis</title>
                     </analytic>
                     <monogr>
                        <title level="j">Written Communication</title>
                        <imprint>
                           <biblScope type="vol">16.1</biblScope>
                           <biblScope type="pages">51-75</biblScope>
                           <date value="1999">1999</date>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Phelps, L.W.">L.W. Phelps</name>
                        </author>
                        <title level="a">Dialectics of coherence: Toward an integrative theory</title>
                     </analytic>
                     <monogr>
                        <title level="j">College English</title>
                        <imprint>
                           <biblScope type="vol">47.1</biblScope>
                           <biblScope type="pages">12-31</biblScope>
                           <date value="1985">1985</date>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Faigley, L.">L. Faigley</name>
                        </author>
                        <author>
                           <name reg="Witte, L.S.">L.S. Witte</name>
                        </author>
                        <title level="a">Coherence, cohesion, and writing quality</title>
                     </analytic>
                     <monogr>
                        <title level="j">College Composition and Communication</title>
                        <imprint>
                           <biblScope type="vol">32</biblScope>
                           <biblScope type="pages">189-204</biblScope>
                           <date value="1981">1981</date>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Vande Kopple, W.">W. Vande Kopple</name>
                        </author>
                        <title level="a">Given and new information and some aspects of the structures, semantics, and pragmatics of written texts</title>
                     </analytic>
                     <monogr>
                        <editor>
                           <name reg="Cooper, C.R.">C.R. Cooper</name>
                        </editor>
                        <editor>
                           <name reg="Greenbaum, S.">S. Greenbaum</name>
                        </editor>
                        <title level="m">Studying writing: Linguistic perspectives</title>
                        <imprint>
                           <publisher>Sage</publisher>
                           <pubPlace>Beverly Hills, CA</pubPlace>
                           <date value="1986">1986</date>
                           <biblScope type="pages">72-111</biblScope>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <monogr>
                        <author>
                           <name reg="Halliday, M.A.K.">M.A.K. Halliday</name>
                        </author>
                        <author>
                           <name reg="Hassan, J.R.">J.R. Hassan</name>
                        </author>
                        <title level="m">Cohesion in English</title>
                        <imprint>
                           <publisher>Longman</publisher>
                           <pubPlace>London</pubPlace>
                           <date value="1976">1976</date>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <monogr>
                        <author>
                           <name reg="Van Dijk, T.">T. Van Dijk</name>
                        </author>
                        <title level="m">Text and context: Explorations in semantics and pragmatics of discourse</title>
                        <imprint>
                           <publisher>Longman</publisher>
                           <pubPlace>London</pubPlace>
                           <date value="1976">1976</date>
                        </imprint>
                     </monogr>
                  </biblStruct>
               </listBibl>
            </div1>
         </div0>
         <div0 type="EmbeddedDoc">
            <div1>
               <head>Enhancing Access to Online Digital Objects through Reciprocity between Primary and Secondary Repositories </head>
               <p rend="Presenter">Dean Rehberger and Joy Palmer</p>
               <p>This paper will examine the role of secondary repositories can play in enhancing access and interaction for students and scholars in the humanities. While access to online resources has steadily improved in the last decade, online archives and digital libraries still remain difficult to use, particularly for students and novice users (Arms). In some cases, a good deal of resources have been put into massive digitization initiatives that have opened rich archives of sources to a wide range of users. Yet, the traditional cataloging and dissemination practices of libraries and archives make it difficult for these users to locate and use effectively these sources, especially within scholarly and educational contexts of the humanities. Many digital libraries around the country, large and small, have made admirable efforts toward creating user portals and galleries to enhance the usability of their holdings, but these results are often expensive and labor intensive, often speaking only directly to a small segment of users. </p>
               <p>To address these problems, we begin with the assumption that access and preservation are mutually dependent concepts. Preservation and access can no longer be thought of in terms of stand alone files or individual digital objects, but rather must directly impact the ways in which users reuse, repurpose, combine and build complex digital objects. This assumption relies on a more complex meaning for the term <term>access</term>. Many scholars in the field have called for a definition of access that goes beyond search interfaces to the ability of users to retrieve information <cit>
                     <q>in some form in which it can be read, viewed, or otherwise employed constructively</q>
                     <bibl>(Borgman 57)</bibl>
                  </cit>. Access thus implies four related conditions that go beyond the ability to link to a network: </p>
               <list type="ordered">
                  <item>equity ߞ the ability of <soCalled>every citizen</soCalled> and not simply technical specialists to use the resources;</item>
                  <item>usability ߞ the ability of users to easily locate, retrieve, use, and navigate resources;</item>
                  <item>context ߞ the conveyance of meaning from stored information to users, so that it makes sense to them;</item>
                  <item>interactivity ߞ the capacity for users to be both consumers and producers of information.</item>
               </list>
               <p>The keys to enhancing access for specific user groups, contexts, and disciplines are to build secondary repositories with resources and tools that allow users to enhance and augment materials (Shabajee), share their work with a community of users (Waller), and easily manipulate the media with simple and intuitive tools (or at least build interfaces that match existing, well-known applications). Users will also need portal spaces that escape the genre of links indexes and become flexible work environments that allow users to become interactive producers (Miller). </p>
               <p>Herbert Van de Sompel has proposed a successful system (OpenURL/SFX framework for context sensitive reference linking) for disaggregating reference linking services from e-publishing. In his framework, the service of providing links between references and across e-publisher's digital repositories is separated from the services provided by the e-publishers. In so doing, the service provides <cit>
                     <q>seamless interconnectivity between ever-increasing collections of heterogeneous resources</q>
                  </cit>, freeing primary repositories from the difficult and expensive task of ensuring links to references while giving users greater access to resources and increasing the value of the digital object (Van de Sompel). Similarly, we propose the concept of secondary repositories that would be responsible for handling secondary metadata, extended materials and resources, interactive tools and application services. This information is cataloged, stored, and maintained in a repository outside of the primary repository that holds the digital object. The comments and observations generated by users in this context are usually highly specialized because such metadata is created from discipline-specific, scholarly perspectives (as an historian, social scientist, teacher, student, enthusiast, etc.) and for a specific purpose (research, publishing, teaching, etc.). Even though the information generated by a secondary repository directly relates to digital objects in primary repositories, secondary repositories remain distinctly separate from the traditional repository. The information gathered in secondary repositories would rarely be used in the primary cataloging and maintenance of the object, and primary repositories would continue to be responsible for preservation, management, and long-term access but would be freed from creating time-consuming and expensive materials, resources, services, and extended metadata for particular user groups. </p>
               <p>In line with digital library best practices, digitized sources are typically cataloged to describe their bibliographic information, along with technical, administrative, and rights metadata. While these practices are essential for preserving the digital object and making it available to users, unfortunately they do so in a language and guise often difficult to understand within the context of use (Lynch 2003). Even though the author's name, the title of the work, and keywords are essential for describing and locating a digital object, this kind of information is not always the most utilized information for ascertaining the relevance of a digital object. For instance, K-I2 teachers often do not have specific authors or titles in mind when searching for materials for their classes. Teachers more frequently search in terms of grade level, the state and national standards that form the basis of their teaching, or broad overarching topics derived from the required content and benchmark standards (e.g., core democratic values or textbook topics) that tend to display too many search returns to make the information of value. </p>
               <p>While cursory studies have indicated these access issues, still very little is known about archival use or how these users express their information needs (Duff, Duff &amp; Johnnson). For digital libraries to begin to fulfill their potential, much research is needed to better understand the processes by which primary repositories are accessed and how information needs are expressed. For example, research needs to address the ways in which teachers integrate content into their pedagogy so that bridges can be built from digital repositories to the educational process, bridges that greatly facilitate the ability of teachers and students to access specific information within the pedagogical process. Recent research strongly suggests that students need conceptual knowledge of information spaces that allow them to create mental models to do strategic and successful searches. As with any primary source, the materials in digital libraries do not literally <soCalled>speak</soCalled> for themselves and impart wisdom; they require interpretation and analysis lysis (Bowker &amp; Star; Duff; Duff &amp; Johnson). Allowing communities of users to enhance metadata and actively use, reuse, repurpose, combine and build complex digital objects can help users to contextualize the information they find, draw from deeper resources within the digital library, and find more meaningful relationships between digital objects and their needs. Thinking in terms of a distributed model (similar to the open source software community) that allows users both easier access to materials and a greater range of search criteria and also provides opportunity for active engagement in the generation of metadata and complex digital objects, promises to help us rethink our most basic assumptions about user access and long-term preservation. </p>
               <p>Collections can also benefit by defining communities of users. For example, with the recent release of secret White House tapes (<xptr to="http://millercenter.virginia.edu/"/>), the sheer number of tapes and hours make it impossible for adequate cataloging of content as well as the difficulty of determining the context and people involved (or even what is said given the poor quality of many tapes). Those historians and scholars (a more regulated and highly defined set of experts) allowed access to the collections could supply information about content and context as well as set terms for debates over more questionable areas of interpretation (e.g., when sound quality makes passages inaudible). While metadata gathered in these ways would need to be qualified (maintained in a secondary repository) because of lack of quality control, the processes could make large quantities of data that is key to many disciplines in the humanities more available and usable. </p>
            </div1>
            <div1>
               <head>Bibliography</head>
               <listBibl>
                  <biblStruct>
                     <monogr>
                        <author>
                           <name reg="Arms, William">William Arms</name>
                        </author>
                        <title level="m">Digital Libraries</title>
                        <imprint>
                           <publisher>MIT Press</publisher>
                           <pubPlace>Cambridge, MA</pubPlace>
                           <date value="2001">2001</date>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <monogr>
                        <author>
                           <name reg="Borgman, C.">C. Borgman</name>
                        </author>
                        <title level="m">From Gutenbeg to the Global Information Infrastructure: Access to Information in the Networked World</title>
                        <imprint>
                           <publisher>MIT Press</publisher>
                           <pubPlace>Cambridge, MA</pubPlace>
                           <date value="2000">2000</date>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <monogr>
                        <author>
                           <name reg="Bowker, Geoffrey C.">Geoffrey C. Bowker</name>
                        </author>
                        <author>
                           <name reg="Star, Susan Leigh">Susan Leigh Star</name>
                        </author>
                        <title level="m">From Gutenbeg to the Global Information Infrastructure: Access to Information in the Networked World</title>
                        <imprint>
                           <publisher>MIT Press</publisher>
                           <pubPlace>Cambridge, MA</pubPlace>
                           <date value="1999">1999</date>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Cole, Charles">Charles Cole</name>
                        </author>
                        <title level="a">Name Collection by Ph.D History Students: Inducing Expertise</title>
                     </analytic>
                     <monogr>
                        <title level="j">Journal of the American Society for Information Science</title>
                        <imprint>
                           <biblScope type="vol">51.5</biblScope>
                           <biblScope type="pages">444-455</biblScope>
                           <date value="2000">2000</date>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Cooperstock, J.R.">J.R. Cooperstock</name>
                        </author>
                        <title level="a">Classroom of the Future: Enhancing Education through Augmented Reality</title>
                     </analytic>
                     <monogr>
                        <title level="m">Proceedings of the HCI International, Conference on Human-Computer Interaction, New Orleans</title>
                        <imprint>
                           <date value="2001">2001</date>
                        </imprint>
                     </monogr>
                     <note>
                        <xptr crdate="2005-04-15" to="http://www.cim.mcgill.ca/~jer/pub/hcii01.pdf"/>
                     </note>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Duff, Wendy">Wendy Duff</name>
                        </author>
                        <title level="a">Evaluating Metadata at a Metalevel</title>
                     </analytic>
                     <monogr>
                        <title level="j">Archival Science</title>
                        <imprint>
                           <biblScope type="vol">1</biblScope>
                           <biblScope type="pages">285-294</biblScope>
                           <date value="2001">2001</date>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Duff, Wendy">Wendy Duff</name>
                        </author>
                        <author>
                           <name reg="Johnson, Catherine A.">Catherine A. Johnson</name>
                        </author>
                        <title level="a">A Virtual Expression of Need</title>
                     </analytic>
                     <monogr>
                        <title level="j">American Archivist</title>
                        <imprint>
                           <biblScope type="vol">64</biblScope>
                           <biblScope type="pages">43-60</biblScope>
                           <date value="2001">2001</date>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Hedstrom, Margaret">Margaret Hedstrom</name>
                        </author>
                        <title level="a">Research Challenges in Digital Archiving and Preservation</title>
                     </analytic>
                     <monogr>
                        <title level="u">NSF Post Digital Libraries Futures Workshop</title>
                        <imprint>
                           <date value="2003-06-15">15-17 June 2003</date>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Kornbluh, Mark Lawrence">Mark Lawrence Kornbluh</name>
                        </author>
                        <author>
                           <name reg="Rehberger, Dean">Dean Rehberger</name>
                        </author>
                        <author>
                           <name reg="Fegan, Michael">Michael Fegan</name>
                        </author>
                        <title level="a">Media MATRIX: Creating Secondary Repositories. in Research and Technology for Digital Libraries</title>
                     </analytic>
                     <monogr>
                        <title level="m">Proceedings of the 8th European Conference, ECDL2004</title>
                        <imprint>
                           <publisher>Springer</publisher>
                           <pubPlace>Berlin</pubPlace>
                           <date value="2004">2004</date>
                           <biblScope type="pages">329-340</biblScope>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Lynch, Clifford">Clifford Lynch</name>
                        </author>
                        <title level="a">Interoperability: the standards challenge for the 90s</title>
                     </analytic>
                     <monogr>
                        <title level="j">Wilson Library Bulletin</title>
                        <imprint>
                           <biblScope type="vol">March</biblScope>
                           <biblScope type="pages">38-42</biblScope>
                           <date value="1995">1995</date>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Lynch, Clifford">Clifford Lynch</name>
                        </author>
                        <title level="a">Reflections toward the Development of a ‘Post D-L Research Agenda</title>
                     </analytic>
                     <monogr>
                        <title level="u">NSF Post Digital Libraries Futures Workshop</title>
                        <imprint>
                           <date value="2003-06-15">15-17 June 2003</date>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Lynch, Clifford">Clifford Lynch</name>
                        </author>
                        <title level="a">Colliding with the Real World: Heresies and Unexplored Questions about Audience, Economics, and Control of Digital Libraries</title>
                     </analytic>
                     <monogr>
                        <editor>
                           <name reg="Bishop, Ann">Ann Bishop</name>
                        </editor>
                        <editor>
                           <name reg="Butterfield, Barbara">Barbara Butterfield</name>
                        </editor>
                        <editor>
                           <name reg="Van House, Nancy">Nancy Van House</name>
                        </editor>
                        <title level="m">Digital Library Use: Social Practice in Design and Evaluation</title>
                        <imprint>
                           <publisher>MIT Press</publisher>
                           <pubPlace>Cambridge, MA</pubPlace>
                           <date value="2001">2001</date>
                           <biblScope type="pages">191-218</biblScope>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Marshall, Catherine">Catherine Marshall</name>
                        </author>
                        <title level="a">Annotation: from Paper Books to the Digital Library</title>
                     </analytic>
                     <monogr>
                        <title level="m">Proceedings of the ACM Digital Libraries ‘97 Conference, Philadelphia, PA</title>
                        <imprint>
                           <date value="1997-07-23">July 23-26, 1997</date>
                        </imprint>
                     </monogr>
                     <note>
                        <xptr crdate="2005-04015" to="http://www.csdl.tamu.edu/~marshall/dl97.pdf"/>
                     </note>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Miller, Paul">Paul Miller</name>
                        </author>
                        <title level="a">The Concept of the Portal</title>
                     </analytic>
                     <monogr>
                        <title level="j">Ariadne</title>
                        <imprint>
                           <biblScope type="vol">30</biblScope>
                           <date value="2001-12-20">20 December 2001</date>
                        </imprint>
                     </monogr>
                     <note>
                        <xptr crdate="2005-04-15" to="http://www.ariadne.ac.uk/issue30/portal/intro.html"/>
                     </note>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Page, K.R.">K.R. Page</name>
                        </author>
                        <author>
                           <name reg="Cruickshank, D.">D. Cruickshank</name>
                        </author>
                        <author>
                           <name reg="Roure, D.D.">D.D. Roure</name>
                        </author>
                        <title level="a">Its About Time: Link Streams as Continuous Metadata</title>
                     </analytic>
                     <monogr>
                        <title level="m">Proceedings of the Twelfth ACM Conference on Hypertext and Hypermedia (Hypertext ‘01)</title>
                        <imprint>
                           <date value="2001">2001</date>
                           <biblScope type="pages">93-102</biblScope>
                        </imprint>
                     </monogr>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Shabajee, Paul">Paul Shabajee</name>
                        </author>
                        <title level="a">Primary Multimedia Objects and  'Educational Metadata': a Fundamental Dilemma for Developers of Multimedia Archives</title>
                     </analytic>
                     <monogr>
                        <title level="j">D-Lib Magazine</title>
                        <imprint>
                           <date value="2000-03">March 2000</date>
                        </imprint>
                     </monogr>
                     <note>
                        <xptr crdate="2005-04-15"
                              to="http://www.dlib.org/dlib/june02/shabajee/06shabajee.html"/>
                     </note>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Rydberg-Cox, Jeffrey">Jeffrey Rydberg-Cox</name>
                        </author>
                        <title level="a">Cultural Heritage Language Technologies: Building an Infrastructure for Collaborative Digital Libraries in the Humanities</title>
                     </analytic>
                     <monogr>
                        <title level="j">Ariadne</title>
                        <imprint>
                           <biblScope type="vol">34</biblScope>
                           <date value="2003-01-14">14 January 2003</date>
                        </imprint>
                     </monogr>
                     <note>
                        <xptr crdate="2005-04-15"
                              to="http://www.ariadne.ac.uk/issue34/rydberg-cox/intro.html"/>
                     </note>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Waller, Richard">Richard Walker</name>
                        </author>
                        <title level="a">Functionality of Digital Annotation: Imitating and Supporting Real-World Annotation</title>
                     </analytic>
                     <monogr>
                        <title level="j">Ariadne</title>
                        <imprint>
                           <biblScope type="vol">35</biblScope>
                           <date value="2003-04-30">30 April 2003</date>
                        </imprint>
                     </monogr>
                     <note>
                        <xptr crdate="2005-04-15" to="http://www.ariadne.ac.uk/issue35/waller/"/>
                     </note>
                  </biblStruct>
                  <biblStruct>
                     <analytic>
                        <author>
                           <name reg="Van de Sompel, Herbert">Herbert Van de Sompel</name>
                        </author>
                        <author>
                           <name reg="Oren Beit-Arie, O.">O. Oren Beit-Arie</name>
                        </author>
                        <title level="a">Open linking in the scholarly Information Environment Using the Open URL Framework</title>
                     </analytic>
                     <monogr>
                        <title level="j">D-Lib Magazine</title>
                        <imprint>
                           <date value="2001-03">March 2001</date>
                        </imprint>
                     </monogr>
                     <note>
                        <xptr crdate="2005-04-15"
                              to="http://www.dlib.org/dlib/march01/vandesompel/03vandesompel.html"/>
                     </note>
                  </biblStruct>
               </listBibl>
            </div1>
         </div0>
         <div0 type="EmbeddedDoc">
            <div1>
               <head>Developing <title level="m">MediaMatrix</title>: A Secondary Repository Tool </head>
               <p rend="Presenter">Michael Fegan</p>
               <p>Speaker Two will argue that one way we can enhance access to online digital objects (particularly in the humanities) is to facilitate the creation of secondary repositories. These repositories will provide discipline/community specific metadata and applications and will allow users to find, use, manipulate and analyze digital objects more easily. </p>
               <p>Even though access by specialist scholars and educators to digital objects has grown at an exponential rate, tangible factors have prevented them from fully taking advantage of these resources in the classroom, where they could provide the conceptual and contextual knowledge of primary objects for their students. When educators do find the materials they need, using objects from various primary repositories to put together presentations and resources for their students and research can be challenging. Beyond merely creating lists of links to primary and secondary resources, assembling galleries of images, segmenting and annotating long audio and video files require far more technical expertise and time than can realistically be expected in the educational context. In addition, even though scholars have a long history of researching archives and are comfortable sifting through records, locating items, and making annotations, comparisons, summaries, and quotations, these processes do not yet translate into online tools. Contemporary bibliographic tools have expanded to allow these users to catalogue and keep notes about media, but they do not allow users to mark specific passages and moments in multimedia, segment it, and return to specific places at a later time. Multimedia and digital repository collections thus remain underutilized in education and research because the tools to manipulate the various formats often <soCalled>frustrate would be users</soCalled> and take too much cognitive effort and time to learn. </p>
               <p>To this end, Speaker Two has developed <title level="m">Media Matrix</title> 1.0 — an online, server-side suite of tools that allows users to locate specific media and streaming media files found in digital repositories and segment, annotate and organize this media online. The application has been developed as part of the <title level="m">Spoken Word Project</title> funded by <title>Digital Libraries Initiative II: Digital Libraries in the Classroom Program</title>, <title>National Science Foundation</title> in conjunction with UK's <title>Joint Information Systems Committee</title>.</p>
               <p>This application is an online tool that allows users to easily find, segment, annotate and organize text, image, and streaming media found in traditional online repositories. <title level="m">MediaMatrix</title> works within a web browser, using the browser's bookmark feature, a familiar tool for most users. When users find a digital object at a digital library or repository, they simply click the <title level="m">MediaMatrix</title> bookmark and it searches through the page, finds the appropriate digital media, and loads it into an editor. Once this object is loaded, portions of the media can be isolated for closer and more detailed work — portions of an audio or video clip may be edited into a time-segment, images may be cropped then enlarged to highlight specific details. <title level="m">MediaMatrix</title> provides tools so that these media can be placed in juxtaposition, for instance, two related images, a segment of audio alongside related images and audio, and so forth. This can be particularly effective for students and researchers who need to fit images into a presentation or would like to demonstrate specific nuances and details about portions of images or artwork. Most importantly, textual annotations can be easily added to the media, and all this information is then submitted and stored on a personal portal page. </p>
               <p>A portal page might be created by a scholar-educator who wishes to provide specific and contextualized resources for classroom use, and/or by a student creating a multimedia-rich essay for a class assignment. While these users have the immediate sense that they are working directly with primary objects, it is important to emphasize that primary repository objects are not actually being downloaded and manipulated. <title level="m">MediaMatrix</title> does not store the digital object, rather, it stores a pointer to the digital object (URI) along with time or dimension offsets the user specified for the particular object and the user's annotation for that particular object. This use of URI pointing as opposed to downloading is especially significant because it removes the possibility that items may be edited and critiqued in contexts divorced from their original repositories, which hold the primary and crucial metadata for such objects. </p>
               <p>As long as primary repositories maintain persistent URIs for their holdings the pointer to the original digital object will always remain within the secondary repository, which acts as a portal to both the primary collection and contextualizing and interpretive information generated by individuals on items in those collections. This information can be stored in a relational database along with valuable information about the individual, who supplies a profile regarding their scholarly/educational background, and provides information of the specific purposes for this work and the user-group (a class, for example) accessing the materials. <title level="m">Media Matrix</title> is a PHP based server side application that stores information in a <title level="m">mySQL</title> database and exports that information into XML for display. The development of the tool and programming environment have been designed to keep it library and archive independent so that it can work with almost any site on the internet. It can also work easily with any of the standard courseware packages. The tool is also search independent because it relies on traditional internet search tools and a site's discovery tools to find an object. Once objects are found, <title level="m">Media Matrix</title> is deployed by the user. Because <title level="m">Media Matrix</title> does not actually copy the digital object from the site (it only stores a pointer to the object in the form of a URI and whatever time offsets are created by the user), it avoids some of the copyright and fair use pitfalls that often keep users from working with digital objects (although there are issues of deep linking to be addressed). The secondary repository can thus be searched and utilized in any number of ways. </p>
               <p>Historians, for example, can browse the portals of other historians working specifically in their research areas or K-12 teachers can browse grade appropriate sections defined by specific grade levels and subjects to see what digital objects other teachers are using or, more important, for time challenged teachers, they can find specific presentations created around standard topics and curriculum frameworks. Users can also perform keyword searches over the annotations created by all users or specific groups of users. A teacher, for instance, can choose to search through only the information in eleventh-grade Civics groups in hopes of finding information that speaks directly to his/her needs. Because users have gathered content from across the Internet and from a variety of digital repositories, searching <title level="m">Media Matrix</title> is equivalent to searching multiple repositories at once. Once users find an object from a particular digital library, they can jump to that repository to find what other objects are available. </p>
               <p>Going beyond demonstration, this paper will also dive the latest findings and evaluations based on initial user testing in several classrooms as <title>Tufts University</title> and <title>Michigan State University</title>.</p>
            </div1>
         </div0>
      </body>
   </text>
</TEI.2>