<?xml version="1.0" encoding="UTF-8"?>
<TEI.2 id="paper_206_downie">
   <teiHeader>
      <fileDesc>
         <titleStmt>
            <title>Modelling Complex Multimedia Relationships in the Humanities Computing Context: Are Dublin Core and FRBR up to the Task?</title>
            <author>
               <name reg="Downie, J. Stephen">J. Stephen Downie</name>
            </author>
            <author>
               <name reg="Renear, Allen">Allen Renear</name>
            </author>
            <author>
               <name reg="Mathes, Adam">Adam Mathes</name>
            </author>
            <author>
               <name reg="Medina, Karen">Karen Medina</name>
            </author>
            <author>
               <name reg="Dubin, David">David Dubin</name>
            </author>
            <author>
               <name reg="Lee, Jin Ha">Jin Ha Lee</name>
            </author>
            <respStmt>
               <resp>Marked up by </resp>
               <name reg="Holmes, Martin">Martin Holmes</name>
               <lb/>
               <name reg="Baer, Patricia">Patricia Baer</name>
            </respStmt>
         </titleStmt>
         <publicationStmt>
            <p>Marked up to be included in the ACH/ALLC 2005 Conference Abstracts book.</p>
         </publicationStmt>
         <sourceDesc>
            <p>None</p>
         </sourceDesc>
      </fileDesc>
      <profileDesc>
         <textClass>
            <classCode>paper</classCode>
            <keywords>
               <list>
                  <item>multimedia modelling</item>
                  <item>Dublin Core</item>
                  <item>FRBR</item>
               </list>
            </keywords>
         </textClass>
      </profileDesc>
      <revisionDesc>
         <list>
            <item>MDH: Created from John Bradley's XML <date value="2005-03-10">10 March 2005</date>
            </item>
            <item>MDH: Merged author's revisions <date value="2005-03-10">10 March 2005</date>
            </item>
            <item>MDH: PGL's editorial revisions merged <date value="2005-05-17">17 May 2005</date>
            </item>
         </list>
      </revisionDesc>
   </teiHeader>
   <text>
      <front>
         <docTitle n="Modelling Complex Multimedia Relationships in the Humanities Computing Context: Are Dublin Core and FRBR up to the Task?">
            <titlePart>Modelling Complex Multimedia Relationships in the Humanities Computing Context: Are Dublin Core and FRBR up to the Task?</titlePart>
         </docTitle>
         <docAuthor>
            <name reg="Downie, J. Stephen">J. Stephen Downie</name>
            <address>
               <addrLine>jdownie@uiuc.edu</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">University of Illinois at Urbana-Champaign</titlePart>
         <docAuthor>
            <name reg="Renear, Allen">Allen Renear</name>
            <address>
               <addrLine>renear@uiuc.edu</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">University of Illinois at Urbana-Champaign</titlePart>
         <docAuthor>
            <name reg="Mathes, Adam">Adam Mathes</name>
            <address>
               <addrLine>adam@adammathes.com</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">University of Illinois at Urbana-Champaign</titlePart>
         <docAuthor>
            <name reg="Medina, Karen">Karen Medina</name>
            <address>
               <addrLine>kmedina@alexia.lis.uiuc.edu</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">University of Illinois at Urbana-Champaign</titlePart>
         <docAuthor>
            <name reg="Dubin, David">David Dubin</name>
            <address>
               <addrLine>ddubin@uiuc.edu</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">University of Illinois at Urbana-Champaign</titlePart>
         <docAuthor>
            <name reg="Lee, Jin Ha">Jin Ha Lee</name>
            <address>
               <addrLine>jinlee1@uiuc.edu</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">University of Illinois at Urbana-Champaign</titlePart>
      </front>
      <body>
         <div0>
            <head>Introduction</head>
            <p>It is now widely recognized that the creation, management, and
            analysis of content other than text is extremely important if the
            digital humanities are to deliver access to, and provide an analytical
            purchase on, the full range of human culture. However it is not clear
            to us whether the cataloguing and classification systems for digital
            content are up to the task. Difficulties in this area threaten to
            impede both the development of tools and techniques — and the
            production of sound theoretical results. In our paper we discuss some
            of these problems, focusing on <emph>relationships</emph> amongst the
            various cultural modes of expression. With the intention of convening a
            larger discussion of how these confusions might be remedied, we then
            propose directions for some clarification and improvement. However, the
            larger issues here are not merely terminological and resist any easy
            resolution. </p>
         </div0>
         <div0>
            <head>The Problem</head>
            <p>Within the humanities computing community it has been a commonplace
               that while the emphasis on representing and analyzing textual content
               may be understandable, it is important to support the other kinds of
               content as well. We agree. The <soCalled>digital humanities</soCalled> must support the
               full range of human cultural products: text, music, images, dance,
               cinema, architecture, design, and so on. At present there are many
               different research communities looking into the organization of, and
               enhanced access to, these various modes of cultural expression. There
               is a text retrieval community (see Baez-Yates &amp; Ribeiro-Neto), a growing music information
               retrieval community (see Futrelle &amp; Downie), an image retrieval community (see Hsin-liang &amp; Rasmussen),
               and so on. Notwithstanding the real progress being made by each of these,
               very astonishingly little work has yet been done to comprehensively
               address the issue that each of these individual modes of expression
               interact with each other in the ordinary course of production,
               management and use, as well as how formats at varying level of
               abstraction interact within a single modality.</p>
            <p>First, to illustrate how the modes of expression interact with each
               other, let us consider the <title level="m">Othello</title>
               corpus. An incomplete inventory of the <title level="m">Othello</title> corpus includes the novella
               by Giraldi Cinthio (1565)
               <cit>
                  <q>upon which Shakespeare based his play</q>
                  <bibl>Hunt</bibl>
               </cit>, Shakespeare's play (1604), the operas by Rossini (1816)                   and Verdi (1887), Dvorak's concert overture, Op.
               93 (1892), and the ballet
               by Lubovitch (2002). If we
               are going to create a digital humanities repository worthy of use by
               humanities scholars and their students, it is incumbent on us to build
               a system that can <soCalled>collocate</soCalled>, or gather up, all extant digital
               representations of <title level="m">Othello</title>:
               all recordings, all scores, all movies, all choreographies, all
               libretti, all scripts, all set and costume designs, all critiques, and
               so on. To aid in this collocation, we need to clearly express the
               relationships between each of these things at both the specific and
               generic levels. On the specific level, we need to indicate that, for
               example, <title level="m">Othello</title>
               choreographic labanotation <hi rend="bold">
                  <hi rend="italics">W</hi>
               </hi> is directly
               based on <title level="m">Othello</title> score <hi rend="bold">
                  <hi rend="italics">X</hi>
               </hi>, which was
                  specifically used in <title level="m">Othello</title>
               movie <hi rend="bold">
                  <hi rend="italics">Y</hi>
               </hi>,
                  and also released in <title level="m">Othello</title>
               soundtrack recording <hi rend="bold">
                  <hi rend="italics">Z</hi>
               </hi>. On the
               generic level, we need to indicate that all <title level="m">Othello</title> scores have some generic
               relationship to all <title level="m">Othello</title>
               recordings, to all <title level="m">Othello</title>
               movies, etc. in such a way that explicates that the works are all
               members of the <title level="m">Othello</title>
               corpus. </p>
            <p>Second, to illustrate interactions between formats within a single
               mode, consider only the music mode of the <title level="m">Othello</title> corpus. For each musical
               realization there usually exists a symbolic score and its individual
               parts. These symbolic representations can, in turn, be
               represented in a variety of digital formats: MusicXML, TIFF, <title level="m">Finale</title>,
               etc. The aural aspect of the music is represented in another variety
               of digital formats: WAV, MP3, Ogg Vorbis, etc. Again, complex
               relationships exist between the <soCalled>symbolic</soCalled> and <soCalled>aural</soCalled>
            representations at both the specific (e.g., recording <hi rend="italics">X</hi> used score <hi rend="italics">Y</hi>) and generic levels (e.g.,
            a <soCalled>fakebook</soCalled> score used to generate different recordings of
               improvised renditions). Other potentially complex relationships exist
               because many of these formats can be used to generate the others. For
               example, a TIFF scan of the <soCalled>original</soCalled> score can be fed through an
               Optical Music Recognition (OMR) system to create a MusicXML score file
               which can generate a MIDI file which then can generate any of the audio
               file formats. Further complicating matters, research is also underway
               to <soCalled>backwards</soCalled> create scores from audio recordings which would capture,
               symbolically, the nuances of a given performance (e.g., Plumbley et al.). </p>
         </div0>
         <div0>
            <head>Standards for Expressing Relationships Among and Within Modes</head>
            <p>There is, of course, a body of work — standards and related research
               — within the cataloguing and classification communities that holds some
               promise for supporting the relationships described above. The <title>Dublin
               Core</title> (<title>DC</title>) is perhaps the most widely used within the digital
               humanities. IFLA's <title level="m">Functional Requirements for Bibliographic Records</title>
               (FRBR) is becoming
               increasingly important. Work by organizations devoted to specific
               modalities such as the Federation Internationale des Archives du Film
               (FIAF)<note n="1">
                  <xptr to="http://www.fiafnet.org/uk/"/>
               </note>, and the
               International Association of Sound and Audiovisual Archives (IASA)<note n="2">
                  <xptr to="http://www.iasa-web.org/index.htm"/>
               </note>, as well as work by such
               researchers as Martha M. Yee (moving pictures — see Yee), and Richard Smiraglia (music — see Smiraglia), etc., are also
               contributing insights and theory to this research domain.</p>
         </div0>
         <div0>
            <head>Are We There Yet?</head>
            <p>We have reviewed results from projects and analyses that suggest
                  there is still much work to do before the functionality envisaged above
                  is a reality. Here we describe one such project that attempts to use
                  <title>FRBR</title> and the <title>DC</title> to support inter- and intra-modal
                  relationships. The <title>DC</title> does in fact hold the most promise for
                  representing these relationships in a way that enables computer
                  supported exploitation for retrieval, navigation, analysis, and so on.</p>
            <p>Ayres describes a project at MusicAustralia to use <title>FRBR</title> and
                  <title>DC</title> to create a digital repository that explicates the
                  complex relationships between the works, expressions, manifestations
                  and items of a collection of music and lyrics found that: <cit>
                  <q>The <hi rend="code">DC.Relation</hi> element can be used to display and support
                     navigation between items with flat, horizontal relationships [i.e.,
                     inter-modal relationships like those between some music and its text].
                     However, the kinds of relationships MusicAustralia wants to expose are
                     a combination of vertical [i.e., intra-modal relationships like those
                     between a score and its recording] and horizontal relationships, and
                     rely heavily on abstract but well understood and demonstrable concepts
                     of the Work and the Expression or version. At this stage, <title>DC</title> does not
                     offer support exposure of navigational pathways that explicitly
                     acknowledge both vertical and horizontal relationships. [Bracketed
                     injections are ours.]</q>
               </cit>
            </p>
            <p>Indeed, a close look at <title>Dublin Core</title> format and type elements suggests
               that the level of precision, and subtlety required is probably not yet
               available there. For instance the <title>DC</title> type vocabulary includes such
               disparate things as <soCalled>
                  <hi rend="code">sound</hi>
               </soCalled>, <soCalled>
                  <hi rend="code">text</hi>
               </soCalled> and <soCalled>
                  <hi rend="code">physical object</hi>
               </soCalled>, and examples
               for <soCalled>
                  <hi rend="code">sound</hi>
               </soCalled> include <soCalled>
                  <hi rend="code">music playback file format</hi>
               </soCalled> and <soCalled>
                  <hi rend="code">an audio compact
               disc</hi>
               </soCalled> (DCMI Usage Board).</p>
         </div0>
         <div0>
            <head>Next Steps: Exploring Ayres' Open Questions</head>
            <p>Because the work of Ayres and her colleagues represents the most
               thorough examination of the combination of <title>FRBR</title> modelling and <title>Dublin
               Core</title> encoding to build a comprehensive multimodal repository, we are
               taking it as the starting point for our present work. The Ayres study
               uncovers a series of unresolved open questions associated with <title>FRBR</title> and
               the modelling of real-world multimodal information. In the Ayres case,
               the two modes are music (i.e., scores, recordings, etc.) and text
               (i.e., lyrics, poems, etc.). These two modes come together to create
               what we commonly consider to be <soCalled>songs</soCalled>. To paraphrase Ayre's first
               open question: 
              
              <list type="force-numbering">
                  <item n="1">Should we model as the primary work: 
               <list type="lower-alpha">
                        <item>the music;</item>
                        <item>the text; or,</item>
                        <item>the combination of text and music?</item>
                     </list>
                  </item>
               </list>
            </p>
            <p> Ayres clearly illustrates that each modelling approach above
               clarifies a specific set of relationships between the music
               compositions and the texts while at the same time obscuring other
               relationships. The examination of this question has implications beyond
               the simpler music-text modelling case. For example, what are the
               implications when we attempt to model more complex cases (e.g., the
               <title level="m">Othello</title> corpus, a Hollywood musical, etc.) with their exponentially
               growing relationships between text (novellas, plays, libretti,
               etc), music (i.e., notations, recordings, etc.), choreography (i.e.,
               notations, video), and so on? Our paper examines this very question. We
               also explore the broader ramifications of Ayre's three related
               subsidiary open questions:

            <list type="force-numbering">
                  <item n="2">Should all notated and performed expressions of music [or dance,
                  or text, etc.] be modelled as a single expression category?</item>
                  <item n="3">Should expressions themselves be further modelled to include
                  sub-categories for notated and performed expressions?</item>
                  <item n="4">Should performed expressions based on particular notated
                  expressions be modelled as expressions of expressions?</item>
               </list>
            </p>
            <p>By examining these fundamental questions, we intend to
               encourage a long-overdue conversation within the humanities computing
               community. Unless our representation schemes do justice to the
               multidimensional complexity of cultural content in all its modes of
               expression, we will not realize the full potential of digital
               humanities repositories.
            </p>
         </div0>
      </body>
      <back>
         <div type="Bibliography">
            <head>Bibliography</head>
            <listBibl>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="Ayres, Marie-Louise">Marie-Louise Ayres</name>
                     </author>
                     <title level="a">MusicAustralia: Experiments with DC.Relation</title>
                  </analytic>
                  <monogr>
                     <title level="u">Presented at DC-ANZ (Dublin Core in Australia and New Zealand) Conference in Canberra</title>
                     <imprint>
                        <date value="2003-02">February 2003</date>
                     </imprint>
                  </monogr>
                  <note>
                     <xptr crdate="2004-11-17" to="http://www.nla.gov.au/nla/staffpaper/2003/ayres1.html"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <author>
                        <name reg="Baez-Yates, R.">R. Baez-Yates</name>
                     </author>
                     <author>
                        <name reg="Ribeiro-Neto, B.">B. Ribeiro-Neto</name>
                     </author>
                     <title level="m">Modern information retrieval</title>
                     <edition>1st ed.</edition>
                     <imprint>
                        <publisher>Addison-Wesley</publisher>
                        <pubPlace>Reading, MA</pubPlace>
                        <date value="1999">1999</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <author>
                        <name reg="DCMI Usage Board">DCMI Usage Board</name>
                     </author>
                     <title level="m" type="WWW document">DCMI Type Vocabulary</title>
                     <imprint>
                        <date value="2004">2004</date>
                     </imprint>
                  </monogr>
                  <note>
                     <xptr crdate="2004-11-27"
                           to="http://dublincore.org/documents/2004/06/14/dcmi-type-vocabulary/"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="Futrelle, Joe">Joe Futrelle</name>
                     </author>
                     <author>
                        <name reg="Downie, J. Stephen">J. Stephen Downie</name>
                     </author>
                     <title level="a">Interdisciplinary
                        Research Issues in Music Information Retrieval: ISMIR 2000-2002</title>
                  </analytic>
                  <monogr>
                     <title level="j">Journal of New Music Research</title>
                     <imprint>
                        <biblScope type="vol">32.2</biblScope>
                        <biblScope type="pages">121-131</biblScope>
                        <date value="2003">2003</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="Hsin-liang, Chen">Chen Hsin-liang</name>
                     </author>
                     <author>
                        <name reg="Rasmussen, Edie M.">Edie M. Rasmussen</name>
                     </author>
                     <title level="a">Intellectual access to images</title>
                  </analytic>
                  <monogr>
                     <title level="j">Library Trends</title>
                     <imprint>
                        <biblScope type="vol">48.2</biblScope>
                        <biblScope type="pages">291-302</biblScope>
                        <date value="1999">1999</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <author>
                        <name reg="Hunt, Mary Ellen">Mary Ellen Hunt</name>
                     </author>
                     <title level="m" type="WWW document">Review of San Francisco Ballet,
                        "Othello". War Memorial Opera House, San Francisco, CA</title>
                     <imprint>
                        <publisher>criticaldance.com</publisher>
                        <date value="2002">2002</date>
                     </imprint>
                  </monogr>
                  <note>
                     <xptr crdate="2004-11-27"
                           to="http://www.criticaldance.com/reviews/2002/sfb-othello_020301.html"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <title level="m" type="WWW document">
                        <name reg="Functional Requirements for Bibliographic Records (FRBR)">Functional Requirements for Bibliographic Records (FRBR)</name>
                     </title>
                     <imprint>
                        <publisher>UBCIM Publications, 19</publisher>
                     </imprint>
                  </monogr>
                  <note>
                     <xptr crdate="2004-11-27" to="http://www.ifla.org/VII/s13/frbr/frbr.htm"/>
                  </note>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="Plumbley, M.D.">M.D. Plumbley</name>
                     </author>
                     <author>
                        <name reg="Abdallah, S.A.">S.A. Abdallah</name>
                     </author>
                     <author>
                        <name reg="Bello, J.P.">J.P. Bello</name>
                     </author>
                     <author>
                        <name reg="Davies, M.E.">M.E. Davies</name>
                     </author>
                     <author>
                        <name reg="Monti, G.">G. Monti</name>
                     </author>
                     <author>
                        <name reg="Sandler, M.B.">M.B. Sandler</name>
                     </author>
                     <title level="a">Automatic Music Transcription and  Audio Source Separation</title>
                  </analytic>
                  <monogr>
                     <title level="j">Cybernetics &amp; Systems</title>
                     <imprint>
                        <biblScope type="vol">33.6</biblScope>
                        <biblScope type="pages">603-627</biblScope>
                        <date value="2002">2002</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <author>
                        <name reg="Smiraglia, Richard">Richard Smiraglia</name>
                     </author>
                     <title level="m">The Nature of "a work": implications for the organization of knowledge</title>
                     <imprint>
                        <publisher>Scarecrow Press</publisher>
                        <pubPlace>Lanham, MD</pubPlace>
                        <date value="2001">2001</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="Yee, Martha M.">Martha M. Yee</name>
                     </author>
                     <title level="a">What is a Work?</title>
                  </analytic>
                  <monogr>
                     <editor>
                        <name reg="Weihs, Jean">Jean Weihs</name>
                     </editor>
                     <title level="m">The Principles and Future of AACR: Proceedings of the International Conference on the Principles and Future Development of AACR, Toronto, Ontario, Canada, October 23-25, 1997</title>
                     <imprint>
                        <publisher>Canadian Library Association</publisher>
                        <pubPlace>Ottawa</pubPlace>
                        <date value="1998">1998</date>
                        <biblScope type="pages">62-104</biblScope>
                     </imprint>
                  </monogr>
               </biblStruct>
            </listBibl>
         </div>
      </back>
   </text>
</TEI.2>