<?xml version="1.0" encoding="UTF-8"?>
<TEI.2 id="paper_110_janssen">
   <teiHeader>
      <fileDesc>
         <titleStmt>
            <title>Animated Dynamic Highlighting</title>
            <author>
               <name reg="Janssen, Bill">Bill Janssen</name>
            </author>
            <author>
               <name reg="Gurevich, Olga">Olga Gurevich</name>
            </author>
            <author>
               <name reg="Karttunen, Lauri">Lauri Karttunen</name>
            </author>
            <respStmt>
               <resp>Marked up by </resp>
               <name reg="Holmes, Martin">Martin Holmes</name>
               <lb/>
               <name reg="Baer, Patricia">Patricia Baer</name>
            </respStmt>
         </titleStmt>
         <publicationStmt>
            <p>Marked up to be included in the ACH/ALLC 2005 Conference Abstracts book.</p>
         </publicationStmt>
         <sourceDesc>
            <p>None</p>
         </sourceDesc>
      </fileDesc>
      <profileDesc>
         <textClass>
            <classCode>paper</classCode>
            <keywords>
               <list>
                  <item>linguisitically-directed reading</item>
                  <item>computationally augmented reading</item>
               </list>
            </keywords>
         </textClass>
      </profileDesc>
      <revisionDesc>
         <list>
            <item>MDH: Created from John Bradley's XML <date value="2005-03">March 2005</date>
            </item>
            <item>MDH: Proofed by RS <date value="2005-05-25">25 May 2005</date>
            </item>
            <item>MDH: Proofed by PGK; entered minor corrections <date value="2005-05-26">26 May 2005</date>
            </item>
         </list>
      </revisionDesc>
   </teiHeader>
   <text>
      <front>
         <docTitle n="Animated Dynamic Highlighting">
            <titlePart>Animated Dynamic Highlighting</titlePart>
         </docTitle>
         <docAuthor>
            <name reg="Janssen, Bill">Bill Janssen</name>
            <address>
               <addrLine>janssen@parc.com</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">Palo Alto Research Center</titlePart>
         <docAuthor>
            <name reg="Gurevich, Olga">Olga Gurevich</name>
            <address>
               <addrLine>olya@socrates.berkeley.edu</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">University of California, Berkeley</titlePart>
         <docAuthor>
            <name reg="Karttunen, Lauri">Lauri Karttunen</name>
            <address>
               <addrLine>karttunen@parc.com</addrLine>
            </address>
         </docAuthor>
         <titlePart type="affil">Palo Alto Research Center</titlePart>
      </front>
      <body>
         <div0>
            <head>1. Introduction</head>
            <p>The recent years have seen an exponential increase in the amount of
information available through the Internet on any given topic. 
Information retrieval techniques have been steadily improving and can
provide a mass of relevant results, but those results still have to be
processed and digested by a human reader.  Information professionals need technology that helps people absorb large amounts of text quickly.  We introduce <title level="m">Animated 
Dynamic Highlighting (ADH)</title>, an interactive, user-controlled technology to improve
presentational aspects of the reading task.  We present the research underlying the ideas of <title level="m">ADH</title>, the <title level="m">ADH</title> technology 
itself, and some results from an initial user study evaluating its effectiveness and 
   usability.</p>
         </div0>
         <div0>
            <head>2. Background</head>
            <p>The study described in this paper is part of a larger effort at
PARC called <title level="m">Productive Reading</title>. We are looking at ways in which
computation can be applied to the reading process, in two major ways:
to enhance document content, and to enhance the user experience of
reading.</p>
            <p>The current model of the reading interface is heavily based on the
static experience of words imaged on paper. This model has been
carried over directly to the presentation of text to the computer
screen. Some attention has been given to using computation to modify
the presentation structure of documents (Beveret al. 75-87; Walker et al.), but
with certain exceptions (Chang et al.). These presentations are
inherently static.</p>
            <p>The major exception to this is the presentation technique commonly
known as <title level="m">rapid serial visual presentation (RSVP)</title>. The overview of
studies in <title level="m">RSVP</title> given in Sicheritz suggest that a dynamically
altered presentation of text may be able to enhance comprehension
   without negatively affecting reading speeds. However, <title level="m">RSVP</title> is often
found to suffer from some serious disadvantages, notably eyestrain,
usually attributed to the fact that the user's eyes do not move from
a fixed position, and user anxiety, due to the inability to look back
at previously-read text.  Other studies such as Castelhano et al. have demonstrated ways to alleviate some of these issues.</p>
         </div0>
         <div0>
            <head>3. <title level="m">ADH</title>
            </head>
            <div1>
               <head>3.1. What <title level="m">ADH</title> does</head>
               <p>The goal of <title level="m">ADH</title> is to preserve the apparent advantages of <title level="m">RSVP</title>,
while mitigating the apparent disadvantages. It paces the user
through an electronic document, sequentially highlighting parts of the
text, each a few words long, without modifying the spatial layout of the original page, so that
the reader's eyes move in a normal reading fashion. The speed with which the
highlighting moves depends on properties of the chunks and on a base speed set by
   the user. The reader can adjust the speed, and also restart <title level="m">ADH</title> from any point in the
document. The reading speed may be at a speed somewhat faster than the
user's habitual reading speed.</p>
            </div1>
            <div1>
               <head>3.2. The viewing technology</head>
               <p>The <title level="m">ADH</title> presentation system is part of a larger system at PARC for
archiving and reading documents, called <title level="m">UpLib</title> (Chang; Mackinlay; Zellweger). The <title level="m">UpLib</title> system
includes a document reader, called <title level="m">ReadUp</title>, which normally supports a conventional
page-oriented document display. <title level="m">ReadUp</title> was modified to present documents
in both <title level="m">RSVP</title> and <title level="m">ADH</title> mode.</p>
               <figure rend="ImageLink">
                  <head>Figure 1: A document page shown with <title level="m">ADH</title> highlighting</head>
                  <p>
                     <xref>paper_110_janssen_1.jpg</xref>
                  </p>
                  <figDesc>Figure 1: A document page shown with <title level="m">ADH</title> highlighting</figDesc>
               </figure>
            </div1>
            <div1>
               <head>3.3. Phrase-breaking technology</head>
               <p>The text of a document is first annotated with part-of-speech tags
   using the <title level="m">Inxight</title> tagger. In
contrast to most taggers, the <title level="m">Inxight</title> tool has a large inventory
of labels to distinguish between different types of determiners,
adverbs,  and pronouns.  While the information is less
detailed than a syntactic parser could produce, the markup makes
it possible to divide the text into semantically coherent pieces. We
have defined a large set of phrasal patterns and compiled them into
   finite-state transducers (Beesley; Karttunen). The transducers are applied in a cascade
taking the output of one pattern matching step as input to the next
one. This process splits the input text into phrases proceeding from
larger constituents (sentences and clauses) to smaller constituents
(NPs, VPs, PPs) and their components. Each phrase should contain 
between 2 and 4 content words (such as nouns, verbs,
adjectives, and adverbs); the boundaries of syntactic constituents are
in most cases preserved.  An example of a partitioned sentence is below:</p>
               <ab>
                  <hi rend="code">&lt;phrase&gt;The Marine Corps
band&lt;/phrase&gt; &lt;phrase&gt;played the national
anthem&lt;/phrase&gt; &lt;phrase&gt;as Dailey unveiled a
space-suited Glenn&lt;/phrase&gt; &lt;phrase&gt;in his
new
place of honor,&lt;/phrase&gt; &lt;phrase&gt;suspended
40 feet
above the floor&lt;/phrase&gt; &lt;phrase&gt;of the
museum's
breathtaking Gallery 100.&lt;/phrase&gt;</hi>
               </ab>
               <p>Finally, the established phrase boundaries are projected back to the
original source text to enable the dynamic highlighting in presenting
the text to the user.</p>
            </div1>
            <div1>
               <head>3.4. Display timing</head>
               <p>Each phrase is allocated an initial display time based on the
user-selected speed. This base span is then modified in a number of
ways: shorter phrases get somewhat less time, longer ones more time.
The timespan is further modified to reflect the findings in Just; Carpenter: phrases ending a line, at the end of a page, at
the beginning of a new line, or ending a sentence all receive varying
amounts of extra time, reflecting the extra time human subjects tend
to take with these kinds of phrases. Finally, the occurrence of
linguistic constructs in the phrase, such as pronouns and compound
nouns, is used to modify the timespan in additional ways.</p>
            </div1>
         </div0>
         <div0>
            <head>4. User Study</head>
            <div1>
               <head>4.1. Method</head>
               <p>The goal of the user study was to assess the effectiveness of <title level="m">ADH</title> and to compare 
it to <title level="m">RSVP</title> (Sicheritz); the same phrase-breaking
and timing were used for <title level="m">ADH</title> and <title level="m">RSVP</title>.
Eighteen test subjects, mostly researchers and interns, were given three alternative modes of
presenting documents: plain (not modified in any way), <title level="m">ADH</title>, and <title level="m">RSVP</title>.  The 
texts contained simple factual information and were followed by  questions testing the 
recall accuracy. The first stage of the experiment used documents with
automatic phrase
breaking, the second one used manual phrase breaking.</p>
               <p>The subjects were also asked  about their
reactions to the <title level="m">ADH</title> and <title level="m">RSVP</title> technologies.</p>
            </div1>
            <div1>
               <head>4.2. Results</head>
               <p>Although there were too few subjects for significant results, some
interesting trends emerged.  Overall, <title level="m">ADH</title> was found to be faster than 
   either plain or <title level="m">RSVP</title> mode; it was also somewhat less accurate.  In general, there was a tradeoff between speed and accuracy
in <title level="m">ADH</title>: the faster a document was read, the less accurate was the
recall.  However, both the speed and accuracy results were better
with manual phrase-breaking than with automatic phrase-breaking. 
   Users found both <title level="m">ADH</title> and <title level="m">RSVP</title> to be somewhat annoying, but rated <title level="m">RSVP</title> worse than <title level="m">ADH</title>.  However, most said they
would use <title level="m">ADH</title> again for skimming through short articles, especially
with improved phrase-breaking and timing algorithms.  On the other
   hand, most users rejected future uses of <title level="m">RSVP</title>. 
The lower user
ratings and reading speeds may be the result of novelty shock. 
The results are nevertheless encouraging: younger subjects in
particular were very enthusiastic about <title level="m">ADH</title>, and the user study produced many 
suggestions for future improvements and well as possible applications of <title level="m">ADH</title>.</p>
            </div1>
         </div0>
         <div0>
            <head>5. Conclusion</head>
            <p>​<title level="m">ADH</title> is one of the many possibilities inherent in the idea of
actively presented text. Interfaces that attempt to work with the
user in understanding the underlying text would seem to have wide
applicability for reading text of all kinds, from technical papers to
email to biography, particularly in overview reading, such as Adler's
   <term>systematic skimming</term> and <term>superficial reading</term> (van Doren; Adler).
They may offer special advantages to those with reading disabilities,
or for specific tasks, such as proofreading. Our initial investigations
into this technique seem promising, and a number of improvements in
both phrase analysis and presentation timing are already being
investigated.</p>
         </div0>
      </body>
      <back>
         <div type="Bibliography">
            <head>Bibliography</head>
            <listBibl>
               <biblStruct>
                  <monogr>
                     <author>
                        <name reg="Beesley, Kenneth">Kenneth Beesley</name>
                     </author>
                     <author>
                        <name reg="Karttunen, Lauri.">Lauri Karttunen</name>
                     </author>
                     <title level="m">Finite State Morphology</title>
                     <imprint>
                        <publisher>CSLI Publications</publisher>
                        <pubPlace>Stanford</pubPlace>
                        <date value="2003">2003</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="Bever, Thomas G.">Thomas G. Bever</name>
                     </author>
                     <author>
                        <name reg="Burwell, Rebecca">Rebecca Burwell</name>
                     </author>
                     <author>
                        <name reg="Jandreau, Steven">Steven Jandreau</name>
                     </author>
                     <author>
                        <name reg="Kaplan, Ronald M.">Ronald M. Kaplan</name>
                     </author>
                     <author>
                        <name reg="Zaenen, Annie">Annie Zaenen</name>
                     </author>
                     <title level="a">Spacing printed text to isolate major phrases improves readability</title>
                  </analytic>
                  <monogr>
                     <title level="j">Visible Language</title>
                     <imprint>
                        <biblScope type="vol">25</biblScope>
                        <biblScope type="pages">75-87</biblScope>
                        <date value="1990">1990</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="Castelhano, Monica S.">Monica S. Castelhano</name>
                     </author>
                     <author>
                        <name reg="Muter, Paul">Paul Muter</name>
                     </author>
                     <title level="a"> Optimizing the reading of electronic text using rapid serial visual presentation</title>
                  </analytic>
                  <monogr>
                     <title level="j">Behaviour &amp; Information Technology</title>
                     <imprint>
                        <biblScope type="vol">20.4</biblScope>
                        <biblScope type="pages">237-247</biblScope>
                        <date value="2001">2001</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="Chang, Bay-Wei">Bay-Wei Chang</name>
                     </author>
                     <author>
                        <name reg="Mackinlay, Jock">Jock Mackinlay</name>
                     </author>
                     <author>
                        <name reg="Zellweger, Polle T.">Polle T. Zellweger</name>
                     </author>
                     <title level="a">Fluidly revealing information in Fluid Documents</title>
                  </analytic>
                  <monogr>
                     <title level="m">Proceedings of Smart Graphics 2000 AAAI Spring Symposium</title>
                     <imprint>
                        <pubPlace>Stanford University</pubPlace>
                        <date value="2000">2000</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="Janssen, William C.">William C. Janssen</name>
                     </author>
                     <author>
                        <name reg="Popat, Kris">Kris Popat</name>
                     </author>
                     <title level="a">UpLib: a universal personal digital library system</title>
                  </analytic>
                  <monogr>
                     <title level="m">Proceedings of the 2003 ACM symposium on Document Engineering</title>
                     <imprint>
                        <pubPlace>Grenoble, France</pubPlace>
                        <date value="2003">2003</date>
                        <biblScope type="pages">234-242</biblScope>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <analytic>
                     <author>
                        <name reg="Just,  Marcel Adam">Marcel Adam Just</name>
                     </author>
                     <author>
                        <name reg="Carpenter,  Patricia A.">Patricia A. Carpenter</name>
                     </author>
                     <title level="a">A theory of reading: From eye fixations to comprehension</title>
                  </analytic>
                  <monogr>
                     <title level="j">Psychological Review</title>
                     <imprint>
                        <biblScope type="vol">87</biblScope>
                        <biblScope type="pages">329-354</biblScope>
                        <date value="1980">1980</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <author>
                        <name reg="Sicheritz, Karen">Karen Sicheritz</name>
                     </author>
                     <title level="m">Applying the Rapid Serial Presentation Technique to Personal Digital Assistants</title>
                     <imprint>
                        <publisher>Master's
                           Thesis, Department of Linguistics, Uppsala University, Sweden</publisher>
                        <date value="2000">2000</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <author>
                        <name reg="van Doren, Charles">Charles van Doren</name>
                     </author>
                     <author>
                        <name reg="Adler, Mortimer">Mortimer Adler</name>
                     </author>
                     <title level="m">How to Read a Book</title>
                     <imprint>
                        <publisher>Simon &amp; Schuster</publisher>
                        <pubPlace>New York</pubPlace>
                        <date value="1972">1972</date>
                     </imprint>
                  </monogr>
               </biblStruct>
               <biblStruct>
                  <monogr>
                     <author>
                        <name reg="Walker, Randall C.">Randall C. Walker</name>
                     </author>
                     <author>
                        <name reg="Walker, Stan D.">Stan D. Walker</name>
                     </author>
                     <title level="m">An Introduction to Live Ink Technology</title>
                     <imprint>
                        <publisher>Walker Reading Technologies, Inc.</publisher>
                        <pubPlace>Rochester, MN.</pubPlace>
                        <date value="2001">2001</date>
                     </imprint>
                  </monogr>
               </biblStruct>
            </listBibl>
         </div>
      </back>
   </text>
</TEI.2>