As part of the work to plan a generic teiJournal Web application, I began the process of creating some sample documents we can use to test and refine our schema and XSLT.
I started from a random document selected from the ACH/ALLC 2005 Abstracts project, and adapted some XSLT from the ScanCan P5 conversion. I generated a schema from ROMA, keeping it quite limited and eliminating a number of elements from some of the core modules. The results is (currently) only 300KB; I think it could be refined further, especially by eliminating some elements from the teiHeader which I'm not actually using, but I want to look more closely at them first; it seems that better options than <classCode> exist for categorizing different types of journal contribution (such as <category>).
Next, I worked on the XSLT. It's XSLT 1.0 (for maximum portability), and I found that making the TEI namespace the default ns for both XSLT files, and switching to Xalan for the transformer, eliminated all namespace problems (such as empty xmlns attributes). I got the transformation working for the first sample file, with only one significant exception. Bibliographical references for online resources were encoded using an <xptr> element, and its crdate attribute was used to encode the "last accessed" date, but crdate seems to have been eliminated from P5. Posted a message to TEI-L about this.
Next, I'll work on a larger group of documents from ACH, including images, tables etc. There is some significant value in doing this anyway; the ACH/ALLC Abstracts project will need to be ported from Mustard to Lettuce when Mustard is rebuilt, and in the process it will have to be updated to work with the new Cocoon/eXist stack. In the process, we might as well do P5 conversions for archive purposes, and make P5 the default XML view of the data. So work on this XSLT would have to be done at some point anyway, irrespective of teiJournal.