Workflow for file submitted by LD to ready for transcription and markup. Posted here in case someone else needs to know where these files are.
1 Docx file
1.1) put file(s) into appropriate subfolder of local Franklin/sourceFiles/fromLyle/ (not used any further)
1.2) copy file(s) into Franklin/sourceFiles/toProcess
1.3) rename file(s) as necessary, file name before extension should be identical with value for filenameroot element e.g. FranklinLives
- camelBack
- multiple images get _n appended to end of image filename
1.4) open and edit docx file in word processor
- delete all instances of OPTIONAL and REQUIRED
- replace all instances of [[ with < and of ]] with >
- edit value of filenameroot element as needed
- tidy up any obvious syntax or typo screwups
- copy entire text to clipboard
1.5) save docx file (e.g. FranklinLives.docx)
2 create and populate xml file
2.1) open in oXygen appropriate template from local Franklin/templates folder
2.2) copy text from clipboard after bottom of template file
2.3) save file into local Franklin/svnrepo/data/text/en1Transcribe folder using filenameroot and "_en.xml" e.g. FranklinLives_en.xml
2.3) edit xml to get valid metadata content
- integrate pasted elements into template, delete any elements that are optional and empty; result should be a file whose metadata validates
- if docx file contains body text, move it into body element (won't validate)
- if docx file contains no body text, put instructions in for transcriber where image or pdf file(s) is/are to use for transcription
- if doc has no related documents, delete relatedDocumentList element
- if doc has related documents, insert a relatedDocument element with contents of appropriate filenameroot for each related document
result will be document with invalid body, but valid everything else
2.4) save xml file
2.5) svn add the xml file
3 jpg/pdf files
3.1) if there are any jpg/pdf files associated with this document, rename them with same filenameroot as xml file, followed by "_n" e.g. FranklinLives_1.jpg FranklinLives_2.jpg for 2 jpgs associated with FranklinLives
3.2) upload any jpg/pdf files to the source folder in the mystery1/www-dev/source folder
3.3) move any image/pdf files to the local Franklin/canMysDevSite/source folder
4) archive docx file
4.1) move the docx file to the local Franklin/sourceFiles/sentToTranscribe folder
Result is
- document(s) from Lyle in local Franklin/sourceFiles/fromLyle/
- local folder Franklin/sourceFiles/toProcess contains no files
- renamed normalized docx file in local Franklin/sourceFiles/sentToTranscribe
- xml file in local Franklin/svnrepo/data/text/en1Transcribe folder
- xml file added to svn repo and committed to repository
- jpg/pdf file(s) uploaded to mystery1/www-dev/source on unix.uvic.ca
- jpg/pdf files(s) moved to local Franklin/canMysDevSite/source folder
- any files on my local machine are backed up to my account on the hcmc backup server (rutabaga)
participated in conference call for Franklin team.
site design discussion to be conducted by sub-committee by email and possibly phone.
mention of including line-by-line transcript of audio files
reported briefly on status of document flow, metadata and processing
I need to
- go through files from LD and make sure they're useable
- review notes on setup for SVN then implement and start setting up users
- talk with MF and JL about hiring of transcribers etc.