TASK: phase 2 of normalizing TC data

19/01/07

Permalink 10:34:50 am, by sarneil, 120 words, 406 views   English (CA)
Categories: Tasks; Mins. worked: 0

TASK: phase 2 of normalizing TC data

- publication dates at start of text field to be moved (rather than copied) to the publication_date field
- cemetary plots at end of text field to be moved (rather than copied) to the cemetary field
- records with missing publication dates will take the most recent previous publication date rather than the earliest subsequent publication date
- try to identify intances of page numbers at start of records and prepend those with the string "page "
- try to figure out a way to extract compound subject codes from start of record and put them into the subject field with a space delimiter
- compile list of subject codes found in the three data files
- John to ensure that students are using consistent subject codes

Pingbacks:

No Pingbacks for this post yet...

Depts

This blog is for work done for academic departments which does not fall under other categories.

Reports

Categories

September 2014
Sun Mon Tue Wed Thu Fri Sat
 << <   > >>
  1 2 3 4 5 6
7 8 9 10 11 12 13
14 15 16 17 18 19 20
21 22 23 24 25 26 27
28 29 30        

XML Feeds