Archives for: May 2013

31/05/13

Permalink 04:56:47 pm, by Hannah, 191 words, 11 views   English (CA)
Categories: Activity Log; Mins. worked: 420

May 31

Hello everyone,

I had a good day in UVic Special Collections, working on a list of deserters for the justice theme. When I get this in order, I will go back to Victoria City Archives, and look through their police documents for these names. Before and after special collections closed, I was working on going through the early magazines of University School. After I finish writing all the captions (pointing people to noteworthy pages in each one, since each magazine is 32 pages), I'll upload those all at once. The reason I haven't uploaded anything yet is because I'm waiting for legal clearance to do so. I am also finalizing the list of education-related documents we want from Victoria City Archives, and I will upload that when it is done. I'm still waiting to hear back from the photographer about the pictures of Vic High memorials, but hopefully, that he will respond soon.

Monday morning, I have an appointment with the archivist from Glenlyon Norfolk about the records for Norfolk House School. Much of next week I will spend with the St. Michaels archivist, getting documents from them.

See you on Monday!

Permalink 02:39:28 pm, by mholmes, 2 words, 16 views   English (CA)
Categories: G&T Hours; Mins. worked: 0

MDH: 229 - 1 = 228 hours G&T

Leaving early.

Permalink 02:31:13 pm, by mholmes, 17 words, 9 views   English (CA)
Categories: Activity log; Mins. worked: 120

Ported CodeSharing to Mariage site

It's now running OK. Had to rewrite the XQuery for version 1.0, and tweak various parameters and paths.

Permalink 01:54:36 pm, by Ben, 259 words, 11 views   English (CA)
Categories: Activity Log; Mins. worked: 300

Friday

Hi,

I've spent the day continuing to build my church database and contact them all (or discern which ones should be contacted). I'm now up to 100 contacts for churches and individuals! I'll be continuing to arrange meetings with them and find relevant information through the coming weeks. I've worked most of today at the HCMC and talked to Greg about our dropbox issues. He says he may look into downloading the dropbox app for these computers here so we can use them the same way as our home computer (see below if you missed that). If he does, it will make our work in the lab a bit easier and less likely to mess up our programs. Now if only he could do something about these darn ergonomic keyboards...!

Kirsten and Ashley, in case you for some reason need to cross reference soldiers with church members, I've started a second page in my archive file for honour rolls. As I find more, I will continue to update it with names by church, and include indication of death if it is shown on the original roll. When I meet with the larger archives next week and the week after, I'll look for membership registers that can also be used for cross referencing.

I am cutting out early today, I just found out my house is being sold and we are having an appraiser in on Monday morning, so I've got to (literally) clean house and get everything in order before then. I'll make up the hours in the coming weeks.

Permalink 10:27:52 am, by Ben, 24 words, 14 views   English (CA)
Categories: Activity Log; Mins. worked: 0

Hannah's dropbox

Hannah you'll probably have to download it again with that email or share the folder to the email you use for your other account.
Permalink 09:42:54 am, by Hannah, 48 words, 7 views   English (CA)
Categories: Activity Log; Mins. worked: 0

Dropbox

Because I created a dropbox account for The City Goes to War project under a different email than my personal dropbox account, I can't seem to save the City Goes to War project files on the dropbox on my hard drive. Does anyone know how to fix this?
Permalink 12:10:10 am, by Kirsten, 526 words, 9 views   English (CA)
Categories: Activity Log; Mins. worked: 540

Updates

Hey there, everyone! I can't wait to see all the wonderful things you've found! :D

So I've put a pdf into my Archives folder called Unit List and CEF Guide. The CEF Guide is just a visual I cobbled together from a couple of books and sites, a diagram of how the Canadian Expeditionary Force was organized (ie, what's bigger, a battalion or a brigade? What Division was the 62nd Battery part of?) and lists of all the units active overseas as of November, 1918. This is mostly for my own reference, but I thought I'd share it in case anybody else was interested! The Unit List names and records the activities of militia and CEF units that were mobilized in Victoria, or drew recruits from Victoria, or both, according to Library and Archives Canada. Unfortunately, their records are missing a few details - they do not have places of mobilization or places of recruitment for every unit they describe. The CEF units listed are just those that are recorded in their CEF unit guides as having been mobilized in Victoria and/or formed, at least in part, of Victorian recruits. It goes without saying that not all of the men who joined these units were from Victoria, but there's also a good chance that there were Victorians who signed up with units not listed here. There's got to be a way to find them too, though...

I have to apologize for the sparseness of the information related to the militia units I list - I am honestly not sure how many militia units were in the city during the war years. I couldn't find any lists of wartime militia units from reputable sources, nevermind a list that determined each unit's city of origin. I'm really sorry about this, but I will try to find out more about Victoria's militia once I get back to Victoria and into the collections of the Princess Mary's and the 5th Regiment.

When it comes to tagging, at this stage we could enter every unit as a tag and see how much we find - as I said, some of these units recruited in Victoria but mobilized elsewhere, so it's quite possible that the soldiers left nothing behind but an attestation paper or two before they shipped out. From what I've seen so far, there's a fair bit to be found on the 88th Regiment Victoria Fusiliers, the 30th Battalion, the 2nd Canadian Mounted Rifles Regiment, the Gordon Highlanders/50th Regiment of Foot, and the 5th BC Regiment Canadian Artillery. Tags will have to take into account the fact that some units changed names and designations during the war; we could use slashed tags to indicate this (as in, 48th Battalion/3rd Pioneer Battalion) or just make two separate tags and apply them to every item we have pertaining to both the original unit and the renamed unit. I'll leave the decision up to Ashley and Jim!

If anybody finds any problems in the unit list or CEF guide, please tell me! I'll be tweaking them as I learn more, and would be grateful to know of any mistakes I've made!

30/05/13

Permalink 05:26:06 pm, by Hannah, 220 words, 11 views   English (CA)
Categories: Activity Log; Mins. worked: 450

May 30

I had a great day! The Victoria City Archives doesn't have detailed finding aids, so you have to ask the archivists to retrieve a lot of the collections, and go through them yourself to figure out what is in there, which is extremely enjoyable. I was there today going through personal records which contained photos of schoolchildren, and I finished off searching their city records relating to education on the online catalogue after they closed. I believe I have a complete list of education documents/photos from the city archives, so I will upload that list to dropbox tomorrow. First, though, I am going to UVic special collections to get a list of deserters from the 88th, so I can look them up in the police records for the 'justice' theme. I have also been going through the early yearbooks for University school, and writing captions to point future archive visitors to the most interesting parts. I will try and finish those tomorrow and put them on dropbox as well. I also contacted the photographer who took the really outstanding shots of the war memorials at Vic high and put them on flickr to ask for his permission to put them on the archive, so hopefully he gets back to me soon. Ben, I will email you his contact info.

Permalink 05:05:42 pm, by mholmes, 13 words, 8 views   English (CA)
Categories: G&T Hours; Mins. worked: 0

MDH: 228 + 1 = 229 hours G&T

Trying to figure out why MoEML is running some operations at glacial speed...

Permalink 05:03:39 pm, by mholmes, 6 words, 9 views   English (CA)
Categories: Activity log; Mins. worked: 60

Fixes to presentation and rehearsal

All ready for next week now.

Permalink 05:03:08 pm, by mholmes, 63 words, 9 views   English (CA)
Categories: Activity log; Mins. worked: 240

Firefighting...

Spent most of the day cleaning up the database to remove duplicate copies of files, fix duplicate ids, and similar infelicities which might lead to the slowdown we're seeing in performance. Problems have gone away with regular files and searches, but the Stow 1598 is still proving to be a killer, and we're working with RE to figure out why it's taking so long.

Permalink 03:48:13 pm, by Hannah, 48 words, 8 views   English (CA)
Categories: Activity Log; Mins. worked: 0

Rotary

Is anyone interested in following up with the President-Elect of Rotary? This doesn't fall under my two themes, but I will if Ben and Ashley are too busy.

Hannah: I'll do Rotary, but probably after I finish with the churches. The contact info would be greatly appreciated! -Ben

Permalink 12:37:16 pm, by Ben, 22 words, 9 views   English (CA)
Categories: Activity Log; Mins. worked: 0

Archives Permission?

Hi, I'm adding photos to my archive folder from the Saanich Archives online site. Do we have permission to use them yet?

Permalink 10:55:37 am, by Ben, 531 words, 13 views   English (CA)
Categories: Activity Log; Mins. worked: 930

Dropbox update and May 29

Ashley and I met yesterday to figure out the proper workings of the Dropbox and Excel spreadsheets and as she mentioned it looks like it is working well now. I did some final troubleshooting, reading and experiments with it today and came up with this suggestion in case people haven't done so yet:

Go to the Dropbox website and find the download button in the top right. If you download a dropbox folder, you have access to all your files right on your computer. This is handy not only for easier navigation, but it also works for saving items as if they were on your own hard drive (open a file from the dropbox folder on your computer, simply click save when you're finished, and it returns automatically to the dropbox folder). This is also handy for updating files that only you are working on because (contrary to what I thought yesterday Ashley) it works even when not connected to the internet. How? Well it saves the files on your computer and then automatically updates when next connected. This saves a lot of the issues we were having with multiple copies of one document or having things both on computers and online. Hope this makes sense!

As for me, I was with Hannah at St. Ann's in the morning, met with Ashley about the Excel sheet and to discuss archive requirements in the afternoon, and dropped by the Anglican Diocese archives in preparation for my visit on Monday. I was surprised to find out the Christ Church Cathedral is actually a newer building (1929) and that most of their artifacts are from the Second World War. So the archives will be my primary source for that congregation and the others from the 1913-1919 period. When I was there I ran into a city counsellor who was very interested in our project and wants to be kept up to date. Networking! Today as I wait to hear back from churches again, I'm updating my archive folder and continuing background research on the churches so I have a better picture of what to look for in the archives and what Victoria's faith community looked like in 1913. My minutes are for both days because I have to run to my other job right at five today.

Ashley: Was the person you networked with Julie Cormier? I don't know if you were there early enough to meet her. I met with her in the morning and she is very interested in sharing some of her information (her society is in the midst of creating an early-Victoria church walking tour and we will be meeting in a few weeks once that has calmed down for her).

Kirsten: Your Excel sheet is going to be a bit different from everyone else's because you are using a Mac. Let me know if you're having any problems with it, I've left it as is and will take a look at it on your computer. The best thing for us would be either getting Excel for Mac or a program that can save files in Excel for PC format or Ashley may have problems keeping yours up to date/transferring information.

29/05/13

Permalink 09:40:38 pm, by Ashley, 535 words, 15 views   English (CA)
Categories: Activity Log; Mins. worked: 690

May 29 and 30

I've made us each a separate spreadsheet in the dropbox folders for each of us. We had multiple versions of the combined spreadsheet in dropbox yesterday and I was worried about loosing data becasue more than one of us is making changes at the same time.

I've also updated the tags in each spreadsheet. I think that the tags should be thematic rather than descriptive of the document. For example, I've removed photograph and document. That information should be in the description section. I'm envisioning that the tags will be used to organize the archive. So students will follow from one document to the next via a tag. A student may want to follow the Oak Bay connection from one photo to the next but isn't likely to follow a connection between two photos because they are both photos.

If you are adding new tags to your sheet, think about the scope. For example, I've used religion rather than church to keep the number of tags to a manageable size. I will give everyone time to visit a few archives and add some documents, so that we can see if we need to add more tags, and in a week or two we can finalize the list.

In the meantime, if you add new tags to your sheet, please tell us in your blog post so that we can add it to our own sheets. I want to avoid losing data becasue there are multiple people working on the same sheet. Kirsten and I have been discussing more descriptive military tags and she will have those for us soon.

Don't feel like you have to fill in all 4 tags. The spaces are there in case you need them, but aren't all necessary.

I've also been talking with Special Collections and library administration about the possibility of using their equipment to digitize documents that some of the municipal archives are unable to digitize. I would like to hear back about this before the meeting I am planning with the archivists from the municipal archives in two weeks.

We've set a tentative date and time to meet with the archives as June 12 at 10am. I'll be calling the archives tomorrow to invite them and arranging for the room.

I joined Ben and Hannah at the Catholic Legacies conference in the afternoon for a tour of the St. Ann's Archives. It was great to meet their archivist Carey and I also spoke with the chair of the Friends of the Sisters of St. Ann's Academy, who said that she would be happy to help us in anyway she can. I will send her an email tomorrow morning explaining a bit more about the project.

This evening and tomorrow morning I will be working on defining my 2 themes for the website so that I will have them to discuss at the meeting on Monday.

*The hours are for today and for the few hours I will work tomorrow morning. I am working for the Congress for the next week so I won't be working on the project much. I will keep up with reading the blog and feel free to contact me with an questions or concerns you have.

Permalink 07:17:27 pm, by Hannah, 634 words, 18 views   English (CA)
Categories: Activity Log; Mins. worked: 900

May 28 and May 29.

I will update the write-up I am keeping for myself about what I am doing in dropbox, but here are the highlights for the past two days:

I visited Victoria High, where, as I believe someone mentioned in our meeting, they are doing a website about the school during WWI. They seemed worried that we would steal their thunder if we used some of their archival material, but I told them that this was the last thing we wanted to do. I said that if Jim permitted it, we would put some of the most interesting stuff they have on our archive, credit them as a source, and instead of a 481 student doing a microhistory on Vic High, link to their website. I was given a tour of their archive. It contains, first, detailed information on Vic High alumni who served in WWI, most of it borrowed from the Commonwealth War Graves Commission, but supplemented with their own research. They have one soldiers’ personal photo albums, and records pertaining to alumni Bobby Powell, who was a Canadian tennis star, and who might make an interesting feature on our website. They have class registers, provincial exam results, registration cards from 1916, and The Camosun, the school paper from the war years. They also have some of the principal’s correspondence. They do want to work with us though, and will apply for formal permission from the alumni association, who owns the archival material. In addition, they are extremely enthusiastic about being a pilot school for the educational package, and I promised to put them in touch with Jeremy as soon as he gets here. I also took photos of their war memorials, which I will put on the spreadsheet as soon as Ben and Ashley create the drop-down menu. A few years ago, the archivist had a professional photographer photograph all their memorials. She gave me some information on him, and I am going to try and track him down to give us permission to use the images.

I met with the archivist from St. Michael’s University School, and she has arranged for me to come back next week. She will share with me photos she has done of their school trophies from that time period, and the list of names she has of people who left St. Michaels and served in the war. She has also given me permission to borrow digital copies of the School’s yearbook “the Black and Red,” which is on their website, and put it on our archive. The Old Boys’ column in “Black and Red,” tracks the alumni serving, and she has it from the WWI years. She has also agreed to let me photograph school clothing from that era, and to let me scan the principal’s diary from the time period.

Since our meeting, I have been going through Victoria City Archives, and looking at detailed finding aids, collections, and what not, figuring out exactly what we will want to put on the archive, and then I will give the list to Jim.

I attended the UVic Catholic Legacy in Victoria Symposium, and got some background information about the Sisters of St. Ann, which will be helpful in the education/medical section. Ashley and I have agreed that when I am visiting the Sisters' archives, I will get the information for her about the medical side of things, because she is supervising, and this will save her some time.

I have been in contact with the School District archivist, and also the Old Cemeteries Society. I spoke to them about sharing what information they have on soldiers buried in Ross Bay for our soldiers database, and they are going to talk to their research committee, and call me back.

The minutes are for yesterday and today.

Permalink 03:58:07 pm, by mholmes, 34 words, 11 views   English (CA)
Categories: Activity log; Mins. worked: 120

Changes to schema and fixes for METR1 and TRIU1

Fixed some CSS errors in the METR1 and TRIU1 files, which were generating invalid CSS in the redesign project. Also fixed a bunch of old-style uses of @rend in TRIU1, and some XSLT bugs.

Permalink 03:56:42 pm, by mholmes, 48 words, 7 views   English (CA)
Categories: Activity log; Mins. worked: 30

Fixed some XSLT bugs, and added the MolSortComparator to the project

The MolSortComparator.jar file was missing from the web application on Peach, and suddenly this began to matter, as all pages in /site/ started failing on the XSLT transformation because of it. I've now imported the Java codebase for this into the MoEML SVN and added the library.

Permalink 03:54:55 pm, by mholmes, 11 words, 16 views   English (CA)
Categories: Activity log; Mins. worked: 90

Prepared presentation for Monday

Met with CP and prepared the presentation for next Monday morning.

Permalink 11:44:47 am, by esaint, 40 words, 12 views   English (CA)
Categories: Activity log; Mins. worked: 240

Update from ES on May 29, 2013

1. ES added the xml files for fraf7, cltf5, pscf6.
2. ES added transcripts for fraf7, cltf5, pscf6
3. SA uploaded all the latest changes to the website and all seems to work fine. Thumbnails have to be created for fraf7, cltf5, pscf6.

Permalink 10:48:30 am, by sarneil, 130 words, 14 views   English (CA)
Categories: Activity log; Mins. worked: 30

Changed group settings on Moses EMLS and Scraps blogs

I have a bunch of new users to add to the system and make members of the CanMys blog. When I added the first and logged in as him, I noticed that when I clicked on the Posts button, I was allowed to see posts from the Moses, EMCS and Scraps blogs. Turns out those three blogs had group permissions which made all users of type "Blogger" members and with permission to upload to the media folder, but do nothing else. Those three blogs also have individual users with specific permissions.
I disabled the is-member checkbox for Bloggers in the group permissions for each of those three blogs. That automatically disabled the upload checkbox. There should be no change of behaviour for specific users that are members of those blogs.

Permalink 10:16:44 am, by mholmes, 7 words, 15 views   English (CA)
Categories: Activity log; Mins. worked: 90

Review of white paper for MVP

Reviewed the second paper on topic modelling.

Permalink 10:03:39 am, by jim, 51 words, 161 views   English (CA)
Categories: Announcements; Mins. worked: 0

Update Hours

Please ensure that you have updated activity and hours for the end of the month. I will run a report on the weekend for us to review at our Monday meeting. My general practice will be to produce a monthly activity and hours summary for review by the project steering committee.

28/05/13

Permalink 05:58:25 pm, by mholmes, 12 words, 16 views   English (CA)
Categories: Activity log; Mins. worked: 120

Worked on CodeSharing presentation

Putting some graphics together for the presentation and creating the first slides.

Permalink 05:57:49 pm, by mholmes, 4 words, 12 views   English (CA)
Categories: G&T Hours; Mins. worked: 0

MDH: 226 + 2 = 228 hours G&T

More work than time...

Permalink 05:22:54 pm, by mholmes, 229 words, 13 views   English (CA)
Categories: Activity log; Mins. worked: 270

Meeting and work arising from it

Team meeting resulted in these things:

  • Check out the best way of encoding such things as headings not present in the source, but supplied by the editor, and figure out how best to provide resp functionality for this.
  • Inline citations using <ref> will become common. I see no reason why they shouldn't work out-of-the-box, but it needs testing.
  • The credits pages will include a manually-encoded and -managed page for Team, one ditto for Team Alumni, and a third page which is automatically generated, for Contributors. This should list Contributors by Name and Contributors by Role, and should link each name to a full bio. Roles are defined through the Marc Relators codes.
  • Each contribution role for each person deserves its own <respStmt> in the header. This allows JJ to put the entire <respStmt> list into order of precedence.
  • Team members' photos should be linked into their bios, and the rendering pipeline should provide for rendering modes which include or exclude the photo. We'll need to figure out encoding strategies then write rendering code.
  • FIXED: The missing.xml 404 page wasn't working in the old site design.
  • FIXED: The use of <hi> for rendering instructions in born-digital documents should be avoided in favour of semantic tagging. I implemented handling for <label> to deal with a specific case in prepare_transcription.xml.
Permalink 05:13:25 pm, by mholmes, 35 words, 42 views   English (CA)
Categories: Announcements; Mins. worked: 20

RG7 G8C Vol 10 page images added to the Colonial Despatches collection

750 page images for RG7 G8C Vol 10 (in three different sizes) have been added to the collection. These cover the British Columbia 1862-63: Despatches from London. These will now be linked into the transcription documents.

Permalink 04:50:08 pm, by Ashley, 255 words, 14 views   English (CA)
Categories: Activity Log; Mins. worked: 420

May 28

Hello all,

This morning I met with the volunteer from Oak Bay archives that is compiling a list of WWI veterans. He is interested in working together to compile the list so that we can share the information. Caroline Duncan from Saanich Archives was at Oak Bay today and we tentatively planned a meeting with the archivists and main volunteers from the municipal archives in Greater Victoria for June 12. I'm working on making contacts at the UVic library so that we can offer to digitize some materials for Saanich (and possibly other archives who don't have the resources to digitize larger items). I'll keep you up to date on that planning.

In the afternoon I met with a contact from the View Royal Archives who provided me with photos of her father, who was a POW held in Germany for 3 years. She also has an oral history that I now have the transcript for. The audio file is held in the Canadian War Museum so I am going to be in contact with them to see if they have digitized the file from the cassette form. If not, we can include the transcript or see if we are able to digitize it in the library. I'll upload those to dropbox tomorrow.

I've also been contacting band offices today. I haven't had much luck yet but I'm waiting to hear back from some offices.

I uploaded my contact list into my folder on dropbox. Feel free to use any of the contacts if you need them.

Permalink 04:39:40 pm, by Ben, 184 words, 17 views   English (CA)
Categories: Activity Log; Mins. worked: 500

Field Trips Rock

Today was fun, went out again to see some churches "in the field." First I visited St. Stephen's out in Saanichton (a beautiful church if anyone wants a photographing spot) and the rector gave me a free copy of their church history book. Their cemetery had some good grave stones and they had a rather large and detailed honour roll which will be very handy for cross-referencing. Then I stopped in for a visit with the rector of St. Mary's who had some amazing things to show me. There are plenty of monuments to look at and the colours of the 88th regiment which are almost falling apart. She had photocopies of church histories and the personal story of the Rev. Andrews who served overseas. She also has some info on a woman's wood carving guild which she said was very special in the community back then.

Spent the afternoon building up some photos in the archive, hope they work! Hannah, I'll be joining you tomorrow, hopefully they got my registration email! HCMC is kicking me out now, see some of you tomorrow.

Ben

Permalink 10:02:10 am, by sarneil, 161 words, 12 views   English (CA)
Categories: Activity log; Mins. worked: 180

undeploy and redeploy corrupted webapp

In the kerfluffle last week with the eXist server and contained webapps, the Francotoile webapp was somehow corrupted. After an hour or two, we got the instance going again, but then discovered that the password for the admin client no longer worked, so we wouldn't be able to update the webapp. Solution was to replace the instance of the webapp on the server with a copy of it on my local machine.

Basic procedure to replace a corrupted instance of a webapp e.g. francotoile
- log in as tapor to tomcat manager on server (peach)
- undeploy webapp
- go back in browser to safe URL (one without undeploy instruction in it)
- ftp in as hcmc to server (peach.hcmc.uvic.ca)
- cd up and down to /usr/local/tomcat-instances/devel/webapps/
- delete old folder
- upload new folder (same name as old folder)
- refresh webapp listing in tomcat manager
- app should appear, click deploy

Permalink 08:43:53 am, by Hannah, 50 words, 11 views   English (CA)
Categories: Activity Log; Mins. worked: 0

Catholic Legacies Talk

I'm going to this tomorrow, because they give you a lecture on the history of St. Ann's, and a tour of the school house and archives. I recommend registering if you plan on going, because then you get free lunch. http://web.uvic.ca/~predigit/files/28-29%20May%20program.pdf
Permalink 07:51:05 am, by Hannah, 79 words, 8 views   English (CA)
Categories: Activity Log; Mins. worked: 0

Contact

Also, I found a good contact regarding Social Services. On Saturday, I stumbled across the 100 year anniversary party of rotary. I spoke to the president elect, and he told me they do have archives, and that a gentleman named Stu McGowan has just finished combing through them to write a history of this organization. I have his personal contact info, so if anyone is interested, text me. If you guys feel too busy, I can follow up on this.
Permalink 07:43:13 am, by Hannah, 175 words, 13 views   English (CA)
Categories: Activity Log; Mins. worked: 660

May 27

Hi everyone,

I found a place on campus to borrow a digital camera yesterday, so I could get started. I then headed down to Victoria City Archives. I had a different archivist than before, and they told me that I should not take photos of anything until they have worked out a licensing agreement with Jim. Instead, I was instructed to go through absolutely everything, make a complete list of everything I would want to scan or photograph, and then submit it to Jim with the paperwork so he could arrange for the payment of fees and what not. So, I am just going through finding aids and inventories, looking at artifacts, and deciding everything that I will photograph right now. I may put this on hold for today though, if I can get a hold of the archivist at Victoria High. They only come in on Tuesday, so I might go down there and check their collection out. At 4 pm, I have an appointment at St. Michael's.

The minutes are from Friday and yesterday.

Permalink 07:42:17 am, by Hannah, 168 words, 8 views   English (CA)
Categories: Activity Log; Mins. worked: 0

May 27

Hi everyone,

I found a place on campus to borrow a digital camera yesterday, so I could get started. I then headed down to Victoria City Archives. I had a different archivist than before, and they told me that I should not take photos of anything until they have worked out a licensing agreement with Jim. Instead, I was instructed to go through absolutely everything, make a complete list of everything I would want to scan or photograph, and then submit it to Jim with the paperwork so he could arrange for the payment of fees and what not. So, I am just going through finding aids and inventories, looking at artifacts, and deciding everything that I will photograph right now. I may put this on hold for today though, if I can get a hold of the archivist at Victoria High. They only come in on Tuesday, so I might go down there and check their collection out. At 4 pm, I have an appointment at St. Michael's.

27/05/13

Permalink 05:11:46 pm, by Ashley, 13 words, 10 views   English (CA)
Categories: Activity Log; Mins. worked: 0

Hisory Pin

I just found this. Have a look a Victoria! www.historypin.com/map
Permalink 05:03:11 pm, by Ashley, 216 words, 8 views   English (CA)
Categories: Activity Log; Mins. worked: 420

Monday, May 27

Hello all,

I've updated the spreadsheet for the documents we're collecting and saved it in the "Archive" folder on Dropbox. I've also made us each a folder there to upload documents. Make sure that all documents uploaded to your folder are entered into the spreadsheet and labeled with the reference number you've assigned it. This could become a mess if we don't keep things organized from the beginning. If you want to add a new tag, or change an existing one, make sure that you add it in all the sheets so that the tags are consistent. It will probably be easier for me to make the tags more board in the future, so if you're not sure about a tag, it is better if it's more specific.

I also went to the View Royal Archives today. One volunteer has compiled a book about her father who was held in Germany as a POW for 3 years. I'm in contact with her sister to get digital copies of the oral histories, letters, and photographs she used.

I've also been in contact with the Central Saanich Archives and they are keen to be involved.

In the afternoon, I started to call the local band offices. I have a few good leads and I will follow them tomorrow!

Ashley

Permalink 04:52:24 pm, by Ben, 214 words, 16 views   English (CA)
Categories: Activity Log; Mins. worked: 0

First of the Private Collections

Hi,

I had a field trip day today, going out to Sooke and Colwood which garnered all the names of Soldiers from Jordan River to 17 Mile House (98 soldiers) including which ones died, as well as checking out some grave sites and getting contact info for historians at the Sooke Museum and Colwood Historical Commission who have each written books about the area during our time period. In Colwood I met with Dick Emory whose father served as a signalman in France with the 88th and was wounded there. He had some great artifacts including the original discharge papers, his father's pay book, and the telegrams sent to his family when he was injured. He also had some good photographs. He'll keep me informed of further finds and any developments from the Historical Commission. My plans for the rest of the week include a date with some regimental colours, checking out some local church honour rolls, and possibly visiting Pearkes' grave. I've been in contact with all the Anglican churches now, so I'll start moving into the other smaller-number denominations this week too.

(I was going to post a photo of what I found today, but my computer won't read my camera card at the moment, so I'll get back to you on that one...)

Permalink 04:49:29 pm, by mholmes, 22 words, 9 views   English (CA)
Categories: G&T Hours; Mins. worked: 0

MDH: 225 + 1 = 226 hours G&T

Trying to get a couple of presentations for July prepared before Congress, so we can concentrate on the big MoEML one afterwards.

Permalink 03:57:42 pm, by Kirsten, 423 words, 8 views   English (CA)
Categories: Activity Log; Mins. worked: 1140

Updates

Hope everyone had a great weekend! Thank you for the advice about posting images, Greg! I'm sorry for flooding the blog with huge unwieldly pictures!! Regarding the organization of our themes, I was thinking something like this for mine...
If we want a more general main menu, we could start with Victorians in the Armed Forces... if we'd prefer more specific main menu options, we could just go with The Army, The Navy, etc. I've dotted the line for Victorians in the Air Force until I know what sort of material I can get on the subject... but there's an expert on WWI aviation at the BCAM, and he's going to contact me when he gets back from a vacation in early June! Hopefully he'll have plenty to share with us about Victoria's flyers. The items in smaller text are just possible subthemes - not an exhaustive list of relevant topics, by any means - based on the material I've seen so far. I included the smaller titles on each theme ("On the Sea", etc) as an option - I'll leave that up to Mr. Kempling and Ashley!

The minutes noted below are from Friday, this weekend, and today. I'm still working on a list and lineages of units that were active in Victoria during the war. I was just going to use it for my own reference, but would something like that be of interest to anybody else? I can stick it in the Dropbox if so!

My Dropbox folder has some of the attestation papers and photographs I've found so far - personal favorites include the 30th Battalion parading past the Legislature and swarming over the SS Mary's rigging. Research plans for the week include a visit to the Bay Street Armory, 13 oral histories available online through UVic's special collections, and finishing off a list of items I'll hunt for at the BC Archives.

Unfortunately, I probably won't be able to get to the Maritime Museum or into the hard-copy Special Collections until the end of the week. My grandfather had a heart attack on Friday, and they moved him down here from Nanaimo yesterday. He's heading into open heart surgery tomorrow morning. It won't be his first, and he had surgery for circulation problems in his lower intestine barely a month ago. My dad and aunt are coming down this afternoon, and staying with me tonight. I will try to get some work done here and there... but I'll be spending as much time as I can with my family.

Permalink 03:44:08 pm, by mholmes, 20 words, 16 views   English (CA)
Categories: Activity log; Mins. worked: 120

Working on the Dates presentation for July

Made more progress with the graphics and presentation for the Dates talk at DH. I think I'm about half-way through.

Permalink 12:34:18 pm, by Greg, 14 words, 10 views   English (CA)
Categories: Documentation; Mins. worked: 0

javaws not running jnlp files

To fix this problem, run
sudo update-alternatives --config javaws
and choose the java 7 version

Permalink 10:36:50 am, by mholmes, 12 words, 15 views   English (CA)
Categories: Activity log; Mins. worked: 90

Review of white paper for MVP

Reviewed and commented on the first of two white papers for MVP.

24/05/13

Permalink 01:28:16 pm, by mholmes, 2 words, 14 views   English (CA)
Categories: G&T Hours; Mins. worked: 0

MDH: 227 - 2 = 225 hours G&T

Leaving early.

Permalink 01:16:37 pm, by mholmes, 13 words, 15 views   English (CA)
Categories: Activity log; Mins. worked: 40

Diagram for DH presentation

Created an SVG map for use in the presentation on dating in July.

Permalink 01:15:58 pm, by mholmes, 71 words, 18 views   English (CA)
Categories: Activity log; Mins. worked: 60

TRIU1 cleaned up

Created XSLT to add long s to transcriptions, based on previous work on Stow, and ran it on TRIU1. Note to self: it needs to exclude editorial notes. Also did a lot of semi-manual cleanup of encoding in the document, ready for KMF and ZV to start work on it. Noticed a lot of remaining @rend attributes; I've now added a Schematron warning for those, so people convert them to @style.

Permalink 10:00:08 am, by mholmes, 8 words, 18 views   English (CA)
Categories: Activity log; Mins. worked: 60

TEI ticket work

Another mockup for the Guidelines TOC page rewrite.

23/05/13

Permalink 05:28:14 pm, by Hannah, 63 words, 16 views   English (CA)
Categories: Activity Log; Mins. worked: 510

March 23

Hi everyone, I had a great chat today with the archivist for the Sisters of St. Ann's, and an interesting visit at City Archives. Unfortunately, the archivist for St. Michael's School forgot my appointment and went to Vancouver today, but I will likely meet with her tomorrow or early next week. I have also been in contact with other schools. More details tomorrow!
Permalink 04:21:36 pm, by Ben, 259 words, 25 views   English (CA)
Categories: Activity Log; Mins. worked: 330

Invitations

Hiya, I spent most of my day on the phone, slowly going through my list. It seems most of the Anglican churches have office hours early in the weeks, so I didn't get a hold of too many, but I have been invited to the Colwood Historical Association meeting on Monday as well as to Dick Emory's house to see his private collection of artifacts and newspaper clippings from when his father served and was wounded in France! That is probably the most exciting, although I located many churches from the 1910's and learned a bit about the Anglican mission in the 1870s from a very chatty Rector's Assistant. I also located the church where Pearkes is burried and another whose reverend served with the 88th Regiment, Victoria Fusiliers. All churches seemed interested in circulating a poster in the coming weeks.

Another vital piece of information is that the Anglican archives close for July and August, so I'm hoping to arrange multiple visits but may want someone to join me for some of them as those archives hold a lot of info and are only open two half-days a week. I will keep you posted. I guess that takes away all my show and tell for tomorrow, but I'll see you all then!

Ben

PS: because I'm contacting so many groups, I've made a new email for myself just for CGTW. If you want to make this your primary contact info for me it will help me keep things all in one place. It is ben.cgtw@gmail.com

Permalink 03:34:54 pm, by mholmes, 40 words, 16 views   English (CA)
Categories: Activity log; Mins. worked: 30

XSLT to enable faster searches

Used XSLT to add an @n attribute to all paragraphs holding the @xml:id of the preceding or first-child milestone element, to enable faster searches on p tags instead of ranges between milestones, while still returning the correct target milestone.

Permalink 03:32:51 pm, by mholmes, 9 words, 20 views   English (CA)
Categories: Activity log; Mins. worked: 40

TEI ticket work

Created another mockup of proposed new Guidelines TOC page.

Permalink 03:32:18 pm, by mholmes, 15 words, 18 views   English (CA)
Categories: Activity log; Mins. worked: 30

Lab testing for DHSI

Tested Macs in A103 to make sure no memory problems etc. doing large transformation exercises.

Permalink 03:31:31 pm, by mholmes, 10 words, 19 views   English (CA)
Categories: Activity log; Mins. worked: 60

More work on CodeSharing

Fixed some bugs and cleaned up some XQuery and XSLT.

Permalink 03:31:01 pm, by mholmes, 31 words, 16 views   English (CA)
Categories: Activity log; Mins. worked: 120

New MoEML app deployed

Took several restarts of Tomcat and various apps, then intervention by sysadmin to increase the number of files a process can open; now we have much higher speed on all apps.

Permalink 03:04:27 pm, by Ashley, 72 words, 11 views   English (CA)
Categories: Activity Log; Mins. worked: 360

Thursday, May 23

Hello all, I had a very exciting afternoon at the Oak Bay archives! I'll tell you all about it tomorrow. I made a great contact that would like to work cooperatively with us. She is the archivist for both Oak Bay Archives and Saanich Archives. I've done a bit of thinking on the organization of the digital archive and I'm excited to talk to you all about it tomorrow. See you at 9.
Permalink 12:37:59 pm, by Greg, 210 words, 23 views   English (CA)
Categories: Activity Log; Mins. worked: 0

Inserting images in to blog posts

You may notice that when you add images to a blog post it tries to display them at full size, sometimes cropping an edge. To make them easier to view, here's a trick.
After adding the image to a post, look at what the blog engine dropped in to your post editor. It looks like this:

<div class="image_block"><img src="http://hcmc.uvic.ca/blogs/media/blogs/cgtw/poster4.jpg" alt="" title="" width="987" height="1281" /></div>

The width and height attributes are representative of the pixel size of the image. We can adjust them to make it fit a little better by fiddling the numbers. If we reduce each number by, say, 50% we end up with this:

<div class="image_block"><img src="http://hcmc.uvic.ca/blogs/media/blogs/cgtw/poster4.jpg" alt="" title="" width="494" height="640" /></div>

Notice that these are rounded to the nearest whole number. If you try to keep image no wider than about 400 or 500 pixels they'll look better in the blog. Also, please note that this does NOT change the size of the original image. It ONLY changes the display size in the blog. Right-clicking and saving will store a full-size version.

Permalink 07:34:51 am, by Hannah, 324 words, 24 views   English (CA)
Categories: Activity Log; Mins. worked: 480

First Day in the Archives!

Hi everyone!

Sorry I did not blog at the end of the day. I started off searching BC Archives' collections for material on police/court activity and war resistors, and then I went down there to talk to the archivist in person. Unfortunately, they told me there would be numerous legal challenges involved in accessing some of the material. I will report what they said in more detail on Friday, but for now I have a basic list of what is open to us. I then looked at what BC Archives has for education, and found some useful school records, some oral histories with Victoria high alumni, and other material. While I was there, I also looked at the music scene in Victoria during our time period so that we might have some audio clips for the website. BC Archives has concert programs from various musical societies, and piano sheet music about Victoria, by Victoria composers and published in Victoria. If we can't find any recordings, I could always record myself playing it and put the clip on the website.

Then I checked out the legislative library, and talked to the lovely reference librarian about what records pertaining to education and the provincial government are there. I think I have an almost complete list of what public schools were in operation during the time period, based on a masters thesis on microfilm she showed me. Today, I am going to Victoria City Archives first. I was hoping to find school board minutes yesterday, although these may not give us the interesting stories we are hoping to feature, so I will keep looking for other material. I have an appointment with the archivist at St. Michael's. I spoke to her on the phone yesterday, and it sounds like they have great records on the school's veterans. I am going to call all the old private schools, and Victoria High, and talk to them about records.

22/05/13

Permalink 05:37:41 pm, by Ashley, 124 words, 21 views   English (CA)
Categories: Activity Log; Mins. worked: 480

Wednesday, May 22

I've spent the day looking for municipal and community archives. So far my list is:

Sooke Region Musuem
Metchosin Museum Society
Esquimalt Municipal Archives
View Royal Community Archives
Oak Bay Archives
Saanich Archives
Goldstream Archives
Sidney Museum and Archives

It seems that Langford does not have an archive, as much as I've looked for it. Let me know if there are any I've neglected! I have finding aids for some of these and the rest I'll be calling tomorrow. I've also found some useful material in the BC Archives for medical history. Tomorrow I'll be following up with the the Chinese Presbyterian Church and calling local First Nations bands to see if they would be interested in advertising in their newsletter or mailing list.

Permalink 04:01:13 pm, by Ben, 157 words, 18 views   English (CA)
Categories: Activity Log; Mins. worked: 300

Making a list...

Wow, things are really kicking off! Love the posters and excited to hear what everyone else found.

As for me, I've started my contact list of churches and social organizations (did you know there are 162 places of worship in this town?) and will be cold-calling them tomorrow. I will be starting with the organizations I know existed 100 years ago, and then as the project moves forward I will start reaching out to newer groups to see if their members have other information - which will be greatly aided by those posters. The things I will look for first are if the groups have lists of members who served, members from the time, any archival materials, and monuments, and from there I will build a list of places worth visiting.

Wish I could join you at the air museum, have a great time. I'm signing off early today to enjoy a birthday dinner, see you all Friday!

Ben

Permalink 03:08:06 pm, by Kirsten, 125 words, 27 views   English (CA)
Categories: Activity Log; Mins. worked: 360

Field Trip Tomorrow!

Hi there, everyone! Going to be heading out to the BC Aviation Museum tomorrow - thanks for the tip, Ben! It looks like there's a heck of a lot of wonderful stuff to see up there. I'm really excited for the Scottish Regiment Museum and 5th BC Artillery Regiment Museum, too, but those might have to wait for another day! Unfortunately, the CFB Esquimalt Naval and Military Museum insists on charging for anything copied out of their collection... which, according to the employee I spoke to, doesn't have very much dating from the period we're interested in. Maybe worth a look once the more promising locations have been searched... oh well. Here's a few more poster ideas! Right click, "View Image" to see full size :)
Permalink 03:01:52 pm, by jim, 37 words, 18 views   English (CA)
Categories: Activity Log; Mins. worked: 0

Meeting Location for Friday 24 May

We will meeting in Cle B215 at 9:00 am. 9:00 - 10:30 Review findings from Wed and Thu 11:00 - 12:00 Confirm timeframe and general approach 1300 - 1500 Develop initial workplan (See dropbox under Project Planning) 1500 - retire to Grad House for drinks
Permalink 02:47:43 pm, by mholmes, 68 words, 15 views   English (CA)
Categories: Activity log; Mins. worked: 120

eXist trunk bug reported and fixed

My contention about the change to docUtils.java having caused a regression which broke relative paths for the doc() function was borne out after I changed the file and rebuilt. Reported the bug formally on the bugtracker, and it is now fixed, so I have a fresh trunk build of eXist ready to go for MoEML. I'll deploy this first thing tomorrow before anyone else gets to work.

Permalink 02:04:38 pm, by skell, 667 words, 20 views   English (CA)
Categories: Activity log; Mins. worked: 20

More on inferred glosses

I am posting this exchange about inferred glosses so that I don't have to think it through all over again in the future!



SMK wrote:

Regarding the search engine, I blogged on 12/12/12:

"ECH's goal for the search engine in the web database is that, if a user searches for "fat", s/he will get results including fat, fatten, fattening, fatty. Our current settings, and our policies for adding inferred glosses, seem to be accomplishing this nicely. An entry which has "fatty" in its def is found by a search for "fat", because it also has an inferred gloss "fat". Searching for "fat*" also returns defs including fat, fatten, fattening, fatty ... but also fatal, fathom, father."

However, we also noticed the converse on 16/04/13:

When I searched for the inflected form “fired”, I also I got all the entries with “fire”.

BUT when I search for “fatty” or “fatten”, I don’t get all the entries with “fat”. What is the difference here?



MDH replied:

I think you're just discovering that a stemming analyzer is not an educated human. It doesn't understand semantics; it just knows how to strip off (some) inflectional endings and index the resulting stems, and then how to stem the search input and search the stemmed index with it. You will never find an automated search engine that gives you perfect results.

Right now, the search is paying no attention to whether things are in gloss tags or not; as I understand it, the purpose of the gloss tags is to construct and English-Nxa’amxcin list, not to aid in searching.

The situation with "fatty" is definitely a bit odd; it appears that if you search for that word, you it doesn't get stemmed prior to the search, whereas if you search for "fired" it does. Perhaps the stemmer avoids stemming -tty inputs because there are many which shouldn't be stemmed? ("batty", "natty", "patty", for instance.)



SMK continued:

OK, so when I search for fatten, fattened, or fattening, I get the same 5 hits – 3 for “fattening”, one for “fattened”, and one for “fatten” – i.e. everything with the stem “fatten”. It doesn't go all the way down to the root “fat”, and that's fine.

When I search for “fatty”, all I get is the one entry for “fatty”, as you explained above. That's fine too.

We had been adding inferred glosses for the uninflected English stems and roots of attested glosses, e.g.

<def>
<seg>I am <gloss>fattening</gloss> it up</seg><bibl corresp="psn:W">W10.138</bibl>
<seg><gloss subtype="i">fatten</gloss></seg><bibl corresp="psn:ECH">ECH</bibl>
<seg><gloss subtype="i">fat</gloss></seg><bibl corresp="psn:ECH">ECH</bibl>
</def>

Here, <gloss subtype="i">fatten</gloss> adds nothing to the search capabilities, because the stemmer can find “fatten” within “fattening”.

But does this entry with “fattening” get found when I search for “fat” because of the stemmer, or because of the <gloss subtype="i">fat</gloss>? It must be because of the inferred gloss, because the stemmer only stems as far as “fatten”.

In the case of “fatty”, where we know the stemmer doesn't operate on it, it still gets found when I search for “fat” because of the <gloss subtype="i">fat.

(“fattening” and “fatty” do NOT get found when I search for “fat” just because they contain the string f-a-t, because “fatal” and “father” are NOT found by a search for “fat”. To find anything with the string f-a-t, I would need to search for “fat*”.)

So the inferred glosses do play a role in improving the search. That said, I don't think we should be going out of our way to add inferred glosses for this reason.

Permalink 12:20:04 pm, by skell, 698 words, 20 views   English (CA)
Categories: Activity log; Mins. worked: 50

Changes to gloss-tagging rules

Much discussion over the last few weeks regarding the placing of gloss tags for generating the Eng-Nx wordlist. I attempt to summarize our conclusions here for future reference.

1) Why do we place inferred glosses (<gloss subtype=”i”>)?

At various times, we have placed inferred glosses for augmenting the search engine on the website, and for generating the English word list.

We concluded that from here on, we ONLY need to place gloss tags for generating the English word list. Inferred glosses do sometimes enhance the web search engine, but now that the stemming analyzer is in place, we don't need to do any further markup to help it out.



2) How should we tag inflected English words?

Until last week, we had been inferring the root word (or stem where relevant) when a def is an inflected or derived form of an English word, e.g.

<def>
<seg>he is <gloss>fattening</gloss> it up</seg>
<bibl corresp=“psn:JM”>JM 1.2.3</bibl>
<seg><gloss subtype=“i”>fatten</gloss></seg>
<bibl corresp=“psn:ECH”>ECH</bibl>
<seg><gloss subtype=“i”>fat</gloss></seg>
<bibl corresp=“psn:ECH”>ECH</bibl>
</def>

This encoding means that this entry will show up three times in the English-Nxa’amxcin wordlist: under fat, under fatten, and under fattening. This seems like overkill, especially when these three words will sort one after the other in the English wordlist anyway.

ECH and SMK decided we would like to see the “fat” entries as follows in the print dictionary:

fat: fat

fatten: fatten, fattened, fattening

fatty: fatty

To accomplish this, we need to reduce the number of gloss tags we place in each entry. Inflected English forms (-ed, -ing) should not be gloss tagged; only their root or stem should be gloss tagged.

So “fattening” would now be gloss-tagged as:

<seg>he is <gloss>fatten</gloss>ing it up</seg>

MDH confirmed that the search engine is ignoring gloss tags, so the stemmer will operate on <gloss>fatten</gloss>ing the same as it would on <gloss>fattening</gloss>. (That is, it will continue to return all results with the stem “fatten” when someone searches for fatten, fattened, or fattening.)

MDH has created two sample Eng-Nx word lists based on the 6 files with “complete” status, one using all the gloss tags, and one omitting the inferred gloss tags. They are in moses/trunk/docs/glosses. We concluded that we don't want to programmatically ignore the inferred glosses, because many of them – especially the synonyms – are worth including. But we can refer to these lists to identify the inflected English words whose gloss tags need to be revised.



3) How should we tag English phrasal verbs?

Where appropriate, English phrasal verbs will be enclosed in a single gloss tag - e.g, <gloss>go after</gloss>. This will allow us to organize the headwords in the Eng-Nx word list as follows:

go

go after

go down

go up

, etc.



4) How can we distinguish English homophones in glosses?

English homophones in glosses will be distinguished with a secondary word (or phrase) in an @n attribute on the <gloss> tag, e.g.<gloss n="conflagration">fire</gloss>, <gloss n="back of boat">stern</gloss>. These will then be rendered as follows in the print dictionary:

fire (conflagration):

stern (back of boat):

We decided not to use parts of speech for @n values. We will always use synonyms. We need to select synonyms that will be clear to readers in the community.

I have now disambiguated the English homophones listed here, and updated the Notes on Definitions and Gloss Tagging document accordingly. Where one homophone was far more common in the data than the other, I only added an @n value on the less common one - e.g. watch (wristwatch).

Permalink 11:53:34 am, by esaint, 7 words, 20 views   English (CA)
Categories: Activity log; Mins. worked: 240

Update from ES - May 22, 2013

ES added transcripts for accf3, fraf8, cltf6

Permalink 10:42:20 am, by jim, 53 words, 19 views   English (CA)
Categories: Activity Log; Mins. worked: 420

Initial Project Orientation

A Useful day. Team members identified a wide range of potential sources of archival material. Members selected potential sources for further investigation with the task of reporting back on Friday at 9:00 am with findings. Followed up with email confirming task assignments. Set up drop box account with initial file structure for team use.
Permalink 10:24:47 am, by mholmes, 77 words, 12 views   English (CA)
Categories: Activity log; Mins. worked: 60

Advanced search working

Trying to abstract the combined keyword/text search into a separate library yesterday was very problematic, but I took a simpler approach this morning and simply copied and adapted the code from search.xq into advanced_search.xq. The result seems to be working perfectly -- the keyword/text search is done first to retrieve a set of @xml:ids, then the search is done on those ids, with additional filters provided by the other form controls.

Permalink 09:48:18 am, by mholmes, 41 words, 14 views   English (CA)
Categories: Activity log; Mins. worked: 90

Added @xml:lang attributes to names

Did this through XSL with some cunning language-detection code based on content and context, and it seems to have worked pretty well. The Names page now uses the @xml:lang attribute instead of its own cruder detection code to build output.

21/05/13

Permalink 09:52:11 pm, by Ashley, 70 words, 291 views   English (CA)
Categories: Tasks, Activity Log, Announcements, Documentation; Mins. worked: 285

Initial meeting

It was great meeting you all today and I'm looking forward to working with you all through the summer! I thought I would post one of my favorite newspaper articles from the project I mentioned today. Blayney was the oldest of the Scott brothers and the event that earned him the Distinguished Flying Cross is outlined in the article on the left. It's a pretty unbelievable story!

Happy hunting tomorrow.

Permalink 07:50:08 pm, by Kirsten, 69 words, 19 views   English (CA)
Categories: Activity Log; Mins. worked: 600

Possible posters?

Hope everyone's having a nice evening! Here's a couple of poster designs - the images are HUGE, but if you right click on them and select "View Image" then they should pop up without being cut off! The visuals are taken from Canadian Great War era posters, and the information is pretty much just copied from the project charter. Please share your own ideas regarding the "look" or messages!
Permalink 05:35:47 pm, by mholmes, 10 words, 27 views   English (CA)
Categories: G&T Hours; Mins. worked: 0

MDH: 225 + 2 = 227 hours G&T

Too much to do, not enough time to do it...

Permalink 05:08:57 pm, by mholmes, 130 words, 10 views   English (CA)
Categories: Activity log; Mins. worked: 120

Limitations on advanced search

PAB wants to combine the simple search (which is actually very complicated behind the scenes, since it does keyword lookups and combines them with supplementary text-searching) with the advanced search filters. This is proving virtually impossible, partly because it's just too messy -- you'd need to retrieve a document set from the keyword search in a separate step, and then filter it -- and partly because I just don't have time to implement it properly before the launch. I'll have a couple more shots at it, but things aren't looking good so far.

Made a few other changes and fixes requested with PAB, and hid the text search box, since it's doing what it says on the box (a text search), and not what PAB wants (a complicated keyword search).

Permalink 05:05:27 pm, by mholmes, 41 words, 10 views   English (CA)
Categories: Activity log; Mins. worked: 180

Various tweaks to XSLT and CSS

Following a meeting at which we discussed strategy, and decided to focus for now on the Mayoral Pageants, worked with KMF on a range of minor display and rendering issues for primary source documents, including bylines, marginal labels, and text indents.

Permalink 03:48:59 pm, by Hannah, 0 words, 15 views   English (CA)
Categories: Activity Log; Mins. worked: 300

Test

Permalink 09:41:31 am, by mholmes, 4 words, 16 views   English (CA)
Categories: Activity log; Mins. worked: 10

Tweak to City Talks site...

...on instructions from JS-R.

Permalink 09:41:03 am, by mholmes, 4 words, 12 views   English (CA)
Categories: Activity log; Mins. worked: 60

Collapsed five slides to a single diagram

As planned last week.

20/05/13

Permalink 09:21:21 pm, by jim, 10 words, 17 views   English (CA)
Categories: Activity Log; Mins. worked: 240

Prepare Briefing Material

Initial presentation prep, admin prep, blog and Word Press Brief

17/05/13

Permalink 03:27:02 pm, by mholmes, 38 words, 15 views   English (CA)
Categories: Activity log; Mins. worked: 90

More work to be done on the presentation

Meeting to review the presentation -- my task now is to collapse six slides which begin with the picture of the filecard box into a single stepped diagram illustrating the old encoding process and the horrible binary result.

Permalink 03:25:22 pm, by mholmes, 43 words, 11 views   English (CA)
Categories: Activity log; Mins. worked: 180

Beginning of tutorial for primary source encoding

Started a tutorial based on SNOW1 (for the moment), and in the process of writing the first bit of it, came up against many annoyances in the rendering of egXML blocks; fixed those rendering issues (in three places, site, redesign, and codesharing. Grrr).

Permalink 11:32:42 am, by jnazar, 12 words, 17 views   English (CA)
Categories: Activity log; Mins. worked: 30

Cascade - Hispanic & Italian website

Emailed DR with latest changes/additions required for site.
Site in progress.

Permalink 11:30:19 am, by jnazar, 9 words, 18 views   English (CA)
Categories: Activity log; Mins. worked: 30

Medieval Studies website (current site)

In Progress: updating site with new course listings 2013-14.

Permalink 11:29:50 am, by jnazar, 10 words, 17 views   English (CA)
Categories: Activity log; Mins. worked: 60

Religious Studies (current site)

In progress: updating site with new course listings for 2013-14.

Permalink 09:21:48 am, by mholmes, 26 words, 10 views   English (CA)
Categories: Activity log; Mins. worked: 30

Added handling for dramatic text tags

Added rendering handling for sp, speaker, and p within sp. The stage tag isn't handled yet. Rolled out changes both to site and to redesign codebases.

Permalink 09:20:57 am, by Greg, 60 words, 25 views   English (CA)
Categories: Servers; Mins. worked: 15

Java headless

The ISE was getting the error
java.lang.NoClassDefFoundError: Could not initialize class sun.awt.X11GraphicsEnvironment
when running xwiki.
It turns out that the existence of quotes in JAVA_OPTS directives causes the option to be ignored. So, for future reference, use -Djava.awt.headless=true instead of -Djava.awt.headless="true" when launching tomcat on a headless server.

Permalink 08:34:05 am, by mholmes, 3 words, 20 views   English (CA)
Categories: Activity log; Mins. worked: 15

Update to City Talks site

...on RL's instructions.

16/05/13

Permalink 05:31:49 pm, by mholmes, 135 words, 11 views   English (CA)
Categories: Activity log; Mins. worked: 120

Troubleshooting: encoded title page of SNOW1, found and fixed rendering bug

Since SNOW1 was a bit of a mess at the beginning, because of the encoders following obsolete examples, I've manually encoded the title page as an example.

Also found a problem with METR1 which was not really a bug, nor an encoding invalidity: a body element which goes straight to content (e.g. a head) with no intervening div is not invalid, but it triggered rendering problems because it was completely unexpected. As it happens, the encoding should not have been that way -- other divs appear later in the body -- but it wasn't technically wrong, so it would be good to figure out a way to prevent this through the schema or more likely through Schematron. We could change the content model of body so that it can only have divs, of course.

Permalink 05:27:51 pm, by mholmes, 13 words, 17 views   English (CA)
Categories: Activity log; Mins. worked: 120

Finished reworking and collapsing my part of the presentation

Section 2 is now down to 6 slides, with more detail and more extensive notes.

Permalink 05:27:14 pm, by mholmes, 82 words, 17 views   English (CA)
Categories: Activity log; Mins. worked: 90

Work on names list

Following Sarah's post, I've done the following:

  • Added a language filter so you can view names only in English or Nxaʔamxcín. This is a crude regex, but it works because English names always begin with caps, and Nxaʔamxcín names never do.
  • Turned off the traffic light display in the names page.
  • Added more processing to the path, to handle rendering of e.g. choice elements inside names.
  • Excluded lexical suffix entries.
  • Elaborated the captions and links a bit.
Permalink 12:17:53 pm, by skell, 229 words, 25 views   English (CA)
Categories: Tasks; Mins. worked: 0

changes for Names pages

Here are a few requests for the Names page on the website:

DONE -exclude Lexical Suffix entries

DONE -fix the display of sic/corr, so that only “Wenatchi” displays, not “WenatcheeWenatchi” (See for example the entry for “Sam George”.)

DONE -put flora (plants) and fauna (animals) in the link text at the top of the page

-separate out the sorting into Nx-Eng and Eng-Nx pages. Ideally, users should be able to view the complete list, or any of the six lists by name type, sorted either by Nxa'amxcin name or by English name. The present setup with Nx and Eng names mixed together in the Name column is somewhat confusing. Continue to sort the Nx-Eng lists based on name tags in prons. For the present, exclude name tags in orths when generating these lists. Sort the Eng-Nx lists based on name tags in defs.

PENDING ECH'S FURTHER DISCUSSION WITH CCT:

Please also generate a printable version of the six lists of names by type. These only need to be sorted alphabetically by Nxa'amxcin name - i.e. only include the name tags within prons when generating these lists. Ideally they would be spreadsheets with the following columns:

Name (pron:seg type= “p”)
Source (following bibl ... if the pron:seg type= “p” is NOT subtype=“i”)
Definition (all defs)
Pronunciation (pron:seg type= “n”)
Source (following bibl)
Word Parts (hyph)

15/05/13

Permalink 05:33:24 pm, by mholmes, 8 words, 24 views   English (CA)
Categories: G&T Hours; Mins. worked: 0

MDH: 223 + 2 = 225 hours G&T

Running very fast to stay in same place...

Permalink 05:18:22 pm, by mholmes, 65 words, 12 views   English (CA)
Categories: Activity log; Mins. worked: 120

Fixes and updates

Did some tasks from yesterday and some new ones:

  • Files that used <group> have now been converted to <div>s. (The only exception is stow_1633, which probably does need <group>.)
  • XSLT rendering has been updated to handle this.
  • Extra stray copies of METR1 have been identified in the db and removed. These were causing errors in the redesign pipeline.
Permalink 02:50:14 pm, by mholmes, 36 words, 14 views   English (CA)
Categories: Activity log; Mins. worked: 360

Subdomain and advanced search both working

I've implemented the advanced search as a separate page, and got it basically working, although some missing bits in the encoding mean that it's not finding everything it should (e.g. dates are missing @whens sometimes).

Permalink 01:51:58 pm, by esaint, 17 words, 24 views   English (CA)
Categories: Activity log; Mins. worked: 240

Update from ES on May 15, 2013

1. ES corrected location coordinates for cltf6, aacf3, fraf8
2. ES added transcripts (non annotated) for fraq 7, fraq8, fraq9

Permalink 10:16:21 am, by mholmes, 32 words, 51 views   English (CA)
Categories: Announcements; Mins. worked: 15

CO 60 Vol 13 page images added to the Colonial Despatches collection

1309 page images for CO 60 Vol 13 (in three different sizes) have been added to the collection. These cover the British Columbia 1862: Despatches to London. These will now be linked into the transcription documents.

14/05/13

Permalink 05:03:10 pm, by mholmes, 6 words, 19 views   English (CA)
Categories: Activity log; Mins. worked: 120

A little work on TEI tickets

Work arising from the Providence meeting.

Permalink 03:56:07 pm, by mholmes, 168 words, 20 views   English (CA)
Categories: Activity log; Mins. worked: 120

Meeting and tasks

I have these tasks coming out of the team meeting today:

  • DONE: Fix rendering of org popups.
  • DONE: Add Schematron constraint for malformed Julian dates.
  • DONE: Fix rendering of persNames with genName and roleName in them.
  • DONE (for group elements): Make a list of files containing group elements, and other bad old code.
  • DONE: Transform files with group elements into nested divs.
  • Add an attribute value parameter to the CodeSharing interface (will have to be done after July, probably).
  • Add handling for @style on list, along with documentation for it, change existing usage of list/@type to @style, then remove list/@type from schema.
  • DONE: Look at forme works in SNOW1 and figure out why they're not rendering properly.
  • Collapse the myth and fict personography types to a single type "lit". This will involve both data and rendering and must be done simultaneously.
  • Add rendering for sp, speaker and stage for SNOW1.
  • In redesign (with Pat): make page credits work like page TOC (pop-out rather than long list).

13/05/13

Permalink 05:04:22 pm, by mholmes, 3 words, 23 views   English (CA)
Categories: G&T Hours; Mins. worked: 0

MDH: 222 + 1 = 223 hours G&T

On late duty.

Permalink 05:01:20 pm, by mholmes, 156 words, 16 views   English (CA)
Categories: Activity log; Mins. worked: 420

eXist build script

I've spent the whole day working on getting a more flexible and successful build system for eXist. This is what I've added to Greg's script:

  • It now checks for the presence of Saxon and warns if it's not available.
  • It checks for three XSLT files, and in each case, if the file is there, it transforms a target file in the build tree. These are for conf.xml.tmpl, mime-types.xml.tmpl, and controller-config.xml. This should allow us to set up build environments for each of our specific projects.
  • It excludes XML Calabash and includes FOP. The former was blocking the build because its download location is down.

Found a number of problems with eXist, which I've reported, including a bad one once the webapp is running: you can no longer call transform:transform with a relative path to the XSLT file, otherwise you get an error. A full path from /db seems to work.

10/05/13

Permalink 03:20:27 pm, by sarneil, 104 words, 24 views   English (CA)
Categories: Activity log; Mins. worked: 90

added thumbnails

ES added about ten new videos and XML data files, so I had to create a thumbnail image for each. I ran each file in the player.xql file, stopped the video, captured a bit of the screen to a png file, edited that to 88x66 px (size that all of them seem to be) added them to the SVN repository, uploaded them to the production site and the copy of the site on my Mac.

While doing that, I noticed extraneous thumbnail files in the images (as opposed to the images/thumbnails) folder, so deleted those from the servers and from the repository.

Permalink 02:30:28 pm, by mholmes, 2 words, 24 views   English (CA)
Categories: G&T Hours; Mins. worked: 0

MDH: 223 - 1 = 222 hours G&T

Leaving early.

Permalink 02:25:08 pm, by mholmes, 211 words, 22 views   English (CA)
Categories: Activity log; Mins. worked: 90

Security re-established

We've been running the live db with open access since the last time I rebuilt it, so in the process of doing other updates (such as rolling out the Java sorting collations) I've also added back the protection that we had before. In the process of doing this, I got bitten by the horrible eXist bug which enables you to lock yourself out of the admin account if you edit the admin user and forget to retype the password into the two password boxes (the effect is that you end up with a random admin password that you can never discover). As a result, I had to remove the server version of the app and replace it with a refreshed version of my local copy. This failed the first few times -- Tomcat tries to auto-deploy the app before it's completely uploaded the dbx files, so the uploaded .filepart files can not be renamed to overwrite the ones created by the live startup. It took two or three shots to get this problem solved. The only way seems to be to let it deploy, but stop it immediately in the Tomcat manager; then delete all the dbx, lock and log files; then upload them again; then restart it in the manager.

Permalink 12:22:00 pm, by skell, 96 words, 16 views   English (CA)
Categories: Activity log; Mins. worked: 5

print dictionary layout and web dictionary sort orders

1) For the linguists' dictionary, we would like to see:

first phonemic representation in bold <orthography in angle brackets> [narrow transcription(s) in square brackets], for both forms and cits - e.g.:

ʔáyx̣ʷt <ʔáyx̌ʷt> [ʔáyəx̣ʷt]
√ʔáyx̣ʷ-t
1. be tired
2. tired, worn out

• √ʔáyx̣ʷ-tl kɬʔámnc
<√ʔáyx̌ʷ-tl kɬʔámnč>
[√ʔáyəx̣ʷ-t ləkɬəʔámənč]
he is tired of waiting (for you / me)

2) On the website, we would ultimately like things sorted by orthography.

Permalink 11:57:09 am, by sarneil, 139 words, 23 views   English (CA)
Categories: Activity log; Mins. worked: 120

change pointers from pear and lettuce to tomcat-devel and hcmc

ES noted that recent changes she'd made weren't appearing on the production site at francotoile.uvic.ca.
I had a connection in the exist admin client that used pear.hcmc.uvic.ca as the domain. I thought that would be dead, but when the connection succeeded, I assumed that domain name was forwarding to the current instance. Wrong. Obviously there is another instance somewhere on "pear" that is still running.

Created a new connection in the admin client using tomcat-devel.hcmc.uvic.ca as the domain and that worked. Also, the webapp in the new instance is francotoile and not francotoile21 as it was in the old instance.

In poking through the files, also noticed a connection string using lettuce.uvic.ca, so changed that to hcmc.uvic.ca and it seems to be working.

Updated the lastpass records.

Permalink 11:33:31 am, by mholmes, 91 words, 16 views   English (CA)
Categories: Activity log; Mins. worked: 90

Handling of homographic glosses

This morning we decided that a simple and quick way to distinguish between homographs with different meanings is required to make the English lookup part of the dictionary less confusing. This will be achieved by adding a clarificatory word or phrase in the @n attribute of a gloss. Glosses will then be presented in the E-to-M view with this clarification in parentheses. Processing on the website will need to be changed to take account of this, and the print dictionary rendering will also have to be written with this in mind.

09/05/13

Permalink 04:59:38 pm, by mholmes, 5 words, 25 views   English (CA)
Categories: G&T Hours; Mins. worked: 0

MDH: 222 + 1 = 223 hours G&T

Wrestling with similarity metric algorithm...

Permalink 03:27:18 pm, by mholmes, 275 words, 28 views   English (CA)
Categories: Activity log; Mins. worked: 240

Wrote an eXist module for similarity metric comparisons

I've now figured out how to create an extension module for eXist, following the instructions here. These are some things I've learned:

  • The only practical way to do this is to work with your module code in the context of the eXist tree, in $EXIST_HOME/extensions/modules/src/org/exist/xquery/modules.
  • You can use a non-eXist namespace -- I'm using http://hcmc.uvic.ca/ns/usm -- but it seems safest to use the eXist package structure, so my package is in org.exist.xquery.modules.unisimmetric.
  • All the extension modules are built together into a single jar called exist-modules.jar. You can build this jar alone, using build.sh extension-modules, then drop that jar into an existing eXist instance (although if the new jar was built with a substantially different version from the rest of the code, there could well be problems).
  • To turn on your module, you add a line to the conf.xml file like this:
    <module uri="http://hcmc.uvic.ca/ns/usm"                        class="org.exist.xquery.modules.unisimmetric.UniSimMetricModule" />
    
    along with the other modules.

I'm not yet happy with my module, and I'm still working on it. In particular, I'm not happy with the scores it's generating, and I think this might be something to do with other bits that get included in the GZIP stream, such as a header; if I can figure out how big those are, I can remove them from the calculation. The highest difference I seem to get is around 0.53 with completely dissimilar strings, so it seems as though the results are being compressed into a range much smaller than 0-1.

Permalink 10:14:12 am, by Greg, 153 words, 29 views   English (CA)
Categories: Servers, Activity log, Activity log, Documentation; Mins. worked: 120

Rsync problem on rutabaga

After an update to DSM 4.2 rutabaga no longer allowed rsync backups, failing with:

sh: rsync: not found
rsync: connection unexpectedly closed (0 bytes received so far) [Receiver]
rsync error: remote command not found (code 127) at io.c(605) [Receiver=3.0.9]

After much wailing and gnashing of teeth we discovered that non-interactive users do not have /usr/syno/bin in their path (it *is* in their path if they shell in to the NAS, so they can run rsync *from* the NAS when shell'd in).

So, that's an easy fix, says us: add a symlink to /usr/syno/bin/rsync in a logical spot that *is* in a non-interactive path, like /usr/bin.

Problem: admin user cannot su root (error message = su: must be suid to work properly), so cannot create symlink.

Answer: TURN ON TELNET AND LOG IN AS ROOT USING THE WORST POSSIBLE METHOD!!! Then, you make the symlink and turn off telnet - quick!

08/05/13

Permalink 05:41:02 pm, by mholmes, 20 words, 26 views   English (CA)
Categories: G&T Hours; Mins. worked: 0

MDH: 220 + 2 = 222 hours G&T

Late duty, then fighting with @W(*%&($^ Rutabaga which has forgotten how to do rsynb backups. Still not solved. GRRR.

Permalink 03:13:35 pm, by mholmes, 58 words, 21 views   English (CA)
Categories: Activity log; Mins. worked: 240

Wrote the text of my Oxford talk

I've written out the prose of the Oxford talk. Still remaining to do before July:

  • Presentation slides (with diagrams etc.)
  • Creation of SF project.
  • Move of files out of MoEML repo into SF repo.
  • Replacement of them in MoEML by a script that exports them into place.
  • Addition of licensing info, SVN headers etc.
  • Prettying-up the SF site.
Permalink 02:57:09 pm, by Greg, 125 words, 18 views   English (CA)
Categories: Activity log; Mins. worked: 120

Network troubles - solved

This morning we got nets to set up B047 on the switch. Some time after that 3 machines lost the ability to get a DHCP address - I have no idea if there is a causal relationship between these things.

After much mucking about, it *looks* like it might have been a communication problem between the DHCP server and the machines.
Even forcibly releasing the DHCP lease didn't make any difference.

In the end, I booted the machine with a LiveCD and fiddled with enabling/disabling the network (in the network manger). I got a proper IP and rebooted in the installed OS. That seemed to break it out of the loop.

A bit perplexing and aggravating because I don't actually know what the problem was...

Permalink 12:12:52 pm, by esaint, 45 words, 20 views   English (CA)
Categories: Activity log; Mins. worked: 240

Update from ES on May 8, 2013

1. ES added transcripts for fraq10, fraf6
2. ES has edited Liette's video, and given it to SA. Corresponding xml file has also been added.
3. ES asked SA to upload all new addition to the production site in order to see if edition with Audacity works fine.

Permalink 10:56:47 am, by jnazar, 32 words, 24 views   English (CA)
Categories: Activity log; Mins. worked: 15

FMIS report

Sent in FMIS report May 2nd re furniture removal. (p/up by May 8, 2013)

May 8: followed up on sent FMIS request re furniture removal specific pick up time. Furniture removed May 8th, 11:00am.

Permalink 10:46:18 am, by jnazar, 17 words, 29 views   English (CA)
Categories: Activity log; Mins. worked: 1680

Administration - scheduling

Computer facility:
Received computer usage requests from several projects for May-August 2013
New schedule updated and posted online

Permalink 10:35:22 am, by jnazar, 63 words, 16 views   English (CA)
Categories: Activity log; Mins. worked: 90

Religious Studies Cascade

SA and I met with SA (Rel.St) to discuss migration of current Religious Studies site over to Cascade format.

Discussion:
- RELS current information and design to be replicated basically in Cascade
- discussed various Cascade requirements

Next steps:

- sent info. request to SA (RELS) required for outline
- HCMC:currently preparing RELS structure outline in readiness for submission for approval

07/05/13

Permalink 04:55:49 pm, by mholmes, 3 words, 22 views   English (CA)
Categories: G&T Hours; Mins. worked: 0

MDH: 219 + 1 = 220 hours G&T

On late duty.

Permalink 04:41:34 pm, by mholmes, 93 words, 18 views   English (CA)
Categories: Activity log; Mins. worked: 240

Meeting and work on page-image-linking

Team meeting, at which we discussed the use of ISE's facsimile viewer in MoEML (which will be easy enough to do, although it's based on a traditional db, and we'll have to replace that with proper TEI facsimile encoding).

People also asked me to clarify how the EEBO linking works, so I've done that in the transcriptions documentation file, and I've also implemented the display of little page-images linking to the EEBO pages. Also, during today, <address> and <addrLine> were added to the schema, with some basic display rendering.

Permalink 11:48:47 am, by mholmes, 123 words, 15 views   English (CA)
Categories: Activity log; Mins. worked: 120

Fixes and plans for MyNDIR release

Met with PAB and made a number of fixes:

  • Created a new copy of all the images in a folder called displaysize, and resized some of them to comply with owners' requests; the webapp now draws from that folder.
  • Made changes to menu captions, and added a new menu item and placeholder page for it ("In progress").
  • Removed authentication by commenting the relevant bits of the webapp's web.xml file, and restarting the webapp in the Tomcat manager. The Tomcat config still has the user set up, but it's no longer being used for anything.
  • Fixed a couple of layout bugs.

We also made a plan for an advanced search, which I'll document in more detail here before I try to implement it.

06/05/13

Permalink 10:01:19 am, by sarneil, 172 words, 30 views   English (CA)
Categories: Activity log; Mins. worked: 60

agenda : print not-approved courses in timetable view

When making modifications a couple of weeks ago (see post), I changed only the list view and not the timetable view. I didn't realize that the dropdown for which courses to print affected only the list view (as in the code it is located in the active_area code and not the view-specific code in manage_calendars.

I added code to
- manage_calendars (at about line 12985 - that file is ridiculously big) to add the option to the select in the dropdown
- manage_calendars.php (at about line 118, in the else if(strcmp("display_table",$do_what)==0) branch) to check the setting of the dropdown and take appropriate action

Notice that in the timetable view, the effect of changing the setting take place immediately in the view, then that view is printed; in the list view, changing the setting does not change the display but does correctly filter what gets printed. Not sure if that inconsistency is a bug or a feature which reflects how those two views are used.

03/05/13

Permalink 02:34:51 pm, by mholmes, 2 words, 25 views   English (CA)
Categories: G&T Hours; Mins. worked: 0

MDH: 220 - 1 = 219 hours G&T

Leaving early.

02/05/13

Permalink 04:26:30 pm, by jim, 5 words, 66 views   English (CA)
Categories: Tasks; Mins. worked: 60

Develop workplan

Develop initial workplan and timesheets.
Permalink 04:23:11 pm, by jim, 25 words, 74 views   English (CA)
Categories: Tasks; Mins. worked: 480

Revise CTGW WordPress Site

Revise CGTW WordPress Site to test Footnote and Bibliography functions. Revise menu structure. Add test content. Add Gallery test material and structure albums and galleries
Permalink 03:27:05 pm, by mholmes, 324 words, 27 views   English (CA)
Categories: Activity log; Mins. worked: 120

Duplicate @xml:ids

The problem of duplicate @xml:id attributes on entries has now become a serious issue for the print dictionary building, because I'm unable to properly process the entire collection properly to produce the book; to build the dictionary I have to use XInclude to create a single XML source file, and when I do that there are over 1600 duplicate ids which prevent some of the processing steps from being successful.

I've taken a quick look at where the duplicates tend to be concentrated, by adding the files in alphabetical order and looking to see how many duplicates occur with each addition. These files create no problems (i.e. they have no duplicates among themselves):

affix_glot-ix.xml
affix_k-m.xml
affix_n-t.xml
affix_u-CAPS.xml
c.xml
c-glot.xml
c-rtr.xml
glottal.xml
h.xml
h-phar-part1.xml
h-phar-part2.xml
l-affric.xml
lex-suff.xml
new-data-2013.xml
p-glot.xml
phar-w.xml
qw-glot.xml
s-rtr.xml
t-glot.xml
xw.xml

When I add the remaining files, one by one (and only one at a time), these are the results:

k.xml            100 duplicates.
k-glot.xml:         18
kw.xml:              2
kw-glot.xml:          2
l.xml:              3
l-fric.xml:          6
m.xml:              3
n.xml:             97
p.xml:              7
particles.xml:          4
pron.xml:          2
q.xml:              4
q-glot.xml:          3
qw.xml:              1
rescued.xml:         54
s.xml:              2
t.xml:             20
ww-glot.xml:          4
x.xml:              3
x-uvul.xml:          4
yy-glot.xml:          4

What I'm going to do is develop the dictionary output using only the valid files, and then add the others in as they get fixed. In the meantime, it might be worth having a go at some of the low-hanging fruit (the ones with only two or three duplicates). More will show up as we add those in, of course -- there will be duplicates across the currently-excluded files as well as those that they share with the "good" files. So the dictionary PDFs will shrink in size, but I'll be able to start doing things like generating page-references that depend on xml:ids.

Permalink 01:28:13 pm, by mholmes, 149 words, 20 views   English (CA)
Categories: Activity log; Mins. worked: 180

Implemented a crude similarity metric in XQuery

Lucene-based fuzzy matching seems to be very broken in the build of eXist I'm using, and in any case it's based on Levenshtein distance, so I've implemented a crude version of the USM/NCD algorithm in XQuery. It's a long way from ideal, though, because it's using base64 versions of strings rather than compressing the actual strings (this is all I can do with eXist's exposed gzip access); using zip seems to be punitive because it would require creating a file on the filesystem or in the db and compressing that. I think a simpler approach would be to take my Java class and strip out all the command-line stuff it contains, then call that directly from XQuery (see the xqSearchUtils java project and the way it's called from the Despatches XQuery for an example). A jar file with a simple XQuery module interface might be very handy indeed.

01/05/13

Permalink 04:35:26 pm, by mholmes, 2 words, 31 views   English (CA)
Categories: G&T Hours; Mins. worked: 0

MDH: 219 + 1 = 220 hours G&T

Media queries...

Permalink 11:54:30 am, by esaint, 32 words, 25 views   English (CA)
Categories: Activity log; Mins. worked: 290

Update from ES on May 1, 2013

1. SA found a solution with regards to cutting the soundtrack at the millisecond : Use Audacity! The program was installed on POMME.
2. ES entered & committed the transcripts for cltq3, fraq11, fraq12, fraq13

Permalink 10:52:00 am, by mholmes, 8 words, 23 views   English (CA)
Categories: Activity log; Mins. worked: 60

Workstudy proposals for 2013-2014

The call is out, and mine are done.

All HCMC Blogs

Actions

Reports

Categories

All HCMC Blogs

Transformer blog

Work on this blogging tool

Image Markup Tool blog

HCMC Project Management

Nxaʔamxcín (Moses) Dictionary Blog

Maintenance

FrancoToile

Mariage

Administration

Academic

Depts

Scandinavian-Canadian Studies

EMLS

Scraps

Image Markup and Presentation

Update of Humanities Sites

viHistory

Vacation, Hours and Sickday Log

Times Colonist Transcript Database

Devonshire

CMC Research Collective

Moodle

Humanities Project Showcase

Peter's blog

teiJournal

Projects

Professional Development

Colonial Despatches

Coup De Des - GUI for concrete poem

Capital Trials at the Old Bailey

Agenda Class Timetabling

Lansdowne Lectures

German Medical Exams

Canadian Mysteries

Map Of London

MyNDIR

Canadian Journal of Buddhist Studies

Adaptive Database

Myths on Maps

Properties

Cascade

Vesalius

DHSI

History of the Philosophy of Language

A City Goes to War

May 2013
Sun Mon Tue Wed Thu Fri Sat
 << < Current> >>
      1 2 3 4
5 6 7 8 9 10 11
12 13 14 15 16 17 18
19 20 21 22 23 24 25
26 27 28 29 30 31  

XML Feeds