meet on goals for ELS site
Posted by sarneil on 05 Mar 2009 in Activity log
Met with LC to discuss what he wants from the English Literary Series site:
- user sees simple google-like search box
- user gets back google-like results
- user has immediate access to pdf files for reading online or printing
- granularity of correct keyword in chapter is fine enough
- user should be able to easily move from one chapter to another, or to get the pdf of the entire book having found the pdf of a specific chapter of that book.
- virtually no money or labour for markup or indexing, but labour available for scanning about 100 issues to PDF
Looks like the best implementation is to put the PDF files into an account on unix.uvic.ca, let Google index them and the write a php page which will allow the user to specify a query and then grab the results from Google and present them appropriately to the user. I.e. rely on Google's indexing and searching algorithms, which do work on PDFs. Google also typically produces HTML and text versions of PDF files.