Addition of digitized newspapers to the Arnprior Archives’ search interface

by Kathy Bryce Friday, June 22, 2018 8:54 AM

Andornot has recently completed work for the Arnprior & McNab/Braeside Archives to add the newly digitized versions of their newspapers up to 1937 to their searchable collections. The majority of issues are from the Arnprior Chronicle starting in 1885.  We also created a Finding Aid allowing researchers to see what issues are available for each of the 16 newspapers with the ability to browse each individually. 

Funding for this project was provided by the Ottawa Branch of the Ontario Genealogical Society, and will be a wonderful new option for genealogical research as well as providing a window into the coverage of historical events. Individual names can be searched, and search words or parts of words are highlighted on the newspaper pages, as in the screenshot below:

image

A search on a general term such as “sawmill” pulls results from several data sources and allows users to easily narrow down their results.

image

As well as providing new search capabilities for this important set of documents, this initiative removes the need to consult the now very fragile originals.

The digitization itself was handled by a local vendor and Andornot scripted the OCR’ing to create a searchable layer in the PDF’s.  When funding permits, the aim is to enhance the search option further by matching up the newspaper issues with an index to births, marriages and deaths created by the Archives. 

If you are considering a similar digitization project, or have databases or other material that you would like to make searchable, contact us for a chat to discuss options!

Library and Archives Canada announces launch of 2018 funding cycle for Documentary Heritage Communities Program

by Jonathan Jacobsen Sunday, December 03, 2017 4:45 PM

Library and Archives Canada has announced the launch of the 2018 funding cycle for the Documentary Heritage Communities Program (DHCP). This is the fourth year of a planned 5 year program, with $1.5 million available this year, as in previous rounds.

The DHCP provides financial assistance to the Canadian documentary heritage community for activities that:

  • increase access to, and awareness of, Canada’s local documentary heritage institutions and their holdings; and
  • increase the capacity of local documentary heritage institutions to better sustain and preserve Canada’s documentary heritage.

The deadline for submitting completed application packages is February 7, 2018. 

This program is a great opportunity for archives, museums, historical societies and other cultural institutions to digitize their collections, develop search engines and virtual exhibits, and other activities that preserve and promote their valuable resources.

The program is aimed at non-governmental organizations specifically, including:

  • Archives; 
  • Privately funded libraries; 
  • Historical societies;              
  • Genealogical organizations/societies;  
  • Professional Associations; and 
  • Museums with an archival component.

Businesses, government and government institution (including municipal governments and Crown Corporations), museums without archives, and universities and colleges are not eligible.

Types of projects which would be considered for funding include:

  • Conversion and digitization for access purposes; 
  • Conservation and preservation treatment; 
  • The development (research, design and production) of virtual and physical exhibitions, including travelling exhibits; 
  • Conversion and digitization for preservation purposes; 
  • Increased digital preservation capacity (excluding digital infrastructure related to day-to-day activities); 
  • Training and workshops that improve competencies and build capacity; and 
  • Development of standards, performance and other measurement activities. 
  • Collection, cataloguing and access based management; and 
  • Commemorative projects.

Further program details, requirements  and application procedures are available at http://www.bac-lac.gc.ca/eng/services/documentary-heritage-communities-program/Pages/dhcp-portal.aspx

How can Andornot help?

Many Andornot clients have obtained DHCP grants in previous rounds, and Andornot has worked on many other projects which would qualify for this grant. Some examples are detailed in these blog posts:

We have extensive experience with digitizing documents, books and audio and video materials, and developing systems to manage those collections and make them searchable or presented in virtual exhibits.

Contact us to discuss collections you have and ideas for proposals. We'll do our best to help you obtain funding from the DHCP program!

Library and Archives Canada announces launch of 2017-2018 funding cycle for Documentary Heritage Communities Program

by Kathy Bryce Monday, October 03, 2016 9:51 AM

Library and Archives Canada has announced the launch of the 2017–2018 funding cycle for the third year of the Documentary Heritage Communities Program (DHCP).

The deadline for applications is January 27th, 2017.  Check out the lists of previous recipientsfor a guide to the type of projects that qualified.  Andornot clients were successful in getting funded in both previous rounds.

This program is a great opportunity for archives, museums, historical societies and other cultural institutions to digitize their collections, develop search engines and virtual exhibits, and other activities that preserve and promote their valuable resources.

The objectives of the program are to:

  • Increase access to, and awareness of Canada's local documentary institutions and their holdings; and
  • Increase the capacity of local documentary heritage institutions to better sustain and preserve Canada's documentary heritage.

The program is aimed at non-governmental organizations specifically, including:

  • Archives;
  • Privately funded libraries;
  • Historical societies;             
  • Genealogical organizations/societies; 
  • Professional Associations; and
  • Museums with an archival component.

As before,businesses, government and government institution (including municipal governments and Crown Corporations), museums without archives, and universities and colleges are not eligible.

Types of projects which would be considered for funding include:

  • Conversion and digitization for access purposes;
  • Conservation and preservation treatment;
  • The development (research, design and production) of virtual and physical exhibitions, including travelling exhibits;
  • Conversion and digitization for preservation purposes;
  • Increased digital preservation capacity (excluding digital infrastructure related to day-to-day activities);
  • Training and workshops that improve competencies and build capacity; and
  • Development of standards, performance and other measurement activities.
  • Collection, cataloguing and access based management; and
  • Commemorative projects.

Two categories of funding available:

  • small projects under $15,000; and
  • large projects up to $100,000.

Further program details, requirements  and application procedures are available at http://www.bac-lac.gc.ca/eng/services/documentary-heritage-communities-program/Pages/dhcp-portal.aspx

How can Andornot help?

Andornot has worked with many clients on the types of projects which would qualify for this grant. Some examples are detailed in these blog posts:

We have extensive experience with digitizing documents, books and audio and video materials, and developing systems to manage those collections and make them searchable or presented in virtual exhibits.

Contact us to discuss collections you have and ideas for proposals. We'll do our best to help you obtain funding from the DHCP program!

New Andornot Add-on: Embedded Document Viewer Surfaces Content Within Digital Documents

by Jonathan Jacobsen Tuesday, August 18, 2015 9:45 AM

So often when searching a database, records in the search results include links to PDFs and other electronic documents. Somewhere in the linked documents are pages with information related to the search, but where? And which pages are the most relevant? A user can use their PDF reader’s Find function to search again for keywords in the document, but that’s repetitive and not especially sophisticated. What if there was a better way of reviewing content within linked documents?

The Andornot Embedded Document Viewer breaks every PDF or similar document down into individual pages, with OCRd, indexed, searchable full text content available to searchers. When a user searches a database, the search results can include individual pages of linked documents, with their search terms highlighted, and with the most relevant pages shown, not just the record that links to the resource.

The screenshot below shows search terms highlighted on page. Additional images and examples are available here.

By viewing individual pages, rather than having to open and review each linked document in its entirety, a user can more quickly assess resources.

Other features include the ability to navigation through the document, zoom in and out of a page, and view thumbnails of all pages.

The Andornot Embedded Document Viewer is often added to the Andornot Discovery Interface search engine. Search results can represent the individual pages of a document that best match the user's search, ranked by relevancy, rather than just the catalogue or parent metadata record for the entire document.

Examples

The Andornot Embedded Document viewer is incorporated into the following projects, which are also based on the Andornot Discovery Interface:

Contact us for more information about enhancing search and discovery of linked, digitized resources.

College of Registered Nurses of BC Offers Digitized Archives

by Jonathan Jacobsen Thursday, November 28, 2013 10:46 AM

The College of Registered Nurses of BC (CRNBC) celebrated its 100th anniversary in 2012. To commemorate this milestone, a project was undertaken to digitize and make available online many decades of CRNBC publications, such as newsletters and annual reports. This collection documents the history of the college and the many nurses who contributed to its first 100 years, and perhaps most importantly, easily enables tracking of important decisions over the decades.

Printed copies of the publications were digitized by a service bureau, with Andornot then developing the online search and presentation system.

The new site is available at https://archives.crnbc.ca

As shown in the flowchart below, the workflow from print to online involved several stages and processes.

  • The service bureau scanned the documents to specifications developed by Andornot, producing thousands of high-resolution TIFF images – one image for each page of each publication – as well as associated XML in ALTO format containing the full text extracted from the scanned images through an OCR process. 
  • Andornot developed scripts to extract metadata from these many separate files, such as the name and date of the publication, and to generate images in different sizes as needed for the interface. We used PowerShell, ImageMagick and djvulibre for this.
  • Andornot developed a search engine using the Andornot Discovery Interface (AnDI) to provide the best possible keyword searching. 

The interface and features were tailored to the specific needs of this project:

  • Brief search results show details of the publication and a snippet of text showing the user’s search words highlighted, as well as a thumbnail image of the page containing the text, and facets to limit by date and publication. 
  • Clicking through to the full record shows the page in greater detail, but still with the search words highlighted. As well, the surrounding pages of the publication are also available allowing quick navigation through the entire publication. This was achieved through the use of the New York Times Document Viewer and custom programming to highlight text in an overlay layer.
  • A PDF of the full document is also available for download. Andornot created these by stitching together the separate images files for each page back together into a single file.
  • Permalinks allow users to easily bookmark and share specific pages and documents.

Often in a digitization project, the result might be a single PDF per publication. With this project, by having each page available as a separate image, we were more easily able to direct the user to the page and text they are most interested in, though they can still access a PDF of the entire document – the best of both worlds.

All of this complexity comes together to provide an elegant and intuitive interface for users.

A CRNBC staffer using the archive says, “This archive is awesome! We were able to search several decades of a policy issue in a short time, so we could draft an historical timeline showing policy changes right up to 2013! Searching this database saved us so much time.”

Contact Andornot for help with your own digitization project.

Month List