Digitization of the missing late c19th volumes

Although there are many digitized collections of statutes available online, and indeed many digitizations of the same publication, I have not found a number of volumes from the last two decades of the nineteenth century.

Happily, I have now been able to digitize these volumes myself, courtesy of the Institute of Historical Research, who very kindly allowed me to photograph their copies.

I copied them using an iphone and a selfie stick designed by Sussex Unversity Humanities Lab. Althugh SHL are developing a whole workflow for DIY scanning and OCRing documents through a modern smartphone, I simply took pictures, and later ran them through Abbyy Finereader, as I have been doing with the digital volumes downloaded from Google Books and Internet Archive.

The whole procedure took a full work day, which I think quite quick given the size and number of the volumes; once I got into the rhythm, the apparatus held firm, I averaged about one volume an hour, photographing two pages at a time.

The text of these volumes can be found on github; some automated correcting has been carried out, but it is still all pretty raw, especially the tables. No doubt there will be pages I have inadvertently photographed twice, photographed poorly, or accidentally omitted, but by and large I think the quality is as good as can be expected. As with all the other volumes I have OCRd, the text is public domain.

Once again, my thanks to the IHR for access to their books and a desk at which to copy them, and to Sussex Humanities Lab for the selfie sticks. Without such help, ‘unofficial’, grassroots, lone scholar projects such as this one would not be able to develop their potential.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.