Thursday, June 10, 2010

Ghostlier Demarcations: Large-Scale Text Digitization Projects and Their Utility for Contemporary Humanities Scholarship

This report takes a closer look at the prospects for "digital humanities," a catchphrase embracing the potential for digitization to influence a large fraction of academic disciplines. In essence the promise of digitization lies in the electronic reproduction of full-texts that allows rapid access, searching, and combining of data. With language as its object of study, the humanities can benefit enormously from digital technologies that can speed up the analysis of language.

The study shows that the promise of digitization in theory is butting up against a number of barriers. Some are technological. Among the digitized collections in existence, it is easier to find works prior to 1923 than afterwards because of copyright conditions. There are problems with the quality of scanning stemming from the limitations of Optical Character Recognition (OCR) technology. Many documents are available in snippets. Collections do not overlap as much as one might suspect. In addition, there are financial restrictions on libraries. However promising the technology might be in future, there is insufficient funding available now to address the limits of digitization. It further appears that there are deeply ingrained cultural patterns in humanities research based in the use of print resources. For these reasons, the report, for the foreseeable future sees a mixture of print and electronic resources instead of a wholesale conversion to digitization.

Henry, Charles, and Kathlin Smith. Ghostlier Demarcations: Large-Scale Text Digitization Projects and Their Utility for Contemporary Humanities Scholarship. Washington D.C.: Council on Library and Information Resources, 2010.

http://lawprofessors.typepad.com/law_librarian_blog/2010/06/do-libraries-face-an-inevitable-digital-future-and-just-what-is-the-cost-per-volume-of-books-versus-.html?utm_source=feedburner&utm_medium=feed&utm_campaign=Feed%3A+LawLibrarianBlog+%28Law+Librarian+Blog%29