Literaturnachweis - Detailanzeige
Autor/in | Kichuk, Diana |
---|---|
Titel | Loose, Falling Characters and Sentences: The Persistence of the OCR Problem in Digital Repository E-Books |
Quelle | In: portal: Libraries and the Academy, 15 (2015) 1, S.59-91 (33 Seiten)
PDF als Volltext |
Sprache | englisch |
Dokumenttyp | gedruckt; online; Zeitschriftenaufsatz |
ISSN | 1531-2542 |
DOI | 10.1353/pla.2015.0005 |
Schlagwörter | Electronic Publishing; Electronic Libraries; Electronic Equipment; Computer Software; Books; Reliability; Accuracy; Metadata; Collaborative Writing; Proofreading |
Abstract | The electronic conversion of scanned image files to readable text using optical character recognition (OCR) software and the subsequent migration of raw OCR text to e-book text file formats are key remediation or media conversion technologies used in digital repository e-book production. Despite real progress, the OCR problem of reliability and accuracy in OCR-derived e-book text and metadata persists. This paper examines a selection of digitized e-books in several prominent digital repositories and discusses the impact of OCR technology on e-book text file formats, metadata, and the online reading experience. (As Provided). |
Anmerkungen | Johns Hopkins University Press. 2715 North Charles Street, Baltimore, MD 21218. Tel: 800-548-1784; Tel: 410-516-6987; Fax: 410-516-6968; e-mail: jlorder@jhupress.jhu.edu; Web site: http://www.press.jhu.edu/journals/subscribe.html |
Erfasst von | ERIC (Education Resources Information Center), Washington, DC |
Update | 2020/1/01 |