For the current REF see the REF 2021 website REF 2021 logo

Output details

11 - Computer Science and Informatics

University of Salford

Return to search Previous output Next output
Output 0 of 0 in the submission
Output title

Word-Based Adaptive OCR for Historical Books

Type
E - Conference contribution
Name of conference/published proceedings
2009 10th International Conference on Document Analysis and Recognition
Volume number
-
Issue number
-
First page of article
501
ISSN of proceedings
-
Year of publication
2009
URL
-
Number of additional authors
4
Additional information

<17>One of the major results of the IMPACT multi-million research project, actively involving industry and academia, in improving OCR performance for large-scale digitization of historical documents. In the case of books (majority of world-library holdings) the proposed architecture for OCR supports a recognition system that can train itself as it progresses through the pages of a book. This is an important requirement for large-scale digitization, where human input is impractical, very costly and material is printed using a variety archaic conventions and fonts. Experiments with material from major European libraries demonstrate a significant improvement in recognition rate using this approach.

Interdisciplinary
-
Cross-referral requested
-
Research group
None
Citation count
8
Proposed double-weighted
No
Double-weighted statement
-
Reserve for a double-weighted output
No
Non-English
No
English abstract
-