Challenges and Opportunities in Image Processing and Analysis of Historical Documents

No Thumbnail Available
Can’t use the file because of accessibility barriers? Contact us with the title of the item, permanent link, and specifics of your accommodation need.

Date

2013-01-30

Journal Title

Journal ISSN

Volume Title

Publisher

Indiana University Libraries

Abstract

A particular challenge in the text recognition of historical document images is the considerable amount of "image noise" that can arise during the whole life cycle of a document from printing and storage to the usage and scanning of the document. Historical documents suffer from several different kinds of noise such as geometric distortions, bleed-through, textured papers, stamp, stain, and so forth. Noise will affect and complicate the different stages of document image analysis including enhancement, segmentation, layout analysis and recognition. This talk will cover the description of different stages of document image analysis and challenges and opportunities in image processing and analysis of historical documents. I will particularly explain about the software that I developed in the IMPACT project for correction of arbitrary geometric artefacts in historical documents. Such distortions appear as arbitrary warping, fold, and page curl and have detrimental effects on OCR and print-on-demand quality.

Description

Presentation materials are not available for this presentation.

Keywords

Citation

Journal

DOI

Link(s) to data and video for this item

Click on the link below in the "External Files" section to play this video.

Rights

Type

Presentation