Visual signature based identification of Low-resolution document images

Behera, Ardhendu, Lalanne, Denis and Ingold, Rolf (2004) Visual signature based identification of Low-resolution document images. ACM symposium on Document engineering 2004 (DocEng04), 28-30 October 2004, Milwaukee, Wisconsin, pp. 178-187, ISBN 1-58113-938-1, DOI https://doi.org/10.1145/1030397.1030432.

Item not available from this archive.

Abstract

In this paper, we present (a) a method for identifying documents captured from low-resolution devices such as web-cams, digital cameras or mobile phones and (b) a technique for extracting their textual content without performing OCR. The first method associates a hierarchically structured visual signature to the low-resolution document image and further matches it with the visual signatures of the original high-resolution document images, stored in PDF form in a repository. The matching algorithm follows the signature hierarchy, which speeds-up the search by guiding it towards fruitful solution spaces. In a second step, the content of the original PDF document is extracted, structured, and matched with its corresponding high-resolution visual signature. Finally, the matched content is attached to the low-resolution document image's visual signature, which greatly enriches the document's content and indexing. We present in this article both these identification and extraction methods and evaluate them on various documents, resolutions and lighting conditions, using different capture devices.

Item Type: Conference or Workshop Item (Paper)
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Q Science > QA Mathematics > QA76 Computer software
Divisions: Computing and Information Systems
Related URLs:
Date Deposited: 10 Dec 2014 16:00
URI: http://repository.edgehill.ac.uk/id/eprint/6236

Archive staff only

Item control page Item control page