Close

Cookies warning

This web site uses cookies to improve your experience. By viewing our content, you are accepting the use of cookies.

Cookies are small text documents stored on your computer; the cookies set by this website can only be used on this website and pose no security risk.

Please do not proceed if you do not want these cookies being set. [Show details]

University of Salford
PRImA - Pattern Recognition & Image Analysis Group

Projects

Performance Analysis of Document Image Analysis Systems

Introduction

Research in Document Image Analysis is becoming increasingly more widespread. This fact has so far resulted in the development of a number of alternative methods for solving the various problems posed in the analysis of document images. Different approaches have been devised to suit different applications and, in most cases, each has been tested with very specific data.

The objective assessment of the performance of document analysis subsystems is relatively in its infancy (with OCR methods performance being the only exception). Currently, test methods and ground truth data are not widely available to suit the diverse needs of each individual component (subsystem) of a Document Analysis System. Moreover, the majority of existing methods and data are constrained by various assumptions on the nature of the image data, such as certain limitations to the freedom of the layout of a page.

Current State

Currently, research is being carried out to identify suitable methods and corresponding ground-truth data organisation to cater for documents with complex layouts. Particular attention is paid to the evaluation of methods applied before OCR.

Further Information

Parts of the on-going work have been presented at International Conferences. For both an overview of the framework and a more detailed account of the analysis of Page Segmentation approaches see the Publications section.

Members Involved

back


Valid XHTML 1.0! Valid CSS! Total number of visitors since 20 November 2003:
Best viewed in 1024x768 - Maintained by: Christos Papadopoulos (e-mail) - © 2004-05