Close

Cookies warning

This web site uses cookies to improve your experience. By viewing our content, you are accepting the use of cookies.

Cookies are small text documents stored on your computer; the cookies set by this website can only be used on this website and pose no security risk.

Please do not proceed if you do not want these cookies being set. [Show details]

University of Salford
PRImA - Pattern Recognition & Image Analysis Group

Further Details

Two Approaches for Text Segmentation in Web Images

D. Karatzas, A. Antonacopoulos

Proceedings of the 7th International Conference on Document Analysis and Recognition (ICDAR2003), Edinburgh, UK, August 2003, pp. 131-136

Abstract

There is a significant need to recognise the text in images on web pages, both for effective indexing and for presentation by non-visual means (e.g., audio). This paper presents and compares two novel methods for the segmentation of characters for subsequent extraction and recognition. The novelty of both approaches is the combination of (different in each case) topological features of characters with an anthropocentric perspective of colour perception in preference to RGB space analysis. Both approaches enable the extraction of text in complex situations such as in the presence of varying colour and texture (characters and background).

Full Paper

Download Download

back


Valid XHTML 1.0! Valid CSS! Total number of visitors since 20 November 2003:
Best viewed in 1024x768 - Maintained by: Christos Papadopoulos (e-mail) - © 2004-05