Proceedings of the 15th International Conference on Document Analysis and Recognition (ICDAR2019), Sydney, Australia, September 2019, pp. 1521-1526
This paper presents an objective comparative evaluation of page segmentation and region classification methods for documents with complex layouts. It describes the competition (modus operandi, dataset and evaluation methodology) held in the context of ICDAR2019, presenting the results of the evaluation of twelve methods – nine submitted, three state-ofthe-art systems (commercial and open-source). Three scenarios are reported in this paper, one evaluating the ability of methods to accurately segment regions and two evaluating both segmentation and region classification. Text recognition was a bonus challenge and was not taken up by all participants. The results indicate that an innovative approach has a clear advantage but there is still a considerable need to develop robust methods that deal with layout challenges, especially with the non-textual content.
C. Clausner, A. Antonacopoulos, S. Pletschacher , "ICDAR2019 Competition on Recognition of Documents with Complex Layouts – RDCL2019", Proceedings of the 15th International Conference on Document Analysis and Recognition (ICDAR2019), Sydney, Australia, September 2019, pp. 1521-1526