This online repository is the main point of reference for all activities related to evaluation within the scope of the Europeana Newspapers project. Its main goal is to provide a representative collection of all the types of newspapers which are and/or might be subject of ongoing or future digitisation activities. As such, it is hosting scanned images, metadata and ground truth (a representation of the ideal result of a processing step like OCR or layout analysis) on the level of individual newspaper pages.
A survey of OCR evaluation tools and metrics
In The 6th International Workshop on Historical Document Imaging and Processing (HIP '21). Association for Computing Machinery, New York, NY, USA, 13–18.
Ontology and Framework for Semantic Labelling of Document Data and Software Methods
Proceedings of the 13th IAPR International Workshop on Document Analysis Systems (DAS2018), Vienna, Austria, April 24-27, 2018, pp. 73-78
Quality Prediction System for Large-Scale Digitisation Workflows
Proceedings of the 12th IAPR International Workshop on Document Analysis Systems (DAS2016), Santorini, Greece, April 11-14, 2016
The ENP Image and Ground Truth Dataset of Historical Newspapers
Proceedings of the 13th International Conference on Document Analysis and Recognition (ICDAR2015), Nancy, France, August 2015, pp. 931-935