Not registered? - Request an account here
PAGE Metadata Scanner is a command line tool that scans a single PAGE XML file (document page layout and text content) and outputs its properties/statistics as comma-separated values.
Following properties are supported:
It is also possible to output statistics on all characters that appear in the text content of a PAGE file.
A survey of OCR evaluation tools and metrics
In The 6th International Workshop on Historical Document Imaging and Processing (HIP '21). Association for Computing Machinery, New York, NY, USA, 13–18.