z-logo
open-access-imgOpen Access
GROTOAP
Author(s) -
Dominika Tkaczyk,
Artur Czeczko,
К. Rusek,
Łukasz Bolikowski,
Roman Bogacewicz
Publication year - 2012
Publication title -
ceon repository (centre for evaluation in education and science)
Language(s) - English
Resource type - Conference proceedings
DOI - 10.1145/2232817.2232901
Subject(s) - computer science
The field of digital document content analysis includes many important tasks, for example page segmentation or zone classification. It is impossible to build effective solutions for such problems and evaluate their performance without a reliable test set, that contains both input documents and expected results of segmentation and classification. In this paper we present GROTOAP — a test set useful for training and performance evaluation of page segmentation and zone classification tasks. The test set contains input articles in a digital form and corresponding ground truth files. All input documents included in the test set have been selected from DOAJ database, which indexes articles published under CC-BY license. The whole test set is available under the same license.National Centre for Research and Development (NCBiR) Grant No. SP/I/1/77065/10Łukasz Bolikowsk

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom