z-logo
open-access-imgOpen Access
TSeg – A Text Segmenter for Corpus Annotation
Author(s) -
Felipe B. Rodrigues,
Richard Semolini,
Norton Trevisan Roman,
Ana María Monteiro
Publication year - 2012
Language(s) - English
Resource type - Conference proceedings
DOI - 10.5753/sbsi.2012.14419
Subject(s) - annotation , computer science , task (project management) , xml , identification (biology) , artificial intelligence , java , process (computing) , software , natural language processing , segmentation , information retrieval , programming language , world wide web , botany , management , economics , biology
This paper describes TSeg – a Java application that allows for both manual and automatic segmentation of a source text into basic units of annotation. TSeg provides a straightforward way to approach this task through a clear point-and-click interface. Once finished the text segmentation, the application outputs an XML file that may be used as input to a more problem specific annotation software. Hence, TSeg moves the identification of basic units of annotation out of the task of annotating these units, making it possible for both problems to be analysed in isolation, thereby reducing the cognitive load on the user and preventing potential damages to the overall outcome of the annotation process.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here