z-logo
open-access-imgOpen Access
Document boundary determination using structural and lexical analysis
Author(s) -
Kazem Taghva,
Marc-Allen Cartright
Publication year - 2008
Publication title -
proceedings of spie, the international society for optical engineering/proceedings of spie
Language(s) - English
Resource type - Conference proceedings
SCImago Journal Rank - 0.192
H-Index - 176
eISSN - 1996-756X
pISSN - 0277-786X
DOI - 10.1117/12.805384
Subject(s) - computer science , process (computing) , automation , natural language processing , information retrieval , boundary (topology) , lexical analysis , artificial intelligence , stack (abstract data type) , programming language , engineering , mathematics , mechanical engineering , mathematical analysis
The document boundary determination problem is the process of identifying individual documents in a stack of papers. In this paper, we report on a classification system for automation of this process. The system employs features based on document structure and lexical content. We also report on experimental results to support the effectiveness of this system.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom