z-logo
open-access-imgOpen Access
Web Document Parsing: A New Approach to Modeling Layout-Language Relations
Author(s) -
M. Yoshida,
H. Nakagawa
Publication year - 2007
Publication title -
ninth international conference on document analysis and recognition (icdar 2007)
Language(s) - English
Resource type - Book series
ISBN - 0-7695-2822-8
DOI - 10.1109/icdar.2007.263
We propose a novel approach for extracting semantic structures from Web documents. Our task is to extract trees that describe the hierarchical relations in documents. We developed an algorithm for this task by using the stochastic context free grammar (SCFG) framework. Experiments showed that our approach effectively worked showing performance improvement through the parameter estimation.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom