z-logo
open-access-imgOpen Access
A Shared Parts Model for Document Image Recognition
Author(s) -
M. Das Gupta,
P. Sarkar
Publication year - 2007
Publication title -
ninth international conference on document analysis and recognition (icdar 2007)
Language(s) - English
Resource type - Book series
ISBN - 0-7695-2822-8
DOI - 10.1109/icdar.2007.34
We address document image classification by visual ap- pearance. An image is represented by a variable-length list of visually salient features. A hierarchical Bayesian net- work is used to model the joint density of these features. This model promotes generalization from a few samples by sharing component probability distributions among differ- ent categories, and by factoring out a common displace- ment vector shared by all features within an image. The Bayesian network is implemented as a factor graph, and parameter estimation and inference are both done by loopy belief propagation. We explain and illustrate our model on a simple shape classification task. We obtain close to 90% accuracy on classifying journal articles from memos in the UWASH-II dataset, as well as on other classification tasks on a home-grown data set of technical articles.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom