z-logo
open-access-imgOpen Access
Clustering Document Images Using Graph Summaries
Author(s) -
Eugen Barbu,
Pierre Héroux,
Sébastien Adam,
Éric Trupin
Publication year - 2005
Publication title -
lecture notes in computer science
Language(s) - English
Resource type - Book series
SCImago Journal Rank - 0.249
H-Index - 400
eISSN - 1611-3349
pISSN - 0302-9743
ISBN - 3-540-26923-1
DOI - 10.1007/11510888_20
Subject(s) - document clustering , computer science , information retrieval , cluster analysis , representation (politics) , graph , document layout analysis , document classification , image (mathematics) , domain (mathematical analysis) , pattern recognition (psychology) , artificial intelligence , mathematics , theoretical computer science , mathematical analysis , politics , political science , law
International audienceDocument image classification is an important step in document image analysis. Based on classification results we can tackle other tasks such as indexation, understanding or navigation in document collections. Using a document representation and an unsupervized classification method, we can group documents that from the user point of view constitute valid clusters. The semantic gap between a domain independent document representation and the user implicit representation can lead to unsatisfactory results. In this paper we describe document images based on frequent occurring symbols. This document description is created in an unsupervised manner and can be related to the domain knowledge. Using data mining techniques applied to a graph based document representation we found frequent and maximal subgraphs. For each document image, we construct a bag containing the frequent subgraphs found in it. This bag of "symbols" represents the description of a document. We present results obtained on a corpus of graphical document images

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom