z-logo
open-access-imgOpen Access
Clustering of Multi Scripts Isolated Characters Using k-Means Algorithm
Author(s) -
Neeru Garg,
Munish Kumar
Publication year - 2015
Publication title -
international journal of mathematical sciences and computing
Language(s) - English
Resource type - Journals
eISSN - 2310-9033
pISSN - 2310-9025
DOI - 10.5815/ijmsc.2015.02.03
Subject(s) - devanagari , scripting language , cluster analysis , computer science , pattern recognition (psychology) , feature (linguistics) , artificial intelligence , feature extraction , image (mathematics) , character recognition , linguistics , philosophy , operating system
The aim of this paper is script identification problem of handwritten text which facilitates the clustering of data according to their type of script. In this paper, collection of different types of handwritten text document i.e. Devanagari, Gurumukhi and Roman is taken as input and then cluster of all these documents according to script type whether i.e. Devanagari, Gurumukhi, or Roman was prepared. Clustering of handwritten multi-script document scheme proposed in this paper is divided into two phases. First phase used to extract the features of given text images. In the second phase, features extracted in the previous phase were used for clustering with kMeans algorithm. In feature extraction phase, we have extracted four types of features, namely, circular curvature feature, horizontal stroke density feature, pixel density feature value and zoning based feature. In this study, we have considered 4,850 samples of isolated characters of Devanagari, Gurumukhi and Roman script.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom