Clustering of Multi Scripts Isolated Characters Using k-Means Algorithm | Zendy

Neeru Garg | Zendy; Munish Kumar | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Clustering of Multi Scripts Isolated Characters Using k-Means Algorithm

Author(s) -

Neeru Garg,

Munish Kumar

Publication year - 2015

Publication title -

international journal of mathematical sciences and computing

Language(s) - English

Resource type - Journals

eISSN - 2310-9033

pISSN - 2310-9025

DOI - 10.5815/ijmsc.2015.02.03

Subject(s) - devanagari , scripting language , cluster analysis , computer science , pattern recognition (psychology) , feature (linguistics) , artificial intelligence , feature extraction , image (mathematics) , character recognition , linguistics , philosophy , operating system

The aim of this paper is script identification problem of handwritten text which facilitates the clustering of data according to their type of script. In this paper, collection of different types of handwritten text document i.e. Devanagari, Gurumukhi and Roman is taken as input and then cluster of all these documents according to script type whether i.e. Devanagari, Gurumukhi, or Roman was prepared. Clustering of handwritten multi-script document scheme proposed in this paper is divided into two phases. First phase used to extract the features of given text images. In the second phase, features extracted in the previous phase were used for clustering with kMeans algorithm. In feature extraction phase, we have extracted four types of features, namely, circular curvature feature, horizontal stroke density feature, pixel density feature value and zoning based feature. In this study, we have considered 4,850 samples of isolated characters of Devanagari, Gurumukhi and Roman script.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research