Efficient Technique for word identification and recognition in Telugu Documents | Zendy

K. Mohana Lakshmi | Zendy; Tummala Ranga Babu | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Efficient Technique for word identification and recognition in Telugu Documents

Author(s) -

K. Mohana Lakshmi,

Tummala Ranga Babu

Publication year - 2019

Publication title -

international journal of recent technology and engineering (ijrte)

Language(s) - English

Resource type - Journals

ISSN - 2277-3878

DOI - 10.35940/ijrte.b3793.078219

Subject(s) - telugu , artificial intelligence , computer science , natural language processing , scale invariant feature transform , codebook , word (group theory) , identification (biology) , bag of words model , feature (linguistics) , devanagari , bag of words model in computer vision , cluster analysis , support vector machine , speech recognition , visual word , pattern recognition (psychology) , feature extraction , mathematics , linguistics , image (mathematics) , image retrieval , character recognition , philosophy , botany , biology , geometry

Telugu language is one of the most spoken Indian languages throughout the world. Since it has an old heritage, so Telugu literature and newspaper publications can be scanned to identify individual words. Identification of Telugu word images poses serious problems owing to its complex structure and larger set of individual characters. This paper aims to develop a novel methodology to achieve the same using SIFT (Scale Invariant Feature Transform) features of telugu words and classifying these features using BoVW (bag of visual words). The features are clustered to create a dictionary using k-means clustering. These words are used to create a visual codebook of the word images and the classification is achieved through SVM (Support Vector Machine).

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research