z-logo
open-access-imgOpen Access
Named Entity Recognition in Telugu language using Language Dependent Features and Rule based Approach
Author(s) -
B. Sasidhar,
P. M. Yohan,
A. Vinaya Babu,
A. Govardhan
Publication year - 2011
Publication title -
international journal of computer applications
Language(s) - English
Resource type - Journals
ISSN - 0975-8887
DOI - 10.5120/2602-3628
Subject(s) - telugu , computer science , natural language processing , artificial intelligence , rule based system
The objective of Named Entity Recognition (NER) is to categorize all named entities in a document into predefined classes like person, organization, location, brand names and others. Named Entity Recognition is a difficult process in Indian languages like Telugu, Hindi, and Bengali, Urdu etc., where sufficient gazetteers and annotated corpora are not available compared to English language? A rule based systems is very difficult to implement because of lack of grammatical and linguistic analysis to make rules in Indian languages like “Telugu”. In this paper we describe the identification of Named Entities using various features, gazetteer lists using language dependent features and rule based approaches for Telugu language. Here we described two phase representation of Named Entity Recognition. The first phase describes the noun identification using Telugu dictionaries, noun morphological stemmer and noun suffixes. The second phase identifies the Named Entities using transliterated gazetteer lists related to different Named Entity tags, various Named Entity suffix features, context features and morphological features.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom