z-logo
open-access-imgOpen Access
Word Level Language Identification on Code-Mixed English-Bodo Text
Author(s) -
Nayan Jyoti Kalita,
Ankita Goyal Agarwala,
Jayprakash Das
Publication year - 2021
Publication title -
iop conference series. materials science and engineering
Language(s) - English
Resource type - Journals
eISSN - 1757-899X
pISSN - 1757-8981
DOI - 10.1088/1757-899x/1020/1/012027
Subject(s) - computer science , assamese , language identification , word (group theory) , natural language processing , identification (biology) , code (set theory) , artificial intelligence , social media , linguistics , world wide web , natural language , programming language , philosophy , botany , set (abstract data type) , biology
Since social media has become an active part of one’s life, people express their views freely in mixed informal languages on such platforms. So, in a multi-lingual country like India, it becomes really difficult for conventional language detectors to identify such languages. This paper mainly aims to detect the language at word level where the code mixed text can be in English-Bodo-Assamese. The data for the same is collected from some related Facebook pages and various classification algorithms are used to predict and compare the accuracy with which the detection is done.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here