An Exploratory Approach to Find a Novel Metric Based Optimum Language Model for Automatic Bangla Word Prediction | Zendy

Md. Tarek Habib | Zendy; Abdullah Al Mamun | Zendy; Md. Sadekur Rahman | Zendy; Shah Md. Tanvir Siddiquee | Zendy; Farruk Ahmed | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

An Exploratory Approach to Find a Novel Metric Based Optimum Language Model for Automatic Bangla Word Prediction

Author(s) -

Md. Tarek Habib,

Abdullah Al Mamun,

Md. Sadekur Rahman,

Shah Md. Tanvir Siddiquee,

Farruk Ahmed

Publication year - 2018

Publication title -

international journal of intelligent systems and applications

Language(s) - English

Resource type - Journals

eISSN - 2074-9058

pISSN - 2074-904X

DOI - 10.5815/ijisa.2018.02.05

Subject(s) - bengali , computer science , word (group theory) , artificial intelligence , sentence , natural language processing , set (abstract data type) , metric (unit) , population , speech recognition , linguistics , programming language , philosophy , operations management , economics , demography , sociology

Word completion and word prediction are two important phenomena in typing that have intense effect on aiding disable people and students while using keyboard or other similar devices. Such auto completion technique also helps students significantly during learning process through constructing proper keywords during web searching. A lot of works are conducted for English language, but for Bangla, it is still very inadequate as well as the metrics used for performance computation is not rigorous yet. Bangla is one of the mostly spoken languages (3.05% of world population) and ranked as seventh among all the languages in the world. In this paper, word prediction on Bangla sentence by using stochastic, i.e. N-gram based language models are proposed for auto completing a sentence by predicting a set of words rather than a single word, which was done in previous work. A novel approach is proposed in order to find the optimum language model based on performance metric. In addition, for finding out better performance, a large Bangla corpus of different word types is used.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research