Supervised Machine Learning Algorithms for Sentiment Analysis of Bangla Newspaper | Zendy

Sabrina Jahan Maisha | Zendy; Nuren Nafisa | Zendy; Abdul Kadar Muhammad Masum | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Supervised Machine Learning Algorithms for Sentiment Analysis of Bangla Newspaper

Author(s) -

Sabrina Jahan Maisha,

Nuren Nafisa,

Abdul Kadar Muhammad Masum

Publication year - 2021

Publication title -

international journal of innovative computing

Language(s) - English

Resource type - Journals

ISSN - 2180-4370

DOI - 10.11113/ijic.v11n2.321

Subject(s) - bengali , artificial intelligence , computer science , random forest , naive bayes classifier , machine learning , support vector machine , sentiment analysis , decision tree , natural language processing , newspaper , algorithm , advertising , business

We can state undoubtedly that Bangla language is rich enough to work with and implement various Natural Language Processing (NLP) tasks. Though it needs proper attention, hardly NLP field has been explored with it. In this age of digitalization, large amount of Bangla news contents are generated in online platforms. Some of the contents are inappropriate for the children or aged people. With the motivation to filter out news contents easily, the aim of this work is to perform document level sentiment analysis (SA) on Bangla online news. In this respect, the dataset is created by collecting news from online Bangla newspaper archive. Further, the documents are manually annotated into positive and negative classes. Composite process technique of “Pipeline” class including Count Vectorizer, transformer (TF-IDF) and machine learning (ML) classifiers are employed to extract features and to train the dataset. Six supervised ML classifiers (i.e. Multinomial Naive Bayes (MNB), K-Nearest Neighbor (K-NN), Random Forest (RF), (C4.5) Decision Tree (DT), Logistic Regression (LR) and Linear Support Vector Machine (LSVM)) are used to analyze the best classifier for the proposed model. There has been very few works on SA of Bangla news. So, this work is a small attempt to contribute in this field. This model showed remarkable efficiency through better results in both the validation process of percentage split method and 10-fold cross validation. Among all six classifiers, RF has outperformed others by 99% accuracy. Even though LSVM has shown lowest accuracy of 80%, it is also considered as good output. However, this work has also exhibited surpassing outcome for recent and critical Bangla news indicating proper feature extraction to build up the model.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore