z-logo
open-access-imgOpen Access
Comparison of Bagging Ensemble Combination Rules for Imbalanced Text Sentiment Analysis
Author(s) -
Reiza Adi Cahya,
Fitra Abdurrachman Bachtiar,
Wayan Firdaus Mahmudy
Publication year - 2021
Publication title -
journal of information technology and computer science
Language(s) - English
Resource type - Journals
eISSN - 2540-9824
pISSN - 2540-9433
DOI - 10.25126/jitecs.202161206
Subject(s) - softmax function , computer science , artificial intelligence , machine learning , naive bayes classifier , ensemble learning , sentiment analysis , classifier (uml) , support vector machine , data mining , artificial neural network
The wealth of opinions expressed by users on micro-blogging sites can be beneficial for product manufacturers of service providers, as they can gain insights about certain aspects of their products or services. The most common approach for analyzing text opinion is using machine learning. However. opinion data are often imbalanced, e.g. the number of positive sentiments heavily outnumbered the negative sentiments. Ensemble technique, which combines multiple classification algorithms to make decisions, can be used to tackle imbalanced data to learn from multiple balanced datasets. The decision of ensemble is obtained by combining the decisions of individual classifiers using a certain rule. Therefore, rule selection is an important factor in ensemble design. This research aims to investigate the best decision combination rule for imbalanced text data. Multinomial Naïve Bayes, Complement Naïve Bayes, Support Vector Machine, and Softmax Regression are used for base classifiers, and max, min, product, sum, vote, and meta-classifier rules are considered for decision combination. The experiment is done on several Twitter datasets. From the experimental results, it is found that the Softmax Regression ensemble with meta-classifier combination rule performs the best in all except in one dataset. However, it is also found that the training of the Softmax Regression ensemble requires intensive computational resources. Keyword :ensemble, SUM, SR, classifier, dataset

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom