z-logo
open-access-imgOpen Access
Sentiment analysis of hate speech as an information tool to prevent riots and environmental damage
Author(s) -
G. A. Marchellim,
Yova Ruldeviyani
Publication year - 2021
Publication title -
iop conference series. earth and environmental science
Language(s) - English
Resource type - Journals
eISSN - 1755-1307
pISSN - 1755-1315
DOI - 10.1088/1755-1315/700/1/012024
Subject(s) - random forest , support vector machine , sentiment analysis , precision and recall , f1 score , social media , computer science , recall , classifier (uml) , artificial intelligence , voice activity detection , natural language processing , machine learning , psychology , speech processing , world wide web , cognitive psychology
Twitter is currently one of the most popular social media in the world. Indonesia ranked in 8th the most Twitter users from all over the world with the 10.65 million users. President of the Republic of Indonesia, Joko Widodo, was ranked 8th leader with the most followers on Twitter, with 11 million users. Everyone can express or give an opinion on something or what they thought through social media including create or spreading the hate speech. One result of this hate speech can lead to mass demonstrations which can result in environmental destruction. Our research focused on sentiment analysis of hate speech towards Indonesian president by comparing the performance between Random Forest algorithm and Support Vector Machine algorithm to prevent the negative result of hate speech that can be happen in the future. This research obtained 550 tweets and labelled manually whether it is categorized as a hate speech or not and divided into each of 275 data for positive and negative sentiments. Random Forest classifier has the best performance by combining gini criterion with the number of trees 128 with the result of the accuracy score is 76.7%, precision score 82.7%, recall score 64%, and F1 score 73.3%. Support Vector Machine classifier has the best performance by combining the linear kernel type and set the C parameter to 100 with the result of the accuracy score at 74.7%, precision score 100%, recall score 49.3%, and F1 score 66.1%. Random Forest classifier performs its best in scoring F1 score, accuracy, and recall while Support Vector Machine classifier performs its best in scoring precision. Based on the results, Random Forest classifier for hate speech detection can be applied for hate speech detection to prevent damaging the environment as it shows better score on accuracy and F1 score than Support Vector Machine Classifier.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here