Open Access
Text Mining for Cyberbullying Detection: a Brazilian Portuguese Evaluation
Author(s) -
C Eberhart,
Luciano Ignaczak,
Márcio Garcia Martins
Publication year - 2021
Language(s) - English
Resource type - Conference proceedings
DOI - 10.5753/stil.2021.17788
Subject(s) - portuguese , naive bayes classifier , support vector machine , brazilian portuguese , computer science , world wide web , data science , artificial intelligence , linguistics , philosophy
Bullying and cyberbullying are words commonly seen in today's news. Although the scientific community has evaluated text mining techniques for cyberbullying detection, few studies have targeted Brazilian Portuguese datasets. Our study aims to assess the text mining application to detect cyberbullying messages written in Brazilian Portuguese. We gathered posts and comments from Reddit communities and extracted several text features. We then processed these features using Naïve Bayes and SVM classifiers to uncover cyberbullying activity. The outcomes of this experiment may not be used solo for cyberbullying detection; however, they can aid moderators in prioritizing content reviews and acting faster on real cyberbullying cases.