Detecting Epidemic Diseases Using Sentiment Analysis of Arabic Tweets
Author(s) -
Qanita Bani Baker,
Farah Shatnawi,
Saif Rawashdeh,
Mohammad AL-Smadi,
Yaser Jararweh
Publication year - 2020
Publication title -
jucs - journal of universal computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.284
H-Index - 53
eISSN - 0948-695X
pISSN - 0948-6968
DOI - 10.3897/jucs.2020.004
Subject(s) - computer science , arabic , naive bayes classifier , sentiment analysis , support vector machine , artificial intelligence , natural language processing , k nearest neighbors algorithm , bayes' theorem , machine learning , linguistics , bayesian probability , philosophy
Opinion mining is an important step towards facilitating information in health data. Several studies have demonstrated the possibility of tracking diseases using public tweets. However, most studies were applied to English language tweets. Influenza is currently one of the world's greatest infectious disease challenges. In this study, a new approach is proposed in order to detect Influenza using machine learning techniques from Arabic tweets in Arab countries. This paper is the first study of epidemic diseases based on Arabic language tweets. In this work, we have collected, labeled, filtered and analyzed the influenza-related tweets written in the Arabic language. Several classifiers were used to measure the quality and the performance of the approach, which are: Naive Bayes, Support Vector Machines, Decision Trees, and K-Nearest Neighbor. The classifiers which achieved the best accuracy results for the three experiments were: Naïve Bayes with 89.06%, and K-Nearest Neighbor with 86.43%, respectively.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom