
Lexicon Based Sentiment Analysis in Indonesia Languages : A Systematic Literature Review
Author(s) -
Yuli Fauziah,
Bambang Darmo Yuwono,
Agus Sasmito Aribowo
Publication year - 2021
Publication title -
rsf conference proceeding series. engineering and technology
Language(s) - English
Resource type - Journals
eISSN - 2809-6843
pISSN - 2809-6878
DOI - 10.31098/cset.v1i1.397
Subject(s) - lexicon , sentiment analysis , computer science , natural language processing , indonesian , artificial intelligence , lemmatisation , sentence , lexical analysis , stop words , preprocessor , focus (optics) , punctuation , naive bayes classifier , lexical database , linguistics , wordnet , philosophy , physics , support vector machine , optics
This systematic literature review aims to determine the trend of lexicon based sentiment analysis research in Indonesian Language in the last two years. The focus of the study is on the understanding of preprocessing used in lexicon-based sentiment analysis studies in the last two years, the lexicon used in these studies, and classification accuracy. The main question in this SLR : what techniques of lexicon based sentiment analysis will provide the highest accuracy. The most widely used preprocessing methods in previous research are tokenization, case conversion, stemming, remove punctuation, remove stop word, remove or replace emoji and emoticons, and normalization or slangword conversion. The sentiment labeling process in previous studies calculated based on the comparison of the number of negative sentiment keywords with positive sentiment keywords in one sentence. The maximum accuracy from previous study is 90%. The most widely used lexicon is NRC and Inset which is a lexicon dictionary in Indonesian. Knowledge of this can be used to propose a better model for lexicon based sentiment analysis in Indonesian Languages.