z-logo
open-access-imgOpen Access
MoArLex: An Arabic Sentiment Lexicon Built Through Automatic Lexicon Expansion
Author(s) -
Mohab Youssef,
Samhaa R. El-Beltagy
Publication year - 2018
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2018.10.464
Subject(s) - lexicon , computer science , sentiment analysis , arabic , natural language processing , social media , artificial intelligence , quality (philosophy) , word (group theory) , world wide web , linguistics , philosophy , epistemology
Research addressing Sentiment Analysis has witnessed great attention over the last decade especially after the huge increase in social media networks usage. Social networks like Facebook and Twitter generate an incredible amount of data on a daily basis, containing posts that discuss all kinds of different topics ranging from sports and products to politics and current events. Since data generated within these mediums is created by users from all over the world, it is multilingual in nature. Arabic is one of the important languages recently targeted by many sentiment analysis efforts. However, Arabic is considered to be under-resourced in terms of lexicons and datasets when compared to English. This paper presents a novel technique for automatically expanding an Arabic sentiment lexicon using word embeddings. Evaluation of the quality of the automatically added terms was done in multiple ways, all of which have shown that lexicon entries added using the presented way are more accurate than sentiment lexicon entries obtained using machine learning or distant supervision methods.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom