z-logo
open-access-imgOpen Access
Text pre-processing of multilingual for sentiment analysis based on social network data
Author(s) -
Neha Garg,
Kamlesh Sharma
Publication year - 2022
Publication title -
international journal of power electronics and drive systems/international journal of electrical and computer engineering
Language(s) - English
Resource type - Journals
eISSN - 2722-2578
pISSN - 2722-256X
DOI - 10.11591/ijece.v12i1.pp776-784
Subject(s) - lexical analysis , computer science , text processing , artificial intelligence , sentiment analysis , natural language processing , stop words , data processing , punctuation , word processing , word (group theory) , text segmentation , information retrieval , segmentation , preprocessor , database , linguistics , philosophy
Sentiment analysis (SA) is an enduring area for research especially in the field of text analysis. Text pre-processing is an important aspect to perform SA accurately. This paper presents a text processing model for SA, using natural language processing techniques for twitter data. The basic phases for machine learning are text collection, text cleaning, pre-processing, feature extractions in a text and then categorize the data according to the SA techniques. Keeping the focus on twitter data, the data is extracted in domain specific manner. In data cleaning phase, noisy data, missing data, punctuation, tags and emoticons have been considered. For pre-processing, tokenization is performed which is followed by stop word removal (SWR). The proposed article provides an insight of the techniques, that are used for text pre-processing, the impact of their presence on the dataset. The accuracy of classification techniques has been improved after applying text pre-processing and dimensionality has been reduced. The proposed corpus can be utilized in the area of market analysis, customer behaviour, polling analysis, and brand monitoring. The text pre-processing process can serve as the baseline to apply predictive analysis, machine learning and deep learning algorithms which can be extended according to problem definition.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here