A SURVEY ON TEXT PRE-PROCESSING AND FEATURE EXTRACTION TECHNIQUES FOR SENTIMENT ANALYSIS OF TWITTER DATA | Zendy

Anupam Sharma | Zendy; Mahesh Parmar | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

A SURVEY ON TEXT PRE-PROCESSING AND FEATURE EXTRACTION TECHNIQUES FOR SENTIMENT ANALYSIS OF TWITTER DATA

Author(s) -

Anupam Sharma,

Mahesh Parmar

Publication year - 2021

Publication title -

international research journal of computer science

Language(s) - English

Resource type - Journals

ISSN - 2393-9842

DOI - 10.26562/irjcs.2021.v0812.001

Subject(s) - computer science , lexical analysis , artificial intelligence , natural language processing , preprocessor , lemmatisation , information extraction , stop words , information retrieval , data pre processing , sentiment analysis , pipeline (software) , feature (linguistics) , categorization , natural language , feature extraction , programming language , linguistics , philosophy

Natural Language Processing (NLP) is a popular branch of Artificial Intelligence (AI) that is concerned with how machines comprehend and interpret human language. A computer can only comprehend information in the form of numbers, not words. As a result, it's critical to research what preprocessing and feature extraction techniques are required to implement on a human language so that when it's transformed to machine-readable form, the computer can understand it. Text classification plays a vital role in NLP jobs, with applications in web search, document categorization, chatbots, virtual assistants, and other areas. Sentences and writings that lack structure are naturally difficult to convert to a machine-readable format. Pre-processing techniques have been given special attention since they serve as a precursor to the following stages of information retrieval strategies. Any errors in the early preprocessing approaches are passed on to the NLP pipeline's later stages. In addition, the order in which techniques like tokenization, StopWords elimination, and lemmatization are used is carefully considered. Information retrieval systems are particularly concerned with how efficiently textual input is cleaned and filtered to remove noisy material that does not add to efficiency and, moreover, leads to incorrect results. This survey work emphasizes the importance of effective text preprocessing approaches, as well as information retrieval using Natural Language Processing feature extraction techniques.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore