z-logo
open-access-imgOpen Access
An Analytical Review of Preprocessing Techniques in Bengali Natural Language Processing
Author(s) -
Sovon Chakraborty,
Protiva Das,
Shakib Mahmud Dipto,
Md. Aktaruzzaman Pramanik,
Jannatun Noor
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3574234
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Research in Bengali Natural Language Processing (BNLP) is rapidly expanding. Despite being one of the most widely spoken languages in the world, BNLP research remains insufficient, particularly in Bengali speech recognition. The languages rich morphology, agglutinative structure, and diverse dialects make text and speech processing especially challenging. However, these challenges can be addressed with effective preprocessing techniques. Various organizations in Bangladesh and West Bengal are integrating Natural Language Processing (NLP) into their services, but without a thorough understanding of preprocessing, these implementations remain incomplete. Applying proper preprocessing techniques to the Bengali language will serve as a foundation for developing robust NLP applications. This paper presents a comprehensive review of preprocessing techniques in BNLP based on state-of-the-art research. It covers key areas such as sentiment analysis, Named Entity Recognition, speech recognition, text categorization, and summarization. First, the paper provides an in-depth discussion of Bengali language characteristics and research areas in BNLP. It then explores the challenges faced by researchers in processing Bengali text and speech. Additionally, it details various preprocessing techniques, highlighting their advantages and disadvantages. Finally, the paper examines future directions for BNLP, emphasizing the role of effective preprocessing in advancing the field.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Empowering knowledge with every search

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom