z-logo
open-access-imgOpen Access
Improving Text Preprocessing For Student Complaint Document Classification Using Sastrawi
Author(s) -
Mochamad Alfan Rosid,
Arif Senja Fitrani,
Ika Ratna Indra Astutik,
Nasrudin Iqrok Mulloh,
Haris Ahmad Gozali
Publication year - 2020
Publication title -
iop conference series. materials science and engineering
Language(s) - English
Resource type - Journals
eISSN - 1757-899X
pISSN - 1757-8981
DOI - 10.1088/1757-899x/874/1/012017
Subject(s) - preprocessor , computer science , data pre processing , process (computing) , selection (genetic algorithm) , information retrieval , artificial intelligence , complaint , data mining , natural language processing , law , operating system , political science
In the text mining there are stages that must be passed namely the text preprocessing stage. Text preprocessing is the stage to do the data selection process in each document, including case folding, tokenizing, filtering, and stemming. The results of the preprocessing process can affect the accuracy of document classification. In documents Bahasa Indonesia, there are still often over-stemming and under-stemming, so improvements are needed in the stemming process. In this study, it is proposed to use sastrawi libraries to improve the results of previous studies that are still not optimal in the results of preprocessing, especially in the filtering and stemming process. From the results of the study, the sastrawi library is able to reduce over stemming and under stemming and a faster processing time compared to using a Tala stemmer.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here