
Stemming Analysis Indonesian Language News Text with Porter Algorithm
Author(s) -
Arif Siswandi,
Yudi Permana,
Arvita Emarilis
Publication year - 2021
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/1845/1/012019
Subject(s) - indonesian , sentence , computer science , algorithm , value (mathematics) , natural language processing , word (group theory) , artificial intelligence , linguistics , machine learning , philosophy
Stemming is the process of classifying various morphological variations of a word or sentence into one and the same basic form. In Indonesian language stemming, there are two types of stemming methods that already exist, namely the dictionary-based stemming algorithm and the non-dictionary-based stemming algorithm. In this study the algorithm used is the Indonesian Porter algorithm for dictionary-based ones. The test was carried out using 100 predetermined Indonesian text documents. The results of tests conducted show that the highest accuracy value is found in the Porter algorithm, the least Overstemming and Understemming values are also found in the Porter algorithm.