Indonesian Named-entity Recognition for 15 Classes Using Ensemble Supervised Learning
Author(s) -
Aditya Satrya Wibawa,
Ayu Purwarianti
Publication year - 2016
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2016.04.053
Subject(s) - computer science , named entity recognition , artificial intelligence , feature (linguistics) , sentence , machine learning , ensemble learning , indonesian , natural language processing , class (philosophy) , pattern recognition (psychology) , linguistics , philosophy , management , economics , task (project management)
Here, we describe our effort in building Indonesian Named Entity Recognition (NER) for newspaper article with 15 classes which is larger number of class type compared to existing Indonesian NER. We employed supervised machine learning in the NER and conducted experiments to find the best attribute combination and the best algorithm with highest accuracy. We compared the attribute of word level, sentence level and document level. In the algorithm, we compared several single machine learning algorithms and also an ensembled one. Using 457 news articles, the best accuracy was achieved by using ensemble technique where the result of several machine learning algorithms were used as the feature for one machine learning algorithm
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom