
PERBANDINGAN KINERJA METODE PRA-PEMROSESAN DALAM PENGKLASIFIKASIAN OTOMATIS DOKUMEN PATEN
Author(s) -
Budi Nugroho,
Asep Denih
Publication year - 2020
Publication title -
komputasi
Language(s) - English
Resource type - Journals
eISSN - 2654-3990
pISSN - 1693-7554
DOI - 10.33751/komputasi.v17i2.2148
Subject(s) - normalization (sociology) , data pre processing , computer science , preprocessor , support vector machine , pattern recognition (psychology) , artificial intelligence , standardization , graph , data processing , data mining , database , theoretical computer science , sociology , anthropology , operating system
This paper presents a performance analysis and comparison of several pre-processing methods used in automatic patent classification with graph kernels for Support Vector Machine (SVM). The pre-processing methods are based on the data transform techniques, namely data scaling, data centering, data standardization, data normalization, the Box-Cox transform and the Yeo-Johnson transform. The automatic patent classification is designed to classify an input of patent citation graphs into one of 10 possible classes of the International Patent Classification (IPC). The input is taken with various background conditions. The experiments showed that the best result is achieved when the pre-processing method is data normalization, achieving a classification accuracy of up to 85.33.15% for the KEHL and 93.80% for the KVHL. In contrast, for the KEHG, the preprocessing method application decreased the accuracy.