Influence of Data Preprocessing
Author(s) -
Changming Zhu,
Daqi Gao
Publication year - 2016
Publication title -
journal of computing science and engineering
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.172
H-Index - 16
eISSN - 2093-8020
pISSN - 1976-4677
DOI - 10.5626/jcse.2016.10.2.51
Subject(s) - preprocessor , computer science , data pre processing , classifier (uml) , data mining , artificial intelligence , pattern recognition (psychology)
In this paper, we research the influence of data preprocessing. We conclude that using different preprocessing methods leads to different classification performances. Moreover, not all data preprocessing methods are necessary, and a criterion is given to make sure which data preprocessing is necessary and which one is effective. Experiments on some real-world data sets validate that different data preprocessing methods result in different effects. Furthermore, experiments about some algorithms with different preprocessing methods also confirm that preprocessing has a great influence on the performance of a classifier.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom