Dimensionality Reduction Methods Used in Machine Learning
Author(s) -
Kristóf Muhi,
Zsolt Csaba Johanyák
Publication year - 2020
Publication title -
műszaki tudományos közlemények
Language(s) - English
Resource type - Journals
ISSN - 2601-5773
DOI - 10.33894/mtk-2020.13.27
Subject(s) - dimensionality reduction , preprocessor , raw data , computer science , data pre processing , artificial intelligence , curse of dimensionality , reduction (mathematics) , pattern recognition (psychology) , machine learning , data mining , feature (linguistics) , data reduction , missing data , mathematics , linguistics , philosophy , geometry , programming language
In most cases, a dataset obtained through observation, measurement, etc. cannot be directly used for the training of a machine learning based system due to the unavoidable existence of missing data, inconsistencies and high dimensional feature space. Additionally, the individual features can contain quite different data types and ranges. For this reason, a data preprocessing step is nearly always necessary before the data can be used. This paper gives a short review of the typical methods applicable in the preprocessing and dimensionality reduction of raw data.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom