Clustering for Data-driven Unraveling Artificial Neural Networks
Author(s) -
Felipe Farias,
Teresa B. Ludermir,
Carmelo J. A. Bastos-Filho
Publication year - 2020
Publication title -
anais do encontro nacional de inteligência artificial e computacional (eniac)
Language(s) - English
Resource type - Conference proceedings
ISSN - 2763-9061
DOI - 10.5753/eniac.2020.12160
Subject(s) - cluster analysis , computer science , artificial neural network , benchmark (surveying) , artificial intelligence , perceptron , data mining , process (computing) , layer (electronics) , machine learning , multilayer perceptron , feature (linguistics) , pattern recognition (psychology) , linguistics , chemistry , philosophy , geodesy , organic chemistry , geography , operating system
This work presents an investigation on how to define Neural Networks (NN) architectures adopting a data-driven approach using clustering to create sub-labels to facilitate the learning process and to discover the number of neurons needed to compose the layers. We also increase the depth of the model aiming to represent the samples better, the more in-depth it flows into the model. We hypothesize that the clustering process identifies sub-regions in the feature space in which the samples belonging to the same cluster have strong similarities. We used seven benchmark datasets to validate our hypothesis using 10-fold cross validation 3 times. The proposed model increased the performance, while never decreased it, with statistical significance considering the p-value $< 0.05$ in comparison with a Multi-Layer Perceptron with a single hidden layer with approximately the same number of parameters of the architectures found by our approach.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom