Multicore and GPU Parallelization of Neural Networks for Face Recognition
Author(s) -
Altaf Ahmad Huqqani,
Erich Schikuta,
Sicen Ye,
Peng Chen
Publication year - 2013
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2013.05.198
Subject(s) - computer science , cuda , artificial neural network , multi core processor , backpropagation , parallel computing , tree traversal , deep learning , face (sociological concept) , task (project management) , focus (optics) , artificial intelligence , computer architecture , machine learning , algorithm , social science , physics , management , sociology , optics , economics
Training of Artificial Neural Networks for large data sets is a time consuming task. Various approaches have been proposed to reduce the efforts, many of them by applying parallelization techniques. In this paper we develop and analyze two novel parallel training approaches for Backpropagation neural networks for face recognition. We focus on two specific paralleliza- tion environments, using on the one hand OpenMP on a conventional multithreaded CPU and CUDA on a GPU. Based on our findings we give guidelines for the efficient parallelization of Backpropagation neural networks on multicore and GPU architectures.Additionally, we present a traversal method finding the best combination of learning rate and momentum term by varying the number of hidden neurons supporting the parallelization efforts
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom