Nonlinear All-Optical Diffractive Deep Neural Network with 10.6 μm Wavelength for Image Classification
Author(s) -
Yichen Sun,
Mingli Dong,
Mingxin Yu,
Jiabin Xia,
Xu Zhang,
Yuchen Bai,
Lidan Lu,
Lianqing Zhu
Publication year - 2021
Publication title -
international journal of optics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.263
H-Index - 17
eISSN - 1687-9392
pISSN - 1687-9384
DOI - 10.1155/2021/6667495
Subject(s) - mnist database , artificial neural network , activation function , computer science , artificial intelligence , nonlinear system , deep learning , pattern recognition (psychology) , inference , parametric statistics , physics , mathematics , statistics , quantum mechanics
A photonic artificial intelligence chip is based on an optical neural network (ONN), low power consumption, low delay, and strong antiinterference ability. The all-optical diffractive deep neural network has recently demonstrated its inference capabilities on the image classification task. However, the size of the physical model does not have miniaturization and integration, and the optical nonlinearity is not incorporated into the diffraction neural network. By introducing the nonlinear characteristics of the network, complex tasks can be completed with high accuracy. In this study, a nonlinear all-optical diffraction deep neural network (N-D2NN) model based on 10.6 μm wavelength is constructed by combining the ONN and complex-valued neural networks with the nonlinear activation function introduced into the structure. To be specific, the improved activation function of the rectified linear unit (ReLU), i.e., Leaky-ReLU, parametric ReLU (PReLU), and randomized ReLU (RReLU), is selected as the activation function of the N-D2NN model. Through numerical simulation, it is proved that the N-D2NN model based on 10.6 μm wavelength has excellent representation ability, which enables them to perform classification learning tasks of the MNIST handwritten digital dataset and Fashion-MNIST dataset well, respectively. The results show that the N-D2NN model with the RReLU activation function has the highest classification accuracy of 97.86% and 89.28%, respectively. These results provide a theoretical basis for the preparation of miniaturized and integrated N-D2NN model photonic artificial intelligence chips.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom