z-logo
open-access-imgOpen Access
Quantisation and pooling method for low‐inference‐latency spiking neural networks
Author(s) -
Lin Zhitao,
Shen Juncheng,
Ma De,
Meng Jianyi
Publication year - 2017
Publication title -
electronics letters
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.375
H-Index - 146
ISSN - 1350-911X
DOI - 10.1049/el.2017.2219
Subject(s) - spiking neural network , mnist database , pooling , computer science , inference , latency (audio) , artificial intelligence , artificial neural network , convolutional neural network , pattern recognition (psychology) , telecommunications
Spiking neural network (SNN) that converted from conventional deep neural network (DNN) has shown great potential as a solution for fast and efficient recognition. A layer‐wise quantisation method based on retraining is proposed to quantise the activation of DNN, which reduces the number of time steps required by converted SNN to achieve minimal accuracy loss. Pooling function is incorporated into convolutional layers to reduce at most 20% of spiking neurons. The converted SNNs achieved 99.15% accuracy on MNIST and 82.9% on CIFAR10 by only seven time steps, and only 10–40% of spikes need to be processed compared with networks using traditional algorithms. The experimental results show that the proposed methods are able to build hardware‐friendly SNNs with ultra‐low‐inference latency.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here