Quantisation and pooling method for low‐inference‐latency spiking neural networks | Zendy

Lin Zhitao | Zendy; Shen Juncheng | Zendy; Ma De | Zendy; Meng Jianyi | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Quantisation and pooling method for low‐inference‐latency spiking neural networks

Author(s) -

Lin Zhitao,

Shen Juncheng,

Ma De,

Meng Jianyi

Publication year - 2017

Publication title -

electronics letters

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.375

H-Index - 146

ISSN - 1350-911X

DOI - 10.1049/el.2017.2219

Subject(s) - spiking neural network , mnist database , pooling , computer science , inference , latency (audio) , artificial intelligence , artificial neural network , convolutional neural network , pattern recognition (psychology) , telecommunications

Spiking neural network (SNN) that converted from conventional deep neural network (DNN) has shown great potential as a solution for fast and efficient recognition. A layer‐wise quantisation method based on retraining is proposed to quantise the activation of DNN, which reduces the number of time steps required by converted SNN to achieve minimal accuracy loss. Pooling function is incorporated into convolutional layers to reduce at most 20% of spiking neurons. The converted SNNs achieved 99.15% accuracy on MNIST and 82.9% on CIFAR10 by only seven time steps, and only 10–40% of spikes need to be processed compared with networks using traditional algorithms. The experimental results show that the proposed methods are able to build hardware‐friendly SNNs with ultra‐low‐inference latency.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Empowering knowledge with every search

About

About Careers Publisher Partners Contact Us

Learn

FAQs Blog Terms of Use Privacy Policy

About

Learn

Discover

Explore