The Remarkable Robustness of Surrogate Gradient Learning for Instilling Complex Function in Spiking Neural Networks | Zendy

Friedemann Zenke | Zendy; Tim P. Vogels | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

The Remarkable Robustness of Surrogate Gradient Learning for Instilling Complex Function in Spiking Neural Networks

Author(s) -

Friedemann Zenke,

Tim P. Vogels

Publication year - 2021

Publication title -

neural computation

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.235

H-Index - 169

eISSN - 1530-888X

pISSN - 0899-7667

DOI - 10.1162/neco_a_01367

Subject(s) - computer science , robustness (evolution) , artificial intelligence , surrogate model , artificial neural network , machine learning , spiking neural network , gradient descent , activation function , neuromorphic engineering , biochemistry , chemistry , gene

Brains process information in spiking neural networks. Their intricate connections shape the diverse functions these networks perform. Yet how network connectivity relates to function is poorly understood, and the functional capabilities of models of spiking networks are still rudimentary. The lack of both theoretical insight and practical algorithms to find the necessary connectivity poses a major impediment to both studying information processing in the brain and building efficient neuromorphic hardware systems. The training algorithms that solve this problem for artificial neural networks typically rely on gradient descent. But doing so in spiking networks has remained challenging due to the nondifferentiable nonlinearity of spikes. To avoid this issue, one can employ surrogate gradients to discover the required connectivity. However, the choice of a surrogate is not unique, raising the question of how its implementation influences the effectiveness of the method. Here, we use numerical simulations to systematically study how essential design parameters of surrogate gradients affect learning performance on a range of classification problems. We show that surrogate gradient learning is robust to different shapes of underlying surrogate derivatives, but the choice of the derivative's scale can substantially affect learning performance. When we combine surrogate gradients with suitable activity regularization techniques, spiking networks perform robust information processing at the sparse activity limit. Our study provides a systematic account of the remarkable robustness of surrogate gradient learning and serves as a practical guide to model functional spiking neural networks.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research