Packing Convolutional Neural Networks in the Frequency Domain | Zendy

Yunhe Wang | Zendy; Chang Xu | Zendy; Chao Xu | Zendy; Dacheng Tao | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Packing Convolutional Neural Networks in the Frequency Domain

Author(s) -

Yunhe Wang,

Chang Xu,

Chao Xu,

Dacheng Tao

Publication year - 2018

Publication title -

ieee transactions on pattern analysis and machine intelligence

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 3.811

H-Index - 372

eISSN - 1939-3539

pISSN - 0162-8828

DOI - 10.1109/tpami.2018.2857824

Subject(s) - convolution (computer science) , convolutional neural network , discrete cosine transform , frequency domain , computer science , benchmark (surveying) , algorithm , artificial intelligence , pattern recognition (psychology) , data compression , kernel (algebra) , domain (mathematical analysis) , computational complexity theory , transform coding , artificial neural network , image (mathematics) , computer vision , mathematics , mathematical analysis , geodesy , combinatorics , geography

Deep convolutional neural networks (CNNs) are successfully used in a number of applications. However, their storage and computational requirements have largely prevented their widespread use on mobile devices. Here we present a series of approaches for compressing and speeding up CNNs in the frequency domain, which focuses not only on smaller weights but on all the weights and their underlying connections. By treating convolution filters as images, we decompose their representations in the frequency domain as common parts (i.e., cluster centers) shared by other similar filters and their individual private parts (i.e., individual residuals). A large number of low-energy frequency coefficients in both parts can be discarded to produce high compression without significantly compression romising accuracy. Furthermore, we explore a data-driven method for removing redundancies in both spatial and frequency domains, which allows us to discard more useless weights by keeping similar accuracies. After obtaining the optimal sparse CNN in the frequency domain, we relax the computational burden of convolution operations in CNNs by linearly combining the convolution responses of discrete cosine transform (DCT) bases. The compression and speed-up ratios of the proposed algorithm are thoroughly analyzed and evaluated on benchmark image datasets to demonstrate its superiority over state-of-the-art methods.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research