Optimal memory-aware backpropagation of deep join networks | Zendy

Olivier Beaumont | Zendy; Julien Herrmann | Zendy; Guillaume Pallez | Zendy; Alena Shilova | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Optimal memory-aware backpropagation of deep join networks

Author(s) -

Olivier Beaumont,

Julien Herrmann,

Guillaume Pallez,

Alena Shilova

Publication year - 2020

Publication title -

philosophical transactions of the royal society a mathematical physical and engineering sciences

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.074

H-Index - 169

eISSN - 1471-2962

pISSN - 1364-503X

DOI - 10.1098/rsta.2019.0049

Subject(s) - computer science , backpropagation , homogeneous , join (topology) , artificial intelligence , artificial neural network , theoretical computer science , scheduling (production processes) , bounded function , mathematical optimization , mathematics , mathematical analysis , combinatorics

Deep learning training memory needs can prevent the user from considering large models and large batch sizes. In this work, we propose to use techniques from memory-aware scheduling and automatic differentiation (AD) to execute a backpropagation graph with a bounded memory requirement at the cost of extra recomputations. The case of a single homogeneous chain, i.e. the case of a network whose stages are all identical and form a chain, is well understood and optimal solutions have been proposed in the AD literature. The networks encountered in practice in the context of deep learning are much more diverse, both in terms of shape and heterogeneity. In this work, we define the class of backpropagation graphs, and extend those on which one can compute in polynomial time a solution that minimizes the total number of recomputations. In particular, we consider join graphs which correspond to models such as siamese or cross-modal networks. This article is part of a discussion meeting issue 'Numerical algorithms for high-performance computational science'.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research