Learning, Memory, and the Role of Neural Network Architecture | Zendy

Ann M. Hermundstad | Zendy; Kevin Brown | Zendy; Danielle S. Bassett | Zendy; Jean M. Carlson | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Learning, Memory, and the Role of Neural Network Architecture

Author(s) -

Ann M. Hermundstad,

Kevin Brown,

Danielle S. Bassett,

Jean M. Carlson

Publication year - 2011

Publication title -

plos computational biology

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 2.628

H-Index - 182

eISSN - 1553-7358

pISSN - 1553-734X

DOI - 10.1371/journal.pcbi.1002063

Subject(s) - computer science , maxima and minima , variety (cybernetics) , artificial neural network , artificial intelligence , network architecture , function (biology) , task (project management) , architecture , deep learning , machine learning , theoretical computer science , mathematics , art , mathematical analysis , computer security , management , evolutionary biology , economics , visual arts , biology

The performance of information processing systems, from artificial neural networks to natural neuronal ensembles, depends heavily on the underlying system architecture. In this study, we compare the performance of parallel and layered network architectures during sequential tasks that require both acquisition and retention of information, thereby identifying tradeoffs between learning and memory processes. During the task of supervised, sequential function approximation, networks produce and adapt representations of external information. Performance is evaluated by statistically analyzing the error in these representations while varying the initial network state, the structure of the external information, and the time given to learn the information. We link performance to complexity in network architecture by characterizing local error landscape curvature. We find that variations in error landscape structure give rise to tradeoffs in performance; these include the ability of the network to maximize accuracy versus minimize inaccuracy and produce specific versus generalizable representations of information. Parallel networks generate smooth error landscapes with deep, narrow minima, enabling them to find highly specific representations given sufficient time. While accurate, however, these representations are difficult to generalize. In contrast, layered networks generate rough error landscapes with a variety of local minima, allowing them to quickly find coarse representations. Although less accurate, these representations are easily adaptable. The presence of measurable performance tradeoffs in both layered and parallel networks has implications for understanding the behavior of a wide variety of natural and artificial learning systems.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research