z-logo
open-access-imgOpen Access
Gradient Descent Optimization in Deep Learning Model Training Based on Multistage and Method Combination Strategy
Author(s) -
Chuanlei Zhang,
Minda Yao,
Wei Chen,
Shanwen Zhang,
Dufeng Chen,
Yuliang Wu
Publication year - 2021
Publication title -
security and communication networks
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.446
H-Index - 43
eISSN - 1939-0114
pISSN - 1939-0122
DOI - 10.1155/2021/9956773
Subject(s) - computer science , gradient descent , deep learning , artificial intelligence , artificial neural network , stochastic gradient descent , deep belief network , field (mathematics) , machine learning , adaptive optimization , gradient method , mathematical optimization , algorithm , mathematics , pure mathematics , operating system
Gradient descent is the core and foundation of neural networks, and gradient descent optimization heuristics have greatly accelerated progress in deep learning. Although these methods are simple and effective, how they work remains unknown. Gradient descent optimization in deep learning has become a hot research topic. Some research efforts have tried to combine multiple methods to assist network training, but these methods seem to be more empirical, without theoretical guides. In this paper, a framework is proposed to illustrate the principle of combining different gradient descent optimization methods by analyzing several adaptive methods and other learning rate methods. Furthermore, inspired by the principle of warmup, CLR, and SGDR, the concept of multistage is introduced into the field of gradient descent optimization, and a gradient descent optimization strategy in deep learning model training based on multistage and method combination strategy is presented. The effectiveness of the proposed strategy is verified on the massive deep learning network training experiments.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom