The Inefficiency of Batch Training for Large Training Sets | Zendy

D. Randall Wilson | Zendy; Tony R. Martinez | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

The Inefficiency of Batch Training for Large Training Sets

Author(s) -

D. Randall Wilson,

Tony R. Martinez

Publication year - 2000

Language(s) - English

DOI - 10.1109/ijcnn.2000.10003

Multilayer perceptrons are often trained using error backpropagation (BP). BP training can be done in either a batch or continuous manner. Claims have frequently been made that batch training is faster and/or more "correct" than continuous training because it uses a better approximation of the true gradient for its weight updates. These claims are often supported by empirical evidence on very small data sets. These claims are untrue, however, for large training sets. This paper explains why batch training is much slower than continuous training for large training sets. Various levels of semi-batch training used on a 20,000-instance speech recognition task show a roughly linear increase in training time required with an increase in batch size.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research