Automatic hyperparameter tuning in on-line learning: Classic Momentum and ADAM

Pawel Wawrzynski,Pawel Zawistowski,Lukasz Lepak

Automatic hyperparameter tuning in on-line learning: Classic Momentum and ADAM

2020

Pawel Wawrzynski
Pawel Zawistowski
Lukasz Lepak

We propose a method that adapts hyperparameters, namely step-sizes and momentum decay factors, in on-line learning with classic momentum and ADAM. The approach is based on the estimation of the short- and long-term influence of these hyperparameters on the loss value. In the experimental study, our approach is applied to on-line learning in small neural networks and deep autoencoders. Automatically tuned coefficients surpass or roughly match the best ones selected manually in terms of learning speed. As a result, on-line learning can be a fully automatic process, producing results from the first run, without preliminary experiments aimed at manual hyperparameter tuning.

Keywords:

fully automatic
Artificial neural network
Feedforward neural network
Acceleration
Hyperparameter
Algorithm
Computer science
Momentum

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations