Training recurrent networks

M. W. Pedersen

Training recurrent networks

1997

M. W. Pedersen

Training recurrent networks is generally believed to be a difficult task. Excessive training times and lack of convergence to an acceptable solution are frequently reported. In this paper we seek to explain the reason for this from a numerical point of view and show how to avoid problems when training. In particular we investigate ill-conditioning, the need for and effect of regularization and illustrate the superiority of second-order methods for training.

Keywords:

Convergence (routing)
Newton's method
Machine learning
Recurrent neural network
Regularization (mathematics)
Artificial intelligence
Computer science
output feedback
recurrent neural nets
ill conditioning

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations