Stochastic Cubic Regularization for Fast Nonconvex Optimization

Nilesh Tripuraneni,Mitchell Stern,Chi Jin,Jeffrey Regier,Michael I. Jordan

Stochastic Cubic Regularization for Fast Nonconvex Optimization

2017

Nilesh Tripuraneni
Mitchell Stern
Chi Jin
Jeffrey Regier
Michael I. Jordan

This paper proposes a stochastic variant of a classic algorithm---the cubic-regularized Newton method [Nesterov and Polyak 2006]. The proposed algorithm efficiently escapes saddle points and finds approximate local minima for general smooth, nonconvex functions in only $\mathcal{\tilde{O}}(\epsilon^{-3.5})$ stochastic gradient and stochastic Hessian-vector product evaluations. The latter can be computed as efficiently as stochastic gradients. This improves upon the $\mathcal{\tilde{O}}(\epsilon^{-4})$ rate of stochastic gradient descent. Our rate matches the best-known result for finding local minima without requiring any delicate acceleration or variance-reduction techniques.

Keywords:

Applied mathematics
Acceleration
Saddle point
Stochastic gradient descent
Mathematical optimization
Newton's method
Regularization (mathematics)
Maxima and minima
Mathematics

Correction
Cite
Save
Machine Reading By IdeaReader

References

Citations