A Parallel SGD method with Strong Convergence.

Dhruv Mahajan,S. Sathiya Keerthi,S. Sundararajan,Léon Bottou

A Parallel SGD method with Strong Convergence.

2013

Dhruv Mahajan
S. Sathiya Keerthi
S. Sundararajan
Léon Bottou

This paper proposes a novel parallel stochastic gradient descent (SGD) method that is obtained by applying parallel sets of SGD iterations (each set operating on one node using the data residing in it) for finding the direction in each iteration of a batch descent method. The method has strong convergence properties. Experiments on datasets with high dimensional feature spaces show the value of this method.

Keywords:

Mathematical optimization
Machine learning
Artificial intelligence
Stochastic gradient descent
Convergence (routing)
Mathematics
high dimensional
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations