Non-Uniform Stochastic Average Gradient Method for Training Conditional Random Fields

Mark W. Schmidt,Reza Babanezhad,Mohamed Osama Ahmed,Aaron Defazio,Ann Clifton,Anoop Sarkar

Non-Uniform Stochastic Average Gradient Method for Training Conditional Random Fields

2015

Mark W. Schmidt
Reza Babanezhad
Mohamed Osama Ahmed
Aaron Defazio
Ann Clifton
Anoop Sarkar

We apply stochastic average gradient (SAG) algorithms for training conditional random elds (CRFs). We describe a practical implementation that uses structure in the CRF gradient to reduce the memory requirement of this linearly-convergent stochastic gradient method, propose a non-uniform sampling scheme that substantially improves practical performance, and analyze the rate of convergence of the SAGA variant under nonuniform sampling. Our experimental results reveal that our method signicantly outperforms existing methods in terms of the training objective, and performs as well or better than optimally-tuned stochastic gradient methods in terms of test error.

Keywords:

Rate of convergence
Machine learning
Mathematics
Mathematical optimization
Conditional random field
Gradient method
Artificial intelligence
CRFS
Stochastic optimization
Nonuniform sampling
Statistics
Sampling (statistics)
sampling scheme
Computer science
stochastic gradient method

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations