Entropy-SGD: Biasing gradient descent into wide valleys. International Conference on Learning Representations (ICLR) 2017

2019 
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    0
    References
    2
    Citations
    NaN
    KQI
    []