Old Web
English
Sign In
Acemap
>
Paper
>
Reducing Activation Recomputation in Large Transformer Models.
Reducing Activation Recomputation in Large Transformer Models.
2022
Vijay Korthikanti
Jared Casper
Sangkug Lym
Lawrence McAfee
Michael Andersch
Mohammad Shoeybi
Bryan Catanzaro
Correction
Cite
Save
Machine Reading By IdeaReader
0
References
0
Citations
NaN
KQI
[]