Old Web
English
Sign In
Acemap
>
Paper
>
Provably Good Batch Off-Policy Reinforcement Learning Without Great Exploration
Provably Good Batch Off-Policy Reinforcement Learning Without Great Exploration
2020
Yao Liu
Adith Swaminathan
Alekh Agarwal
Emma Brunskill
Keywords:
Reinforcement learning
Artificial intelligence
Computer science
Correction
Source
Cite
Save
Machine Reading By IdeaReader
0
References
11
Citations
NaN
KQI
[]