Old Web
English
Sign In
Acemap
>
authorDetail
>
Iurii Kemaev
Iurii Kemaev
Google
Computer science
Reinforcement learning
Grid
improved performance
Linear combination
4
Papers
3
Citations
0
KQI
Citation Trend
Filter By
Interval:
1900~2024
1900
2024
Author
Papers (4)
Sort By
Default
Most Recent
Most Early
Most Citation
No data
Journal
Conference
Others
Discovering a set of policies for the worst case reward
2021
arXiv: Artificial Intelligence
Tom Zahavy
André Barreto
Daniel J. Mankowitz
Shaobo Hou
Brendan ODonoghue
Iurii Kemaev
Satinder Singh
Show All
Source
Cite
Save
Citations (0)
Discovery of Options via Meta-Learned Subgoals
2021
NeurIPS | Neural Information Processing Systems
Vivek Veeriah
Tom Zahavy
Matteo Hessel
Zhongwen Xu
Junhyuk Oh
Iurii Kemaev
Hado van Hasselt
David Silver
Satinder Singh
Show All
Source
Cite
Save
Citations (0)
Discovering a set of policies for the worst case reward
2021
ICLR | International Conference on Learning Representations
Tom Zahavy
André Barreto
Daniel J. Mankowitz
Shaobo Hou
Brendan ODonoghue
Iurii Kemaev
Satinder Singh
Show All
Source
Cite
Save
Citations (3)
Discovery of Options via Meta-Learned Subgoals
2021
| Annual Conference on Neural Information Processing Systems
Vivek Veeriah
Tom Zahavy
Matteo Hessel
Zhongwen Xu
Junhyuk Oh
Iurii Kemaev
Hado P. van Hasselt
David Silver
Satinder Singh
Show All
Source
Cite
Save
Citations (0)
1