Old Web
English
Sign In
Acemap
>
Paper
>
Analysis and Improvement of Policy Gradient Estimation
Analysis and Improvement of Policy Gradient Estimation
2011
Zhao TingTing
Hachiya Hirotaka
Niu Gang
Sugiyama Masashi
Keywords:
Welfare economics
Management science
Reinforcement learning
Economics
Mathematical optimization
gradient estimation
Correction
Source
Cite
Save
Machine Reading By IdeaReader
0
References
13
Citations
NaN
KQI
[]