Adaptive Variance for Changing Sparse-Reward Environments.

Xingyu Lin,Pengsheng Guo,Carlos Florensa,David Held

Adaptive Variance for Changing Sparse-Reward Environments.

2019

Xingyu Lin
Pengsheng Guo
Carlos Florensa
David Held

Robots that are trained to perform a task in a fixed environment often fail when facing unexpected changes to the environment due to a lack of exploration. We propose a principled way to adapt the policy for better exploration in changing sparse-reward environments. Unlike previous works which explicitly model environmental changes, we analyze the relationship between the value function and the optimal exploration for a Gaussian-parameterized policy and show that our theory leads to an effective strategy for adjusting the variance of the policy, enabling fast adapt to changes in a variety of sparse-reward environments.

Keywords:

Machine learning
Engineering
Control engineering
Bellman equation
Robot
Artificial intelligence

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations