Continuous-time Model-based Reinforcement Learning

Çağatay Yıldız,Markus Heinonen,Harri Lähdesmäki

Continuous-time Model-based Reinforcement Learning

2021

Çağatay Yıldız
Markus Heinonen
Harri Lähdesmäki

Model-based reinforcement learning (MBRL) approaches rely on discrete-time state transition models whereas physical systems and the vast majority of control tasks operate in continuous-time. To avoid time-discretization approximation of the underlying process, we propose a continuous-time MBRL framework based on a novel actor-critic method. Our approach also infers the unknown state evolution differentials with Bayesian neural ordinary differential equations (ODE) to account for epistemic uncertainty. We implement and test our method on a new ODE-RL suite that explicitly solves continuous-time control systems. Our experiments illustrate that the model is robust against irregular and noisy data, is sample-efficient, and can solve control problems which pose challenges to discrete-time MBRL methods.

Keywords:

Process (engineering)
Reinforcement learning
Uncertainty quantification
Artificial intelligence
Ode
Ordinary differential equation
Physical system
Computer science
Control system
Bayesian probability

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations