Reinforcement learning and adaptive optimization of a class of Markov jump systems with completely unknown dynamic information

Shuping He,Maoguang Zhang,Haiyang Fang,Fei Liu,Xiaoli Luan,Zhengtao Ding

Reinforcement learning and adaptive optimization of a class of Markov jump systems with completely unknown dynamic information

2019

Shuping He
Maoguang Zhang
Haiyang Fang
Fei Liu
Xiaoli Luan
Zhengtao Ding

In this paper, an online adaptive optimal control problem of a class of continuous-time Markov jump linear systems (MJLSs) is investigated by using a parallel reinforcement learning (RL) algorithm with completely unknown dynamics. Before collecting and learning the subsystems information of states and inputs, the exploration noise is firstly added to describe the actual control input. Then, a novel parallel RL algorithm is used to parallelly compute the corresponding N coupled algebraic Riccati equations by online learning. By this algorithm, we will not need to know the dynamic information of the MJLSs. The convergence of the proposed algorithm is also proved. Finally, the effectiveness and applicability of this novel algorithm is illustrated by two simulation examples.

Keywords:

Mathematical optimization
Mathematics
Optimal control
Convergence (routing)
Linear system
Adaptive optimization
Markov chain
Jump
Reinforcement learning
Algebraic number
Need to know
online learning
markov jump

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations