Fitted Value Iteration in Continuous MDPs With State Dependent Action Sets

Hao Li,Shiping Shao,Abhishek Gupta

Fitted Value Iteration in Continuous MDPs With State Dependent Action Sets

2022

Hao Li
Shiping Shao
Abhishek Gupta

In this letter, we establish the convergence of fitted value iteration and fitted Q-value iteration for continuous-state continuous-action Markov decision problems (MDPs) with state-dependent action sets. We further extend the algorithm and the convergence result to the case of monotone MDPs.

Keywords:

Applied mathematics
Decision problem
Probabilistic logic
Markov chain
Approximation algorithm
Convergence (routing)
Monotone polygon
Mathematics
Markov decision process
Kernel (statistics)

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations