R-max - a general polynomial time algorithm for near-optimal reinforcement learning

I BrafmanRonen,TennenholtzMoshe

R-max - a general polynomial time algorithm for near-optimal reinforcement learning

2003

I BrafmanRonen
TennenholtzMoshe

R-MAX is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-MAX, the agent always maintains a complete, but possibly in...

Keywords:

reinforcement learning algorithm
Artificial intelligence
Reinforcement learning
Markov decision process
general polynomial
Mathematics
Time complexity
Machine learning

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations