An Empirical Study of Actor-Critic Methods for Feedback Controllers of Ball-Screw Drivers

Borja Fernandez-Gauna,Igor Ansoategui,Ismael Etxeberria Agiriano,Manuel Graña

An Empirical Study of Actor-Critic Methods for Feedback Controllers of Ball-Screw Drivers

2013

Borja Fernandez-Gauna
Igor Ansoategui
Ismael Etxeberria Agiriano
Manuel Graña

In this paper we study the use of Reinforcement Learning Actor-Critic methods to learn the control of a ball-screw feed drive. We have tested three different actors: Q-value based, Policy Gradient and CACLA actors. We have paid special attention to the sensibility to suboptimal learning gain tuning. As a benchmark, we have used randomly-initialized PID controllers. CACLA provides an stable control comparable to the best heuristically tuned PID controller, despite its lack of knowledge of the actual error value.

Keywords:

Machine learning
Artificial intelligence
Computer science
Control engineering
PID controller
Empirical research
Reinforcement learning
Real-time computing
Heuristic
Ball screw
lack of knowledge
Radial basis function
Sensibility

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations