Optimization of Energy Policies Using Direct Value Search

Jérémie Decock,Jean-Joseph Christophe,Olivier Teytaud

Optimization of Energy Policies Using Direct Value Search

2014

Jérémie Decock
Jean-Joseph Christophe
Olivier Teytaud

Direct Policy Search is a widely used tool for reinforcement learning; however, it is usually not suitable for handling high-dimensional constrained action spaces such as those arising in power system control (unit commitmen problems). We propose Direct Value Search, an hybridization of DPS with Bellman decomposition techniques. We prove runtime properties, and apply the results to an energy management problem.

Keywords:

Energy policy
Energy management
Artificial neural network
Reinforcement learning
Electric power system
Mathematical optimization
Engineering

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations