Scenario co-evolution for reinforcement learning on a grid world smart factory domain

Thomas Gabor,Andreas Sedlmeier,Marie Kiermeier,Thomy Phan,Marcel Henrich,Monika Pichlmair,Bernhard Kempter,Cornel Klein,Horst Sauer,Reiner SchmidSiemens Ag,Jan Wieghardt

Scenario co-evolution for reinforcement learning on a grid world smart factory domain

2019

Thomas Gabor
Andreas Sedlmeier
Marie Kiermeier
Thomy Phan
Marcel Henrich
Monika Pichlmair
Bernhard Kempter
Cornel Klein
Horst Sauer
Reiner SchmidSiemens Ag
Jan Wieghardt

Adversarial learning has been established as a successful paradigm in reinforcement learning. We propose a hybrid adversarial learner where a reinforcement learning agent tries to solve a problem while an evolutionary algorithm tries to find problem instances that are hard to solve for the current expertise of the agent, causing the intelligent agent to co-evolve with a set of test instances or scenarios. We apply this setup, called scenario co-evolution, to a simulated smart factory problem that combines task scheduling with navigation of a grid world. We show that the so trained agent outperforms conventional reinforcement learning. We also show that the scenarios evolved this way can provide useful test cases for the evaluation of any (however trained) agent.

Keywords:

Grid
Machine learning
Industrial engineering
Artificial intelligence
Reinforcement learning
Computer science
Factory
Intelligent agent
Test case
Evolutionary algorithm
smart factory
Scheduling (computing)
Adversarial system

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations