Imitation with Neural Density Models

Kuno Kim,Akshat Jindal,Yang Song,Jiaming Song,Yanan Sui,Stefano Ermon

Imitation with Neural Density Models

2021

Kuno Kim
Akshat Jindal
Yang Song
Jiaming Song
Yanan Sui
Stefano Ermon

We propose a new framework for Imitation Learning (IL) via density estimation of the expert's occupancy measure followed by Maximum Occupancy Entropy Reinforcement Learning (RL) using the density as a reward. Our approach maximizes a non-adversarial model-free RL objective that provably lower bounds reverse Kullback–Leibler divergence between occupancy measures of the expert and imitator. We present a practical IL algorithm, Neural Density Imitation (NDI), which obtains state-of-the-art demonstration efficiency on benchmark control tasks.

Keywords:

Imitation
Measure (mathematics)
Artificial intelligence
Density estimation
Reinforcement learning
Divergence (statistics)
Entropy (information theory)
Occupancy
Computer science
Benchmark (computing)

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations