Provably Efficient Third-Person Imitation from Offline Observation.

Aaron Zweig,Joan Bruna

Provably Efficient Third-Person Imitation from Offline Observation.

2020

Aaron Zweig
Joan Bruna

Domain adaptation in imitation learning represents an essential step towards improving generalizability. However, even in the restricted setting of third-person imitation where transfer is between isomorphic Markov Decision Processes, there are no strong guarantees on the performance of transferred policies. We present problem-dependent, statistical learning guarantees for third-person imitation from observation in an offline setting, and a lower bound on performance in the online setting.

Keywords:

imitation learning
Isomorphism
Upper and lower bounds
Markov decision process
Machine learning
Generalizability theory
Domain adaptation
statistical learning
Mathematics
Artificial intelligence
Imitation
online setting
third person

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations