Provably Efficient Third-Person Imitation from Offline Observation

Aaron Zweig,Joan Bruna

Provably Efficient Third-Person Imitation from Offline Observation

2020

Aaron Zweig
Joan Bruna

Domain adaptation in imitation learning represents an essential step towards improving generalizability. However, even in the restricted setting of third-person imitation where transfer is between isomorphic Markov Decision Processes, there are no strong guarantees on the performance of transferred policies. We present problem-dependent, statistical learning guarantees for third-person imitation from observation in an offline setting, and a lower bound on performance in the online setting.

Keywords:

third person
Cognitive psychology
Imitation
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations