Online Sequence-to-Sequence Active Learning for Open-Domain Dialogue Generation.

Nabiha Asghar,Pascal Poupart,Xin Jiang,Hang Li

Online Sequence-to-Sequence Active Learning for Open-Domain Dialogue Generation.

2016

Nabiha Asghar
Pascal Poupart
Xin Jiang
Hang Li

We propose an online, end-to-end, neural generative conversational model for open-domain dialog. It is trained using a unique combination of offline two-phase supervised learning and online human-in-the-loop active learning. While most existing research proposes offline supervision or hand-crafted reward functions for online reinforcement, we devise a novel interactive learning mechanism based on a diversity-promoting heuristic for response generation and one-character user-feedback at each step. Experiments show that our model inherently promotes the generation of meaningful, relevant and interesting responses, and can be used to train agents with customized personas, moods and conversational styles.

Keywords:

Machine learning
Supervised learning
Generative grammar
Natural language processing
Dialog box
Persona
Active learning
Artificial intelligence
Heuristic
Computer science
Interactive Learning
Reinforcement
open domain

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations