Learning Semantic Textual Similarity from Conversations

Yinfei Yang,Steve Yuan,Daniel Cer,Sheng-yi Kong,Noah Constant,Petr Pilař,Heming Ge,Yun-hsuan Sung,Brian Strope,Ray Kurzweil

Learning Semantic Textual Similarity from Conversations

2018

Yinfei Yang
Steve Yuan
Daniel Cer
Sheng-yi Kong
Noah Constant
Petr Pilař
Heming Ge
Yun-hsuan Sung
Brian Strope
Ray Kurzweil

We present a novel approach to learn representations for sentence-level semantic similarity using conversational data. Our method trains an unsupervised model to predict conversational responses. The resulting sentence embeddings perform well on the Semantic Textual Similarity (STS) Benchmark and SemEval 2017’s Community Question Answering (CQA) question similarity subtask. Performance is further improved by introducing multitask training, combining conversational response prediction and natural language inference. Extensive experiments show the proposed model achieves the best performance among all neural models on the STS Benchmark and is competitive with the state-of-the-art feature engineered and mixed systems for both tasks.

Keywords:

Natural language processing
Artificial intelligence
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

104

Citations