Building a Large-scale Persona Dialog Dataset

Yinhe Zheng,G Chen,MinLie Huang

Building a Large-scale Persona Dialog Dataset

2018

Yinhe Zheng
G Chen
MinLie Huang

We proposed a primary version of a large scale multi-turn dialogue dataset in Chinese that contains over 25 million sessions of dialogues crawled from Weibo1. Diversified personality traits for each dialogue participant are collected to facilitate modelling persona in dialogues. Our dataset fills the blank of the resources for building personalised dialogue systems in open-domain conversations and can also serves as an important resource for a wide range of studies.

Keywords:

World Wide Web
Persona
Dialog box
Big Five personality traits
Blank
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations