A Semi-automated Evaluation Metric for Dialogue Model Coherence

Sudeep Gandhe,David R. Traum

A Semi-automated Evaluation Metric for Dialogue Model Coherence

2016

Sudeep Gandhe
David R. Traum

We propose a new metric, Voted Appropriateness, which can be used to automatically evaluate dialogue policy decisions, once some wizard data has been collected. We show that this metric outperforms a previously proposed metric Weak agreement. We also present a taxonomy for dialogue model evaluation schemas, and orient our new metric within this taxonomy.

Keywords:

Schema (psychology)
Wizard
Data mining
Mathematics
Coherence (physics)
Virtual actor
Artificial intelligence
context evaluation
policy decision
Information retrieval

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations