An Evaluation Framework Based on Gold Standard Models for Definition Question Answering

Samir Kanaan,Jordi Turmo

An Evaluation Framework Based on Gold Standard Models for Definition Question Answering

2004

Samir Kanaan
Jordi Turmo

This paper presents a weak supervised evaluation framework for definition question answering (DefQA) called Solon. It automatically evaluates a set of DefQA systems using existing human definitions as gold standard models. This allows the framework to overcome known limitations of the evaluation methods in the state of the art with the advantage that it is less supervised. In addition, Solon adapts its configuration for each specific DefQA task, thus rendering a good evaluation procedure. The results obtained in our experiments show that Solon is able to detect the best systems and to score them accordingly, with state of the art performance.

Keywords:

Question answering
Rendering (computer graphics)
Data mining
Computer science
Gold standard
Information retrieval
evaluation methods

Correction
Cite
Save
Machine Reading By IdeaReader

References

Citations