Leveraging VerbNet to build Corpus-Specific Verb Clusters

Daniel W. Peterson,Jordan L. Boyd-Graber,Martha Palmer,Daisuke Kawahara

Leveraging VerbNet to build Corpus-Specific Verb Clusters

2016

Daniel W. Peterson
Jordan L. Boyd-Graber
Martha Palmer
Daisuke Kawahara

In this paper, we aim to close the gap from extensive, human-built semantic resources and corpus-driven unsupervised models. The particular resource explored here is VerbNet, whose organizing principle is that semantics and syntax are linked. To capture patterns of usage that can augment knowledge resources like VerbNet, we expand a Dirichlet process mixture model to predict a VerbNet class for each sense of each verb, allowing us to incorporate annotated VerbNet data to guide the clustering process. The resulting clusters align more closely to hand-curated syntactic/semantic groupings than any previous models, and can be adapted to new domains since they require only corpus counts.

Keywords:

Mixture model
VerbNet
Natural language processing
Cluster analysis
Syntax
Verb
Semantics
Machine learning
Organizing principle
Dirichlet process
Artificial intelligence
Computer science
Cluster (physics)
dirichlet process mixture model

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations