Lexicalized Stochastic Modeling of Constraint-Based Grammars using Log-Linear Measures and EM Training

Stefan Riezler,Detlef Prescher,Jonas Kuhn,Mark Johnson

Lexicalized Stochastic Modeling of Constraint-Based Grammars using Log-Linear Measures and EM Training

2000

Stefan Riezler
Detlef Prescher
Jonas Kuhn
Mark Johnson

We present a new approach to stochastic modeling of constraint-based grammars that is based on log-linear models and uses EM for estimation from unannotated data. The techniques are applied to an LFG grammar for German. Evaluation on an exact match task yields 86% precision for an ambiguity rate of 5.4, and 90% precision on a subcat frame match for an ambiguity rate of 25. Experimental comparison to training from a parsebank shows a 10% gain from EM training. Also, a new class-based grammar lexicalization is presented, showing a 10% gain over unlexicalized models.

Keywords:

Natural language processing
Artificial intelligence
Log-linear model
Ambiguity
Grammar
Computer science
German
Lexicalization
Rule-based machine translation
exact match

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations