Adaptive language modeling using minimum discriminant estimation

S. Della Pietra,V. Della Pietra,Robert L. Mercer,Salim Roukos

Adaptive language modeling using minimum discriminant estimation

1992

S. Della Pietra
V. Della Pietra
Robert L. Mercer
Salim Roukos

We present an algorithm to adapt a n-gram language model to a document as it is dictated. The observed partial document is used to estimate a unigram distribution for the words that already occurred. Then, we find the closest n-gram distribution to the static n-gram distribution (using the discrimination information distance measure) and that satisfies the marginal constraints derived from the document. The resulting minimum discrimination information model results in a perplexity of 208 instead of 290 for the static trigram model on a document of 321 words.

Keywords:

Natural language processing
Trigram
Speech recognition
Satisfiability
Perplexity
Information distance
Computer science
Information model
Machine learning
Language model
Pattern recognition
Artificial intelligence
Discriminant

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations