Compound words in large-vocabulary German speech recognition systems

1996 
The paper analyzes the impact of German compound words on speech recognition. It is well known that due to an idiosyncrasy of German orthography, compound words make up a major fraction of German vocabulary, and most out-of-vocabulary (OOV) compounds are composed of frequent words already in the lexicon. The paper introduces a new method for handling the components of compounds rather than the compounds themselves. This not only reduces the vocabulary, and therefore the complexity, but also improves word accuracy, and reduced complexity means a more robust language model.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    4
    References
    44
    Citations
    NaN
    KQI
    []