Combining standard and throat microphones for robust speech recognition

Martin Graciarena,Horacio Franco,Kemal Sonmez,Harry Bratt

Combining standard and throat microphones for robust speech recognition

2003

Martin Graciarena
Horacio Franco
Kemal Sonmez
Harry Bratt

We present a method to combine the standard and throat microphone signals for robust speech recognition in noisy environments. Our approach is to use the probabilistic optimum filter (POF) mapping algorithm to estimate the standard microphone clean-speech feature vectors, used by standard speech recognizers, from both microphones' noisy-speech feature vectors. A small untranscribed "stereo" database (noisy and clean simultaneous recordings) is required to train the POF mappings. In continuous-speech recognition experiments using SRI International's DECIPHER recognition system, both using artificially added noise and using recorded noisy speech, the combined-microphone approach significantly outperforms the single-microphone approach.

Keywords:

Artificial intelligence
Mathematics
Speech recognition
Pattern recognition
Feature vector
Probabilistic logic
Throat microphone
Feature extraction
Microphone
DECIPHER
mapping algorithm
recognition system

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations