Efficient Speech Transcription Through Respeaking

Matthias Sperber,Graham Neubig,Christian Fügen,Satoshi Nakamura,Alex Waibel

Efficient Speech Transcription Through Respeaking

2013

Matthias Sperber
Graham Neubig
Christian Fügen
Satoshi Nakamura
Alex Waibel

We propose a method for efficient off-line speech transcription through respeaking. Speech is segmented into smaller utterances using an initial automatic transcript. Respeaking is performed segment by segment, while confidence filtering helps save supervision effort. We conduct detailed experiments comparing speaking vs. typing, sequential vs. confidence-ordered supervision, and examine the effect of the respeaking word error rate on correction efficiency. Our results demonstrate that the proposed method can not only outperform typing in terms of correction efficiency, but is also much less demanding for the respeakers than traditional respeaking methods, consequently helping to keep costs down.

Keywords:

Filter (signal processing)
Speech recognition
Pattern recognition
Artificial intelligence
Typing
Computer science
Word error rate
speech transcription
Natural language processing

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations