Phase modelling of speech excitation for low bit-rate sinusoidal transform coding

Xiaoqin Sun,Fabrice Plante,Barry M. G. Cheetham,Kenneth W. T. Wong

Phase modelling of speech excitation for low bit-rate sinusoidal transform coding

1997

Xiaoqin Sun
Fabrice Plante
Barry M. G. Cheetham
Kenneth W. T. Wong

Sinusoidal transform coding (STC) techniques model speech as the sum of sine-waves whose frequencies, amplitudes and phases are specified at regular intervals. To achieve a low-bit rate representation, only the spectral envelope is encoded and the phases are regenerated according to a minimum phase assumption. In this paper, the inaccuracy of the minimum phase model is demonstrated. It is shown that the phase spectra of decoded speech segments may be corrected using either the parameters of a Rosenberg pulse model or a second order all-pass filter. Experiments have shown that by applying this correction, the phase accuracy increases and the speech quality improves.

Keywords:

Decoding methods
Speech processing
Spectral envelope
Mathematical optimization
Speech recognition
Transform coding
Speech synthesis
Speech coding
Amplitude
Pulse (signal processing)
Control theory
Computer science
Minimum phase
Pattern recognition
Artificial intelligence
Algorithm
electronic mail

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations