Introducing compact: An oscillator-based approach to toll-quality speech coding at low bit rates

2010 
In this paper, we introduce an improved oscillator model we term the Complete Oscillator Model (COM). A significant advantage of the COM over classical oscillators such as the Self Excited Vocoder is that it is not restricted to modeling only certain larger-scale patterns in the source sequence. Here, we develop a speech coding system based on the proposed COM. In this system, the COM is used in combination with a linear predictor, the Pulsed Autoregressive CompensaTor (PACT), to develop a novel, oscillator-based approach to toll-quality speech coding at low bit rates. Unlike the linear prediction-based models utilized in modern speech coders, oscillators do not depend on an estimate of the residual error to regenerate the signal. As such, the residual is encoded only for select frames, providing a potential improvement in coding efficiency. An implementation of the hybrid COM/PACT system, which we call COMPACT, is described and is shown to provide both perceptual quality and bit rate that are competitive with mature standards such as G.729 and AMR. The given implementation is demonstrated to produce toll-quality speech, as measured by PESQ-MOS, at 9.77 kbps. Future tuning of this implementation is expected to improve performance to where it could exceed the current state of the art.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    6
    References
    1
    Citations
    NaN
    KQI
    []