Hybrid MELP/CELP coding at bit rates from 6.4 to 2.4 kb/s

2003 
This paper describes extensions of the 4 kb/s hybrid MELP/CELP coder, up to 6.4 kb/s and down to 2.4 kb/s. The baseline 4 kb/s coder uses three coding modes: MELP in strongly voiced speech frames, CELP with pitch prediction in weakly voiced frames, and CELP with stochastic excitation in unvoiced frames. To minimize switching artifacts between parametric MELP and waveform CELP coding, an alignment phase is encoded in MELP and zero-phase equalization is applied to the CELP target signal. The 6.4 kb/s extension uses the same three modes as the 4 kb/s coder, with improved MELP and CELP coders. The 2.4 kb/s extension uses only two modes: MELP for voiced frames and CELP synthesis with random excitation for unvoiced frames. The alignment phase is encoded in MELP frames for all bit rates so that time synchrony with input speech is always maintained. Alignment phase and zero-phase equalization enable smooth switching between coders at different bit rates. The hybrid MELP/CELP coding structure leads to coders that perform better at a given bit rate than MELP or CELP separately, and better than or equivalent to higher bit-rate ITU standards. Formal subjective tests show that for all-but-one tested conditions, the 6.4 kb/s hybrid coder is better than 8 kb/s G.729 and the 2.4 kb/s coder is equivalent to, or better than, 6.4 kb/s G.729 Annex D.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    11
    References
    1
    Citations
    NaN
    KQI
    []