Linear acoustic echo cancellation using deep neural networks and convex reconstruction of incomplete transfer function

2017 
Linear acoustic path estimation for acoustic echo cancellation is difficult during periods where the near-end signal (speech) is active. In this paper, we assume that the impulse response is sparse. There are many algorithms that solve the problem of estimating sparse impulse response in the time domain. In this paper, we propose algorithms working in the time-frequency domain. In our approach, it is assumed that the respective transfer function can be estimated only for those frequencies where the near-end signal is not active. First, a deep neural network trained on mixed signals is used to detect the activity of the near-end signal. In frequencies where no activity is detected, the acoustic transfer function is estimated using conventional frequency domain least squares. This results in an incomplete transfer function (ITF) estimate. The completion is done through finding the sparsest representation of the ITF in the time domain. This can be done adaptively using the soft-threshold function, which is applied in the time domain. To achieve improved accuracy, oversampling can be used.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    12
    References
    3
    Citations
    NaN
    KQI
    []