Perceptually-motivated Environment-specific Speech Enhancement

Jiaqi Su,Adam Finkelstein,Zeyu Jin

Perceptually-motivated Environment-specific Speech Enhancement

2019

Jiaqi Su
Adam Finkelstein
Zeyu Jin

This paper introduces a deep learning approach to enhance speech recordings made in a specific environment. A single neural network learns to ameliorate several types of recording artifacts, including noise, reverberation, and non-linear equalization. The method relies on a new perceptual loss function that combines adversarial loss with spectrogram features. Both subjective and objective evaluations show that the proposed approach improves on state-of-the-art baseline methods.

Keywords:

Artificial intelligence
Speech enhancement
Pattern recognition
Computer science
Perception
Artificial neural network
Spectrogram
Reverberation
Equalization (audio)
Noise reduction
Deep learning

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations