Sound Events Recognition and Retrieval Using Multi-Channel Sparse Coding Convolutional Neural Networks

Chien-Yao Wang,Tzu-Chiang Tai,Jia-Ching Wang,Andri Santoso,Seksan Mathulaprangsan,Chin-Chin Chiang,Chung-Hsien Wu

Sound Events Recognition and Retrieval Using Multi-Channel Sparse Coding Convolutional Neural Networks

2020

Chien-Yao Wang
Tzu-Chiang Tai
Jia-Ching Wang
Andri Santoso
Seksan Mathulaprangsan
Chin-Chin Chiang
Chung-Hsien Wu

This article proposes two novel deep convolutional neural networks (CNN), which are called the sparse coding convolutional neural network (SC-CNN) and the multi-convolutional-channel SC-CNN (MSC-CNN), to address the sound event recognition and retrieval problem. Unlike the general framework of a CNN, in which the feature learning process is performed hierarchically, the proposed framework models the whole memorization process in the human brain, including encoding, storage, and recollection. In particular, the MSC-CNN is designed to recognize multiple sound events that occur simultaneously. The experimental results indicate that the proposed SC-CNN and MSC-CNN outperforms the state-of-the-art systems in sound event recognition and retrieval.

Keywords:

Convolutional neural network
Computer science
Speech recognition
multi channel
Artificial intelligence
Neural coding
Pattern recognition
Communication channel

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations