MAEC: Multi-Instance Learning with an Adversarial Auto-Encoder-Based Classifier for Speech Emotion Recognition

Changzeng Fu,Chaoran Liu,Carlos Toshinori Ishi,Hiroshi Ishiguro

MAEC: Multi-Instance Learning with an Adversarial Auto-Encoder-Based Classifier for Speech Emotion Recognition

2021

Changzeng Fu
Chaoran Liu
Carlos Toshinori Ishi
Hiroshi Ishiguro

In this paper, we propose an adversarial auto-encoder-based classifier, which can regularize the distribution of latent representation to smooth the boundaries among categories. Moreover, we adopt multi-instance learning by dividing speech into a bag of segments to capture the most salient moments for presenting an emotion. The proposed model was trained on the IEMOCAP dataset and evaluated on the in-corpus validation set (IEMOCAP) and the cross-corpus validation set (MELD). The experiment results show that our model outperforms the baseline on in-corpus validation and increases the scores on cross-corpus validation with regularization.

Keywords:

Speech recognition
Emotion recognition
Classifier (UML)
Adversarial system
Autoencoder
Computer science

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations