Integrating Facial Images, Speeches and Time for Empathy Prediction.

Shi Yin,Yonggan Fu,Can Wang,Runlong Wu,Heyan Ding,Shangfei Wang

Integrating Facial Images, Speeches and Time for Empathy Prediction.

2019

Shi Yin
Yonggan Fu
Can Wang
Runlong Wu
Heyan Ding
Shangfei Wang

We propose a multi-modal method for the One-Minute Empathy Prediction competition. First, we use bottleneck residual and fully-connected network to encode facial images and speeches of the listener. Second, we propose to use the current time stage as a temporal feature and encoded it into the proposed multi-modal network. Third, we select a subset training data based on its performance of empathy prediction on the validation data. Experimental results on the testing set show that the proposed method outperforms the baseline methods significantly according to the CCC metric (0.14 vs 0.06).

Keywords:

Empathy
Cognitive psychology
Psychology
Speech recognition
ENCODE
Bottleneck
Computer science
current time
Residual
Training set

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations