Guided Spatial Transformers for Facial Expression Recognition

Cristina Luna Jiménez,Jorge Cristobal Martín,Ricardo Kleinlein,Manuel Gil Martín,Jose M. Moya,Fernando Fernández-Martínez

Guided Spatial Transformers for Facial Expression Recognition

2021

Spatial Transformer Networks are considered a powerful algorithm to learn the main areas of an image, but still, they could be more efficient by receiving images with embedded expert knowledge. This paper aims to improve the performance of conventional Spatial Transformers when applied to Facial Expression Recognition. Based on the Spatial Transformers’ capacity of spatial manipulation within networks, we propose different extensions to these models where effective attentional regions are captured employing facial landmarks or facial visual saliency maps. This specific attentional information is then hardcoded to guide the Spatial Transformers to learn the spatial transformations that best fit the proposed regions for better recognition results. For this study, we use two datasets: AffectNet and FER-2013. For AffectNet, we achieve a 0.35% point absolute improvement relative to the traditional Spatial Transformer, whereas for FER-2013, our solution gets an increase of 1.49% when models are fine-tuned with the Affectnet pre-trained weights.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations