A First Prototype of an Emotional Smart Speaker.

2021 
Affective computing comprises the techniques devoted to identify and understand human emotions. However, this topic covers many other subtopics; it can be remarked Speech Emotion Recognition (SER) between them. In the last two decades, we have witnessed the birth and expansion of marketed products like smart voice assistants and their associated autonomous smart speakers by Amazon, Google, and Apple. This work presents the design and implementation of a new Emotional Smart Speaker prototype-based hybridisation of an Amazon Echo Dot device and A Rasberry PI with a low-power SER algorithm built-in. The proposed SER algorithm is based on a Bag of Models method with two base models, an XtraTrees algorithm and a pre-trained Resnet18 Neural Network. The proposal has been validated for four well-known SER datasets: EmoDB, TESS, SAVEE and RAVDSS. And the obtained model outperforms eleven well-known ML methods available in the literature for the studied public datasets.
    • Correction
    • Source
    • Cite
    • Save
    • Machine Reading By IdeaReader
    17
    References
    0
    Citations
    NaN
    KQI
    []