Speech Emotion Recognition Using Deep Learning Techniques: A Review

Ruhul Amin Khalil*, Edward Jones, Mohammad Inayatullah Babar, Tariqullah Jan, Mohammad Haseeb Zafar, Thamer Alhussain

*Awdur cyfatebol y gwaith hwn

Allbwn ymchwil: Cyfraniad at gyfnodolynErthygladolygiad gan gymheiriaid

429 Dyfyniadau (Scopus)

Crynodeb

Emotion recognition from speech signals is an important but challenging component of Human-Computer Interaction (HCI). In the literature of speech emotion recognition (SER), many techniques have been utilized to extract emotions from signals, including many well-established speech analysis and classification techniques. Deep Learning techniques have been recently proposed as an alternative to traditional techniques in SER. This paper presents an overview of Deep Learning techniques and discusses some recent literature where these methods are utilized for speech-based emotion recognition. The review covers databases used, emotions extracted, contributions made toward speech emotion recognition and limitations related to it.

Iaith wreiddiolSaesneg
Rhif yr erthygl8805181
Tudalennau (o-i)117327-117345
Nifer y tudalennau19
CyfnodolynIEEE Access
Cyfrol7
Dynodwyr Gwrthrych Digidol (DOIs)
StatwsCyhoeddwyd - 19 Awst 2019
Cyhoeddwyd yn allanolIe

Dyfynnu hyn