Show simple item record

AuthorLatif, Siddique
AuthorRana, Rajib
AuthorKhalifa, Sara
AuthorJurdak, Raja
AuthorQadir, Junaid
AuthorSchuller, Bjorn
Available date2025-07-08T03:58:09Z
Publication Date2023
Publication NameIEEE Transactions on Affective Computing
ResourceScopus
Identifierhttp://dx.doi.org/10.1109/TAFFC.2021.3114365
ISSN19493045
URIhttp://hdl.handle.net/10576/66063
AbstractTraditionally, speech emotion recognition (SER) research has relied on manually handcrafted acoustic features using feature engineering. However, the design of handcrafted features for complex SER tasks requires significant manual effort, which impedes generalisability and slows the pace of innovation. This has motivated the adoption of representation learning techniques that can automatically learn an intermediate representation of the input signal without any manual feature engineering. Representation learning has led to improved SER performance and enabled rapid innovation. Its effectiveness has further increased with advances in deep learning (DL), which has facilitated deep representation learning where hierarchical representations are automatically learned in a data-driven manner. This article presents the first comprehensive survey on the important topic of deep representation learning for SER. We highlight various techniques, related challenges and identify important future areas of research. Our survey bridges the gap in the literature since existing surveys either focus on SER with hand-engineered features or representation learning in the general setting without focusing on SER.
Languageen
PublisherIEEE
Subjectdomain adaptation
multi task learning
representation learning
Speech emotion recognition
unsupervised learning
TitleSurvey of Deep Representation Learning for Speech Emotion Recognition
TypeArticle
Pagination1634-1654
Issue Number2
Volume Number14
dc.accessType Full Text


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record