Show simple item record

AuthorHamza, Kheddar
AuthorHimeur, Yassine
AuthorAl-Maadeed, Somaya
AuthorAmira, Abbes
AuthorBensaali, Faycal
Available date2024-10-13T09:53:18Z
Publication Date2023-10-09
Publication NameKnowledge-Based Systems
Identifierhttp://dx.doi.org/10.1016/j.knosys.2023.110851
CitationKheddar, H., Himeur, Y., Al-Maadeed, S., Amira, A., & Bensaali, F. (2023). Deep transfer learning for automatic speech recognition: Towards better generalization. Knowledge-Based Systems, 277, 110851.‏
ISSN09507051
URIhttps://www.sciencedirect.com/science/article/pii/S0950705123006019
URIhttp://hdl.handle.net/10576/60081
AbstractAutomatic speech recognition (ASR) has recently become an important challenge when using deep learning (DL). It requires large-scale training datasets and high computational and storage resources. Moreover, DL techniques and machine learning (ML) approaches in general, hypothesize that training and testing data come from the same domain, with the same input feature space and data distribution characteristics. This assumption, however, is not applicable in some real-world artificial intelligence (AI) applications. Moreover, there are situations where gathering real data is challenging, expensive, or rarely occurring, which cannot meet the data requirements of DL models. deep transfer learning (DTL) has been introduced to overcome these issues, which helps develop high-performing models using real datasets that are small or slightly different but related to the training data. This paper presents a comprehensive survey of DTL-based ASR frameworks to shed light on the latest developments and helps academics and professionals understand current challenges. Specifically, after presenting the DTL background, a well-designed taxonomy is adopted to inform the state-of-the-art. A critical analysis is then conducted to identify the limitations and advantages of each framework. Moving on, a comparative study is introduced to highlight the current challenges before deriving opportunities for future research.
Languageen
PublisherElsevier B.V.
SubjectAutomatic speech recognition
Deep transfer learning
Fine-tuning
Domain adaptation
Models fusion
Large language model
TitleDeep transfer learning for automatic speech recognition: Towards better generalization
TypeArticle
Volume Number277
dc.accessType Open Access


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record