Deep transfer learning for automatic speech recognition: Towards better generalization

Hamza, Kheddar; Himeur, Yassine; Al-Maadeed, Somaya; Amira, Abbes; Bensaali, Faycal

المؤلف	Hamza, Kheddar
المؤلف	Himeur, Yassine
المؤلف	Al-Maadeed, Somaya
المؤلف	Amira, Abbes
المؤلف	Bensaali, Faycal
تاريخ الإتاحة	2024-10-13T09:53:18Z
تاريخ النشر	2023-10-09
اسم المنشور	Knowledge-Based Systems
المعرّف	http://dx.doi.org/10.1016/j.knosys.2023.110851
الاقتباس	Kheddar, H., Himeur, Y., Al-Maadeed, S., Amira, A., & Bensaali, F. (2023). Deep transfer learning for automatic speech recognition: Towards better generalization. Knowledge-Based Systems, 277, 110851.‏
الرقم المعياري الدولي للكتاب	09507051
معرّف المصادر الموحد	https://www.sciencedirect.com/science/article/pii/S0950705123006019
معرّف المصادر الموحد	http://hdl.handle.net/10576/60081
الملخص	Automatic speech recognition (ASR) has recently become an important challenge when using deep learning (DL). It requires large-scale training datasets and high computational and storage resources. Moreover, DL techniques and machine learning (ML) approaches in general, hypothesize that training and testing data come from the same domain, with the same input feature space and data distribution characteristics. This assumption, however, is not applicable in some real-world artificial intelligence (AI) applications. Moreover, there are situations where gathering real data is challenging, expensive, or rarely occurring, which cannot meet the data requirements of DL models. deep transfer learning (DTL) has been introduced to overcome these issues, which helps develop high-performing models using real datasets that are small or slightly different but related to the training data. This paper presents a comprehensive survey of DTL-based ASR frameworks to shed light on the latest developments and helps academics and professionals understand current challenges. Specifically, after presenting the DTL background, a well-designed taxonomy is adopted to inform the state-of-the-art. A critical analysis is then conducted to identify the limitations and advantages of each framework. Moving on, a comparative study is introduced to highlight the current challenges before deriving opportunities for future research.
اللغة	en
الناشر	Elsevier B.V.
الموضوع	Automatic speech recognition Deep transfer learning Fine-tuning Domain adaptation Models fusion Large language model
العنوان	Deep transfer learning for automatic speech recognition: Towards better generalization
النوع	Article
رقم المجلد	277
dc.accessType	Open Access

تحقق من خيارات الوصول

الملفات في هذه التسجيلة

الاسم:: 1-s2.0-S0950705123006019-main.pdf
الحجم:: 1.984Mb
الصيغة:: PDF

عرض / فتح

هذه التسجيلة تظهر في المجموعات التالية

علوم وهندسة الحاسب [‎2484‎ items ]

عرض بسيط للتسجيلة

Deep transfer learning for automatic speech recognition: Towards better generalization

الملفات في هذه التسجيلة

هذه التسجيلة تظهر في المجموعات التالية

Video