Comparative Evaluation of Sentiment Analysis Methods Across Arabic Dialects
المؤلف | Baly, Ramy |
المؤلف | El-Khoury, Georges |
المؤلف | Moukalled, Rawan |
المؤلف | Aoun, Rita |
المؤلف | Hajj, Hazem |
المؤلف | Shaban, Khaled Bashir |
المؤلف | El-Hajj, Wassim |
تاريخ الإتاحة | 2022-12-21T10:01:46Z |
تاريخ النشر | 2017 |
اسم المنشور | Procedia Computer Science |
المصدر | Scopus |
الملخص | Sentiment analysis in Arabic is challenging due to the complex morphology of the language. The task becomes more challenging when considering Twitter data that contain significant amounts of noise such as the use of Arabizi, code-switching and different dialects that varies significantly across the Arab world, the use of non-Textual objects to express sentiments, and the frequent occurrence of misspellings and grammatical mistakes. Modeling sentiment in Twitter should become easier when we understand the characteristics of Twitter data and how its usage varies from one Arab region to another. We describe our effort to create the first Multi-Dialect Arabic Sentiment Twitter Dataset (MD-ArSenTD) that is composed of tweets collected from 12 Arab countries, annotated for sentiment and dialect. We use this dataset to analyze tweets collected from Egypt and the United Arab Emirates (UAE), with the aim of discovering distinctive features that may facilitate sentiment analysis. We also perform a comparative evaluation of different sentiment models on Egyptian and UAE tweets. These models are based on feature engineering and deep learning, and have already achieved state-of-The-Art accuracies in English sentiment analysis. Results indicate the superior performance of deep learning models, the importance of morphological features in Arabic NLP, and that handling dialectal Arabic leads to different outcomes depending on the country from which the tweets are collected. |
راعي المشروع | This work was made possible by NPRP 6-716-1-138 grant from the Qatar National Research Fund (a member of Qatar Foundation). The statements made herein are solely the responsibility of the authors. |
اللغة | en |
الناشر | Elsevier |
الموضوع | Computational linguistics Deep learning Linguistics Social networking (online) Comparative evaluations Complex morphology Dialectal arabics Feature engineerings Morphological features Sentiment analysis State of the art United Arab Emirates Data mining |
النوع | Conference Paper |
الصفحات | 266-273 |
رقم المجلد | 117 |
تحقق من خيارات الوصول
الملفات في هذه التسجيلة
هذه التسجيلة تظهر في المجموعات التالية
-
علوم وهندسة الحاسب [2402 items ]