A study of the effects of preprocessing strategies on sentiment analysis for Arabic text
Author | Duwairi, Rehab |
Author | El-Orfali, Mahmoud |
Available date | 2016-05-16T10:55:24Z |
Publication Date | 2014-08 |
Publication Name | Journal of Information Science |
Resource | Scopus |
Citation | Rehab Duwairi and Mahmoud El-Orfali "A study of the effects of preprocessing strategies on sentiment analysis for Arabic text" Journal of Information Science August 2014 40:501-513 |
ISSN | 0165-5515 |
Abstract | Sentiment analysis has drawn considerable interest among researchers owing to the realization of its fascinating commercial and business benefits. This paper deals with sentiment analysis in Arabic text from three perspectives. First, several alternatives of text representation were investigated. In particular, the effects of stemming, feature correlation and n-gram models for Arabic text on sentiment analysis were investigated. Second, the behaviour of three classifiers, namely, SVM, Naive Bayes, and K-nearest neighbour classifiers, with sentiment analysis was investigated. Third, the effects of the characteristics of the dataset on sentiment analysis were analysed. To this end, we applied the techniques proposed in this paper to two datasets; one was prepared in-house by the authors and the second one is freely available online. All the experimentation was done using Rapidminer. The results show that our selection of preprocessing strategies on the reviews increases the performance of the classifiers. |
Language | en |
Publisher | SAGE Publications Ltd |
Subject | Arabic text opinion mining polarity classification sentiment analysis |
Type | Article |
Pagination | 501-513 |
Issue Number | 4 |
Volume Number | 40 |
Files in this item
Files | Size | Format | View |
---|---|---|---|
There are no files associated with this item. |
This item appears in the following Collection(s)
-
Computer Science & Engineering [2402 items ]