تصفح حسب الناشر
السجلات المعروضة 1 -- 2 من 2
-
Arabic corpora for credibility analysis
( European Language Resources Association (ELRA) , 2016 , Conference Paper)A significant portion of data generated on blogging and microblogging websites is non-credible as shown in many recent studies. To filter out such non-credible information, machine learning can be deployed to build automatic ... -
DART: A large dataset of dialectal Arabic tweets
( European Language Resources Association (ELRA) , 2019 , Conference Paper)In this paper, we present a new large manually-annotated multi-dialect dataset of Arabic tweets that is publicly available. The Dialectal ARabic Tweets (DART) dataset has about 25K tweets that are annotated via crowdsourcing ...