• Arabic corpora for credibility analysis 

      Al Zaatari, Ayman; El Ballouli, Rim; Elbassuoni, Shady; El-Hajj, Wassim; Hajj, Hazem; ... more authors ( European Language Resources Association (ELRA) , 2016 , Conference Paper)
      A significant portion of data generated on blogging and microblogging websites is non-credible as shown in many recent studies. To filter out such non-credible information, machine learning can be deployed to build automatic ...
    • DART: A large dataset of dialectal Arabic tweets 

      Alsarsour, Israa; Mohamed, Esraa; Suwaileh, Reem; Elsayed, Tamer ( European Language Resources Association (ELRA) , 2019 , Conference Paper)
      In this paper, we present a new large manually-annotated multi-dialect dataset of Arabic tweets that is publicly available. The Dialectal ARabic Tweets (DART) dataset has about 25K tweets that are annotated via crowdsourcing ...