• ArabicWeb16: A new crawl for today's Arabic Web 

      Suwaileh, Reem; Kutlu, Mucahid; Fathima, Nihal; Elsayed, Tamer; Lease, Matthew ( Association for Computing Machinery, Inc , 2016 , Conference Paper)
      Web crawls provide valuable snapshots of the Web which enable a wide variety of research, be it distributional analysis to characterize Web properties or use of language, content analysis in social science, or Information ...
    • ArCov-19: The First Arabic COVID-19 Twitter Database with Propagation Networks 

      Haouari, Fatima; Hasanain, Maram; Suwaileh, Reem; Elsayed, Tamer ( Cornel University , 2020 , Article  &   Video)
      In this paper, we present ArCOV-19, an Arabic COVID-19 Twitter dataset that covers the period from 27th of January till 31st of March 2020. ArCOV-19 is the first publicly-available Arabic Twitter dataset covering COVID-19 ...
    • ArTest: The First Test Collection for Arabic Web Search with Relevance Rationales 

      Hasanain, Maram; Barkallah, Yassmine; Suwaileh, Reem; Kutlu, Mucahid; Elsayed, Tamer ( Association for Computing Machinery, Inc , 2020 , Conference Paper)
      The scarcity of Arabic test collections has long hindered information retrieval (IR) research over the Arabic Web. In this work, we present ArTest, the first large-scale test collection designed for the evaluation of ad-hoc ...
    • DART: A large dataset of dialectal Arabic tweets 

      Alsarsour, Israa; Mohamed, Esraa; Suwaileh, Reem; Elsayed, Tamer ( European Language Resources Association (ELRA) , 2019 , Conference Paper)
      In this paper, we present a new large manually-annotated multi-dialect dataset of Arabic tweets that is publicly available. The Dialectal ARabic Tweets (DART) dataset has about 25K tweets that are annotated via crowdsourcing ...
    • Effective Realtime Tweet Summarization 

      Suwaileh, Reem; Elsayed, Tamer ( Hamad bin Khalifa University Press (HBKU Press) , 2018 , Conference Paper)
      Twitter has been developed as an immense information creation and sharing network through which users post information. Information could vary from the world»s breaking news to other topics such as sports, science, religion, ...
    • Overview of the CLEF-2019 Checkthat! LAB: Automatic identification and verification of claims. Task 2: Evidence and factuality 

      Hasanain, Maram; Suwaileh, Reem; Elsayed, Tamer; Barrón-Cedeño, Alberto; Nakov, Preslav ( CEUR-WS , 2019 , Conference Paper)
      We present an overview of the second edition of the CheckThat! Lab at CLEF 2019. The lab featured two tasks in two different languages: English and Arabic. Task 1 (English) challenged the participating systems to predict ...