• ArabicWeb16: A new crawl for today's Arabic Web 

      Suwaileh, Reem; Kutlu, Mucahid; Fathima, Nihal; Elsayed, Tamer; Lease, Matthew ( Association for Computing Machinery, Inc , 2016 , Conference Paper)
      Web crawls provide valuable snapshots of the Web which enable a wide variety of research, be it distributional analysis to characterize Web properties or use of language, content analysis in social science, or Information ...
    • ArCov-19: The First Arabic COVID-19 Twitter Database with Propagation Networks 

      Haouari, Fatima; Hasanain, Maram; Suwaileh, Reem; Elsayed, Tamer ( Cornel University , 2020 , Article  &   Video)
      In this paper, we present ArCOV-19, an Arabic COVID-19 Twitter dataset that covers the period from 27th of January till 31st of March 2020. ArCOV-19 is the first publicly-available Arabic Twitter dataset covering COVID-19 ...
    • BIGIR at CLEF 2019: Automatic verification of Arabic claims over the web 

      Haouari, Fatima; Ali, Zien Sheikh; Elsayed, Tamer ( CEUR-WS , 2019 , Conference Paper)
      With the proliferation of fake news and its prevalent impact on democracy, journalism, and public opinions, manual fact-checkers become unscalable to the volume and speed of fake news propagation. Automatic fact-checkers ...
    • Crowd vs. Expert: What can relevance judgment rationales teach us about assessor disagreement? 

      Kutlu, M.; Kutlu, Mucahid; McDonnell, Tyler; Barkallah, Yassmine; Elsayed, Tamer; ... more authors ( ACM , 2018 , Conference Paper)
      © 2018 ACM. While crowdsourcing offers a low-cost, scalable way to collect relevance judgments, lack of transparency with remote crowd work has limited understanding about the quality of collected judgments. In prior work, ...
    • DART: A large dataset of dialectal Arabic tweets 

      Alsarsour, Israa; Mohamed, Esraa; Suwaileh, Reem; Elsayed, Tamer ( European Language Resources Association (ELRA) , 2019 , Conference Paper)
      In this paper, we present a new large manually-annotated multi-dialect dataset of Arabic tweets that is publicly available. The Dialectal ARabic Tweets (DART) dataset has about 25K tweets that are annotated via crowdsourcing ...
    • EveTAR: A new test collection for event detection in Arabic tweets 

      Almerekhi, Hind; Hasanain, Maram; Elsayed, Tamer ( Association for Computing Machinery, Inc , 2016 , Conference Paper)
      Research on event detection in Twitter is often obstructed by the lack of publicly-available evaluation mechanisms such as test collections; this problem is more severe when considering the scarcity of them in languages ...
    • On the evaluation of tweet timeline generation task 

      Magdy, Walid; Elsayed, Tamer; Hasanain, Maram ( Springer Verlag , 2016 , Conference Paper)
      Tweet Timeline Generation (TTG) task aims to generate a timeline of relevant but novel tweets that summarizes the development of a given topic. A typical TTG system first retrieves tweets then detects novel tweets among ...
    • Overview of the CLEF-2019 Checkthat! LAB: Automatic identification and verification of claims. Task 2: Evidence and factuality 

      Hasanain, Maram; Suwaileh, Reem; Elsayed, Tamer; Barrón-Cedeño, Alberto; Nakov, Preslav ( CEUR-WS , 2019 , Conference Paper)
      We present an overview of the second edition of the CheckThat! Lab at CLEF 2019. The lab featured two tasks in two different languages: English and Arabic. Task 1 (English) challenged the participating systems to predict ...
    • QU-IR at SemEval 2016 Task 3: Learning to rank on Arabic community question answering forums with word embedding 

      Malhas, Rana; Torki, Marwan; Elsayed, Tamer ( Association for Computational Linguistics (ACL) , 2016 , Conference Paper)
      Resorting to community question answering (CQA) websites for finding answers has gained momentum in the past decade with the explosive rate at which social media has been proliferating. With many questions left unanswered ...
    • Query performance prediction for microblog search 

      Hasanain, Maram; Elsayed, Tamer ( Elsevier Ltd , 2017 , Article)
      Query performance prediction (QPP) is the task of estimating the effectiveness of a retrieval system given a search query in the absence of any feedback from the searcher. The task has been proven to be very challenging, ...
    • Unsupervised adaptive microblog filtering for broad dynamic topics 

      Magdy, Walid; Elsayed, Tamer ( Elsevier , 2016 , Article)
      Information filtering has been a major task of study in the field of information retrieval (IR) for a long time, focusing on filtering well-formed documents such as news articles. Recently, more interest was directed towards ...
    • What questions do journalists ask on Twitter? 

      Hasanain, Maram; Bagdouri, Mossaab; Elsayed, Tamer; Oard, Douglas W. ( AI Access Foundation , 2016 , Conference Paper)
      Social media platforms are a major source of information for both the general public and for journalists. Journalists use Twitter and other social media services to gather story ideas, to find eyewitnesses, and for a wide ...