Search

Now showing items 1-10 of 22

DART: A large dataset of dialectal Arabic tweets

Alsarsour, Israa; Mohamed, Esraa; Suwaileh, Reem; Elsayed, Tamer ( European Language Resources Association (ELRA) , 2019 , Conference Paper)

In this paper, we present a new large manually-annotated multi-dialect dataset of Arabic tweets that is publicly available. The Dialectal ARabic Tweets (DART) dataset has about 25K tweets that are annotated via crowdsourcing ...

QU-IR at SemEval 2016 Task 3: Learning to rank on Arabic community question answering forums with word embedding

Malhas, Rana; Torki, Marwan; Elsayed, Tamer ( Association for Computational Linguistics (ACL) , 2016 , Conference Paper)

Resorting to community question answering (CQA) websites for finding answers has gained momentum in the past decade with the explosive rate at which social media has been proliferating. With many questions left unanswered ...

Crowd vs. Expert: What can relevance judgment rationales teach us about assessor disagreement?

Kutlu, M.; Kutlu, Mucahid; McDonnell, Tyler; Barkallah, Yassmine; Elsayed, Tamer; ... more authors ... less authors ( ACM , 2018 , Conference Paper)

© 2018 ACM. While crowdsourcing offers a low-cost, scalable way to collect relevance judgments, lack of transparency with remote crowd work has limited understanding about the quality of collected judgments. In prior work, ...

BIGIR at CLEF 2019: Automatic verification of Arabic claims over the web

Haouari, Fatima; Ali, Zien Sheikh; Elsayed, Tamer ( CEUR-WS , 2019 , Conference Paper)

With the proliferation of fake news and its prevalent impact on democracy, journalism, and public opinions, manual fact-checkers become unscalable to the volume and speed of fake news propagation. Automatic fact-checkers ...

What questions do journalists ask on Twitter?

Hasanain, Maram; Bagdouri, Mossaab; Elsayed, Tamer; Oard, Douglas W. ( AI Access Foundation , 2016 , Conference Paper)

Social media platforms are a major source of information for both the general public and for journalists. Journalists use Twitter and other social media services to gather story ideas, to find eyewitnesses, and for a wide ...

EveTAR: A new test collection for event detection in Arabic tweets

Almerekhi, Hind; Hasanain, Maram; Elsayed, Tamer ( Association for Computing Machinery, Inc , 2016 , Conference Paper)

Research on event detection in Twitter is often obstructed by the lack of publicly-available evaluation mechanisms such as test collections; this problem is more severe when considering the scarcity of them in languages ...

ArabicWeb16: A new crawl for today's Arabic Web

Suwaileh, Reem; Kutlu, Mucahid; Fathima, Nihal; Elsayed, Tamer; Lease, Matthew ( Association for Computing Machinery, Inc , 2016 , Conference Paper)

Web crawls provide valuable snapshots of the Web which enable a wide variety of research, be it distributional analysis to characterize Web properties or use of language, content analysis in social science, or Information ...

On the evaluation of tweet timeline generation task

Magdy, Walid; Elsayed, Tamer; Hasanain, Maram ( Springer Verlag , 2016 , Conference Paper)

Tweet Timeline Generation (TTG) task aims to generate a timeline of relevant but novel tweets that summarizes the development of a given topic. A typical TTG system first retrieves tweets then detects novel tweets among ...

Effective Realtime Tweet Summarization

Suwaileh, Reem; Elsayed, Tamer ( Hamad bin Khalifa University Press (HBKU Press) , 2018 , Conference Paper)

Twitter has been developed as an immense information creation and sharing network through which users post information. Information could vary from the world»s breaking news to other topics such as sports, science, religion, ...

Overview of the CLEF-2019 Checkthat! LAB: Automatic identification and verification of claims. Task 2: Evidence and factuality

Hasanain, Maram; Suwaileh, Reem; Elsayed, Tamer; Barrón-Cedeño, Alberto; Nakov, Preslav ( CEUR-WS , 2019 , Conference Paper)

We present an overview of the second edition of the CheckThat! Lab at CLEF 2019. The lab featured two tasks in two different languages: English and Arabic. Task 1 (English) challenged the participating systems to predict ...