Search
Now showing items 1-10 of 22
DART: A large dataset of dialectal Arabic tweets
(
European Language Resources Association (ELRA)
, 2019 , Conference Paper)
In this paper, we present a new large manually-annotated multi-dialect dataset of Arabic tweets that is publicly available. The Dialectal ARabic Tweets (DART) dataset has about 25K tweets that are annotated via crowdsourcing ...
ArabicWeb16: A new crawl for today's Arabic Web
(
Association for Computing Machinery, Inc
, 2016 , Conference Paper)
Web crawls provide valuable snapshots of the Web which enable a wide variety of research, be it distributional analysis to characterize Web properties or use of language, content analysis in social science, or Information ...
Effective Realtime Tweet Summarization
(
Hamad bin Khalifa University Press (HBKU Press)
, 2018 , Conference Paper)
Twitter has been developed as an immense information creation and sharing network through which users post information. Information could vary from the world»s breaking news to other topics such as sports, science, religion, ...
Overview of the CLEF-2019 Checkthat! LAB: Automatic identification and verification of claims. Task 2: Evidence and factuality
(
CEUR-WS
, 2019 , Conference Paper)
We present an overview of the second edition of the CheckThat! Lab at CLEF 2019. The lab featured two tasks in two different languages: English and Arabic. Task 1 (English) challenged the participating systems to predict ...
ArTest: The First Test Collection for Arabic Web Search with Relevance Rationales
(
Association for Computing Machinery, Inc
, 2020 , Conference Paper)
The scarcity of Arabic test collections has long hindered information retrieval (IR) research over the Arabic Web. In this work, we present ArTest, the first large-scale test collection designed for the evaluation of ad-hoc ...
Time-critical geolocation for social good
(
Springer
, 2020 , Conference Paper)
Twitter has become an instrumental source of news in emergencies where efficient access, dissemination of information, and immediate reactions are critical. Nevertheless, due to several challenges, the current fully-automated ...
Improving Arabic Microblog Retrieval with Distributed Representations
(
Springer
, 2020 , Conference Paper)
Query expansion (QE) using pseudo relevance feedback (PRF) is one of the approaches that has been shown to be effective for improving microblog retrieval. In this paper, we investigate the performance of three different ...
CheckThat! at CLEF 2019: Automatic identification and verification of claims
(
Springer Verlag
, 2019 , Conference Paper)
We introduce the second edition of the CheckThat! Lab, part of the 2019 Cross-Language Evaluation Forum (CLEF). CheckThat! proposes two complementary tasks. Task 1: predict which claims in a political debate should be ...
Overview of CheckThat! 2020: Automatic Identification and Verification of Claims in Social Media
(
Springer Science and Business Media Deutschland GmbH
, 2020 , Conference Paper)
We present an overview of the third edition of the CheckThat! Lab at CLEF 2020. The lab featured five tasks in two different languages: English and Arabic. The first four tasks compose the full pipeline of claim verification ...
CheckThat! at CLEF 2020: Enabling the automatic identification and verification of claims in social media
(
Springer
, 2020 , Conference Paper)
We describe the third edition of the CheckThat! Lab, which is part of the 2020 Cross-Language Evaluation Forum (CLEF). CheckThat! proposes four complementary tasks and a related task from previous lab editions, offered in ...