EveTAR: building a large-scale multi-task test collection over Arabic tweets

Hasanain, M.; Suwaileh, R.; Elsayed, T.; Kutlu, M.; Almerekhi, H.

المؤلف	Hasanain, M.
المؤلف	Suwaileh, R.
المؤلف	Elsayed, T.
المؤلف	Kutlu, M.
المؤلف	Almerekhi, H.
تاريخ الإتاحة	2020-03-18T08:10:08Z
تاريخ النشر	2018
اسم المنشور	Information Retrieval Journal
المصدر	Scopus
الرقم المعياري الدولي للكتاب	13864564
معرّف المصادر الموحد	http://dx.doi.org/10.1007/s10791-017-9325-7
معرّف المصادر الموحد	http://hdl.handle.net/10576/13322
الملخص	This article introduces a new language-independent approach for creating a large-scale high-quality test collection of tweets that supports multiple information retrieval (IR) tasks without running a shared-task campaign. The adopted approach (demonstrated over Arabic tweets) designs the collection around significant (i.e., popular) events, which enables the development of topics that represent frequent information needs of Twitter users for which rich content exists. That inherently facilitates the support of multiple tasks that generally revolve around events, namely event detection, ad-hoc search, timeline generation, and real-time summarization. The key highlights of the approach include diversifying the judgment pool via interactive search and multiple manually-crafted queries per topic, collecting high-quality annotations via crowd-workers for relevancy and in-house annotators for novelty, filtering out low-agreement topics and inaccessible tweets, and providing multiple subsets of the collection for better availability. Applying our methodology on Arabic tweets resulted in EveTAR, the first freely-available tweet test collection for multiple IR tasks. EveTAR includes a crawl of 355M Arabic tweets and covers 50 significant events for which about 62K tweets were judged with substantial average inter-annotator agreement (Kappa value of 0.71). We demonstrate the usability of EveTAR by evaluating existing algorithms in the respective tasks. Results indicate that the new collection can support reliable ranking of IR systems that is comparable to similar TREC collections, while providing strong baseline results for future studies over Arabic tweets.
راعي المشروع	Acknowledgements This work was made possible by NPRP Grant# NPRP 7-1313-1-245 from the Qatar National Research Fund (a member of Qatar Foundation). The statements made herein are solely the responsibility of the authors. We would like to thank the crowd workers and in-house annotators for their valuable efforts in producing high-quality judgments of tweets. We would also like to thank Doug Oard, Mark Sanderson, and the anonymous reviewers for their insightful comments on earlier versions of this article.
اللغة	en
الناشر	Springer Netherlands
الموضوع	Ad-hoc search Dialects Evaluation Event detection Microblogs Real-time summarization Timeline generation Twitter
العنوان	EveTAR: building a large-scale multi-task test collection over Arabic tweets
النوع	Article
الصفحات	307 - 336
رقم العدد	4
رقم المجلد	21
dc.accessType	Abstract Only

الملفات في هذه التسجيلة

الملفات	الحجم	الصيغة	العرض
لا توجد ملفات لها صلة بهذه التسجيلة.

هذه التسجيلة تظهر في المجموعات التالية

علوم وهندسة الحاسب [‎2485‎ items ]

عرض بسيط للتسجيلة

EveTAR: building a large-scale multi-task test collection over Arabic tweets

الملفات في هذه التسجيلة

هذه التسجيلة تظهر في المجموعات التالية

Video