ArabicWeb16: A new crawl for today's Arabic Web
المؤلف | Suwaileh, Reem |
المؤلف | Kutlu, Mucahid |
المؤلف | Fathima, Nihal |
المؤلف | Elsayed, Tamer |
المؤلف | Lease, Matthew |
تاريخ الإتاحة | 2021-09-01T10:02:44Z |
تاريخ النشر | 2016 |
اسم المنشور | SIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval |
المصدر | Scopus |
الملخص | Web crawls provide valuable snapshots of the Web which enable a wide variety of research, be it distributional analysis to characterize Web properties or use of language, content analysis in social science, or Information Retrieval (IR) research to develop and evaluate effective search algorithms. While many English-centric Web crawls exist, existing public Arabic Web crawls are quite limited, limiting research and development. To remedy this, we present ArabicWeb16, a new public Web crawl of roughly 150M Arabic Web pages with significant coverage of dialectal Arabic as well as Modern Standard Arabic. For IR researchers, we expect ArabicWeb16 to support various research areas: ad-hoc search, question answering, filtering, cross-dialect search, dialect detection, entity search, blog search, and spam detection. 2016 ACM. |
اللغة | en |
الناشر | Association for Computing Machinery, Inc |
الموضوع | Internet Websites Ad-hoc search Arabic retrieval Evaluation Multi-Dialect Web collections Information retrieval |
النوع | Conference |
الصفحات | 673-676 |
الملفات في هذه التسجيلة
الملفات | الحجم | الصيغة | العرض |
---|---|---|---|
لا توجد ملفات لها صلة بهذه التسجيلة. |
هذه التسجيلة تظهر في المجموعات التالية
-
علوم وهندسة الحاسب [2426 items ]