Show simple item record

AuthorSuwaileh, Reem
AuthorKutlu, Mucahid
AuthorFathima, Nihal
AuthorElsayed, Tamer
AuthorLease, Matthew
Available date2021-09-01T10:02:44Z
Publication Date2016
Publication NameSIGIR 2016 - Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval
ResourceScopus
URIhttp://dx.doi.org/10.1145/2911451.2914677
URIhttp://hdl.handle.net/10576/22382
AbstractWeb crawls provide valuable snapshots of the Web which enable a wide variety of research, be it distributional analysis to characterize Web properties or use of language, content analysis in social science, or Information Retrieval (IR) research to develop and evaluate effective search algorithms. While many English-centric Web crawls exist, existing public Arabic Web crawls are quite limited, limiting research and development. To remedy this, we present ArabicWeb16, a new public Web crawl of roughly 150M Arabic Web pages with significant coverage of dialectal Arabic as well as Modern Standard Arabic. For IR researchers, we expect ArabicWeb16 to support various research areas: ad-hoc search, question answering, filtering, cross-dialect search, dialect detection, entity search, blog search, and spam detection. 2016 ACM.
Languageen
PublisherAssociation for Computing Machinery, Inc
SubjectInternet
Websites
Ad-hoc search
Arabic retrieval
Evaluation
Multi-Dialect
Web collections
Information retrieval
TitleArabicWeb16: A new crawl for today's Arabic Web
TypeConference Paper
Pagination673-676


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record