bigIR at TREC 2020: Simple but Deep Retrieval of Passages and Documents
Abstract
In this paper, we present the participation of the bigIR team at Qatar University in the TREC Deep Learning 2020 track. We participated in both document and passage retrieval tasks, and each of its subtasks, full ranking and reranking. As it is our first participation in the track, our primary goal is to experiment with the latest approaches and pre-trained models for both tasks. We used Anserini IR toolkit for indexing and retrieval, and experimented with different techniques for passage expansion and reranking, which are either BERT-based or sequence-to-sequence based. All our submitted runs for the passage retrieval task, and most of our submitted runs for the document retrieval task outperformed TREC median submission. We observed that BERT reranker performed slightly better than T5 reranker when expanding passages with sequence-to-sequence based models. However, T5 achieved better results than BERT when passages were expanded with DeepCT, a BERT-based model. Moreover, the results showed that combining the title and the head segment as document representation for reranking yielded significant improvement over each separately.
DOI/handle
http://hdl.handle.net/10576/60888Collections
- Computer Science & Engineering [2402 items ]
Related items
Showing items related by title, author, creator and subject.
-
Studying effectiveness of Web search for fact checking
Hasanain, Maram; Elsayed, Tamer ( John Wiley and Sons Inc , 2022 , Article)Web search is commonly used by fact checking systems as a source of evidence for claim verification. In this work, we demonstrate that the task of retrieving pages useful for fact checking, called evidential pages, is ... -
Partial shoeprint retrieval using multiple point-of-interest detectors and SIFT descriptors
Al-Maadeed, Somaya; Bouridane A.; Crookes D.; Nibouche O. ( IOS Press , 2015 , Article)Shoeprint evidence collected from crime scenes can play an important role in forensic investigations. Usually, the analysis of shoeprints is carried out manually and is based on human expertise and knowledge. As well as ... -
ENABLING EFFECTIVE ARABIC INFORMATION RETRIEVAL ON THE WEB AND SOCIAL MEDIA
HASANAIN, MARAM GHANEM (06-2 , Dissertation)Arabic is one of the most dominant languages on the Web and social media. The huge and ever-growing Arabic user generated content, further motivated by the ongoing political unrest in the region, created an immense need ...