AyaTEC: Building a Reusable Verse-Based Test Collection for Arabic Question Answering on the Holy Qur'an
Author | Malhas, Rana |
Author | Elsayed, Tamer |
Available date | 2024-11-05T06:05:20Z |
Publication Date | 2020 |
Publication Name | ACM Transactions on Asian and Low-Resource Language Information Processing |
Resource | Scopus |
Identifier | http://dx.doi.org/10.1145/3400396 |
ISSN | 23754699 |
Abstract | The absence of publicly available reusable test collections for Arabic question answering on the Holy Qur'an has impeded the possibility of fairly comparing the performance of systems in that domain. In this article, we introduce AyaTEC, a reusable test collection for verse-based question answering on the Holy Qur'an, which serves as a common experimental testbed for this task. AyaTEC includes 207 questions (with their corresponding 1,762 answers) covering 11 topic categories of the Holy Qur'an that target the information needs of both curious and skeptical users. To the best of our effort, the answers to the questions (each represented as a sequence of verses) in AyaTEC were exhaustive-that is, all qur'anic verses that directly answered the questions were exhaustively extracted and annotated. To facilitate the use of AyaTEC in evaluating the systems designed for that task, we propose several evaluation measures to support the different types of questions and the nature of verse-based answers while integrating the concept of partial matching of answers in the evaluation. |
Sponsor | Both authors contributed equally to this research. This work was funded in part by Qatar University through grant QUST-1-CENG-2018-24. Authors' address: R. Malhas and T. Elsayed, Qatar University, P.O. Box 2713, Doha, Qatar; emails: {rana.malhas, telsayed}@ qu.edu.qa. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from permissions@acm.org. 2020 Association for Computing Machinery. 2375-4699/2020/10-ART78 $15.00 https://doi.org/10.1145/3400396 |
Language | en |
Publisher | Association for Computing Machinery |
Subject | Classical Arabic evaluation |
Type | Article |
Issue Number | 6 |
Volume Number | 19 |
Files in this item
This item appears in the following Collection(s)
-
Computer Science & Engineering [2402 items ]