Show simple item record

AuthorAl Marri, Wadha J.
AuthorMalluhi, Qutaibah
AuthorOuzzani, Mourad
AuthorTang, Mingjie
AuthorAref, Walid G.
Available date2024-07-17T07:14:41Z
Publication Date2016
Publication NameInformation Systems
ResourceScopus
Identifierhttp://dx.doi.org/10.1016/j.is.2015.10.008
ISSN3064379
URIhttp://hdl.handle.net/10576/56740
AbstractIdentifying similarities in large datasets is an essential operation in several applications such as bioinformatics, pattern recognition, and data integration. To make a relational database management system similarity-aware, the core relational operators have to be extended. While similarity-awareness has been introduced in database engines for relational operators such as joins and group-by, little has been achieved for relational set operators, namely Intersection, Difference, and Union. In this paper, we propose to extend the semantics of relational set operators to take into account the similarity of values. We develop efficient query processing algorithms for evaluating them, and implement these operators inside an open-source database system, namely PostgreSQL. By extending several queries from the TPC-H benchmark to include predicates that involve similarity-based set operators, we perform extensive experiments that demonstrate up to three orders of magnitude speedup in performance over equivalent queries that only employ regular operators.
SponsorThis publication was made possible by the support of an NPRP grant 4-1534-1-247 from the Qatar National Research Fund (a member of Qatar Foundation), and the National Science Foundation under Grants III-1117766 and III-0964639 . The statements made herein are solely the responsibility of the authors.
Languageen
PublisherElsevier
SubjectRelational databases
Set operators
Similarity query processing
TitleThe similarity-aware relational database set operators
TypeArticle
Pagination79-93
Volume Number59
dc.accessType Abstract Only


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record