The similarity-aware relational intersect database operator
| المؤلف | Al Marri, Wadha J. |
| المؤلف | Malluhi, Qutaibah |
| المؤلف | Ouzzani, Mourad |
| المؤلف | Tang, Mingjie |
| المؤلف | Aref, Walid G. |
| محرر | Traina, Agma Juci Machado |
| محرر | Traina Jr., Caetano |
| محرر | Cordeiro, Robson Leonardo Ferreira |
| تاريخ الإتاحة | 2016-05-01T13:33:02Z |
| تاريخ النشر | 2014 |
| اسم المنشور | Similarity Search and Applications: 7th International Conference, SISAP 2014, Los Cabos, Mexico, October 29-31, 2014. Proceedings |
| المصدر | Scopus |
| الاقتباس | Al Marri, W.J., Malluhi, Q., Ouzzani, M., Tang, M., Aref, W.G. "The similarity-aware relational intersect database operator" (2014) Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8821, pp. 164-175. |
| الترقيم الدولي الموحد للكتاب | 978-3-319-11987-8 |
| الترقيم الدولي الموحد للكتاب | 978-3-319-11988-5 (Online) |
| الملخص | Identifying similarities in large datasets is an essential operation in many applications such as bioinformatics, pattern recognition, and data integration. To make the underlying database system similarity-aware, the core relational operators have to be extended. Several similarity-aware relational operators have been proposed that introduce similarity processing at the database engine level, e.g., similarity joins and similarity group-by. This paper extends the semantics of the set intersection operator to operate over similar values. The paper describes the semantics of the similarity-based set intersection operator, and develops an efficient query processing algorithm for evaluating it. The proposed operator is implemented inside an open-source database system, namely PostgreSQL. Several queries from the TPC-H benchmark are extended to include similarity-based set intersetion predicates. Performance results demonstrate up to three orders of magnitude speedup in performance over equivalent queries that only employ regular operators. |
| راعي المشروع | NPRP grant 4-1534-1-247 from the Qatar National Research Fund and by the National Science Foundation Grants IIS 0916614, IIS 1117766, and IIS 0964639. |
| اللغة | en |
| الناشر | Springer International Publishing |
| السلسلة | Lecture Notes in Computer Science |
| الموضوع | bioinformatics data integration pattern recognition query processing semantics database operators query processing algorithms regular operators relational operator set intersection similarity group byes three orders of magnitude Tpc-h benchmarks |
| النوع | Conference |
| الصفحات | 164-175 |
| رقم المجلد | 8821 |
الملفات في هذه التسجيلة
| الملفات | الحجم | الصيغة | العرض |
|---|---|---|---|
|
لا توجد ملفات لها صلة بهذه التسجيلة. |
|||
هذه التسجيلة تظهر في المجموعات التالية
-
علوم وهندسة الحاسب [2520 items ]
-
الابحاث المتعددة التخصصات والتصاميم االذكية [45 items ]

