Search
Now showing items 751-760 of 772
The many benefits of annotator rationales for relevance judgments
(
International Joint Conferences on Artificial Intelligence
, 2017 , Conference Paper)
When collecting subjective human ratings of items, it can be difficult to measure and enforce data quality due to task subjectivity and lack of insight into how judges arrive at each rating decision. To address this, we ...
Why Is That Relevant? Collecting Annotator Rationales for Relevance Judgments
(
AAAI Press
, 2016 , Conference Paper)
When collecting subjective human ratings of items, it can be difficult to measure and enforce data quality due to task subjectivity and lack of insight into how judges’ arrive at each rating decision. To address this, we ...
Efficient parallel skyline query processing for high-dimensional data
(
IEEE Computer Society
, 2019 , Conference Paper)
Given a set of multidimensional data points, skyline queries retrieve those points that are not dominated by any other points in the set. Due to the ubiquitous use of skyline queries, such as in preference-based query ...
Maintaining database anonymity in the presence of queries
(
Springer
, 2013 , Conference Paper)
With the advent of cloud computing there is an increased interest in outsourcing an organization's data to a remote provider in order to reduce the costs associated with self-hosting. If that database contains information ...
Efficient alignment of next generation sequencing data using MapReduce on the cloud
(
Institute of Electrical and Electronics Engineers Inc.
, 2012 , Conference Paper)
This paper presents a methodology for running NGS read mapping tools in the cloud environment based on the MapReduce programming paradigm. As a demonstration, the recently developed and robust sequence alignment tool, ...
Similarity Group-By operators for multi-dimensional relational data
(
Institute of Electrical and Electronics Engineers Inc.
, 2016 , Conference Paper)
The SQL group-by operator plays an important role in summarizing and aggregating large datasets in a data analytics stack. The Similarity SQL-based Group-By operator (SGB, for short) extends the semantics of the standard ...
Scalable multi-core implementation for motif finding problem
(
Institute of Electrical and Electronics Engineers Inc.
, 2014 , Conference Paper)
The motif finding problem is a key step for understanding the gene regulation and expression, drug design, disease resistance, etc. Many sequential algorithms have been proposed in the literature to find the exact motifs. ...
LocationSpark: A distributed in-memory data management system for big spatial data
(
VLDB Endowment
, 2015 , Conference Paper)
We present LocationSpark, a spatial data processing system built on top of Apache Spark, a widely used distributed data processing system. LocationSpark offers a rich set of spatial query operators, e.g., range search, ...
Association rule mining on fragmented database
(
Springer
, 2015 , Conference Paper)
Anonymization methods are an important tool to protect privacy. The goal is to release data while preventing individuals from being identified. Most approaches generalize data, reducing the level of detail so that many ...
Secure and private outsourcing of shape-based feature extraction
(
Springer
, 2013 , Conference Paper)
There has been much recent work on secure storage outsourcing, where an organization wants to store its data at untrusted remote cloud servers in an encrypted form, such that its own employees can query the encrypted data ...