scispace - formally typeset
Open AccessJournal ArticleDOI

Query-Based Data Pricing

Reads0
Chats0
TLDR
A framework for pricing data on the Internet that allows the price of any query to be derived automatically, and proves that pricing all other queries is NP-complete, thus establishing a dichotomy on the complexity of the pricing problem when all views are selection queries.
Abstract
Data is increasingly being bought and sold online, and Web-based marketplace services have emerged to facilitate these activities. However, current mechanisms for pricing data are very simple: buyers can choose only from a set of explicit views, each with a specific price. In this article, we propose a framework for pricing data on the Internet that, given the price of a few views, allows the price of any query to be derived automatically. We call this capability query-based pricing. We first identify two important properties that the pricing function must satisfy, the arbitrage-free and discount-free properties. Then, we prove that there exists a unique function that satisfies these properties and extends the seller's explicit prices to all queries. Central to our framework is the notion of query determinacy, and in particular instance-based determinacy: we present several results regarding the complexity and properties of it.When both the views and the query are unions of conjunctive queries or conjunctive queries, we show that the complexity of computing the price is high. To ensure tractability, we restrict the explicit prices to be defined only on selection views (which is the common practice today). We give algorithms with polynomial time data complexity for computing the price of two classes of queries: chain queries (by reducing the problem to network flow), and cyclic queries. Furthermore, we completely characterize the class of conjunctive queries without self-joins that have PTIME data complexity, and prove that pricing all other queries is NP-complete, thus establishing a dichotomy on the complexity of the pricing problem when all views are selection queries.

read more

Citations
More filters
Journal ArticleDOI

Secrets and Lies: Digital Security in a Networked World

Dominic Pinto
- 01 Feb 2003 - 
Journal ArticleDOI

Marketplaces for data: an initial survey

TL;DR: This work identifies several categories and dimensions of data marketplaces and data vendors and provides a snapshot of the situation as of Summer 2012.
Journal ArticleDOI

Efficient task-specific data valuation for nearest neighbor algorithms

TL;DR: This paper defines the "relative value of data" via the Shapley value, as it uniquely possesses properties with appealing real-world interpretations, such as fairness, rationality and decentralizability, and develops an algorithm based on Locality Sensitive Hashing (LSH) with only sublinear complexity.
Proceedings ArticleDOI

A theory of pricing private data

TL;DR: A theoretical framework for assigning prices to noisy query answers, as a function of their accuracy, and for dividing the price amongst data owners who deserve compensation for their loss of privacy is proposed.
Proceedings ArticleDOI

Toward practical query pricing with QueryMarket

TL;DR: This work develops a new pricing system, QueryMarket, for flexible query pricing in a data market based on an earlier theoretical framework and shows how to use an Integer Linear Programming formulation of the pricing problem for a large class of queries, even when pricing is computationally hard.
References
More filters
Book

Introduction to Algorithms

TL;DR: The updated new edition of the classic Introduction to Algorithms is intended primarily for use in undergraduate or graduate courses in algorithms or data structures and presents a rich variety of algorithms and covers them in considerable depth while making their design and analysis accessible to all levels of readers.
Book

Foundations of databases

TL;DR: This book discusses Languages, Computability, and Complexity, and the Relational Model, which aims to clarify the role of Semantic Data Models in the development of Query Language Design.
Book

Introduction to Algorithms, Second Edition

TL;DR: The complexity class P is formally defined as the set of concrete decision problems that are polynomial-time solvable, and encodings are used to map abstract problems to concrete problems.
Book

Secrets and Lies: Digital Security in a Networked World

TL;DR: This book argues that modern systems have so many components and connections-some of them not even known by the systems' designers, implementers, or users-that insecurities always remain, and that the world was full of bad security systems designed by people who read Applied Cryptography.
Trending Questions (1)