The geometry of differential privacy: the sparse and approximate cases

doi:10.1145/2488608.2488652

Open AccessProceedings ArticleDOI

The geometry of differential privacy: the sparse and approximate cases

Aleksandar Nikolov, +2 more

- pp 351-360

Chats0

TLDR

The connection between the hereditary discrepancy and the privacy mechanism enables the first polylogarithmic approximation to the hereditary discrepancies of a matrix A to be derived.

Abstract:

We study trade-offs between accuracy and privacy in the context of linear queries over histograms. This is a rich class of queries that includes contingency tables and range queries and has been the focus of a long line of work. For a given set of d linear queries over a database x ∈ RN, we seek to find the differentially private mechanism that has the minimum mean squared error. For pure differential privacy, [5, 32] give an O(log2 d) approximation to the optimal mechanism. Our first contribution is to give an efficient O(log2 d) approximation guarantee for the case of (e,δ)-differential privacy. Our mechanism adds carefully chosen correlated Gaussian noise to the answers. We prove its approximation guarantee relative to the hereditary discrepancy lower bound of [44], using tools from convex geometry. We next consider the sparse case when the number of queries exceeds the number of individuals in the database, i.e. when d > n Δ |x|1. The lower bounds used in the previous approximation algorithm no longer apply --- in fact better mechanisms are known in this setting [7, 27, 28, 31, 49]. Our second main contribution is to give an efficient (e,δ)-differentially private mechanism that, for any given query set A and an upper bound n on |x|1, has mean squared error within polylog(d,N) of the optimal for A and n. This approximation is achieved by coupling the Gaussian noise addition approach with linear regression over the l1 ball. Additionally, we show a similar polylogarithmic approximation guarantee for the optimal e-differentially private mechanism in this sparse setting. Our work also shows that for arbitrary counting queries, i.e. A with entries in {0,1}, there is an e-differentially private mechanism with expected error ~O(√n) per query, improving on the ~O(n2/3) bound of [7] and matching the lower bound implied by [15] up to logarithmic factors.The connection between the hereditary discrepancy and the privacy mechanism enables us to derive the first polylogarithmic approximation to the hereditary discrepancy of a matrix A.

The geometry of differential privacy: the sparse and approximate cases

Citations

The Algorithmic Foundations of Differential Privacy

Private Empirical Risk Minimization: Efficient Algorithms and Tight Error Bounds

Protecting Locations with Differential Privacy under Temporal Correlations

Differentially Private Empirical Risk Minimization: Efficient Algorithms and Tight Error Bounds

Privacy Aware Learning

References

Calibrating noise to sensitivity in private data analysis

Calibrating noise to sensitivity in private data analysis

An algorithm for quadratic programming

The complexity of satisfiability problems

Our data, ourselves: privacy via distributed noise generation

Related Papers (5)

Calibrating noise to sensitivity in private data analysis

Our data, ourselves: privacy via distributed noise generation

Boosting and Differential Privacy

The Algorithmic Foundations of Differential Privacy

Mechanism Design via Differential Privacy