List-decodable Linear Regression

Open AccessProceedings Article

List-decodable Linear Regression

Sushrut Karmalkar, +2 more

- Vol. 32, pp 7423-7432

Chats0

TLDR

The first polynomial-time algorithm for robust regression in the list-decodable setting where an adversary can corrupt a greater than 1/2 fraction of examples was given in this paper.

Abstract:

We give the first polynomial-time algorithm for robust regression in the list-decodable setting where an adversary can corrupt a greater than 1/2 fraction of examples. For any \alpha < 1, our algorithm takes as input a sample {(x_i,y_i)}_{i \leq n} of n linear equations where \alpha n of the equations satisfy y_i = \langle x_i,\ell^*\rangle +\zeta for some small noise \zeta and (1-\alpha) n of the equations are {\em arbitrarily} chosen. It outputs a list L of size O(1/\alpha) - a fixed constant - that contains an \ell that is close to \ell^*. Our algorithm succeeds whenever the inliers are chosen from a certifiably anti-concentrated distribution D. In particular, this gives a (d/\alpha)^{O(1/\alpha^8)} time algorithm to find a O(1/\alpha) size list when the inlier distribution is a standard Gaussian. For discrete product distributions that are anti-concentrated only in regular directions, we give an algorithm that achieves similar guarantee under the promise that \ell^* has all coordinates of the same magnitude. To complement our result, we prove that the anti-concentration assumption on the inliers is information-theoretically necessary. To solve the problem we introduce a new framework for list-decodable learning that strengthens the ``identifiability to algorithms'' paradigm based on the sum-of-squares method.

List-decodable Linear Regression

Citations

Recent Advances in Algorithmic High-Dimensional Robust Statistics.

Reducibility and Statistical-Computational Gaps from Secret Leakage

List decodable learning via sum of squares

Robust Linear Regression: Optimal Rates in Polynomial Time

Robustly Learning Mixtures of k Arbitrary Gaussians.

References

The ellipsoid method and its consequences in combinatorial optimization

Hierarchical mixtures of experts and the EM algorithm

Sums of Squares, Moment Matrices and Optimization Over Polynomials

On a lemma of Littlewood and Offord

Squared Functional Systems and Optimization Problems

Related Papers (5)

Learning from untrusted data

List-decodable robust mean estimation and learning mixtures of spherical gaussians

Agnostic Estimation of Mean and Covariance

Statistical Query Lower Bounds for Robust Estimation of High-Dimensional Gaussians and Gaussian Mixtures

Robust Estimation of a Location Parameter