Neural Networks are Convex Regularizers: Exact Polynomial-time Convex Optimization Formulations for Two-Layer Networks

Open AccessProceedings Article

Neural Networks are Convex Regularizers: Exact Polynomial-time Convex Optimization Formulations for Two-Layer Networks

Mert Pilanci, +1 more

- Vol. 1, pp 7695-7705

Chats0

TLDR

It is shown that ReLU networks trained with standard weight decay are equivalent to block $\ell_1$ penalized convex models and certain standard convolutional linear networks are equivalent semi-definite programs which can be simplified to regularized linear models in a polynomial sized discrete Fourier feature space.

Abstract:

We develop exact representations of training two-layer neural networks with rectified linear units (ReLUs) in terms of a single convex program with number of variables polynomial in the number of training samples and the number of hidden neurons. Our theory utilizes semi-infinite duality and minimum norm regularization. We show that ReLU networks trained with standard weight decay are equivalent to block $\ell_1$ penalized convex models. Moreover, we show that certain standard convolutional linear networks are equivalent semi-definite programs which can be simplified to $\ell_1$ regularized linear models in a polynomial sized discrete Fourier feature space.

Citations

PDF

Open Access

More filters

Convex Analysisの二,三の進展について

徹丸山

Posted Content

Revealing the Structure of Deep Neural Networks via Convex Duality

Tolga Ergen, +1 more

- 22 Feb 2020 -

arXiv: Learning

TL;DR: It is shown that a set of optimal hidden layer weights for a norm regularized DNN training problem can be explicitly found as the extreme points of a convex set and it is proved that each optimal weight matrix is rank-$K$ and aligns with the previous layers via duality.

...read moreread less

Posted Content

Convex Geometry and Duality of Over-parameterized Neural Networks

Tolga Ergen, +1 more

- 25 Feb 2020 -

arXiv: Learning

TL;DR: A convex analytic framework for ReLU neural networks is developed which elucidates the inner workings of hidden neurons and their function space characteristics and establishes a connection to $\ell_0$-$\ell_1$ equivalence for neural networks analogous to the minimal cardinality solutions in compressed sensing.

...read moreread less

Proceedings Article

Implicit Regularization Towards Rank Minimization in ReLU Networks

Nadav Timor, +2 more

TL;DR: It is proved (and demonstrated empirically) that, unlike the linear case, GF on ReLU networks may no longer tend to minimize ranks, in a rather strong sense (even approximately, for “most” datasets of size 2).

...read moreread less

Posted Content

Implicit Convex Regularizers of CNN Architectures: Convex Optimization of Two- and Three-Layer Networks in Polynomial Time

Tolga Ergen, +1 more

- 26 Jun 2020 -

arXiv: Learning

TL;DR: A convex analytic framework utilizing semi-infinite duality is developed to obtain equivalent convex optimization problems for several two- and three-layer CNN architectures, and it is proved that two-layerCNNs can be globally optimized via an $\ell_2$ norm regularized convex program.

...read moreread less