Hidden Convexity of Wasserstein GANs: Interpretable Generative Models with Closed-Form Solutions

Open AccessPosted Content

Hidden Convexity of Wasserstein GANs: Interpretable Generative Models with Closed-Form Solutions

- 12 Jul 2021 -

TLDR

In this paper, the authors analyze the training of Wasserstein GANs with two-layer neural network discriminators through the lens of convex duality, and for a variety of generators expose the conditions under which GAN can be solved exactly with convex optimization approaches, or can be represented as convexconcave games.

Abstract:

Generative Adversarial Networks (GANs) are commonly used for modeling complex distributions of data. Both the generators and discriminators of GANs are often modeled by neural networks, posing a non-transparent optimization problem which is non-convex and non-concave over the generator and discriminator, respectively. Such networks are often heuristically optimized with gradient descent-ascent (GDA), but it is unclear whether the optimization problem contains any saddle points, or whether heuristic methods can find them in practice. In this work, we analyze the training of Wasserstein GANs with two-layer neural network discriminators through the lens of convex duality, and for a variety of generators expose the conditions under which Wasserstein GANs can be solved exactly with convex optimization approaches, or can be represented as convex-concave games. Using this convex duality interpretation, we further demonstrate the impact of different activation functions of the discriminator. Our observations are verified with numerical results demonstrating the power of the convex interpretation, with applications in progressive training of convex architectures corresponding to linear generators and quadratic-activation discriminators for CelebA image generation. The code for our experiments is available at this https URL.

Hidden Convexity of Wasserstein GANs: Interpretable Generative Models with Closed-Form Solutions

Citations

Convex Geometry and Duality of Over-parameterized Neural Networks

Path Regularization: A Convexity and Sparsity Inducing Regularization for Parallel ReLU Networks

The Convex Geometry of Backpropagation: Neural Network Gradient Flows Converge to Extreme Points of the Dual Convex Program

References

Adam: A Method for Stochastic Optimization

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Image-to-Image Translation with Conditional Adversarial Networks

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks

A Style-Based Generator Architecture for Generative Adversarial Networks

Related Papers (5)

Regularized bundle methods for convex and non-convex risks

A Convex Formulation for Learning Scale-Free Networks via Submodular Relaxation

Convex programming with fast proximal and linear operators

Universal gradient methods for convex optimization problems

A Unified Approach to Error Bounds for Structured Convex Optimization Problems