Do Neural Optimal Transport Solvers Work? A Continuous Wasserstein-2 Benchmark

Open AccessPosted Content

Do Neural Optimal Transport Solvers Work? A Continuous Wasserstein-2 Benchmark

Alexander Korotin, +5 more

- 03 Jun 2021 -

arXiv: Learning

Chats0

TLDR

In this paper, the Wasserstein-2 distance was used to evaluate the performance of neural network-based optimal transport (OT) solvers for quadratic-cost transport.

Abstract:

Despite the recent popularity of neural network-based solvers for optimal transport (OT), there is no standard quantitative way to evaluate their performance. In this paper, we address this issue for quadratic-cost transport -- specifically, computation of the Wasserstein-2 distance, a commonly-used formulation of optimal transport in machine learning. To overcome the challenge of computing ground truth transport maps between continuous measures needed to assess these solvers, we use input-convex neural networks (ICNN) to construct pairs of measures whose ground truth OT maps can be obtained analytically. This strategy yields pairs of continuous benchmark measures in high-dimensional spaces such as spaces of images. We thoroughly evaluate existing optimal transport solvers using these benchmark measures. Even though these solvers perform well in downstream tasks, many do not faithfully recover optimal transport maps. To investigate the cause of this discrepancy, we further test the solvers in a setting of image generation. Our study reveals crucial limitations of existing solvers and shows that increased OT accuracy does not necessarily correlate to better results downstream.

Do Neural Optimal Transport Solvers Work? A Continuous Wasserstein-2 Benchmark

Citations

On Transportation of Mini-batches: A Hierarchical Approach.

Generative Modeling with Optimal Transport Maps.

Physics Informed Convex Artificial Neural Networks (PICANNs) for Optimal Transport based Density Estimation.

References

Adam: A Method for Stochastic Optimization

Deep Learning Face Attributes in the Wild

Topics in Optimal Transportation

GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium

Improved training of wasserstein GANs

Related Papers (5)

Mapping Estimation for Discrete Optimal Transport

Computational Methods for Computer Vision: Minimal Solvers and Convex Relaxations

Greedy stochastic algorithms for entropy-regularized optimal transport problems

A Fast Proximal Point Method for Wasserstein Distance.

FALKON: An Optimal Large Scale Kernel Method