Practical Full Resolution Learned Lossless Image Compression

Open AccessPosted Content

Practical Full Resolution Learned Lossless Image Compression

Fabian Mentzer, +4 more

- 30 Nov 2018 -

arXiv: Image and Video Processing

Chats0

TLDR

The first practical learned lossless image compression system, L3C, is proposed and it outperforms the popular engineered codecs, PNG, WebP and JPEG 2000, and finds that learning the auxiliary representation is crucial and outperforms predefined auxiliary representations such as an RGB pyramid significantly.

Abstract:

We propose the first practical learned lossless image compression system, L3C, and show that it outperforms the popular engineered codecs, PNG, WebP and JPEG 2000. At the core of our method is a fully parallelizable hierarchical probabilistic model for adaptive entropy coding which is optimized end-to-end for the compression task. In contrast to recent autoregressive discrete probabilistic models such as PixelCNN, our method i) models the image distribution jointly with learned auxiliary representations instead of exclusively modeling the image distribution in RGB space, and ii) only requires three forward-passes to predict all pixel probabilities instead of one for each pixel. As a result, L3C obtains over two orders of magnitude speedups when sampling compared to the fastest PixelCNN variant (Multiscale-PixelCNN). Furthermore, we find that learning the auxiliary representation is crucial and outperforms predefined auxiliary representations such as an RGB pyramid significantly.

Citations

PDF

Open Access

More filters

Proceedings Article

Generating Diverse High-Fidelity Images with VQ-VAE-2

Ali Razavi, +2 more

TL;DR: In this article, the authors explore the use of vector quantized variational autoencoder (VQ-VAE) models for large scale image generation and demonstrate that a multi-scale hierarchical organization with powerful priors over the latent codes is able to generate samples with quality that rivals that of state of the art Generative Adversarial Networks on multifaceted datasets such as ImageNet, while not suffering from GAN's known shortcomings such as mode collapse and lack of diversity.

...read moreread less

Posted Content

Learned Image Compression with Discretized Gaussian Mixture Likelihoods and Attention Modules

Zhengxue Cheng, +3 more

- 06 Jan 2020 -

arXiv: Image and Video Processing

TL;DR: This paper proposes to use discretized Gaussian Mixture Likelihoods to parameterize the distributions of latent codes, which can achieve a more accurate and flexible entropy model and achieves a state-of-the-art performance against existing learned compression methods.

...read moreread less

Journal ArticleDOI

End-to-End Learnt Image Compression via Non-Local Attention Optimization and Improved Context Modeling

Tong Chen, +5 more

- 19 Feb 2021 -

IEEE Transactions on Image Processing

TL;DR: An end-to-end learnt lossy image compression approach, which is built on top of the deep nerual network (DNN)-based variational auto-encoder (VAE) structure with Non-Local Attention optimization and Improved Context modeling (NLAIC).

...read moreread less

Journal ArticleDOI

Nonlinear Transform Coding

Johannes Ballé, +7 more

- 01 Feb 2021 -

IEEE Journal of Selected Topics in Signa...

TL;DR: A novel variant of entropy-constrained vector quantization, based on artificial neural networks, as well as learned entropy models, is introduced to assess the empirical rate–distortion performance of nonlinear transform coding methods.

...read moreread less

Proceedings Article

Integer Discrete Flows and Lossless Compression

Emiel Hoogeboom, +3 more

TL;DR: This work introduces a flow-based generative model for ordinal discrete data called Integer Discrete Flow (IDF): a bijective integer map that can learn rich transformations on high-dimensional data and introduces a flexible transformation layer called integer discrete coupling.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

A mathematical theory of communication

Claude E. Shannon

- 01 Jul 1948 -

Bell System Technical Journal

TL;DR: This final installment of the paper considers the case where the signals or the messages or both are continuously variable, in contrast with the discrete nature assumed until now.

...read moreread less

Book

Elements of information theory

Thomas M. Cover, +1 more

TL;DR: The author examines the role of entropy, inequality, and randomness in the design of codes and the construction of codes in the rapidly changing environment.

...read moreread less

Proceedings Article

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Sergey Ioffe, +1 more

TL;DR: Applied to a state-of-the-art image classification model, Batch Normalization achieves the same accuracy with 14 times fewer training steps, and beats the original model by a significant margin.

...read moreread less

Proceedings Article

Auto-Encoding Variational Bayes

Diederik P. Kingma, +1 more

TL;DR: A stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case is introduced.

...read moreread less

Posted Content

Rethinking Atrous Convolution for Semantic Image Segmentation

Liang-Chieh Chen, +3 more

- 17 Jun 2017 -

arXiv: Computer Vision and Pattern Recog...

TL;DR: The proposed `DeepLabv3' system significantly improves over the previous DeepLab versions without DenseCRF post-processing and attains comparable performance with other state-of-art models on the PASCAL VOC 2012 semantic image segmentation benchmark.

...read moreread less

Collapse

Related Papers (5)

Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks

Nick Johnston, +8 more

- 29 Mar 2017 -

arXiv: Computer Vision and Pattern Recog...

Practical Full Resolution Learned Lossless Image Compression

Citations

Generating Diverse High-Fidelity Images with VQ-VAE-2

Learned Image Compression with Discretized Gaussian Mixture Likelihoods and Attention Modules

End-to-End Learnt Image Compression via Non-Local Attention Optimization and Improved Context Modeling

Nonlinear Transform Coding

Integer Discrete Flows and Lossless Compression

References

A mathematical theory of communication

Elements of information theory

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Auto-Encoding Variational Bayes

Rethinking Atrous Convolution for Semantic Image Segmentation

Related Papers (5)

Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks

Variable Rate Deep Image Compression With a Conditional Autoencoder

Task-Aware Quantization Network for JPEG Image Compression

Context-adaptive Entropy Model for End-to-end Optimized Image Compression

Learned Scalable Image Compression with Bidirectional Context Disentanglement Network