Showing papers by "Yannick Berthoumieu published in 2022"

PDF

Open Access

Journal Article•DOI•

Patch-based image Super Resolution using generalized Gaussian mixture model

[...]

Dang Phoung Lan Nguyen, Jean-François Aujol, Yannick Berthoumieu

07 Jun 2022-arXiv.org

TL;DR: In this paper , the authors proposed an algorithm to learn a joint generalized Gaussian mixture model (GGMM) from a pair of the low resolution patches and the corresponding high resolution patches from the reference data.

...read moreread less

Abstract: les patchs consid´erable. La technique de minimum de l’erreur qua-dratique moyenne est une m´ethode de restauration d’images qui utilise un mod`ele gaussien de probabilit´es les patchs d’images. article propose un algorithme d’apprentissage d’un mod ` ele conjoint de m ´ elange gaussien g ´ en ´ eralis ´ e (GGMM) ` a partir des paires de patchs ` a basse r´esolution et des patchs correspondants `a haute r´esolution d’une image de r´ef´erence. `A partir de ce mod`ele GGMM, l’image haute r´esolution en utilisant la m´ethode MMSE. Nos ´evaluations num´eriques indiquent que la m´ethode MMSE-GGMM comporte rapport `a Abstract – Single Image Super Resolution (SISR) methods aim to recover the clean images in high resolution from low resolution observations. A family of patch-based approaches have received considerable attention and development. The minimum mean square error (MMSE) method is a powerful image restoration method that uses a probability model on the patches of images. This paper proposes an algorithm to learn a joint generalized Gaussian mixture model (GGMM) from a pair of the low resolution patches and the corresponding high resolution patches from the reference data. We then reconstruct the high resolution image based on the MMSE method. Our numerical evaluations indicate that the MMSE-GGMM method competes with other state of the art methods.

...read moreread less

1 citations

Book Chapter•DOI•

Remote Sensing Scene Classification Based on Covariance Pooling of Multi-layer CNN Features Guided by Saliency Maps

[...]

Sara Akodad, Lionel Bombrun, Christian Germain, Yannick Berthoumieu

01 Jan 2022-Lecture Notes in Computer Science

TL;DR: In this paper , a novel supervised classification algorithm based on covariance pooling of multi-layer convolutional neural network (CNN) features is introduced, inspired by the theory of robust statistics, a weighted covariance matrix estimator is considered.

...read moreread less

Abstract: The new generation of remote sensing imaging sensors enables high spatial, spectral and temporal resolution images with high revisit frequencies. These sensors allow the acquisition of multi-spectral and multi-temporal images. The availability of these data has raised the interest of the remote sensing community to develop novel machine learning strategies for supervised classification. This paper aims at introducing a novel supervised classification algorithm based on covariance pooling of multi-layer convolutional neural network (CNN) features. The basic idea consists in an ensemble learning approach based on covariance matrices estimation from CNN features. Then, after being projected on the log-Euclidean space, an SVM classifier is used to make a decision. In order to give more strength to relatively small objects of interest in the scene, we propose to incorporate the visual saliency map in the process. For that, inspired by the theory of robust statistics, a weighted covariance matrix estimator is considered. Larger weights are given to more salient regions. Finally, some experiments on remote sensing classification are conducted on the UC Merced land use dataset. The obtained results confirm the potential of the proposed approach in terms of classification scene accuracy. It demonstrates, besides the interest of exploiting second order statistics and adopting an ensemble learning approach, the benefit of incorporating visual saliency maps.

...read moreread less

Proceedings Article•DOI•

Deep Ensemble Learning Model Based on Covariance Pooling of Multi-Layer CNN Features

[...]

Sara Akodad, Lionel Bombrun, Maria Puscasu, Junshi Xia, Christian Germain, Yannick Berthoumieu - Show less +2 more

16 Oct 2022

TL;DR: In this paper , a multi-layer ensemble learning architecture is proposed to obtain better predictive performance for remote sensing scene classification, indoor scene recognition, and texture classification, which has been shown to improve representation and generalization abilities.

...read moreread less

Abstract: Compared to standard deep convolutional neural networks (CNN) which include a global average pooling operator, second-order neural networks have a global covariance pooling operator which allows to capture richer statistics of CNN features. They have been shown to improve representation and generalization abilities. However, this covariance pooling is performed only on the deepest CNN feature maps. To benefit from different levels of abstraction, we propose to extend these models by using a multi-layer approach. In addition, to obtain better predictive performance, an end-to-end ensemble learning architecture is proposed. Experiments are conducted on four datasets and have confirmed the potential of the proposed model for various image processing applications such as remote sensing scene classification, indoor scene recognition and texture classification.

...read moreread less

Proceedings Article•DOI•

Automatic selection of latent variables in variational auto-encoders

[...]

Emma Jouffroy, Audrey Giremus, Yannick Berthoumieu, Olivier Bach, A. Hugget - Show less +1 more

29 Aug 2022

TL;DR: This paper proposes a variant of the VAE which is able to automatically determine the informative components of the latent space, which consists in augmenting the vanilla VAE with auxiliary variables and defining a hierarchical model which favors that only a subset of the hidden variables are used for the encoding.

...read moreread less

Abstract: Variational auto-encoders (VAEs) are powerful generative neural networks based on latent variables. They aim to capture the distribution of a dataset, by building an informative space composed of a reduced number of variables. However, the size of this latent space is both sensitive and difficult to adjust. Thus, most state-of-the-art architectures experience either dis-entanglement issues, or, at the opposite, posterior collapse. Both phenomena impair the interpretability of the latent variables. In this paper, we propose a variant of the VAE which is able to automatically determine the informative components of the latent space. It consists in augmenting the vanilla VAE with auxiliary variables and defining a hierarchical model which favors that only a subset of the latent variables are used for the encoding. We refer to it as NGVAE. We compare its performance with other auto-encoder based architectures.

...read moreread less

Journal Article•DOI•

Riemannian information gradient methods for the parameter estimation of ECD

[...]

Jialun Zhou¹, Salem Said¹, Yannick Berthoumieu¹•Institutions (1)

Centre national de la recherche scientifique¹

01 Mar 2022-Signal Processing

TL;DR: In this paper, the information stochastic gradient (ISG) method is proposed, which is an online (recursive) method, which achieves the same performance as MLE, while requiring modest memory and time resources.

...read moreread less