Home
/
Authors
/
Jérôme Rony

Author

Jérôme Rony

Bio: Jérôme Rony is an academic researcher from École de technologie supérieure. The author has contributed to research in topics: Computer science & Mutual information. The author has an hindex of 9, co-authored 18 publications receiving 348 citations.

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Decoupling Direction and Norm for Efficient Gradient-Based L2 Adversarial Attacks and Defenses

[...]

Jérôme Rony, Luiz G. Hafemann, Luiz S. Oliveira¹, Ismail Ben Ayed, Robert Sabourin, Eric Granger - Show less +2 more•Institutions (1)

Federal University of Paraná¹

15 Jun 2019

TL;DR: In this article, an efficient approach is proposed to generate gradient-based attacks that induce misclassifications with low L2 norm, by decoupling the direction and the norm of the adversarial perturbation that is added to the image.

...read moreread less

Abstract: Research on adversarial examples in computer vision tasks has shown that small, often imperceptible changes to an image can induce misclassification, which has security implications for a wide range of image processing systems. Considering L2 norm distortions, the Carlini and Wagner attack is presently the most effective white-box attack in the literature. However, this method is slow since it performs a line-search for one of the optimization terms, and often requires thousands of iterations. In this paper, an efficient approach is proposed to generate gradient-based attacks that induce misclassifications with low L2 norm, by decoupling the direction and the norm of the adversarial perturbation that is added to the image. Experiments conducted on the MNIST, CIFAR-10 and ImageNet datasets indicate that our attack achieves comparable results to the state-of-the-art (in terms of L2 norm) with considerably fewer iterations (as few as 100 iterations), which opens the possibility of using these attacks for adversarial training. Models trained with our attack achieve state-of-the-art robustness against white-box gradient-based L2 attacks on the MNIST and CIFAR-10 datasets, outperforming the Madry defense when the attacks are limited to a maximum norm.

...read moreread less

206 citations

Book Chapter•DOI•

Adversarial Vision Challenge

[...]

Wieland Brendel¹, Jonas Rauber¹, Alexey Kurakin², Nicolas Papernot², Behar Veliqi¹, Sharada P. Mohanty³, Florian Laurent³, Marcel Salathé³, Matthias Bethge¹, Yaodong Yu, Hongyang Zhang⁴, Susu Xu⁴, Hongbao Zhang, Pengtao Xie, Eric P. Xing, Thomas Brunner, Frederik Diehl, Jérôme Rony, Luiz G. Hafemann, Shuyu Cheng⁵, Yinpeng Dong⁵, Xuefei Ning⁵, Wenshuo Li⁵, Yu Wang⁵ - Show less +20 more•Institutions (5)

University of Tübingen¹, Google², École Polytechnique Fédérale de Lausanne³, Carnegie Mellon University⁴, Tsinghua University⁵

06 Aug 2018-arXiv: Learning

TL;DR: This chapter describes the organisation and structure of the challenge as well as the solutions developed by the top-ranking teams.

...read moreread less

Abstract: This competition was meant to facilitate measurable progress towards robust machine vision models and more generally applicable adversarial attacks. It encouraged researchers to develop query-efficient adversarial attacks that can successfully operate against a wide range of defenses while just observing the final model decision to generate adversarial examples. Conversely, the competition encouraged the development of new defenses that can resist a wide range of strong decision-based attacks. In this chapter we describe the organisation and structure of the challenge as well as the solutions developed by the top-ranking teams.

...read moreread less

61 citations

Posted Content•

A unifying mutual information view of metric learning: cross-entropy vs. pairwise losses

[...]

Malik Boudiaf, Jérôme Rony, Imtiaz Masud Ziko, Eric Granger, Marco Pedersoli, Pablo Piantanida¹, Ismail Ben Ayed - Show less +3 more•Institutions (1)

Université Paris-Saclay¹

19 Mar 2020-arXiv: Learning

TL;DR: It is demonstrated that the cross-entropy is an upper bound on a new pairwise loss, which has a structure similar to various pairwise losses: it minimizes intra-class distances while maximizing inter- class distances and it can be seen as an approximate bound-optimization algorithm for minimizing this pairwise lost.

...read moreread less

Abstract: Recently, substantial research efforts in Deep Metric Learning (DML) focused on designing complex pairwise-distance losses, which require convoluted schemes to ease optimization, such as sample mining or pair weighting. The standard cross-entropy loss for classification has been largely overlooked in DML. On the surface, the cross-entropy may seem unrelated and irrelevant to metric learning as it does not explicitly involve pairwise distances. However, we provide a theoretical analysis that links the cross-entropy to several well-known and recent pairwise losses. Our connections are drawn from two different perspectives: one based on an explicit optimization insight; the other on discriminative and generative views of the mutual information between the labels and the learned features. First, we explicitly demonstrate that the cross-entropy is an upper bound on a new pairwise loss, which has a structure similar to various pairwise losses: it minimizes intra-class distances while maximizing inter-class distances. As a result, minimizing the cross-entropy can be seen as an approximate bound-optimization (or Majorize-Minimize) algorithm for minimizing this pairwise loss. Second, we show that, more generally, minimizing the cross-entropy is actually equivalent to maximizing the mutual information, to which we connect several well-known pairwise losses. Furthermore, we show that various standard pairwise losses can be explicitly related to one another via bound relationships. Our findings indicate that the cross-entropy represents a proxy for maximizing the mutual information -- as pairwise losses do -- without the need for convoluted sample-mining heuristics. Our experiments over four standard DML benchmarks strongly support our findings. We obtain state-of-the-art results, outperforming recent and complex DML methods.

...read moreread less

60 citations

Journal Article•DOI•

Multiregion segmentation of bladder cancer structures in MRI with progressive dilated convolutional networks

[...]

Jose Dolz¹, Xiaopan Xu², Jérôme Rony¹, Jing Yuan³, Yang Liu², Eric Granger¹, Christian Desrosiers¹, Xi Zhang², Ismail Ben Ayed¹, Hongbing Lu² - Show less +6 more•Institutions (3)

École de technologie supérieure¹, Fourth Military Medical University², Xidian University³

01 Dec 2018-Medical Physics

TL;DR: Wang et al. as discussed by the authors proposed to leverage the representation capacity of deep fully convolutional neural networks (FCN) for the segmentation of bladder walls and tumor regions in MRI images.

...read moreread less

Abstract: Purpose Precise segmentation of bladder walls and tumor regions is an essential step toward noninvasive identification of tumor stage and grade, which is critical for treatment decision and prognosis of patients with bladder cancer (BC). However, the automatic delineation of bladder walls and tumor in magnetic resonance images (MRI) is a challenging task, due to important bladder shape variations, strong intensity inhomogeneity in urine, and very high variability across the population, particularly on tumors' appearance. To tackle these issues, we propose to leverage the representation capacity of deep fully convolutional neural networks. Methods The proposed network includes dilated convolutions to increase the receptive field without incurring extra cost or degrading its performance. Furthermore, we introduce progressive dilations in each convolutional block, thereby enabling extensive receptive fields without the need for large dilation rates. The proposed network is evaluated on 3.0T T2-weighted MRI scans from 60 pathologically confirmed patients with BC. Results Experiments show the proposed model to achieve a higher level of accuracy than state-of-the-art methods, with a mean Dice similarity coefficient of 0.98, 0.84, and 0.69 for inner wall, outer wall, and tumor region segmentation, respectively. These results represent a strong agreement with reference contours and an increase in performance compared to existing methods. In addition, inference times are less than a second for a whole three-dimensional (3D) volume, which is between two and three orders of magnitude faster than related state-of-the-art methods for this application. Conclusion We showed that a CNN can yield precise segmentation of bladder walls and tumors in BC patients on MRI. The whole segmentation process is fully automatic and yields results similar to the reference standard, demonstrating the viability of deep learning models for the automatic multiregion segmentation of bladder cancer MRI images.

...read moreread less

56 citations

Posted Content•

Transductive Information Maximization For Few-Shot Learning

[...]

Malik Boudiaf, Ziko Imtiaz Masud, Jérôme Rony, Jose Dolz, Pablo Piantanida, Ismail Ben Ayed - Show less +2 more

25 Aug 2020-arXiv: Learning

TL;DR: This work introduces Transductive Infomation Maximization (TIM) for few-shot learning, and proposes a new alternating-direction solver for the mutual-information loss, which substantially speeds up transductive-inference convergence over gradient-based optimization, while yielding similar accuracy.

...read moreread less

Abstract: We introduce Transductive Infomation Maximization (TIM) for few-shot learning. Our method maximizes the mutual information between the query features and their label predictions for a given few-shot task, in conjunction with a supervision loss based on the support set. Furthermore, we propose a new alternating-direction solver for our mutual-information loss, which substantially speeds up transductive-inference convergence over gradient-based optimization, while yielding similar accuracy. TIM inference is modular: it can be used on top of any base-training feature extractor. Following standard transductive few-shot settings, our comprehensive experiments demonstrate that TIM outperforms state-of-the-art methods significantly across various datasets and networks, while used on top of a fixed feature extractor trained with simple cross-entropy on the base classes, without resorting to complex meta-learning schemes. It consistently brings between 2% and 5% improvement in accuracy over the best performing method, not only on all the well-established few-shot benchmarks but also on more challenging scenarios,with domain shifts and larger numbers of classes.

...read moreread less

49 citations

1
2
3
4
…
5

Cited by

PDF

Open Access

More filters

Posted Content•

Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks.

[...]

Francesco Croce¹, Matthias Hein¹•Institutions (1)

University of Tübingen¹

03 Mar 2020-arXiv: Learning

TL;DR: Two extensions of the PGD-attack overcoming failures due to suboptimal step size and problems of the objective function are proposed and combined with two complementary existing ones to form a parameter-free, computationally affordable and user-independent ensemble of attacks to test adversarial robustness.

...read moreread less

Abstract: The field of defense strategies against adversarial attacks has significantly grown over the last years, but progress is hampered as the evaluation of adversarial defenses is often insufficient and thus gives a wrong impression of robustness. Many promising defenses could be broken later on, making it difficult to identify the state-of-the-art. Frequent pitfalls in the evaluation are improper tuning of hyperparameters of the attacks, gradient obfuscation or masking. In this paper we first propose two extensions of the PGD-attack overcoming failures due to suboptimal step size and problems of the objective function. We then combine our novel attacks with two complementary existing ones to form a parameter-free, computationally affordable and user-independent ensemble of attacks to test adversarial robustness. We apply our ensemble to over 50 models from papers published at recent top machine learning and computer vision venues. In all except one of the cases we achieve lower robust test accuracy than reported in these papers, often by more than $10\%$, identifying several broken defenses.

...read moreread less

667 citations

Proceedings Article•

Theoretically Principled Trade-off between Robustness and Accuracy

[...]

Hongyang Zhang¹, Yaodong Yu², Jiantao Jiao³, Eric P. Xing⁴, Laurent El Ghaoui³, Michael I. Jordan³ - Show less +2 more•Institutions (4)

Carnegie Mellon University¹, University of Virginia², University of California, Berkeley³, Microsoft⁴

24 May 2019

TL;DR: TRADES as mentioned in this paper decomposes the prediction error for adversarial examples (robust error) as the sum of the natural (classification) error and boundary error, and provides a differentiable upper bound using the theory of classification-calibrated loss.

...read moreread less

Abstract: We identify a trade-off between robustness and accuracy that serves as a guiding principle in the design of defenses against adversarial examples. Although this problem has been widely studied empirically, much remains unknown concerning the theory underlying this trade-off. In this work, we decompose the prediction error for adversarial examples (robust error) as the sum of the natural (classification) error and boundary error, and provide a differentiable upper bound using the theory of classification-calibrated loss, which is shown to be the tightest possible upper bound uniform over all probability distributions and measurable predictors. Inspired by our theoretical analysis, we also design a new defense method, TRADES, to trade adversarial robustness off against accuracy. Our proposed algorithm performs well experimentally in real-world datasets. The methodology is the foundation of our entry to the NeurIPS 2018 Adversarial Vision Challenge in which we won the 1st place out of ~2,000 submissions, surpassing the runner-up approach by $11.41\%$ in terms of mean $\ell_2$ perturbation distance.

...read moreread less

640 citations

Posted Content•

Theoretically Principled Trade-off between Robustness and Accuracy

[...]

Hongyang Zhang¹, Yaodong Yu², Jiantao Jiao³, Eric P. Xing¹, Laurent El Ghaoui³, Michael I. Jordan³ - Show less +2 more•Institutions (3)

Carnegie Mellon University¹, University of Virginia², University of California, Berkeley³

24 Jan 2019-arXiv: Learning

...read moreread less

454 citations

Journal Article•DOI•

Histology for Pathologists

[...]

Walter F. Coulson

08 Jul 1992-JAMA

TL;DR: Several years ago, as editor-in-chief of the American Journal of Surgical Pathology, Dr Sternberg inaugurated a series of articles under the same heading as his book's title, Histology for Pathologists, which have now been considerably augmented and incorporated into this large, almost 1000-page, comprehensive text.

...read moreread less

Abstract: Several years ago, as editor-in-chief of theAmerican Journal of Surgical Pathology, Dr Sternberg inaugurated a series of articles under the same heading as his book's title,Histology for Pathologists. These articles have now been considerably augmented and incorporated into this large, almost 1000-page, comprehensive text, whose 80 authors comprise an international cast of experts. Inevitably, the chapters vary in their quality, to an extent that reflects either inadequate editorial surveillance or adherence to a very tight deadline. I would assume that the instruction to contributors was to write on pure and applied histology, with the level of detail required by a practicing pathologist. As Dr Sternberg mentions in his preface, there are many histology books in existence, but none contains the detail and variations from normal that form a baseline for estimating the presence of disease. The papers originally published in theAmerican Journal of Surgical Pathologyare strictly

...read moreread less

427 citations

Journal Article•DOI•

Multi-Scale Self-Guided Attention for Medical Image Segmentation

[...]

Ashish Sinha¹, Jose Dolz²•Institutions (2)

Indian Institute of Technology Roorkee¹, École de technologie supérieure²

01 Jan 2021-IEEE Journal of Biomedical and Health Informatics

TL;DR: Compared to other state-of-the-art segmentation networks, this model yields better segmentation performance, increasing the accuracy of the predictions while reducing the standard deviation, which demonstrates the efficiency of the approach to generate precise and reliable automatic segmentations of medical images.

...read moreread less

Abstract: Even though convolutional neural networks (CNNs) are driving progress in medical image segmentation, standard models still have some drawbacks. First, the use of multi-scale approaches, i.e., encoder-decoder architectures, leads to a redundant use of information, where similar low-level features are extracted multiple times at multiple scales. Second, long-range feature dependencies are not efficiently modeled, resulting in non-optimal discriminative feature representations associated with each semantic class. In this paper we attempt to overcome these limitations with the proposed architecture, by capturing richer contextual dependencies based on the use of guided self-attention mechanisms. This approach is able to integrate local features with their corresponding global dependencies, as well as highlight interdependent channel maps in an adaptive manner. Further, the additional loss between different modules guides the attention mechanisms to neglect irrelevant information and focus on more discriminant regions of the image by emphasizing relevant feature associations. We evaluate the proposed model in the context of semantic segmentation on three different datasets: abdominal organs, cardiovascular structures and brain tumors. A series of ablation experiments support the importance of these attention modules in the proposed architecture. In addition, compared to other state-of-the-art segmentation networks our model yields better segmentation performance, increasing the accuracy of the predictions while reducing the standard deviation. This demonstrates the efficiency of our approach to generate precise and reliable automatic segmentations of medical images. Our code is made publicly available at: https://github.com/sinAshish/Multi-Scale-Attention .

...read moreread less

302 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127

Collapse