Home
/
Authors
/
Hartmut Bauermeister

Author

Hartmut Bauermeister

Bio: Hartmut Bauermeister is an academic researcher from University of Siegen. The author has contributed to research in topics: Deep learning & Transfer of learning. The author has an hindex of 2, co-authored 6 publications receiving 166 citations.

Papers

PDF

Open Access

More filters

Proceedings Article•

Inverting Gradients - How easy is it to break privacy in federated learning?

[...]

Jonas Geiping¹, Hartmut Bauermeister¹, Hannah Dröge¹, Michael Moeller¹•Institutions (1)

University of Siegen¹

01 Jan 2020

TL;DR: In this paper, the authors show that it is possible to reconstruct images at high resolution from the knowledge of their parameter gradients, and demonstrate that such a break of privacy is possible even for trained deep networks.

...read moreread less

Abstract: The idea of federated learning is to collaboratively train a neural network on a server. Each user receives the current weights of the network and in turns sends parameter updates (gradients) based on local data. This protocol has been designed not only to train neural networks data-efficiently, but also to provide privacy benefits for users, as their input data remains on device and only parameter gradients are shared. But how secure is sharing parameter gradients? Previous attacks have provided a false sense of security, by succeeding only in contrived settings - even for a single image. However, by exploiting a magnitude-invariant loss along with optimization strategies based on adversarial attacks, we show that is is actually possible to faithfully reconstruct images at high resolution from the knowledge of their parameter gradients, and demonstrate that such a break of privacy is possible even for trained deep networks. We analyze the effects of architecture as well as parameters on the difficulty of reconstructing an input image and prove that any input to a fully connected layer can be reconstructed analytically independent of the remaining architecture. Finally we discuss settings encountered in practice and show that even averaging gradients over several iterations or several images does not protect the user's privacy in federated learning applications in computer vision.

...read moreread less

423 citations

Posted Content•

Inverting Gradients -- How easy is it to break privacy in federated learning?

[...]

Jonas Geiping¹, Hartmut Bauermeister¹, Hannah Dröge¹, Michael Moeller¹•Institutions (1)

University of Siegen¹

31 Mar 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: It is shown that is is actually possible to faithfully reconstruct images at high resolution from the knowledge of their parameter gradients, and it is demonstrated that such a break of privacy is possible even for trained deep networks.

...read moreread less

99 citations

Journal Article•DOI•

Convergent Data-driven Regularizations for CT Reconstruction

[...]

Samira Kabri, Alexander Auras, D. Riccio, Hartmut Bauermeister, Martin Benning, Michael Moeller, Martin Burger - Show less +3 more

14 Dec 2022-arXiv.org

TL;DR: In this paper , a generic linear regularization that learns how to manipulate the singular values of the linear operator in an extension of [1] and a tailored approach in the Fourier domain that is specific to CT-reconstruction are presented.

...read moreread less

Abstract: The reconstruction of images from their corresponding noisy Radon transform is a typical example of an ill-posed linear inverse problem as arising in the application of computerized tomography (CT). As the (näıve) solution does not depend on the measured data continuously, regularization is needed to re-establish a continuous dependence. In this work, we investigate simple, but yet still provably convergent approaches to learning linear regularization methods from data. More specifically, we analyze two approaches: One generic linear regularization that learns how to manipulate the singular values of the linear operator in an extension of [1], and one tailored approach in the Fourier domain that is specific to CT-reconstruction. We prove that such approaches become convergent regularization methods as well as the fact that the reconstructions they provide are typically much smoother than the training data they were trained on. Finally, we compare the spectral as well as the Fourier-based approaches for CT-reconstruction numerically, discuss their advantages and disadvantages and investigate the effect of discretization errors at different resolutions.

...read moreread less

3 citations

Proceedings Article•DOI•

Deep Optimization Prior for THz Model Parameter Estimation

[...]

Tak Ming Wong, Hartmut Bauermeister, Matthias Kahl, P. Haring Bolivar, Michael Möller, Andreas Kolb - Show less +2 more

01 Jan 2022

TL;DR: This paper demonstrates that deep priors also allow to find better local optima in the non-convex energy landscape of the nonlinear inverse problem arising from THz imaging.

...read moreread less

Abstract: In this paper, we propose a deep optimization prior approach with application to the estimation of material-related model parameters from terahertz (THz) data that is acquired using a Frequency Modulated Continuous Wave (FMCW) THz scanning system. A stable estimation of the THz model parameters for low SNR and shot noise configurations is essential to achieve acquisition times required for applications in, e.g., quality control. Conceptually, our deep optimization prior approach estimates the desired THz model parameters by optimizing for the weights of a neural network. While such a technique was shown to improve the reconstruction quality for convex objectives in the seminal work of Ulyanov et al., our paper demonstrates that deep priors also allow to find better local optima in the non-convex energy landscape of the nonlinear inverse problem arising from THz imaging. We verify this claim numerically on various THz parameter estimation problems for synthetic and real data under low SNR and shot noise conditions. While the low SNR scenario not even requires regularization, the impact of shot noise is significantly reduced by total variation (TV) regularization. We compare our approach with existing optimization techniques that require sophisticated physically motivated initialization, and with a 1D single-pixel reparametrization method.

...read moreread less

2 citations

Posted Content•

Fast Convex Relaxations using Graph Discretizations

[...]

Jonas Geiping¹, Fjedor Gaede², Hartmut Bauermeister¹, Michael Moeller¹•Institutions (2)

University of Siegen¹, Western Washington University²

23 Apr 2020-arXiv: Computer Vision and Pattern Recognition

TL;DR: This methodology is discussed in detail and examples in multi-label segmentation by minimal partitions and stereo estimation are shown, where it is demonstrated that the proposed graph discretization technique can reduce the runtime as well as the memory consumption by up to a factor of 10 in comparison to classical pixelwise discretizations.

...read moreread less

Abstract: Matching and partitioning problems are fundamentals of computer vision applications with examples in multilabel segmentation, stereo estimation and optical-flow computation. These tasks can be posed as non-convex energy minimization problems and solved near-globally optimal by recent convex lifting approaches. Yet, applying these techniques comes with a significant computational effort, reducing their feasibility in practical applications. We discuss spatial discretization of continuous partitioning problems into a graph structure, generalizing discretization onto a Cartesian grid. This setup allows us to faithfully work on super-pixel graphs constructed by SLIC or Cut-Pursuit, massively decreasing the computational effort for lifted partitioning problems compared to a Cartesian grid, while optimal energy values remain similar: The global matching is still solved near-globally optimal. We discuss this methodology in detail and show examples in multi-label segmentation by minimal partitions and stereo estimation, where we demonstrate that the proposed graph discretization can reduce runtime as well as memory consumption of convex relaxations of matching problems by up to a factor of 10.

...read moreread less

1 citations

Cited by

PDF

Open Access

More filters

Book Chapter•DOI•

Deep Leakage from Gradients

[...]

Ligeng Zhu¹, Zhijian Liu¹, Song Han¹•Institutions (1)

Massachusetts Institute of Technology¹

21 Jun 2019

TL;DR: In this paper, the authors show that they can obtain the private training set from the publicly shared gradients, which is called deep leakage from gradient and practically validate the effectiveness of their algorithm on both computer vision and natural language processing tasks.

...read moreread less

Abstract: Passing gradient is a widely used scheme in modern multi-node learning system (e.g, distributed training, collaborative learning). In a long time, people used to believe that gradients are safe to share: i.e, the training set will not be leaked by gradient sharing. However, in this paper, we show that we can obtain the private training set from the publicly shared gradients. The leaking only takes few gradient steps to process and can obtain the original training set instead of look-alike alternatives. We name this leakage as \textit{deep leakage from gradient} and practically validate the effectiveness of our algorithm on both computer vision and natural language processing tasks. We empirically show that our attack is much stronger than previous approaches and thereby and raise people's awareness to rethink the gradients' safety. We also discuss some possible strategies to defend this deep leakage.

...read moreread less

450 citations

Posted Content•

A Survey on Federated Learning Systems: Vision, Hype and Reality for Data Privacy and Protection

[...]

Qinbin Li¹, Zeyi Wen², Zhaomin Wu¹, Sixu Hu¹, Naibo Wang¹, Yuan Li¹, Xu Liu¹, Bingsheng He¹ - Show less +4 more•Institutions (2)

National University of Singapore¹, University of Western Australia²

23 Jul 2019-arXiv: Learning

TL;DR: A comprehensive review of federated learning systems can be found in this paper, where the authors provide a thorough categorization of the existing systems according to six different aspects, including data distribution, machine learning model, privacy mechanism, communication architecture, scale of federation and motivation of federation.

...read moreread less

Abstract: Federated learning has been a hot research topic in enabling the collaborative training of machine learning models among different organizations under the privacy restrictions. As researchers try to support more machine learning models with different privacy-preserving approaches, there is a requirement in developing systems and infrastructures to ease the development of various federated learning algorithms. Similar to deep learning systems such as PyTorch and TensorFlow that boost the development of deep learning, federated learning systems (FLSs) are equivalently important, and face challenges from various aspects such as effectiveness, efficiency, and privacy. In this survey, we conduct a comprehensive review on federated learning systems. To achieve smooth flow and guide future research, we introduce the definition of federated learning systems and analyze the system components. Moreover, we provide a thorough categorization for federated learning systems according to six different aspects, including data distribution, machine learning model, privacy mechanism, communication architecture, scale of federation and motivation of federation. The categorization can help the design of federated learning systems as shown in our case studies. By systematically summarizing the existing federated learning systems, we present the design factors, case studies, and future research opportunities.

...read moreread less

305 citations

Proceedings Article•DOI•

See through Gradients: Image Batch Recovery via GradInversion

[...]

Hongxu Yin¹, Arun Mallya¹, Arash Vahdat¹, Jose M. Alvarez¹, Jan Kautz¹, Pavlo Molchanov¹ - Show less +2 more•Institutions (1)

Nvidia¹

15 Apr 2021

TL;DR: GradInversion as mentioned in this paper proposes a group consistency regularization framework, where multiple agents starting from different random seeds work together to find an enhanced reconstruction of the original data batch, even for complex datasets, deep networks, and large batch sizes.

...read moreread less

Abstract: Training deep neural networks requires gradient estimation from data batches to update parameters. Gradients per parameter are averaged over a set of data and this has been presumed to be safe for privacy-preserving training in joint, collaborative, and federated learning applications. Prior work only showed the possibility of recovering input data given gradients under very restrictive conditions – a single input point, or a network with no non-linearities, or a small 32 × 32 px input batch. Therefore, averaging gradients over larger batches was thought to be safe. In this work, we introduce GradInversion, using which input images from a larger batch (8 – 48 images) can also be recovered for large networks such as ResNets (50 layers), on complex datasets such as ImageNet (1000 classes, 224 × 224 px). We formulate an optimization task that converts random noise into natural images, matching gradients while regularizing image fidelity. We also propose an algorithm for target class label recovery given gradients. We further propose a group consistency regularization framework, where multiple agents starting from different random seeds work together to find an enhanced reconstruction of the original data batch. We show that gradients encode a surprisingly large amount of information, such that all the individual images can be recovered with high fidelity via GradInversion, even for complex datasets, deep networks, and large batch sizes.

...read moreread less

216 citations

Journal Article•DOI•

End-to-end privacy preserving deep learning on multi-institutional medical imaging

[...]

Georgios Kaissis, Alexander Ziller¹, Jonathan Passerat-Palmbach², Théo Ryffel³, Dmitrii Usynin, Andrew Trask⁴, Ionésio Lima⁵, Jason Mancuso, Friederike Jungmann¹, Marc-Matthias Steinborn, Andreas Saleh, Marcus R. Makowski¹, Daniel Rueckert², Daniel Rueckert¹, Rickmer Braren¹, Rickmer Braren⁶ - Show less +12 more•Institutions (6)

Technische Universität München¹, Imperial College London², French Institute for Research in Computer Science and Automation³, University of Oxford⁴, Federal University of Campina Grande⁵, Ludwig Maximilian University of Munich⁶

01 Jun 2021-Nature Machine Intelligence

TL;DR: PriMIA (Privacy-preserving Medical Image Analysis), a free, open-source software framework for differentially private, securely aggregated federated learning and encrypted inference on medical imaging data, is presented.

...read moreread less

Abstract: Using large, multi-national datasets for high-performance medical imaging AI systems requires innovation in privacy-preserving machine learning so models can train on sensitive data without requiring data transfer. Here we present PriMIA (Privacy-preserving Medical Image Analysis), a free, open-source software framework for differentially private, securely aggregated federated learning and encrypted inference on medical imaging data. We test PriMIA using a real-life case study in which an expert-level deep convolutional neural network classifies paediatric chest X-rays; the resulting model’s classification performance is on par with locally, non-securely trained models. We theoretically and empirically evaluate our framework’s performance and privacy guarantees, and demonstrate that the protections provided prevent the reconstruction of usable data by a gradient-based model inversion attack. Finally, we successfully employ the trained model in an end-to-end encrypted remote inference scenario using secure multi-party computation to prevent the disclosure of the data and the model. Gaining access to medical data to train AI applications can present problems due to patient privacy or proprietary interests. A way forward can be privacy-preserving federated learning schemes. Kaissis, Ziller and colleagues demonstrate here their open source framework for privacy-preserving medical image analysis in a remote inference scenario.

...read moreread less

170 citations

Journal Article•DOI•

Turbo-Aggregate: Breaking the Quadratic Aggregation Barrier in Secure Federated Learning

[...]

Jinhyun So¹, Basak Guler², A. Salman Avestimehr¹•Institutions (2)

University of Southern California¹, University of California, Riverside²

26 Jan 2021

TL;DR: This article proposes the first secure aggregation framework, named Turbo-Aggregate, which employs a multi-group circular strategy for efficient model aggregation, and leverages additive secret sharing and novel coding techniques for injecting aggregation redundancy in order to handle user dropouts while guaranteeing user privacy.

...read moreread less

Abstract: Federated learning is a distributed framework for training machine learning models over the data residing at mobile devices, while protecting the privacy of individual users. A major bottleneck in scaling federated learning to a large number of users is the overhead of secure model aggregation across many users. In particular, the overhead of the state-of-the-art protocols for secure model aggregation grows quadratically with the number of users. In this article, we propose the first secure aggregation framework, named Turbo-Aggregate, that in a network with $N$ users achieves a secure aggregation overhead of $O(N\log {N})$ , as opposed to $O(N^{2})$ , while tolerating up to a user dropout rate of 50%. Turbo-Aggregate employs a multi-group circular strategy for efficient model aggregation, and leverages additive secret sharing and novel coding techniques for injecting aggregation redundancy in order to handle user dropouts while guaranteeing user privacy. We experimentally demonstrate that Turbo-Aggregate achieves a total running time that grows almost linear in the number of users, and provides up to $40\times $ speedup over the state-of-the-art protocols with up to $N=200$ users. Our experiments also demonstrate the impact of model size and bandwidth on the performance of Turbo-Aggregate.

...read moreread less

170 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100

Collapse