Home
/
Authors
/
Lars Ruthotto

Author

Lars Ruthotto

Other affiliations: University of Lübeck, University of Münster, University of British Columbia

Bio: Lars Ruthotto is an academic researcher from Emory University. The author has contributed to research in topics: Artificial neural network & Inverse problem. The author has an hindex of 20, co-authored 88 publications receiving 2195 citations. Previous affiliations of Lars Ruthotto include University of Lübeck & University of Münster.

Papers published on a yearly basis

2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Stable Architectures for Deep Neural Networks

[...]

Eldad Haber¹, Lars Ruthotto²•Institutions (2)

University of British Columbia¹, Emory University²

01 Jan 2018-Inverse Problems

TL;DR: New forward propagation techniques inspired by systems of Ordinary Differential Equations (ODE) are proposed that overcome this challenge and lead to well-posed learning problems for arbitrarily deep networks.

...read moreread less

Abstract: Deep neural networks have become invaluable tools for supervised machine learning, e.g. classification of text or images. While often offering superior results over traditional techniques and successfully expressing complicated patterns in data, deep architectures are known to be challenging to design and train such that they generalize well to new data. Critical issues with deep architectures are numerical instabilities in derivative-based learning algorithms commonly called exploding or vanishing gradients. In this paper, we propose new forward propagation techniques inspired by systems of ordinary differential equations (ODE) that overcome this challenge and lead to well-posed learning problems for arbitrarily deep networks. The backbone of our approach is our interpretation of deep learning as a parameter estimation problem of nonlinear dynamical systems. Given this formulation, we analyze stability and well-posedness of deep learning and use this new understanding to develop new network architectures. We relate the exploding and vanishing gradient phenomenon to the stability of the discrete ODE and present several strategies for stabilizing deep learning for very deep networks. While our new architectures restrict the solution space, several numerical experiments show their competitiveness with state-of-the-art networks.

...read moreread less

474 citations

Journal Article•DOI•

Deep Neural Networks Motivated by Partial Differential Equations

[...]

Lars Ruthotto¹, Eldad Haber²•Institutions (2)

Emory University¹, University of British Columbia²

01 Apr 2020-Journal of Mathematical Imaging and Vision

TL;DR: In this article, a new PDE interpretation of a class of deep convolutional neural networks (CNN) was established, which are commonly used to learn from speech, image, and video data.

...read moreread less

Abstract: Partial differential equations (PDEs) are indispensable for modeling many physical phenomena and also commonly used for solving image processing tasks. In the latter area, PDE-based approaches interpret image data as discretizations of multivariate functions and the output of image processing algorithms as solutions to certain PDEs. Posing image processing problems in the infinite-dimensional setting provides powerful tools for their analysis and solution. For the last few decades, the reinterpretation of classical image processing problems through the PDE lens has been creating multiple celebrated approaches that benefit a vast area of tasks including image segmentation, denoising, registration, and reconstruction. In this paper, we establish a new PDE interpretation of a class of deep convolutional neural networks (CNN) that are commonly used to learn from speech, image, and video data. Our interpretation includes convolution residual neural networks (ResNet), which are among the most promising approaches for tasks such as image classification having improved the state-of-the-art performance in prestigious benchmark challenges. Despite their recent successes, deep ResNets still face some critical challenges associated with their design, immense computational costs and memory requirements, and lack of understanding of their reasoning. Guided by well-established PDE theory, we derive three new ResNet architectures that fall into two new classes: parabolic and hyperbolic CNNs. We demonstrate how PDE theory can provide new insights and algorithms for deep learning and demonstrate the competitiveness of three new CNN architectures using numerical experiments.

...read moreread less

329 citations

Posted Content•

Reversible Architectures for Arbitrarily Deep Residual Neural Networks

[...]

Bo Chang¹, Lili Meng¹, Eldad Haber¹, Lars Ruthotto², David Begert, Elliot Holtham - Show less +2 more•Institutions (2)

University of British Columbia¹, Emory University²

12 Sep 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this article, the authors develop a theoretical framework on stability and reversibility of deep residual networks, and derive three reversible neural network architectures that can go arbitrarily deep in theory, which allows a memory-efficient implementation, which does not need to store the activations for most hidden layers.

...read moreread less

Abstract: Recently, deep residual networks have been successfully applied in many computer vision and natural language processing tasks, pushing the state-of-the-art performance with deeper and wider architectures. In this work, we interpret deep residual networks as ordinary differential equations (ODEs), which have long been studied in mathematics and physics with rich theoretical and empirical success. From this interpretation, we develop a theoretical framework on stability and reversibility of deep neural networks, and derive three reversible neural network architectures that can go arbitrarily deep in theory. The reversibility property allows a memory-efficient implementation, which does not need to store the activations for most hidden layers. Together with the stability of our architectures, this enables training deeper networks using only modest computational resources. We provide both theoretical analyses and empirical results. Experimental results demonstrate the efficacy of our architectures against several strong baselines on CIFAR-10, CIFAR-100 and STL-10 with superior or on-par state-of-the-art performance. Furthermore, we show our architectures yield superior results when trained using fewer training data.

...read moreread less

189 citations

Journal Article•DOI•

Stable Architectures for Deep Neural Networks

[...]

Eldad Haber¹, Lars Ruthotto²•Institutions (2)

University of British Columbia¹, Emory University²

09 May 2017-arXiv: Learning

TL;DR: In this article, the authors propose new forward propagation techniques inspired by systems of Ordinary Differential Equations (ODE) that overcome the numerical instabilities in derivative-based learning algorithms commonly called exploding or vanishing gradients.

...read moreread less

Abstract: Deep neural networks have become invaluable tools for supervised machine learning, e.g., classification of text or images. While often offering superior results over traditional techniques and successfully expressing complicated patterns in data, deep architectures are known to be challenging to design and train such that they generalize well to new data. Important issues with deep architectures are numerical instabilities in derivative-based learning algorithms commonly called exploding or vanishing gradients. In this paper we propose new forward propagation techniques inspired by systems of Ordinary Differential Equations (ODE) that overcome this challenge and lead to well-posed learning problems for arbitrarily deep networks. The backbone of our approach is our interpretation of deep learning as a parameter estimation problem of nonlinear dynamical systems. Given this formulation, we analyze stability and well-posedness of deep learning and use this new understanding to develop new network architectures. We relate the exploding and vanishing gradient phenomenon to the stability of the discrete ODE and present several strategies for stabilizing deep learning for very deep networks. While our new architectures restrict the solution space, several numerical experiments show their competitiveness with state-of-the-art networks.

...read moreread less

162 citations

Journal Article•DOI•

hMRI – A toolbox for quantitative MRI in neuroscience and clinical research

[...]

Karsten Tabelow, Evelyne Balteau¹, John Ashburner, Martina F. Callaghan, Bogdan Draganski², Gunther Helms³, Ferath Kherif², Tobias Leutritz⁴, Antoine Lutti², Christophe Phillips¹, Enrico Reimer⁴, Lars Ruthotto⁵, Maryam Seif⁶, Nikolaus Weiskopf⁴, Gabriel Ziegler⁷, Siawoosh Mohammadi - Show less +12 more•Institutions (7)

University of Liège¹, University of Lausanne², Lund University³, Max Planck Society⁴, Emory University⁵, University of Zurich⁶, Otto-von-Guericke University Magdeburg⁷

01 Jul 2019-NeuroImage

TL;DR: The hMRI-toolbox is introduced, an open-source, easy-to-use tool available on GitHub, for qMRI data handling and processing, and can be readily combined with existing SPM toolboxes for estimating diffusion MRI parameter maps.

...read moreread less

151 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18

Collapse

Cited by

PDF

Open Access

More filters

Proceedings Article•DOI•

Random graphs

[...]

Alan Frieze¹•Institutions (1)

Carnegie Mellon University¹

22 Jan 2006

TL;DR: Some of the major results in random graphs and some of the more challenging open problems are reviewed, including those related to the WWW.

...read moreread less

Abstract: We will review some of the major results in random graphs and some of the more challenging open problems. We will cover algorithmic and structural questions. We will touch on newer models, including those related to the WWW.

...read moreread less

7,116 citations

On robust estimation of the location parameter

[...]

Frederick R. Forst

01 Jan 1980

3,652 citations

The Ensemble Kalman Filter: Theoretical formulation and practical implementation

[...]

Geir Evensen¹•Institutions (1)

Remote Sensing Center¹

01 Apr 2003

TL;DR: The EnKF has a large user group, and numerous publications have discussed applications and theoretical aspects of it as mentioned in this paper, and also presents new ideas and alternative interpretations which further explain the success of the EnkF.

...read moreread less

Abstract: The purpose of this paper is to provide a comprehensive presentation and interpretation of the Ensemble Kalman Filter (EnKF) and its numerical implementation. The EnKF has a large user group, and numerous publications have discussed applications and theoretical aspects of it. This paper reviews the important results from these studies and also presents new ideas and alternative interpretations which further explain the success of the EnKF. In addition to providing the theoretical framework needed for using the EnKF, there is also a focus on the algorithmic formulation and optimal numerical implementation. A program listing is given for some of the key subroutines. The paper also touches upon specific issues such as the use of nonlinear measurements, in situ profiles of temperature and salinity, and data which are available with high frequency in time. An ensemble based optimal interpolation (EnOI) scheme is presented as a cost-effective approach which may serve as an alternative to the EnKF in some applications. A fairly extensive discussion is devoted to the use of time correlated model errors and the estimation of model bias.

...read moreread less

2,975 citations

Journal Article•DOI•

An integrated approach to correction for off-resonance effects and subject movement in diffusion MR imaging

[...]

Jesper L. R. Andersson, Stamatios N. Sotiropoulos

15 Jan 2016-NeuroImage

TL;DR: The method is based on registering the individual volumes to a model free prediction of what each volume should look like, thereby enabling its use on high b-value data where the contrast is vastly different in different volumes.

...read moreread less

2,431 citations

Reference Entry•DOI•

IEEE Transactions on Pattern Analysis and Machine Intelligence

[...]

King-Sun Fu

15 Oct 2004

2,118 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse