Home
/
Authors
/
Konstantinos Spiliopoulos

Author

Konstantinos Spiliopoulos

Other affiliations: University of Maryland, College Park, Heriot-Watt University, Brown University

Bio: Konstantinos Spiliopoulos is an academic researcher from Boston University. The author has contributed to research in topics: Stochastic differential equation & Large deviations theory. The author has an hindex of 23, co-authored 139 publications receiving 2439 citations. Previous affiliations of Konstantinos Spiliopoulos include University of Maryland, College Park & Heriot-Watt University.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007

Papers

PDF

Open Access

More filters

Journal Article•DOI•

DGM: A deep learning algorithm for solving partial differential equations

[...]

Justin Sirignano¹, Konstantinos Spiliopoulos²•Institutions (2)

University of Illinois at Urbana–Champaign¹, Boston University²

15 Dec 2018-Journal of Computational Physics

TL;DR: A deep learning algorithm similar in spirit to Galerkin methods, using a deep neural network instead of linear combinations of basis functions is proposed, and is implemented for American options in up to 100 dimensions.

...read moreread less

1,290 citations

Journal Article•DOI•

Mean field analysis of neural networks: A law of large numbers

[...]

Justin Sirignano, Konstantinos Spiliopoulos¹•Institutions (1)

Boston University¹

10 Mar 2020-Siam Journal on Applied Mathematics

TL;DR: Machine learning and neural networks have revolutionized fields such as image, text, and speech recognition as discussed by the authors, and many important real-world applications in these areas are based on neural networks.

...read moreread less

125 citations

Posted Content•

Mean Field Analysis of Neural Networks: A Central Limit Theorem

[...]

Justin Sirignano¹, Konstantinos Spiliopoulos²•Institutions (2)

University of Illinois at Urbana–Champaign¹, Boston University²

28 Aug 2018-arXiv: Probability

TL;DR: In this paper, the central limit theorem for neural networks with a single hidden layer was proved in the asymptotic regime of simultaneously (a) large numbers of hidden units and (b) large number of stochastic gradient descent training iterations.

...read moreread less

Abstract: We rigorously prove a central limit theorem for neural network models with a single hidden layer. The central limit theorem is proven in the asymptotic regime of simultaneously (A) large numbers of hidden units and (B) large numbers of stochastic gradient descent training iterations. Our result describes the neural network's fluctuations around its mean-field limit. The fluctuations have a Gaussian distribution and satisfy a stochastic partial differential equation. The proof relies upon weak convergence methods from stochastic analysis. In particular, we prove relative compactness for the sequence of processes and uniqueness of the limiting process in a suitable Sobolev space.

...read moreread less

106 citations

Posted Content•

Mean Field Analysis of Neural Networks: A Law of Large Numbers

[...]

Justin Sirignano, Konstantinos Spiliopoulos¹•Institutions (1)

Boston University¹

02 May 2018-arXiv: Probability

TL;DR: It is rigorously proved that the empirical distribution of the neural network parameters converges to the solution of a nonlinear partial differential equation, which can be considered a law of large numbers for neural networks.

...read moreread less

Abstract: Machine learning, and in particular neural network models, have revolutionized fields such as image, text, and speech recognition. Today, many important real-world applications in these areas are driven by neural networks. There are also growing applications in engineering, robotics, medicine, and finance. Despite their immense success in practice, there is limited mathematical understanding of neural networks. This paper illustrates how neural networks can be studied via stochastic analysis, and develops approaches for addressing some of the technical challenges which arise. We analyze one-layer neural networks in the asymptotic regime of simultaneously (A) large network sizes and (B) large numbers of stochastic gradient descent training iterations. We rigorously prove that the empirical distribution of the neural network parameters converges to the solution of a nonlinear partial differential equation. This result can be considered a law of large numbers for neural networks. In addition, a consequence of our analysis is that the trained parameters of the neural network asymptotically become independent, a property which is commonly called "propagation of chaos".

...read moreread less

80 citations

Journal Article•DOI•

Mean field analysis of neural networks: A central limit theorem

[...]

Justin Sirignano¹, Konstantinos Spiliopoulos²•Institutions (2)

University of Illinois at Urbana–Champaign¹, Boston University²

01 Mar 2020-Stochastic Processes and their Applications

TL;DR: In this article, the central limit theorem for neural networks with a single hidden layer was proved in the asymptotic regime of simultaneously (a) large numbers of hidden units and (b) large number of stochastic gradient descent training iterations.

...read moreread less

78 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

I and i

[...]

Kevin Barraclough

08 Dec 2001-BMJ

TL;DR: There is, I think, something ethereal about i —the square root of minus one, which seems an odd beast at that time—an intruder hovering on the edge of reality.

...read moreread less

Abstract: There is, I think, something ethereal about i —the square root of minus one. I remember first hearing about it at school. It seemed an odd beast at that time—an intruder hovering on the edge of reality. Usually familiarity dulls this sense of the bizarre, but in the case of i it was the reverse: over the years the sense of its surreal nature intensified. It seemed that it was impossible to write mathematics that described the real world in …

...read moreread less

33,785 citations

Journal Article•

다중혈관 관상동맥 환자에서 y-문합을 이용하여 양쪽 내흉동맥만을 사용한 우회술의 조기 성적

[...]

성기익, 이영탁, 박계현, 전태국, 박표원, 한일용, 장윤희 - Show less +3 more

01 Mar 2003-The Korean Journal of Thoracic and Cardiovascular Surgery

28,685 citations

Convex Analysisの二,三の進展について

[...]

徹丸山

01 Feb 1977

5,933 citations

Journal Article•DOI•

Convergence of Probability Measures

[...]

J. F. C. Kingman¹•Institutions (1)

University of Sussex¹

01 Nov 1969-Journal of The Royal Statistical Society Series C-applied Statistics

TL;DR: Convergence of Probability Measures as mentioned in this paper is a well-known convergence of probability measures. But it does not consider the relationship between probability measures and the probability distribution of probabilities.

...read moreread less

Abstract: Convergence of Probability Measures. By P. Billingsley. Chichester, Sussex, Wiley, 1968. xii, 253 p. 9 1/4“. 117s.

...read moreread less

5,689 citations

Book Chapter•DOI•

Convergence of probability measures

[...]

Richard F. Bass

01 Jan 2011

TL;DR: Weakconvergence methods in metric spaces were studied in this article, with applications sufficient to show their power and utility, and the results of the first three chapters are used in Chapter 4 to derive a variety of limit theorems for dependent sequences of random variables.

...read moreread less

Abstract: The author's preface gives an outline: "This book is about weakconvergence methods in metric spaces, with applications sufficient to show their power and utility. The Introduction motivates the definitions and indicates how the theory will yield solutions to problems arising outside it. Chapter 1 sets out the basic general theorems, which are then specialized in Chapter 2 to the space C[0, l ] of continuous functions on the unit interval and in Chapter 3 to the space D [0, 1 ] of functions with discontinuities of the first kind. The results of the first three chapters are used in Chapter 4 to derive a variety of limit theorems for dependent sequences of random variables. " The book develops and expands on Donsker's 1951 and 1952 papers on the invariance principle and empirical distributions. The basic random variables remain real-valued although, of course, measures on C[0, l ] and D[0, l ] are vitally used. Within this framework, there are various possibilities for a different and apparently better treatment of the material. More of the general theory of weak convergence of probabilities on separable metric spaces would be useful. Metrizability of the convergence is not brought up until late in the Appendix. The close relation of the Prokhorov metric and a metric for convergence in probability is (hence) not mentioned (see V. Strassen, Ann. Math. Statist. 36 (1965), 423-439; the reviewer, ibid. 39 (1968), 1563-1572). This relation would illuminate and organize such results as Theorems 4.1, 4.2 and 4.4 which give isolated, ad hoc connections between weak convergence of measures and nearness in probability. In the middle of p. 16, it should be noted that C*(S) consists of signed measures which need only be finitely additive if 5 is not compact. On p. 239, where the author twice speaks of separable subsets having nonmeasurable cardinal, he means "discrete" rather than "separable." Theorem 1.4 is Ulam's theorem that a Borel probability on a complete separable metric space is tight. Theorem 1 of Appendix 3 weakens completeness to topological completeness. After mentioning that probabilities on the rationals are tight, the author says it is an

...read moreread less

3,554 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse