Home
/
Authors
/
Yann Ollivier

Author

Yann Ollivier

Other affiliations: Facebook, École normale supérieure de Lyon

Bio: Yann Ollivier is an academic researcher from University of Paris-Sud. The author has contributed to research in topics: Ricci curvature & Random group. The author has an hindex of 19, co-authored 38 publications receiving 1780 citations. Previous affiliations of Yann Ollivier include Facebook & École normale supérieure de Lyon.

Topics: Ricci curvature, Random group, Group (mathematics), Curvature, Gradient descent ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Ricci curvature of Markov chains on metric spaces

[...]

Yann Ollivier¹•Institutions (1)

École normale supérieure de Lyon¹

01 Feb 2009-Journal of Functional Analysis

TL;DR: In this article, the authors define the Ricci curvature of metric spaces in terms of how much small balls are closer (in Wasserstein transportation distance) than their centers are.

...read moreread less

728 citations

Journal Article•DOI•

Ricci curvature of metric spaces

[...]

Yann Ollivier¹•Institutions (1)

École normale supérieure de Lyon¹

01 Dec 2007-Comptes Rendus Mathematique

TL;DR: Ollivier et al. as discussed by the authors define a notion of Ricci curvature in metric spaces equipped with a measure or a random walk, which is compatible with Bakry-Emery theory and is robust and very easy to implement in concrete examples such as graphs.

...read moreread less

177 citations

Proceedings Article•DOI•

A survey of Ricci curvature for metric spaces and Markov chains

[...]

Yann Ollivier

01 Jan 2010

TL;DR: In this paper, the authors present a notion of Ricci curvature valid on arbitrary metric spaces, such as graphs, and generalize a series of classical theorems in positive Ricci curve, including spectral gap estimates, concentration of measure or log-Sobolev inequalities.

...read moreread less

Abstract: This text is a presentation of the general context and results of [Oll07] and [Oll09], with comments on related work. The goal is to present a notion of Ricci curvature valid on arbitrary metric spaces, such as graphs, and to generalize a series of classical theorems in positive Ricci curvature, such as spectral gap estimates, concentration of measure or log-Sobolev inequalities. The necessary background (concentration of measure, curvature in Riemannian geometry, convergence of Markov chains) is covered in the first section. Special emphasis is put on open questions of varying difficulty.

...read moreread less

130 citations

Journal Article•DOI•

Curvature, concentration and error estimates for markov chain monte carlo

[...]

Aldéric Joulin, Yann Ollivier

01 Nov 2010-Annals of Probability

TL;DR: In this paper, the authors provide nonasymptotic estimates for the rate of convergence of empirical means of Markov chains, together with a Gaussian or exponential control on the deviations of empirical mean.

...read moreread less

Abstract: We provide explicit nonasymptotic estimates for the rate of convergence of empirical means of Markov chains, together with a Gaussian or exponential control on the deviations of empirical means. These estimates hold under a "positive curvature" assumption expressing a kind of metric ergodicity, which generalizes the Ricci curvature from differential geometry and, on finite graphs, amounts to contraction under path coupling.

...read moreread less

118 citations

Journal Article•DOI•

Curvature, concentration and error estimates for Markov chain Monte Carlo

[...]

Aldéric Joulin, Yann Ollivier

08 Apr 2009-arXiv: Probability

TL;DR: In this article, the authors provide nonasymptotic estimates for the rate of convergence of empirical means of Markov chains, together with a Gaussian or exponential control on the deviations of empirical mean.

...read moreread less

106 citations

1
2
3
4
…
5
6
7
8
9

Collapse

Cited by

PDF

Open Access

More filters

Pattern Recognition and Machine Learning

[...]

Christopher M. Bishop¹•Institutions (1)

Microsoft¹

01 Jan 2006

TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.

...read moreread less

Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

...read moreread less

10,141 citations

贝叶斯滤波与平滑 (Bayesian filtering and smoothing)

[...]

Simo Särkkä

01 Jan 2015

TL;DR: This compact, informal introduction for graduate students and advanced undergraduates presents the current state-of-the-art filtering and smoothing methods in a unified Bayesian framework and learns what non-linear Kalman filters and particle filters are, how they are related, and their relative advantages and disadvantages.

...read moreread less

Abstract: Filtering and smoothing methods are used to produce an accurate estimate of the state of a time-varying system based on multiple observational inputs (data). Interest in these methods has exploded in recent years, with numerous applications emerging in fields such as navigation, aerospace engineering, telecommunications, and medicine. This compact, informal introduction for graduate students and advanced undergraduates presents the current state-of-the-art filtering and smoothing methods in a unified Bayesian framework. Readers learn what non-linear Kalman filters and particle filters are, how they are related, and their relative advantages and disadvantages. They also discover how state-of-the-art Bayesian parameter estimation methods can be combined with state-of-the-art filtering and smoothing algorithms. The book’s practical and algorithmic approach assumes only modest mathematical prerequisites. Examples include MATLAB computations, and the numerous end-of-chapter exercises include computational assignments. MATLAB/GNU Octave source code is available for download at www.cambridge.org/sarkka, promoting hands-on work with the methods.

...read moreread less

1,102 citations

Book Chapter•DOI•

Fixed Point Theory

[...]

Klaus Deimling

01 Jan 1985

TL;DR: The first group of results in fixed point theory were derived from Banach's fixed point theorem as discussed by the authors, which is a nice result since it contains only one simple condition on the map F, since it is easy to prove and since it nevertheless allows a variety of applications.

...read moreread less

Abstract: Formally we have arrived at the middle of the book. So you may need a pause for recovering, a pause which we want to fill up by some fixed point theorems supplementing those which you already met or which you will meet in later chapters. The first group of results centres around Banach’s fixed point theorem. The latter is certainly a nice result since it contains only one simple condition on the map F, since it is so easy to prove and since it nevertheless allows a variety of applications. Therefore it is not astonishing that many mathematicians have been attracted by the question to which extent the conditions on F and the space Ω can be changed so that one still gets the existence of a unique or of at least one fixed point. The number of results produced this way is still finite, but of a statistical magnitude, suggesting at a first glance that only a random sample can be covered by a chapter or even a book of the present size. Fortunately (or unfortunately?) most of the modifications have not found applications up to now, so that there is no reason to write a cookery book about conditions but to write at least a short outline of some ideas indicating that this field can be as interesting as other chapters. A systematic account of more recent ideas and examples in fixed point theory should however be written by one of the true experts. Strange as it is, such a book does not seem to exist though so many people are puzzling out so many results.

...read moreread less

994 citations

Posted Content•

Regularizing and Optimizing LSTM Language Models

[...]

Stephen Merity¹, Nitish Shirish Keskar¹, Richard Socher¹•Institutions (1)

Salesforce.com¹

07 Aug 2017-arXiv: Computation and Language

TL;DR: This paper proposes the weight-dropped LSTM which uses DropConnect on hidden-to-hidden weights as a form of recurrent regularization and introduces NT-ASGD, a variant of the averaged stochastic gradient method, wherein the averaging trigger is determined using a non-monotonic condition as opposed to being tuned by the user.

...read moreread less

Abstract: Recurrent neural networks (RNNs), such as long short-term memory networks (LSTMs), serve as a fundamental building block for many sequence learning tasks, including machine translation, language modeling, and question answering. In this paper, we consider the specific problem of word-level language modeling and investigate strategies for regularizing and optimizing LSTM-based models. We propose the weight-dropped LSTM which uses DropConnect on hidden-to-hidden weights as a form of recurrent regularization. Further, we introduce NT-ASGD, a variant of the averaged stochastic gradient method, wherein the averaging trigger is determined using a non-monotonic condition as opposed to being tuned by the user. Using these and other regularization strategies, we achieve state-of-the-art word level perplexities on two data sets: 57.3 on Penn Treebank and 65.8 on WikiText-2. In exploring the effectiveness of a neural cache in conjunction with our proposed model, we achieve an even lower state-of-the-art perplexity of 52.8 on Penn Treebank and 52.0 on WikiText-2.

...read moreread less

899 citations

Journal Article•DOI•

Ricci curvature of Markov chains on metric spaces

[...]

Yann Ollivier¹•Institutions (1)

École normale supérieure de Lyon¹

01 Feb 2009-Journal of Functional Analysis

TL;DR: In this article, the authors define the Ricci curvature of metric spaces in terms of how much small balls are closer (in Wasserstein transportation distance) than their centers are.

...read moreread less

728 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse