Home
/
Authors
/
Sivaram Ambikasaran

Author

Sivaram Ambikasaran

Other affiliations: Courant Institute of Mathematical Sciences, Mercer University, New York University ...read more

Bio: Sivaram Ambikasaran is an academic researcher from Indian Institute of Science. The author has contributed to research in topics: Matrix (mathematics) & Solver. The author has an hindex of 17, co-authored 27 publications receiving 1742 citations. Previous affiliations of Sivaram Ambikasaran include Courant Institute of Mathematical Sciences & Mercer University.

Topics: Matrix (mathematics), Solver, Covariance, Hierarchical matrix, Computer science ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Fast and scalable Gaussian process modeling with applications to astronomical time series

[...]

Daniel Foreman-Mackey¹, Eric Agol¹, Sivaram Ambikasaran², Ruth Angus³•Institutions (3)

University of Washington¹, Indian Institute of Science², Columbia University³

09 Nov 2017-The Astronomical Journal

TL;DR: In this paper, the covariance function is expressed as a mixture of complex exponentials, without requiring evenly spaced observations or uniform noise, which can be used for probabilistic inference of stellar rotation periods, asteroseismic oscillation spectra and transiting planet parameters.

...read moreread less

Abstract: The growing field of large-scale time domain astronomy requires methods for probabilistic data analysis that are computationally tractable, even with large data sets. Gaussian processes (GPs) are a popular class of models used for this purpose, but since the computational cost scales, in general, as the cube of the number of data points, their application has been limited to small data sets. In this paper, we present a novel method for GPs modeling in one dimension where the computational requirements scale linearly with the size of the data set. We demonstrate the method by applying it to simulated and real astronomical time series data sets. These demonstrations are examples of probabilistic inference of stellar rotation periods, asteroseismic oscillation spectra, and transiting planet parameters. The method exploits structure in the problem when the covariance function is expressed as a mixture of complex exponentials, without requiring evenly spaced observations or uniform noise. This form of covariance arises naturally when the process is a mixture of stochastically driven damped harmonic oscillators-providing a physical motivation for and interpretation of this choice-but we also demonstrate that it can be a useful effective model in some other cases. We present a mathematical description of the method and compare it to existing scalable GP methods. The method is fast and interpretable, with a range of potential applications within astronomical data analysis and beyond. We provide well-tested and documented open-source implementations of this method in C++, Python, and Julia.

...read moreread less

611 citations

Journal Article•DOI•

Fast Direct Methods for Gaussian Processes

[...]

Sivaram Ambikasaran¹, Daniel Foreman-Mackey¹, Leslie Greengard¹, David W. Hogg¹, Michael O'Neil¹ - Show less +1 more•Institutions (1)

New York University¹

01 Feb 2016-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: In this paper, the authors show that for the most commonly used covariance functions, the matrix $C$ can be hierarchically factored into a product of block low-rank updates of the identity matrix, yielding an $\mathcal {O} (n\,\log^2, n)$ algorithm for inversion.

...read moreread less

Abstract: A number of problems in probability and statistics can be addressed using the multivariate normal (Gaussian) distribution. In the one-dimensional case, computing the probability for a given mean and variance simply requires the evaluation of the corresponding Gaussian density. In the $n$ -dimensional setting, however, it requires the inversion of an $n \times n$ covariance matrix, $C$ , as well as the evaluation of its determinant, $\det (C)$ . In many cases, such as regression using Gaussian processes, the covariance matrix is of the form $C = \sigma ^2 I + K$ , where $K$ is computed using a specified covariance kernel which depends on the data and additional parameters (hyperparameters). The matrix $C$ is typically dense, causing standard direct methods for inversion and determinant evaluation to require $\mathcal {O}(n^3)$ work. This cost is prohibitive for large-scale modeling. Here, we show that for the most commonly used covariance functions, the matrix $C$ can be hierarchically factored into a product of block low-rank updates of the identity matrix, yielding an $\mathcal {O} (n\,\log^2\, n)$ algorithm for inversion. More importantly, we show that this factorization enables the evaluation of the determinant $\det (C)$ , permitting the direct calculation of probabilities in high dimensions under fairly broad assumptions on the kernel defining $K$ . Our fast algorithm brings many problems in marginalization and the adaptation of hyperparameters within practical reach using a single CPU core. The combination of nearly optimal scaling in terms of problem size with high-performance computing resources will permit the modeling of previously intractable problems. We illustrate the performance of the scheme on standard covariance kernels.

...read moreread less

545 citations

Journal Article•DOI•

Fast and scalable Gaussian process modeling with applications to astronomical time series

[...]

Daniel Foreman-Mackey¹, Eric Agol¹, Sivaram Ambikasaran², Ruth Angus³•Institutions (3)

University of Washington¹, Indian Institute of Science², Columbia University³

28 Mar 2017-arXiv: Instrumentation and Methods for Astrophysics

TL;DR: A novel method for Gaussian processes modeling in one dimension where the computational requirements scale linearly with the size of the data set, and is fast and interpretable, with a range of potential applications within astronomical data analysis and beyond.

...read moreread less

Abstract: The growing field of large-scale time domain astronomy requires methods for probabilistic data analysis that are computationally tractable, even with large datasets. Gaussian Processes are a popular class of models used for this purpose but, since the computational cost scales, in general, as the cube of the number of data points, their application has been limited to small datasets. In this paper, we present a novel method for Gaussian Process modeling in one-dimension where the computational requirements scale linearly with the size of the dataset. We demonstrate the method by applying it to simulated and real astronomical time series datasets. These demonstrations are examples of probabilistic inference of stellar rotation periods, asteroseismic oscillation spectra, and transiting planet parameters. The method exploits structure in the problem when the covariance function is expressed as a mixture of complex exponentials, without requiring evenly spaced observations or uniform noise. This form of covariance arises naturally when the process is a mixture of stochastically-driven damped harmonic oscillators -- providing a physical motivation for and interpretation of this choice -- but we also demonstrate that it can be a useful effective model in some other cases. We present a mathematical description of the method and compare it to existing scalable Gaussian Process methods. The method is fast and interpretable, with a range of potential applications within astronomical data analysis and beyond. We provide well-tested and documented open-source implementations of this method in C++, Python, and Julia.

...read moreread less

282 citations

Journal Article•DOI•

An $$\mathcal O (N \log N)$$O(NlogN) Fast Direct Solver for Partial Hierarchically Semi-Separable Matrices

[...]

Sivaram Ambikasaran¹, Eric Darve¹•Institutions (1)

Stanford University¹

01 Dec 2013-Journal of Scientific Computing

TL;DR: The key ingredients behind this fast solver are recursion, efficient low rank factorization using Chebyshev interpolation, and the Sherman–Morrison–Woodbury formula.

...read moreread less

Abstract: This article describes a fast direct solver (i.e., not iterative) for partial hierarchically semi-separable systems. This solver requires a storage of $$\mathcal O (N \log N)$$ O ( N log N ) and has a computational complexity of $$\mathcal O (N \log N)$$ O ( N log N ) arithmetic operations. The numerical benchmarks presented illustrate the method in the context of interpolation using radial basis functions. The key ingredients behind this fast solver are recursion, efficient low rank factorization using Chebyshev interpolation, and the Sherman---Morrison---Woodbury formula. The algorithm and the analysis are worked out in detail. The performance of the algorithm is illustrated for a variety of radial basis functions and target accuracies.

...read moreread less

146 citations

Journal Article•DOI•

A fast block low-rank dense solver with applications to finite-element matrices

[...]

AmirHossein Aminfar¹, Sivaram Ambikasaran², Eric Darve¹•Institutions (2)

Stanford University¹, Mercer University²

01 Jan 2016-Journal of Computational Physics

TL;DR: A fast solver for the dense "frontal" matrices that arise from the multifrontal sparse elimination process of 3D elliptic PDEs, using the HODLR direct solver as a preconditioner to the GMRES iterative scheme to reach machine accuracy much faster than a conventional LU solver.

...read moreread less

112 citations

1
2
3
4
…
5
6
7

Collapse

Cited by

PDF

Open Access

More filters

The Ensemble Kalman Filter: Theoretical formulation and practical implementation

[...]

Geir Evensen¹•Institutions (1)

Remote Sensing Center¹

01 Apr 2003

TL;DR: The EnKF has a large user group, and numerous publications have discussed applications and theoretical aspects of it as mentioned in this paper, and also presents new ideas and alternative interpretations which further explain the success of the EnkF.

...read moreread less

Abstract: The purpose of this paper is to provide a comprehensive presentation and interpretation of the Ensemble Kalman Filter (EnKF) and its numerical implementation. The EnKF has a large user group, and numerous publications have discussed applications and theoretical aspects of it. This paper reviews the important results from these studies and also presents new ideas and alternative interpretations which further explain the success of the EnKF. In addition to providing the theoretical framework needed for using the EnKF, there is also a focus on the algorithmic formulation and optimal numerical implementation. A program listing is given for some of the key subroutines. The paper also touches upon specific issues such as the use of nonlinear measurements, in situ profiles of temperature and salinity, and data which are available with high frequency in time. An ensemble based optimal interpolation (EnOI) scheme is presented as a cost-effective approach which may serve as an alternative to the EnKF in some applications. A fairly extensive discussion is devoted to the use of time correlated model errors and the estimation of model bias.

...read moreread less

2,975 citations

Journal Article•DOI•

Fast Direct Methods for Gaussian Processes

[...]

Sivaram Ambikasaran¹, Daniel Foreman-Mackey¹, Leslie Greengard¹, David W. Hogg¹, Michael O'Neil¹ - Show less +1 more•Institutions (1)

New York University¹

01 Feb 2016-IEEE Transactions on Pattern Analysis and Machine Intelligence

...read moreread less

545 citations

Journal Article•DOI•

Single-pixel three-dimensional imaging with time-based depth resolution

[...]

Ming-Jie Sun¹, Matthew P. Edgar², Graham M. Gibson², Baoqing Sun², Neal Radwell², Robert A. Lamb³, Miles J. Padgett² - Show less +3 more•Institutions (3)

Beihang University¹, University of Glasgow², Selex ES³

05 Jul 2016-Nature Communications

TL;DR: A modified time-of-flight three-dimensional imaging system, which can use compressed sensing techniques to reduce acquisition times, whilst distributing the optical illumination over the full field of view, is shown.

...read moreread less

Abstract: A three-dimensional imaging system which distributes the optical illumination over the full field-of-view is sought after. Here, the authors demonstrate the capability of reconstructing 128 × 128 pixel resolution three-dimensional scenes to an accuracy of 3 mm as well as real-time video with a frame-rate up to 12 Hz.

...read moreread less

409 citations

Journal Article•DOI•

When Gaussian Process Meets Big Data: A Review of Scalable GPs

[...]

Haitao Liu¹, Yew-Soon Ong¹, Xiaobo Shen¹, Jianfei Cai¹•Institutions (1)

Nanyang Technological University¹

07 Jan 2020-IEEE Transactions on Neural Networks

TL;DR: In this article, a review of state-of-the-art scalable Gaussian process regression (GPR) models is presented, focusing on global and local approximations for subspace learning.

...read moreread less

Abstract: The vast quantity of information brought by big data as well as the evolving computer hardware encourages success stories in the machine learning community. In the meanwhile, it poses challenges for the Gaussian process regression (GPR), a well-known nonparametric, and interpretable Bayesian model, which suffers from cubic complexity to data size. To improve the scalability while retaining desirable prediction quality, a variety of scalable GPs have been presented. However, they have not yet been comprehensively reviewed and analyzed to be well understood by both academia and industry. The review of scalable GPs in the GP community is timely and important due to the explosion of data size. To this end, this article is devoted to reviewing state-of-the-art scalable GPs involving two main categories: global approximations that distillate the entire data and local approximations that divide the data for subspace learning. Particularly, for global approximations, we mainly focus on sparse approximations comprising prior approximations that modify the prior but perform exact inference, posterior approximations that retain exact prior but perform approximate inference, and structured sparse approximations that exploit specific structures in kernel matrix; for local approximations, we highlight the mixture/product of experts that conducts model averaging from multiple local experts to boost predictions. To present a complete review, recent advances for improving the scalability and capability of scalable GPs are reviewed. Finally, the extensions and open issues of scalable GPs in various scenarios are reviewed and discussed to inspire novel ideas for future research avenues.

...read moreread less

381 citations

Journal Article•

GPflow: a Gaussian process library using tensorflow

[...]

Alexander G. de G. Matthews¹, Mark van der Wilk¹, Thomas Nickson², Keisuke Fujii³, Alexis Boukouvalas⁴, Pablo León-Villagrá⁵, Zoubin Ghahramani¹, James Hensman⁶ - Show less +4 more•Institutions (6)

University of Cambridge¹, University of Oxford², Kyoto University³, University of Manchester⁴, University of Edinburgh⁵, Lancaster University⁶

01 Jan 2017-Journal of Machine Learning Research

TL;DR: GPflow as discussed by the authors is a Gaussian process library that uses TensorFlow for its core computations and Python for its front end The distinguishing features of GPflow are that it uses variational inference as the primary approximation method.

...read moreread less

Abstract: GPflow is a Gaussian process library that uses TensorFlow for its core computations and Python for its front end The distinguishing features of GPflow are that it uses variational inference as the primary approximation method, provides concise code through the use of automatic differentiation, has been engineered with a particular emphasis on software testing and is able to exploit GPU hardware

...read moreread less

381 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse