Optimal Detection of Changepoints With a Linear Computational Cost

doi:10.1080/01621459.2012.737745

Open AccessJournal ArticleDOI

Optimal Detection of Changepoints With a Linear Computational Cost

Rebecca Killick, +2 more

- 17 Oct 2012 -

Journal of the American Statistical Asso...

- Vol. 107, Iss: 500, pp 1590-1598

Chats0

TLDR

This work considers the problem of detecting multiple changepoints in large data sets and introduces a new method for finding the minimum of such cost functions and hence the optimal number and location of changepoints that has a computational cost which is linear in the number of observations.

Abstract:

In this article, we consider the problem of detecting multiple changepoints in large datasets. Our focus is on applications where the number of changepoints will increase as we collect more data: for example, in genetics as we analyze larger regions of the genome, or in finance as we observe time series over longer periods. We consider the common approach of detecting changepoints through minimizing a cost function over possible numbers and locations of changepoints. This includes several established procedures for detecting changing points, such as penalized likelihood and minimum description length. We introduce a new method for finding the minimum of such cost functions and hence the optimal number and location of changepoints that has a computational cost, which, under mild conditions, is linear in the number of observations. This compares favorably with existing methods for the same problem whose computational cost can be quadratic or even cubic. In simulation studies, we show that our new method can...

Citations

PDF

Open Access

More filters

Book ChapterDOI

Computational outlier detection methods in sliced inverse regression

Hadrien Lorenzo, +1 more

TL;DR: In this article, three outlier detection methods are proposed and their numerical behaviors are illustrated on a simulated sample. But they use IB (in-bags) or OOB (out-of-bag) prediction errors from subsampling or resampling approaches.

...read moreread less

Posted Content

Measures of Model Risk in Continuous-time Finance Models

Emese Lazar, +2 more

- 16 Oct 2020 -

arXiv: Econometrics

TL;DR: This work investigates the impact of parameter estimation risk and model specification risk on the models'ability to capture the joint dynamics of stock and option prices, and proposes expected shortfall type model risk measures applied to Levy jump models and affine jump-diffusion models.

...read moreread less

Journal ArticleDOI

Seeded binary segmentation: a general methodology for fast and optimal changepoint detection

- 03 Oct 2022 -

Biometrika

TL;DR: In this paper , a deterministic construction of background intervals, called seeded intervals, in which single change points are searched, is proposed, and the final selection of change points based on the candidates from seeded intervals can be done in various ways, adapted to the problem at hand.

...read moreread less

Posted ContentDOI

Pooled CRISPR Inverse PCR sequencing (PCIP-seq): simultaneous sequencing of retroviral insertion points and the integrated provirus with long reads

Maria Artesi, +25 more

- 06 Dec 2019 -

bioRxiv

TL;DR: Pooled CRISPR Inverse PCR sequencing (PCIP-seq) is developed, a method that leverages long reads on the Oxford Nanopore MinION platform to sequence the insertion site and its associated provirus and uncovered evidence of viral hypermutation, recombination and recurrent selection.

...read moreread less

Journal ArticleDOI

An Examination of the Recent Stability of Ozonesonde Global Network Data

- 01 Oct 2022 -

Earth and Space Science

TL;DR: In this article , the authors provide a comprehensive examination of global ozonesonde network data stability and accuracy since 2004 in light of the sudden post-2013 TCO "dropoff" of ∼3-4% that was reported previously at select stations.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

A new look at the statistical model identification

Hirotugu Akaike

- 01 Dec 1974 -

IEEE Transactions on Automatic Control

TL;DR: In this article, a new estimate minimum information theoretical criterion estimate (MAICE) is introduced for the purpose of statistical identification, which is free from the ambiguities inherent in the application of conventional hypothesis testing procedure.

...read moreread less

Journal ArticleDOI

Estimating the Dimension of a Model

Gideon Schwarz

- 01 Mar 1978 -

Annals of Statistics

TL;DR: In this paper, the problem of selecting one of a number of models of different dimensions is treated by finding its Bayes solution, and evaluating the leading terms of its asymptotic expansion.

...read moreread less

Estimating the dimension of a model

Gideon Schwarz

TL;DR: In this paper, the problem of selecting one of a number of models of different dimensions is treated by finding its Bayes solution, and evaluating the leading terms of its asymptotic expansion.

...read moreread less

Journal ArticleDOI

A Cluster Analysis Method for Grouping Means in the Analysis of Variance

A. J. Scott, +1 more

- 01 Sep 1974 -

Biometrics

TL;DR: In this paper, the authors used the techniques of cluster analysis to split the treatments into reasonably homogeneous groups and developed a likelihood ratio test for judging the significance of differences among the resulting groups.

...read moreread less

Book

Applied Dynamic Programming

Richard Bellman, +1 more

Collapse

Optimal Detection of Changepoints With a Linear Computational Cost

Citations

Computational outlier detection methods in sliced inverse regression

Measures of Model Risk in Continuous-time Finance Models

Seeded binary segmentation: a general methodology for fast and optimal changepoint detection

Pooled CRISPR Inverse PCR sequencing (PCIP-seq): simultaneous sequencing of retroviral insertion points and the integrated provirus with long reads

An Examination of the Recent Stability of Ozonesonde Global Network Data

References

A new look at the statistical model identification

Estimating the Dimension of a Model

Estimating the dimension of a model

A Cluster Analysis Method for Grouping Means in the Analysis of Variance

Applied Dynamic Programming

Related Papers (5)

changepoint: An R Package for Changepoint Analysis

A Cluster Analysis Method for Grouping Means in the Analysis of Variance

Using penalized contrasts for the change-point problem

Circular binary segmentation for the analysis of array-based DNA copy number data.

Continuous inspection schemes