Optimal Detection of Changepoints With a Linear Computational Cost
Reads0
Chats0
TLDR
This work considers the problem of detecting multiple changepoints in large data sets and introduces a new method for finding the minimum of such cost functions and hence the optimal number and location of changepoints that has a computational cost which is linear in the number of observations.Abstract:
In this article, we consider the problem of detecting multiple changepoints in large datasets. Our focus is on applications where the number of changepoints will increase as we collect more data: for example, in genetics as we analyze larger regions of the genome, or in finance as we observe time series over longer periods. We consider the common approach of detecting changepoints through minimizing a cost function over possible numbers and locations of changepoints. This includes several established procedures for detecting changing points, such as penalized likelihood and minimum description length. We introduce a new method for finding the minimum of such cost functions and hence the optimal number and location of changepoints that has a computational cost, which, under mild conditions, is linear in the number of observations. This compares favorably with existing methods for the same problem whose computational cost can be quadratic or even cubic. In simulation studies, we show that our new method can...read more
Citations
More filters
Book ChapterDOI
Computational outlier detection methods in sliced inverse regression
Hadrien Lorenzo,Jérôme Saracco +1 more
TL;DR: In this article, three outlier detection methods are proposed and their numerical behaviors are illustrated on a simulated sample. But they use IB (in-bags) or OOB (out-of-bag) prediction errors from subsampling or resampling approaches.
Posted Content
Measures of Model Risk in Continuous-time Finance Models
TL;DR: This work investigates the impact of parameter estimation risk and model specification risk on the models'ability to capture the joint dynamics of stock and option prices, and proposes expected shortfall type model risk measures applied to Levy jump models and affine jump-diffusion models.
Journal ArticleDOI
Seeded binary segmentation: a general methodology for fast and optimal changepoint detection
TL;DR: In this paper , a deterministic construction of background intervals, called seeded intervals, in which single change points are searched, is proposed, and the final selection of change points based on the candidates from seeded intervals can be done in various ways, adapted to the problem at hand.
Posted ContentDOI
Pooled CRISPR Inverse PCR sequencing (PCIP-seq): simultaneous sequencing of retroviral insertion points and the integrated provirus with long reads
Maria Artesi,Maria Artesi,Vincent Hahaut,Vincent Hahaut,Basiel Cole,Laurens Lambrechts,Laurens Lambrechts,Fereshteh Ashrafi,Fereshteh Ashrafi,Ambroise Marçais,Olivier Hermine,Philip J. Griebel,Natasa Arsic,Frank van der Meer,Arsène Burny,Dominique Bron,Elettra Bianchi,Philippe Delvenne,Vincent Bours,Carole Charlier,Michel Georges,Linos Vandekerkhove,Anne Van den Broeke,Anne Van den Broeke,Keith Durkin,Keith Durkin +25 more
TL;DR: Pooled CRISPR Inverse PCR sequencing (PCIP-seq) is developed, a method that leverages long reads on the Oxford Nanopore MinION platform to sequence the insertion site and its associated provirus and uncovered evidence of viral hypermutation, recombination and recurrent selection.
Journal ArticleDOI
An Examination of the Recent Stability of Ozonesonde Global Network Data
TL;DR: In this article , the authors provide a comprehensive examination of global ozonesonde network data stability and accuracy since 2004 in light of the sudden post-2013 TCO "dropoff" of ∼3-4% that was reported previously at select stations.
References
More filters
Journal ArticleDOI
A new look at the statistical model identification
TL;DR: In this article, a new estimate minimum information theoretical criterion estimate (MAICE) is introduced for the purpose of statistical identification, which is free from the ambiguities inherent in the application of conventional hypothesis testing procedure.
Journal ArticleDOI
Estimating the Dimension of a Model
TL;DR: In this paper, the problem of selecting one of a number of models of different dimensions is treated by finding its Bayes solution, and evaluating the leading terms of its asymptotic expansion.
Estimating the dimension of a model
TL;DR: In this paper, the problem of selecting one of a number of models of different dimensions is treated by finding its Bayes solution, and evaluating the leading terms of its asymptotic expansion.
Journal ArticleDOI
A Cluster Analysis Method for Grouping Means in the Analysis of Variance
A. J. Scott,M. Knott +1 more
TL;DR: In this paper, the authors used the techniques of cluster analysis to split the treatments into reasonably homogeneous groups and developed a likelihood ratio test for judging the significance of differences among the resulting groups.
Related Papers (5)
A Cluster Analysis Method for Grouping Means in the Analysis of Variance
A. J. Scott,M. Knott +1 more