scispace - formally typeset
Open AccessJournal ArticleDOI

Optimal Detection of Changepoints With a Linear Computational Cost

Reads0
Chats0
TLDR
This work considers the problem of detecting multiple changepoints in large data sets and introduces a new method for finding the minimum of such cost functions and hence the optimal number and location of changepoints that has a computational cost which is linear in the number of observations.
Abstract
In this article, we consider the problem of detecting multiple changepoints in large datasets. Our focus is on applications where the number of changepoints will increase as we collect more data: for example, in genetics as we analyze larger regions of the genome, or in finance as we observe time series over longer periods. We consider the common approach of detecting changepoints through minimizing a cost function over possible numbers and locations of changepoints. This includes several established procedures for detecting changing points, such as penalized likelihood and minimum description length. We introduce a new method for finding the minimum of such cost functions and hence the optimal number and location of changepoints that has a computational cost, which, under mild conditions, is linear in the number of observations. This compares favorably with existing methods for the same problem whose computational cost can be quadratic or even cubic. In simulation studies, we show that our new method can...

read more

Citations
More filters
Journal ArticleDOI

Distributional change of monthly precipitation due to climate change:comprehensive examination of dataset in southeastern United States

TL;DR: In this paper, a number of watersheds are selected from the Hydro-Climate Data Network over southeastern United States to examine possible changes in hydrological time series, e.g. precipitation, introduced by changing climate.
Journal Article

Constrained Dynamic Programming and Supervised Penalty Learning Algorithms for Peak Detection in Genomic Data

TL;DR: This work proposes the first dynamic programming algorithm that is guaranteed to compute the optimal solution to changepoint detection problems with constraints between adjacent segment mean parameters, and achieves state-of-the-art accuracy in all data sets.
DissertationDOI

Enhancing the information content of geophysical data for nuclear site characterisation

Michael Tso
TL;DR: This thesis examines various aspects of uncertainty in ERT and develops new methods to better use geophysical data quantitatively and proposes that the various steps in the general workflow of an ERT study can be viewed as a pipeline for information and uncertainty propagation and suggested some areas have been understudied.
Journal ArticleDOI

Applications of statistical process control in the management of unaccounted for gas

TL;DR: It is demonstrated that significant practical advantages can be gained by implementing statistical monitoring in the balancing process and the importance of using multiple such measures is highlighted, in addition to a thorough analysis of the UAG time series focusing on the presence of seasonality.
Journal ArticleDOI

The use of scaling properties to detect relevant changes in financial time series: A new visual warning tool

TL;DR: In this article, the dynamical evolution of multiscaling in financial time series is investigated using time-dependent Generalized Hurst Exponents (GHE), H q, for various values of the parameter q.
References
More filters
Journal ArticleDOI

A new look at the statistical model identification

TL;DR: In this article, a new estimate minimum information theoretical criterion estimate (MAICE) is introduced for the purpose of statistical identification, which is free from the ambiguities inherent in the application of conventional hypothesis testing procedure.
Journal ArticleDOI

Estimating the Dimension of a Model

TL;DR: In this paper, the problem of selecting one of a number of models of different dimensions is treated by finding its Bayes solution, and evaluating the leading terms of its asymptotic expansion.

Estimating the dimension of a model

TL;DR: In this paper, the problem of selecting one of a number of models of different dimensions is treated by finding its Bayes solution, and evaluating the leading terms of its asymptotic expansion.
Journal ArticleDOI

A Cluster Analysis Method for Grouping Means in the Analysis of Variance

A. J. Scott, +1 more
- 01 Sep 1974 - 
TL;DR: In this paper, the authors used the techniques of cluster analysis to split the treatments into reasonably homogeneous groups and developed a likelihood ratio test for judging the significance of differences among the resulting groups.
Related Papers (5)