scispace - formally typeset
Search or ask a question
Journal Article•DOI•

Nonparametric Tests for Shift at an Unknown Time Point

01 Oct 1968-Annals of Mathematical Statistics (Institute of Mathematical Statistics)-Vol. 39, Iss: 5, pp 1731-1743
TL;DR: In this article, a nonparametric approach to the problem of testing for a shift in the level of a process occurring at an unknown time point when a fixed number of observations are drawn consecutively in time is presented.
Abstract: This work is an investigation of a nonparametric approach to the problem of testing for a shift in the level of a process occurring at an unknown time point when a fixed number of observations are drawn consecutively in time. We observe successively the independent random variables $X_1, X_2, \cdots, X_N$ which are distributed according to the continuous cdf $F_i, i = 1, 2, \cdots, N$. An upward shift in the level shall be interpreted to mean that the random variables after the change are stochastically larger than those before. Two versions of the testing problem are studied. The first deals with the case when the initial process level is known and the second when it is unknown. In the first case, we make the simplifying assumption that the distributions $F_i$ are symmetric before the shift and introduce the known initial level by saying that the point of symmetry $\gamma_0$ is known. Without loss of generality, we set $\gamma_0 = 0$. Defining a class of cdf's $\mathscr{F}_0 = \{F:F$ continuous, $F$ symmetric about origin$\}$, the problem of detecting an upward shift becomes that of testing the null hypothesis $H_0:F_0 = F_1 = \cdots = F_N,\quad\text{some}\quad F_0 \varepsilon\mathscr{F}_0,$ against the alternative $H_1:F_0 = F_1 = \cdots = F_m > F_{m + 1} = \cdots = F_N,\quad\text{some}\quad F_0 \varepsilon\mathscr{F}_0$ where $m(0 \leqq m \leqq N - 1)$ is unknown and the notation $F_m > F_{m + 1}$ indicates that $X_{m + 1}$ is stochastically larger than $X_m$. For the second situation with unknown initial level, the problem becomes that of testing the null hypothesis $H_0^\ast:F_1 = \cdots = F_N$, against the alternatives $H_1^\ast: F_1 = \cdots = F_m > F_{m + 1} = \cdots = F_N$, where $m(1 \leqq m \leqq N - 1)$ is unknown. Here the distributions are not assumed to be symmetric. The testing problem in the case of known initial level has been considered by Page [11], Chernoff and Zacks [2] and Kander and Zacks [7]. Assuming that the observations are initially from a symmetric distribution with known mean $\gamma_0$, Page proposes a test based on the variables $\operatorname{sgn} (X_i - \gamma_0)$. Chernoff and Zacks assume that the $F_i$ are normal cdf's with constant known variance and they derive a test for shift in the mean through a Bayesian argument. Their approach is extended to the one parameter exponential family of distributions by Kander and Zacks. Except for the test based on signs, all the previous work lies within the framework of a parametric statistics. The second formulation of the testing problem, the case of unknown initial level, has not been treated in detail. The only test proposed thus far is the one derived by Chenoff and Zacks for normal distributions with constant known variance. In both problems, our approach generally is to find optimal invariant tests for certain local shift alternatives and then to examine their properties. Our optimality criterion is the maximization of local average power where the average is over the space of the nuisance parameter $m$ with respect to an arbitrary weighting $\{q_i, i = 1, 2, \cdots, N: q_i \geqq 0, \sum^N_{i = 1} q_i = 1\}$. From the Bayesian viewpoint, $q_i$ may be interpreted as the prior probability that $X_i$ is the first shifted variate. Invariant tests with maximum local average power are derived for the case of known initial level in Section 2 and for the case of unknown initial level in Section 3. In both cases, the tests are distribution-free and they are unbiased for general classes of shift alternatives. They all depend upon the weight function $\{q_i\}$. With uniform weights, certain tests in Section 3 reduce to the standard tests for trend while a degenerate weight function leads to the usual two sample tests. In Section 4, we obtain the asymptotic distributions of the test statistics under local translation alternatives and investigate their Pitman efficiencies. Some small sample powers for normal alternatives are given in Section 5.

Content maybe subject to copyright    Report

Citations
More filters
Journal Article•DOI•
TL;DR: In this article, a class of strongly consistent estimators for the change-point (0, 1) was proposed, which require no knowledge of the functional forms or parametric families of the variables.
Abstract: Consider a sequence of independent random variables $\{X_i: 1 \leq i \leq n\}$ having cdf $F$ for $i \leq \theta n$ and cdf $G$ otherwise. A class of strongly consistent estimators for the change-point $\theta \in (0, 1)$ is proposed. The estimators require no knowledge of the functional forms or parametric families of $F$ and $G$. Furthermore, $F$ and $G$ need not differ in their means (or other measure of location). The only requirement is that $F$ and $G$ differ on a set of positive probability. The proof of consistency provides rates of convergence and bounds on the error probability for the estimators. The estimators are applied to two well-known data sets, in both cases yielding results in close agreement with previous parametric analyses. A simulation study is conducted, showing that the estimators perform well even when $F$ and $G$ share the same mean, variance and skewness.

235 citations

Book Chapter•DOI•
S. Zacks1•
01 Jan 1983
TL;DR: In this article, the authors discuss the sequential procedures of testing and estimation and describe classical and Bayesian approaches to the change-point problem and present Bayesian and maximum likelihood estimation of the location of the shift points.
Abstract: Publisher Summary This chapter discusses fixed sample and the sequential procedures of testing and estimation and describes classical and Bayesian approaches to the change-point problem. It presents Bayesian and maximum likelihood estimation of the location of the shift points. The Bayesian approach is based on modeling the prior distribution of the unknown parameters, adopting a loss function and deriving the estimator, which minimizes the posterior risk. The chapter discusses this approach with an example of a shift in the mean of a normal sequence. The estimators obtained are generally nonlinear complicated functions of the random variables. From the Bayesian point of view, these estimators are optimal. The maximum likelihood estimation of the location parameter of the change point is an attractive alternative to the Bayes estimators. Regression relationship can change at unknown epochs, resulting in different regression regimes that should be detected and identified.

163 citations

Journal Article•DOI•
F. Lombard1•
TL;DR: In this article, procedures based on quadratic form rank statistics to test for one or more changepoints in a series of independent observations are considered, and models incorporating both smooth and abrupt changes are introduced.
Abstract: SUMMARY We consider procedures based on quadratic form rank statistics to test for one or more changepoints in a series of independent observations. Models incorporating both smooth and abrupt changes are introduced. Various test statistics are suggested, their asymptotic null distributions are derived and tables of significance points are given. A Monte Carlo study shows that the asymptotic significance points are applicable to moderately sized samples.

148 citations

References
More filters
Book•
01 Jan 1959
TL;DR: The general decision problem, the Probability Background, Uniformly Most Powerful Tests, Unbiasedness, Theory and First Applications, and UNbiasedness: Applications to Normal Distributions, Invariance, Linear Hypotheses as discussed by the authors.
Abstract: The General Decision Problem.- The Probability Background.- Uniformly Most Powerful Tests.- Unbiasedness: Theory and First Applications.- Unbiasedness: Applications to Normal Distributions.- Invariance.- Linear Hypotheses.- The Minimax Principle.- Multiple Testing and Simultaneous Inference.- Conditional Inference.- Basic Large Sample Theory.- Quadratic Mean Differentiable Families.- Large Sample Optimality.- Testing Goodness of Fit.- General Large Sample Methods.

6,480 citations

Journal Article•DOI•
TL;DR: In this paper, a Bayesian approach is used to estimate the current mean of an object in a given trajectory from a series of observations, and a sequence of tests are designed to locate the last time point of change.
Abstract: : A tracking problem is considered. Observations are taken on the successive positions of an object traveling on a path, and it is desired to estimate its current position. The objective is to arrive at a simple formula which implicitly accounts for possible changes in direction and discounts observations taken before the latest change. To develop a reasonable procedure, a simpler problem is studied. Successive observations are taken on n independently and normally distributed random variables X sub 1, X sub 2, ..., X sub n with means mu sub 1, mu sub 2, ..., mu sub n and variance 1. Each mean mu sub i is equal to the preceding mean mu sub (i-1) except when an occasional change takes place. The object is to estimate the current mean mu sub n. This problem is studied from a Bayesian point of view. An 'ad hoc' estimator is described, which applies a combination of the A.M.O.C. Bayes estimator and a sequence of tests designed to locate the last time point of change. The various estimators are then compared by a Monte Carlo study of samples of size 9. This Bayesian approach seems to be more appropriate for the related problem of testing whether a change in mean has occurred. This test procedure is simpler than that used by Page. The power functions of the two procedures are compared. (Author)

554 citations

Journal Article•DOI•
TL;DR: In this paper, the authors studied the properties of the test statistic T_n, which was proposed by H. Chernoff and S. Zacks to detect shifts in a parameter of a distribution function, occurring at unknown time points between consecutively taken observations.
Abstract: The present study is concerned with the properties of a test statistic proposed by H. Chernoff and S. Zacks [1] to detect shifts in a parameter of a distribution function, occurring at unknown time points between consecutively taken observations. The testing problem we study is confined to a fixed sample size situation, and can be described as follows: Given observations on independent random variables $X_1, \cdots, X_n$, (taken at consecutive time points) which are distributed according to $F(X; \theta_i); \theta_i \varepsilon \Omega$ for all $i = 1, \cdots, n$, one has to test the simple hypothesis: $H_0 : \theta_1 = \cdots = \theta_n = \theta_0$ ($\theta_0$ is known) against the composite alternative: $H_1 : \theta_1 = \cdots = \theta_m = \theta_0 \\ \theta_{m + 1} = \cdots = \theta_n = \theta_0 + \delta;\quad\delta > 0,$ where both the point of change, $m$, and the size of the change, $\delta$, are unknown $(m = 1, \cdots, n - 1), 0 < \delta < \infty$. A Bayesian approach led Chernoff and Zacks in [1] to propose the test statistic $T_n = \sum^{n - 1}_{i = 1} iX_{i + 1}$, for the case of normally distributed random variables. A generalization for random variables, whose distributions belong to the one parameter exponential family, i.e., their density can be represented as $f(x; \theta) = h(x) \exp \lbrack\psi_1(\theta)U(x) + \psi_2(\theta)\rbrack, \theta \varepsilon \Omega$ where $\psi_1(\theta)$ is monotone, yields the test statistic $T_n = \sum^{n - 1}_{i = 1} iU(x_{i + 1})$. In the present paper we study the operating characteristics of the test statistic $T_n$. General conditions are given for the convergence of the distribution of $T_n$, as the sample size grows, to a normal distribution. The rate of convergence is also studied. It was found that the closeness of the distribution function of $T_n$ to the corresponding normal distribution is not satisfactory for the purposes of determining test criteria and values of power functions, in cases of small samples from non-normal distributions. The normal approximation can be improved by considering the first four terms in Edgeworth's asymptotic expansion of the distribution function of $T_n$ (see H. Cramer [2] p. 227). Such an approximation involves the normal distribution, its derivatives and the semi-invariants of $T_n$. The goodness of the approximations based on such an expansion, and that of the simple normal approximation, for small sample situations, were studied for cases where the observed random variables are binomially or exponentially distributed. In order to compare the exact distribution functions of $T_n$ to the approximations, the exact forms of the distributions of $T_n$ in the binomial and exponential cases were derived. As seen in Section 4, these distribution functions are quite involved, especially under the alternative hypothesis. Tables of coefficients are given for assisting the determination of these distributions, under the null hypothesis assumption, in situations of samples whose size is $2 \leqq n \leqq 10$. For samples of size $n \geqq 10$ one can use the normal approximation to the test criterion and obtain good results. The power functions of the test statistic $T_n$, for the binomial and exponential cases, are given in Section 5. The comparison with the values of the power function obtained by the normal approximation is also given. As was shown by Chernoff and Zacks in [1], when $X$ is binomially distributed the power function values of $T_n$ are higher than those of a test statistic proposed by E. S. Page [5], for most of the $m$ values (points of shift) and $\delta$ values (size of shift). A comparative study in which the effectiveness of test procedures based on $T_n$ relative to those based on Page's and other procedures will be given elsewhere for the exponential case, and other distributions of practical interest.

149 citations

Journal Article•DOI•
TL;DR: In this article, a class of rank score tests for the hypothesis $H : \alpha = \beta = 0", is proposed in Section 2. In Section 3 and 4 the limiting distribution of the test statistics is shown to be central under $H, and non-central under a sequence of alternatives tending to the hypothesis at a suitable rate.
Abstract: For testing hypotheses about $\alpha$ and $\beta$ in the linear regression model $Y_j = \alpha + \beta x_j + Z_j$, Brown and Mood [18] have proposed distribution-free tests, based on their median estimates. Daniels [6] has also given a distribution-free test for the hypothesis that the regression parameters have specified values. This latter test is an improvement on the Brown and Mood median procedure, although both are based on the signs of the observations. Recently Hajek [10] constructed rank tests, which are asymptotically most powerful, for testing the hypothesis that $\beta = 0$, while $\alpha$ is regarded as a nuisance parameter. In this paper, a class of rank score tests for the hypothesis $H : \alpha = \beta = 0$, is proposed in Section 2. This class includes as special cases, the Wilcoxon and the normal scores type of tests. In Sections 3 and 4 the limiting distribution of the test statistics is shown to be central $\chi^2$, under $H$, and non-central $\chi^2$, under a sequence of alternatives tending to the hypothesis at a suitable rate. In Section 5, the Pitman efficiency of the proposed tests relative to the classical $F$-test, is proved to be the same as the efficiency of the corresponding rank score tests relative to the $t$-test in the two sample problem.

27 citations