scispace - formally typeset
Search or ask a question
Journal ArticleDOI

A Measure of Asymptotic Efficiency for Tests of a Hypothesis Based on the sum of Observations

01 Dec 1952-Annals of Mathematical Statistics (Institute of Mathematical Statistics)-Vol. 23, Iss: 4, pp 493-507
TL;DR: In this paper, it was shown that the likelihood ratio test for fixed sample size can be reduced to this form, and that for large samples, a sample of size $n$ with the first test will give about the same probabilities of error as a sample with the second test.
Abstract: In many cases an optimum or computationally convenient test of a simple hypothesis $H_0$ against a simple alternative $H_1$ may be given in the following form. Reject $H_0$ if $S_n = \sum^n_{j=1} X_j \leqq k,$ where $X_1, X_2, \cdots, X_n$ are $n$ independent observations of a chance variable $X$ whose distribution depends on the true hypothesis and where $k$ is some appropriate number. In particular the likelihood ratio test for fixed sample size can be reduced to this form. It is shown that with each test of the above form there is associated an index $\rho$. If $\rho_1$ and $\rho_2$ are the indices corresponding to two alternative tests $e = \log \rho_1/\log \rho_2$ measures the relative efficiency of these tests in the following sense. For large samples, a sample of size $n$ with the first test will give about the same probabilities of error as a sample of size $en$ with the second test. To obtain the above result, use is made of the fact that $P(S_n \leqq na)$ behaves roughly like $m^n$ where $m$ is the minimum value assumed by the moment generating function of $X - a$. It is shown that if $H_0$ and $H_1$ specify probability distributions of $X$ which are very close to each other, one may approximate $\rho$ by assuming that $X$ is normally distributed.
Citations
More filters
ReportDOI
01 Jan 1991
TL;DR: An integrated network must attempt to meet the needs of all the applications, which include digital television, digital audio, and facsimile transmission, and in general multimedia applications.
Abstract: As computer communication evolves into the twenty-first century high bandwidth low delay communication is becoming the norm rather than the exception. This is leading to data networks being used by applications which previously had restricted themselves to specialized networks. These applications include digital television, digital audio, and facsimile transmission, and in general multimedia (including continuous-media) applications. The quality of service (QOS) [Leiner 88] expected from the network by these applications varies over a wide range: some are sensitive to delays experienced in the communication network, others are sensitive to loss rates, while yet others are sensitive to delay variations. An integrated network, which aims to support all these services, must attempt to meet the needs of all the applications.

33 citations

Journal ArticleDOI
TL;DR: In this article, interactive Markov chains are studied in both discrete and continuous time and it is proved that the Markovian processes converge to a deterministic process almost surely as the population size becomes infinite and the normalized process is asymptotically normal with specified mean vector and covariance matrix.
Abstract: A class of models called interactive Markov chains is studied in both discrete and continuous time These models were introduced by Conlisk and serve as a rich class for sociological modeling, because they allow for interactions among individuals In discrete time, it is proved that the Markovian processes converge to a deterministic process almost surely as the population size becomes infinite More importantly, the normalized process is shown to be asymptotically normal with specified mean vector and covariance matrix In continuous time, the chain is shown to converge weakly to a diffusion process with specified drift and scale terms The distributional results will allow for the construction of a likelihood function from interactive Markov chain data, so these results will be important for questions of statistical inference An example from manpower planning is given which indicates the use of this theory in constructing and evaluating control policies for certain social systems

33 citations

Journal ArticleDOI
TL;DR: DENDIS is a new algorithm that combines the best of the available techniques in such a way that tractability is actually improved with a user friendly parameter setting.
Abstract: To deal with large datasets, sampling can be used as a preprocessing step for clustering. In this paper, an hybrid sampling algorithm is proposed. It is density-based while managing distance concepts to ensure space coverage and fit cluster shapes. At each step a new item is added to the sample: it is chosen as the furthest from the representative in the most important group. A constraint on the hyper volume induced by the samples avoids over sampling in high density areas. The inner structure allows for internal optimization: only a few distances have to be computed. The algorithm behavior is investigated using synthetic and real-world data sets and compared to alternative approaches, at conceptual and empirical levels. The numerical experiments proved it is more parsimonious, faster and more accurate, according to the Rand Index, with both k-means and hierarchical clustering algorithms.

33 citations


Cites background from "A Measure of Asymptotic Efficiency ..."

  • ...The results are interesting from a theoretical point of view (Chernoff, 1952), but they tend to overestimate the sample size in non worst-case situations....

    [...]

Proceedings ArticleDOI
01 Oct 2016
TL;DR: In this paper, it was shown that the frequency gap is not necessary to estimate the signal as a whole, and an algorithm was proposed to estimate a Fourier-sparse signal with a constant factor growth of the noise and sample complexity polynomial in k and logarithmic in the bandwidth and signal to noise ratio.
Abstract: We consider the problem of estimating a Fourier-sparse signal from noisy samples, where the sampling is done over some interval [0, T] and the frequencies can be "off-grid". Previous methods for this problem required the gap between frequencies to be above 1/T, the threshold required to robustly identify individual frequencies. We show the frequency gap is not necessary to estimate the signal as a whole: for arbitrary k-Fourier-sparse signals under l2 bounded noise, we show how to estimate the signal with a constant factor growth of the noise and sample complexity polynomial in k and logarithmic in the bandwidth and signal-to-noise ratio. As a special case, we get an algorithm to interpolate degree d polynomials from noisy measurements, using O(d) samples and increasing the noise by a constant factor in l2.

33 citations

Journal ArticleDOI
TL;DR: A confidence sequence is a sequence of confidence intervals that is uniformly valid over an unbounded time horizon as discussed by the authors, and confidence sequences whose widths go to zero, with nonasymptotic coverage guarantees under nonparametric conditions.
Abstract: A confidence sequence is a sequence of confidence intervals that is uniformly valid over an unbounded time horizon. Our work develops confidence sequences whose widths go to zero, with nonasymptotic coverage guarantees under nonparametric conditions. We draw connections between the Cramer-Chernoff method for exponential concentration, the law of the iterated logarithm (LIL), and the sequential probability ratio test---our confidence sequences are time-uniform extensions of the first; provide tight, nonasymptotic characterizations of the second; and generalize the third to nonparametric settings, including sub-Gaussian and Bernstein conditions, self-normalized processes, and matrix martingales. We illustrate the generality of our proof techniques by deriving an empirical-Bernstein bound growing at a LIL rate, as well as a novel upper LIL for the maximum eigenvalue of a sum of random matrices. Finally, we apply our methods to covariance matrix estimation and to estimation of sample average treatment effect under the Neyman-Rubin potential outcomes model.

33 citations

References
More filters