scispace - formally typeset
Open AccessJournal ArticleDOI

A Measure of Asymptotic Efficiency for Tests of a Hypothesis Based on the sum of Observations

Herman Chernoff
- 01 Dec 1952 - 
- Vol. 23, Iss: 4, pp 493-507
Reads0
Chats0
TLDR
In this paper, it was shown that the likelihood ratio test for fixed sample size can be reduced to this form, and that for large samples, a sample of size $n$ with the first test will give about the same probabilities of error as a sample with the second test.
Abstract
In many cases an optimum or computationally convenient test of a simple hypothesis $H_0$ against a simple alternative $H_1$ may be given in the following form. Reject $H_0$ if $S_n = \sum^n_{j=1} X_j \leqq k,$ where $X_1, X_2, \cdots, X_n$ are $n$ independent observations of a chance variable $X$ whose distribution depends on the true hypothesis and where $k$ is some appropriate number. In particular the likelihood ratio test for fixed sample size can be reduced to this form. It is shown that with each test of the above form there is associated an index $\rho$. If $\rho_1$ and $\rho_2$ are the indices corresponding to two alternative tests $e = \log \rho_1/\log \rho_2$ measures the relative efficiency of these tests in the following sense. For large samples, a sample of size $n$ with the first test will give about the same probabilities of error as a sample of size $en$ with the second test. To obtain the above result, use is made of the fact that $P(S_n \leqq na)$ behaves roughly like $m^n$ where $m$ is the minimum value assumed by the moment generating function of $X - a$. It is shown that if $H_0$ and $H_1$ specify probability distributions of $X$ which are very close to each other, one may approximate $\rho$ by assuming that $X$ is normally distributed.

read more

Citations
More filters
Journal ArticleDOI

High-performance exact algorithms for motif search.

TL;DR: This paper presents algorithms for two different versions of the motif search problem – a deterministic algorithm (called DMS) and a randomized algorithm (Monte Carlo algorithm) that have the potential of performing well in practice.
Book ChapterDOI

False Positive or False Negative

TL;DR: Algorithms that can effectively mine frequent item(set)s from high speed transactional data streams with a bound of memory consumption are developed based on Chernoff bound and significantly outperform the existing false-positive algorithms.
Posted Content

On Approximating the Number of $k$-cliques in Sublinear Time.

TL;DR: In this article, the problem of approximating the number of k-cliques in a graph when given query access to the graph was studied and an algorithm that outputs a $(1+\varepsilon)$-approximation (with high probability) was proposed.
Journal ArticleDOI

Large deviations for M-estimators

TL;DR: In this article, the authors study the large deviation principle for M-estimators (and maximum likelihood estimators in particular) and obtain the rate function of the LDA for M estimators.
Journal ArticleDOI

Preserving privacy in association rule mining with bloom filters

TL;DR: This paper proposes a new approach for preserving privacy in association rule mining to use keyed Bloom filters to represent transactions as well as data items and proposes δ-folding technique to further reduce the storage requirement without sacrificing mining precision and running time.
References
More filters