scispace - formally typeset
Search or ask a question

Showing papers in "Annals of Mathematical Statistics in 1935"


Journal ArticleDOI
TL;DR: In this article, the authors developed a criterion free from defects of this nature, depending only on the assumption of random sampling from a normal universe, and developed the distribution of T defined by
Abstract: Criteria for the rejection of outlying observations may be designed to reject a given fraction of all observations, or a proportion varying with the size of the sample. Irwin' has discussed several criteria based on sampling from a normal population which had been used previously, as well as one which he proposed. This is based on the principal of fixing the expectation of rejecting an observation from a sample independently of the aggregate number, N, of the sample. The criterion, X, is b/a times the interval between successive observations in ascending order of magnitude, where a is the standard deviation of the sampled population. In the same paper he gave, for different values of N, a table of P1(X) and P2(X), respectively probabffities of exceeding given values of X for the first or second such interval from either end. In actual use, however, a is estimated from the sample standard deviation, and we are left to decide whether observations in question are to be included or not in estimating the standard deviation as also whether or not to modify this by addition or subtraction of an estimate of its probable error. The object of the present communication is to develop a criterion free from defects of this nature, depending only on the assumption of random sampling from a normal universe. For this purpose we develop the distribution of T defined by

166 citations





Journal ArticleDOI
TL;DR: In this article, it was found that the usual method of treating the probable error gives an overly optimistic idea of the smallness of the deviations that may be expected in future samples.
Abstract: The following statement of the significance of a probable error is often made: "The probable error of the mean is a value above and below the mean such that if the test were repeated under the same conditions there would be, on the average, equal chances that the mean would fall within or without this range." The probable error is attached to the mean of the sample and it is assumed that the standard deviation of the sample is that of the sampled normal population. This was formerly a very usual explanation of the meaning of probable error by research workers, but it is inaccurate and misleading, especially for samples of 20 or less such as are dealt with in agricultural experiments. The inaccuracy of this explanation of the meaning of probable error has been realized for many years by competent statisticians, but no satisfactory treatment has heretofore been devised.' The attempted explanation of the probable error in terms of the expected frequency of the occurrence of different size deviations of the means of future samples from the sample mean does raise a very interesting, important, and legitimate question, namely, what is the probability of a second mean lying within a certain multiple of the standard deviation of a first sample of the mean of a first sample? This question is of fundamental concern to those engaged in experimental work. Its answer will indicate to investigators reasonable deviations from the results oftheir first experiments, will form a valid basis for the rejection of doubtful observations or groups of such observations, and will form a basis for a test of the significance of the divergence of results in different experiments. It is found that the usual method of treating the probable error gives an overly optimistic idea of the smallness of the deviations that may be expected in future samples. The distribution function of the variable

25 citations