scispace - formally typeset
Search or ask a question
Journal ArticleDOI

On a Test of Whether one of Two Random Variables is Stochastically Larger than the Other

01 Mar 1947-Annals of Mathematical Statistics (Institute of Mathematical Statistics)-Vol. 18, Iss: 1, pp 50-60
TL;DR: In this paper, the authors show that the limit distribution is normal if n, n$ go to infinity in any arbitrary manner, where n = m = 8 and n = n = 8.
Abstract: Let $x$ and $y$ be two random variables with continuous cumulative distribution functions $f$ and $g$. A statistic $U$ depending on the relative ranks of the $x$'s and $y$'s is proposed for testing the hypothesis $f = g$. Wilcoxon proposed an equivalent test in the Biometrics Bulletin, December, 1945, but gave only a few points of the distribution of his statistic. Under the hypothesis $f = g$ the probability of obtaining a given $U$ in a sample of $n x's$ and $m y's$ is the solution of a certain recurrence relation involving $n$ and $m$. Using this recurrence relation tables have been computed giving the probability of $U$ for samples up to $n = m = 8$. At this point the distribution is almost normal. From the recurrence relation explicit expressions for the mean, variance, and fourth moment are obtained. The 2rth moment is shown to have a certain form which enabled us to prove that the limit distribution is normal if $m, n$ go to infinity in any arbitrary manner. The test is shown to be consistent with respect to the class of alternatives $f(x) > g(x)$ for every $x$.

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI
TL;DR: In this article, a test of the hypothesis that the samples are from the same population may be made by ranking the observations from from 1 to Σn i (giving each observation in a group of ties the mean of the ranks tied for), finding the C sums of ranks, and computing a statistic H. Under the stated hypothesis, H is distributed approximately as χ2(C − 1), unless the samples were too small, in which case special approximations or exact tables are provided.
Abstract: Given C samples, with n i observations in the ith sample, a test of the hypothesis that the samples are from the same population may be made by ranking the observations from from 1 to Σn i (giving each observation in a group of ties the mean of the ranks tied for), finding the C sums of ranks, and computing a statistic H. Under the stated hypothesis, H is distributed approximately as χ2(C – 1), unless the samples are too small, in which case special approximations or exact tables are provided. One of the most important applications of the test is in detecting differences among the population means.* * Based in part on research supported by the Office of Naval Research at the Statistical Research Center, University of Chicago.

9,365 citations


Cites background or methods from "On a Test of Whether one of Two Ran..."

  • ...Festinger 171, Haldane and Smith [14], Mann and Whitney [ 28 ], White [59], and Wilcoxon [61, 621 have published tables for the two-sample case....

    [...]

  • ...a An extensive comparison of exaot probabilities for the twc-aample test [ 28 ] with those based on the normal approximation indicates that the normal approximation is usually better with the continuity sdjustment when the probability is above 0.02, and better without it when the probability is 0.02 or below....

    [...]

Journal ArticleDOI
TL;DR: A new method for metagenomic biomarker discovery is described and validates by way of class comparison, tests of biological consistency and effect size estimation to address the challenge of finding organisms, genes, or pathways that consistently explain the differences between two or more microbial communities.
Abstract: This study describes and validates a new method for metagenomic biomarker discovery by way of class comparison, tests of biological consistency and effect size estimation. This addresses the challenge of finding organisms, genes, or pathways that consistently explain the differences between two or more microbial communities, which is a central problem to the study of metagenomics. We extensively validate our method on several microbiomes and a convenient online interface for the method is provided at http://huttenhower.sph.harvard.edu/lefse/.

9,057 citations


Cites methods from "On a Test of Whether one of Two Ran..."

  • ...The LEfSe algorithm is introduced in overview in the Results section, and Figure 6 illustrates in detail the format of the input (a matrix with n rows and m columns) and the three steps performed by the computational tool: the KW rank sum test [49] on classes, the pairwise Wilcoxon test [50,51] between subclasses of different classes, and the LDA [52] on the relevant features....

    [...]

  • ...for example, given some known population structure within a set of input samples, is a feature more abundant in all population subclasses or in just one? Specifically, we first use the non-parametric factorial KruskalWallis (KW) sum-rank test [49] to detect features with significant differential abundance with respect to the class of interest; biological consistency is subsequently investigated using a set of pairwise tests among subclasses using the (unpaired) Wilcoxon rank-sum test [50,51]....

    [...]

Book ChapterDOI
TL;DR: In this article, upper bounds for the probability that the sum S of n independent random variables exceeds its mean ES by a positive number nt are derived for certain sums of dependent random variables such as U statistics.
Abstract: Upper bounds are derived for the probability that the sum S of n independent random variables exceeds its mean ES by a positive number nt. It is assumed that the range of each summand of S is bounded or bounded above. The bounds for Pr {S – ES ≥ nt} depend only on the endpoints of the ranges of the summands and the mean, or the mean and the variance of S. These results are then used to obtain analogous inequalities for certain sums of dependent random variables such as U statistics and the sum of a random sample without replacement from a finite population.

8,655 citations

Posted Content
TL;DR: An evaluation of double-blind reviewed journals through important academic publishing databases revealed that more than 30 academic articles in the domain of international marketing (in a broad sense) used PLS path modeling as means of statistical analysis.
Abstract: Purpose: This paper discusses partial least squares path modeling (PLS), a powerful structural equation modeling technique for research on international marketing. While a significant body of research provides guidance for the use of covariance-based structural equation modeling (CBSEM) in international marketing, there are no subject-specific guidelines for the use of PLS so far.Methodology/approach: A literature review of the use of PLS in international marketing reveals the increasing application of this methodology.Findings: This paper reveals the strengths and weaknesses of PLS in the context of research on international marketing, and provides guidance for multi-group analysis.Originality/value of paper: The paper assists researchers in making well-grounded decisions regarding the application of PLS in certain research situations and provides specific implications for an appropriate application of the methodology.

7,536 citations


Cites background from "On a Test of Whether one of Two Ran..."

  • ...potential missing data, the psychometric properties of the variables examined, and the magnitude of the relationships considered before deciding on an appropriate sample size to use or to ensure that a sufficient sample size is actually available to study the phenomena of interest’’ (Marcoulides & Saunders, 2006, p. VI). Marcoulides and Saunders offer other warnings that clearly echo those presented in the CBSEM literature. For instance, SchneeweiX (2001) addresses the magnitude of standard errors in PLS path modeling estimators resulting from not using enough observations (consistency) and indicators for each latent variable (consistency at large)....

    [...]

Book ChapterDOI
06 Mar 2009
TL;DR: In this article, the authors conducted an exhaustive literature review to determine the status quo of PLS path modeling in international marketing research and found that more than 30 academic articles in the domain of international marketing used PLS as a means of statistical analysis.
Abstract: In order to determine the status quo of PLS path modeling in international marketing research, we conducted an exhaustive literature review. An evaluation of double-blind reviewed journals through important academic publishing databases (e.g., ABI/Inform, Elsevier ScienceDirect, Emerald Insight, Google Scholar, PsycINFO, Swetswise) revealed that more than 30 academic articles in the domain of international marketing (in a broad sense) used PLS path modeling as means of statistical analysis. We assessed what the main motivation for the use of PLS was in respect of each article. Moreover, we checked for applications of PLS in combination with one or more additional methods, and whether the main reason for conducting any additional method(s) was mentioned.

5,282 citations

References
More filters
Book ChapterDOI
Frank Wilcoxon1
TL;DR: The comparison of two treatments generally falls into one of the following two categories: (a) a number of replications for each of the two treatments, which are unpaired, or (b) we may have a series of paired comparisons, some of which may be positive and some negative as mentioned in this paper.
Abstract: The comparison of two treatments generally falls into one of the following two categories: (a) we may have a number of replications for each of the two treatments, which are unpaired, or (b) we may have a number of paired comparisons leading to a series of differences, some of which may be positive and some negative. The appropriate methods for testing the significance of the differences of the means in these two cases are described in most of the textbooks on statistical methods.

12,871 citations


"On a Test of Whether one of Two Ran..." refers methods in this paper

  • ...+ - (2n m + 2nm2 -n2 _ m2 -nm) 16 which is obtained from (1) by multiplication by (U nm/2)4 and expansion....

    [...]

  • ...Using the recurrence relation (1) the probabilities pnm(U) have been tabulated for m < n < 8 (see Table I)....

    [...]

  • ...After multiplying (1) by (U nm/2)2, using...

    [...]

Journal ArticleDOI

827 citations


"On a Test of Whether one of Two Ran..." refers background in this paper

  • ...Since the probability of the ith 1 preceding the jth 0 is 4, we have (2) En,m(U) = nm/2....

    [...]