What are the contributions mentioned in the paper "Simultaneous inference in general parametric models" ?

In this paper the authors describe simultaneous inference procedures in general parametric models, where the experimental questions are specified through a linear combination of elemental model parameters. Several examples using a variety of different statistical models illustrate the breadth ∗This is a preprint of an article published in Biometrical Journal, Volume 50, Number 3, 346–363.

What is the default re-parametrization used as elemental parameters in the R?

The so-called ”treatment contrast” vector θ = (µ, γ2− γ1, γ3− γ1, . . . , γq−γ1) is, for example, the default re-parametrization used as elemental parameters in the R-system for statistical computing (R Development Core Team, 2008).

What is the advantage of single-step procedures?

Single-step procedures have the advantage that corresponding simultaneous confidence intervals are easily available, as previously noted.

What is the p-value for a given family of null hypotheses?

That is, for a given family of null hypotheses H10 , . . . , H k 0 , an individual hypothesis H j 0 is rejected only if all intersection hypotheses HJ = ⋂ i∈J H i 0 with j ∈ J ⊆ {1, . . . , k} are rejected (Marcus et˜al., 1976).

What is the scalar test statistic for testing the global null hypothesis?

By construction, the authors can reject an individual null hypothesis Hj0 , j = 1, . . . , k, whenever the associated adjusted p-value is less than or equal to the pre-specified significance level α, i.e., pj ≤ α.

What is the simplest way to model the response?

The response is modelled by a linear combination of the covariates with normal error εi and constant variance σ 2,Yi = β0 +q ∑j=1βjXij +

What is the p-value for the jth individual two-sided hypothesis?

In the present context of single-step tests, the (at least asymptotic) adjusted p-value for the jth individual two-sided hypothesis Hj0 : ϑj = mj, j = 1, . . . , k, is given bypj = 1− gν(Rn, |tj|),where t1, . . . , tk denote the observed test statistics.

What is the p-value for the global null hypothesis?

The resulting global p-value (exact or approximate, depending on context) for H0 is 1 − gν(Rn,max |t|) when T = t has been observed.

Why is mcp() not available in multcomp?

Because it is impossible to determine the parameters of interest automatically in this case, mcp() in multcomp will by default generate comparisons for the main effects γj only, ignoring covariates and interactions.

(Open Access) Simultaneous inference in general parametric models. (2008) | Torsten Hothorn

Q: What is the purpose of this paper?

In this paper the authors aim at a unified description of simultaneous inference procedures in parametric models with generally correlated parameter estimates.

Simultaneous Inference

in General Parametric Models

∗

Torsten Hothorn

Institut f

ur Statistik

Ludwig-Maximilians-Universit

at M

unchen

Ludwigstraße 33, D–80539 M

unchen, Germany

Frank Bretz

Statistical Methodology, Clinical Information Sciences

Novartis Pharma AG

CH-4002 Basel, Switzerland

Peter Westfall

Texas Tech University

Lubbock, TX 79409, U.S.A

March 15, 2013

Abstract

Simu ltaneous inference is a common problem in many areas of application. If

multiple null hypotheses are tested simultaneously, the probability of rejecting er-

roneously at least one of them increases beyond the pre-speciﬁed signiﬁcance level.

Simu ltaneous inference procedures have to be used which adjust for multiplicity and

thu s control the overall type I error rate. In this paper we describe simultaneous infer-

ence procedures in general parametric models, where the experimental questions are

speciﬁed through a linear combination of elemental model parameters. The frame-

work described here is quite general and extends the canonical theory of multiple

comparison procedures in ANOVA models to linear regression problems, generalized

linear models, linear mixed eﬀects models, the Cox model, robust linear models, etc.

Seve ral examples using a variety of diﬀerent statistical models illustrate the breadth

∗

This is a preprint of an article published in Biometrical Journal, Volume 50, Number 3, 346–363.

➞

2008 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim; available online

http://www.

biometrical-journal.com

of the results. For the analyses we use the R add-on package multcomp, which pro-

vides a convenient interface to the general approach adopted here.

Key words: multiple tests, multiple comparisons, simultaneous conﬁdence intervals,

adjusted p-values, multivariate normal distribution, robust statistics.

1 Introduction

Multiplicity is an intrinsic problem of any simultaneous inference. If each of k, say, null

hypotheses is tested at nominal level α, the overall type I error rate can be substantially

larger than α. That is, the probability of at least one erroneous rejection is larger tha n

α for k ≥ 2. Common multiple comparison procedures adjust for multiplicity and thus

ensure that the overall type I error remains below the pre-sp eciﬁe d signiﬁcance level α.

Examples of such multiple comparison procedures include Dunnett’s many-to-one compar-

isons, Tukey’s all-pairwise comparisons, sequential pairwise contrasts, comparisons with

the average, changep oint analyses, dose-response contrasts, etc. These procedures are all

well established for classical regression and ANOVA models allowing for cova riates and/or

factorial treatment structures with i.i.d.˜normal errors and constant variance, see Bretz

et˜al.

(2008) and the references therein. For a general reading on multiple co mparison

procedures we refer to

Hochberg and Tamhane (1987) and Hsu (1996).

In this paper we aim at a uniﬁed description of simultaneous inference procedures in para-

metric models with generally correlated parameter estimates. Each individual null hypothe-

sis is speciﬁed through a linear combination of elemental model parameters and we allow for

k of such null hypotheses to be tested simultaneously, regardless of the number of elemental

model parameters p. The general framework described here extends the current canoni-

cal theory with respect to the following aspects: (i) model assumptions such as normality

and homoscedasticity are relaxed, thus allowing for simultaneous inference in generalized

linear models, mixed eﬀects models, survival models, etc.; (ii) arbitrary linear functions of

the elemental parameters are allowed, not just contrasts of means in AN(C)OVA models;

(iii) computing the r eference distribution is feasible for arbitrary designs, especially for

unbalanced designs; and (iv) a uniﬁed implementation is provided which allows for a fast

transition of the theoretical results to the desks of data analysts interested in simultaneous

inferences for multiple hypotheses.

Accordingly, the paper is organized as follows. Section˜

2 deﬁnes the general model and ob-

tains the asymptotic or exact distribution of linear functions of elemental model parameters

under rather weak conditions. In Section˜

3 we describe the framework for simultaneous

inference procedures in general parametric models. An overview about important applica-

tions of the methodology is given in Section˜4 followed by a s hort discussion of the software

implementation in Section˜

5. Most interesting from a practical point of view is Section˜6

where we analyze four rather challenging problems with the tools developed in this paper.

2 Model and Parameters

In this section we introduce the underlying model assumptions and derive some asymptotic

results necessary in the subsequent sections. The results from this section form the basis

for the simultaneous inference procedures described in Section˜3.

Let M((Z

, . . . , Z

), θ, η ) denote a semi-parametric statistical model. The set of n obser-

vations is described by (Z

, . . . , Z

). The model contains ﬁxed but unknown elemental

parameters θ ∈ R

and other (random or nuisance) parameters η. We are primarily in-

terested in the linear functions ϑ := Kθ of the parameter vector θ as speciﬁed through

the consta nt matrix K ∈ R

k,p

. In what follows we describe the underlying model assump-

tions, the limiting distribution of estimates of our parameters of interest ϑ, as well as the

corresponding test statistics for hypotheses about ϑ and their limiting joint distribution.

Suppose

∈ R

is an estimate of θ and S

∈ R

p,p

is an estimate of cov(

) with

−→ Σ ∈ R

p,p

(1)

for some positive, nondecreasing sequence a

. Furthermore, we assume that a multivariate

central limit theorem holds, i.e.,

1/2

(

− θ)

−→ N

(0, Σ). (2)

If both (

1) and (2) are fulﬁlled we write

∼ N

(θ, S

). Then, by Theorem 3.3.A in Serﬂing

(1980), the linear function

= K

, i.e., an estimate of our parameters of interest, also

follows an approximate multivariate normal distribution

= K

∼ N

(ϑ, S

⋆

)

with covariance matrix S

⋆

:= KS

⊤

for any ﬁxed matrix K ∈ R

k,p

. Thus we need not

to distinguish between elemental parameters θ or derived parameters ϑ = Kθ that are of

interest to the r esearcher. Instead we simply assume for the moment that we have (in

analogy to (

1) and (2))

∼ N

(ϑ, S

⋆

) with a

⋆

−→ Σ

⋆

:= KΣK

⊤

∈ R

k,k

(3)

and that the k parameters in ϑ are themselves the parameters of interest to the researcher.

It is assumed that the diagonal elements of the covariance matrix are positive, i.e., Σ

⋆

> 0

for j = 1, . . . , k.

Then, the standardized estimator

is again asymptotically normally distributed

:= D

−1/2

(

− ϑ)

∼ N

(0, R

) (4)

where D

= diag(S

⋆

) is the diagonal matrix given by the diagonal elements of S

⋆

and

= D

−1/2

⋆

−1/2

∈ R

k,k

is the correlation matrix of the k-dimensional statistic T

. To demonstrate (4), note that

with (3) we have a

⋆

−→ Σ

⋆

and a

−→ diag(Σ

⋆

). Deﬁne the sequence ˜a

needed to

establish ˜a-convergence in (

4) by ˜a

≡ 1. Then we have

˜a

= D

−1/2

⋆

−1/2

= (a

)

−1/2

⋆

)(a

)

−1/2

−→ diag(Σ

⋆

)

−1/2

⋆

diag(Σ

⋆

)

−1/2

=: R ∈ R

k,k

where the convergence in probability to a constant follows from Slutzky’s Theorem (The-

orem 1.5.4,

Serﬂing, 1980) and therefore (4) holds. To ﬁnish note that

= D

−1/2

(

− ϑ) = (a

)

−1/2

1/2

(

− ϑ)

−→ N

(0, R).

For the purposes of multiple comparisons, we need convergence of multivariate probabilities

calculated for the vector T

when T

is assumed normally distributed with R

treated

as if it were the true correlation matrix. However, such probabilities P(max(|T

| ≤ t)

are continuous functions of R

(and a critical value t) which converge by R

−→ R as

a consequence of Theorem 1.7 in

Serﬂing (1980). In cases where T

is assumed multi-

variate t distributed with R

treated as the estimated correlation matrix, we have similar

convergence as the degrees of freedom approach inﬁnity.

Since we only assume that the parameter estimates are asymptotically normally distributed

with a consistent estimate of the associated covariance matrix being available, our frame-

work covers a large class of statistical models, including linear regression and ANOVA

models, generalized linear models, linear mixed eﬀects models, the Cox model, robust lin-

ear models, etc. Standard software packages can be used to ﬁt such models and obtain

the estimates

and S

which are essentially the only two quantities that are needed for

what follow s in Section˜

3. It should be noted that the elemental parameters θ are not

necessarily means or diﬀerences of means in AN(C)OVA models. Also, we do not restrict

our attention to contrasts of such means, but allow for any set of constants leading to the

linear functions ϑ = Kθ of interest. Speciﬁc examples for K and θ will be given later in

Sections˜

4 and 6.

3 Global and Simultaneous Inference

Based on the results from Section˜

2, we now focus on the derivation of suitable inference

procedures. We start considering the general linear hypothesis (

Searle, 1971) formulated

in terms of our parameters of interest ϑ

: ϑ := Kθ = m.

Under the conditions of H

it follows from Section˜2 that

= D

−1/2

(

− m)

∼ N

(0, R

This approximating distribution will now be used as the reference distribution when con-

structing the inference procedures. The global hypothesis H

can be tested using standard

global tests, such as the F - or the χ

-test. An alternative approach is to use maximum

tests, as explained in Subsection˜

3.1. Note that a small global p-value (obtained from one

of these procedures) leading to a rejection of H

does not give further indication about

the nature of the signiﬁcant result. Therefore, one is often interested in the individual null

hypotheses

: ϑ

= m

Testing the hypotheses set {H

, . . . , H

} simultaneously thus requires the individual as-

sessments while maintaining the familywise error rate, as discussed in Subsection˜3.2

At this point it is worth considering two special cases. A stronger a ssumption than asymp-

totic normality of

in (

2) is exact normality, i.e.,

∼ N

(θ, Σ). If the covariance matrix

Σ is known, it follows by standard arguments that T

∼ N

(0, R), when T

is normalized

using ﬁxed, known variances. Otherwise, in the typical situation of linear models with

normal i.i.d. errors, Σ = σ

A, where σ

is unknown but A is ﬁxed and known, the exact

distribution of T

is a k-dimensional multivariate t

(ν, R) distribution with ν degrees of

freedom (ν = n − p − 1 for linear models), see

Tong (1990 ).

3.1 Global Inference

The F - and the χ

-test are classical approaches to assess the global null hypothesis H

Standard results (such as Theorem 3.5,

Serﬂing, 1980) ensure that

= T

⊤

−→ χ

(Rank(R)) when

∼ N

(θ, S

)

F =

⊤

Rank(R)

∼ F(Rank(R), ν) when

∼ N

(θ, σ

A),

where Rank(R) and ν are the corresponding degrees of freedom of the χ

and F distri-

bution, respectively. Furthermore, Rank(R

)

denotes the Moore-Penrose inverse of the

correlation matrix Rank(R).

Another suitable scalar test statistic for testing the global hypothesis H

is to consider

the maximum of the individual test statistics T

1,n

, . . . , T

k,n

of the multivariate statistic

= (T

1,n

, . . . , T

k,n

), leading to a max-t type test statistic max(|T

|). The distribution

of this statistic under the conditions of H

can be handled through the k-dimensional

distribution

P(max(|T

|) ≤ t)

∼

−t

· · ·

−t

, . . . , x

; R, ν) dx

· · · dx

=: g

(R, t) (5)

Simultaneous inference in general parametric models.

Figures

Citations

Species boundaries in the human pathogen Paracoccidioides

Soil Carbon Stocks Decrease following Conversion of Secondary Forests to Rubber (Hevea brasiliensis) Plantations

Cerebellar Contribution to Social Cognition

The analysis of zero-inflated count data: beyond zero-inflated poisson regression

Root exudate cocktails: the link between plant diversity and soil microorganisms?

References

Robust Regression and Outlier Detection

Approximation Theorems of Mathematical Statistics

Linear Models

Multiple Comparison Procedures

Multiple Comparison Procedures.

Related Papers (5)

R: A language and environment for statistical computing.

Fitting Linear Mixed-Effects Models Using lme4

ggplot2: Elegant Graphics for Data Analysis

Modern Applied Statistics with S

vegan: Community Ecology Package

Frequently Asked Questions (14)

Q1. What are the contributions mentioned in the paper "Simultaneous inference in general parametric models" ?

Q2. What is the default re-parametrization used as elemental parameters in the R?

Q3. What is the purpose of this paper?

Q4. What is the advantage of single-step procedures?

Q5. What is the p-value for a given family of null hypotheses?

Q6. What is the scalar test statistic for testing the global null hypothesis?

Q7. What is the simplest way to model the response?

Q8. What is the general framework for simultaneous inference?

Q9. What is the p-value for the jth individual two-sided hypothesis?

Q10. What is the p-value for the global null hypothesis?

Q11. What is the way to test the global null hypothesis?

Q12. What are examples of multiple comparison procedures?

Q13. Why is mcp() not available in multcomp?

Q14. What is the sequence of n needed to establish -convergence in (4)?