What are the contributions mentioned in the paper "Interaction effects in econometrics" ?

The authors provide practical advice for applied economists regarding specification and interpretation of linear regression models with interaction terms.

What is the reason for the large change in the coefficient to the main term?

The large change in the coefficient to the main term is not due to misspecification but it reflects that the coefficient to X1 is to be interpreted as the marginal effect of X1 when X2 is zero.

What does the authors find to be the strongest result of negative interactions?

Including quadratic terms in the property rights measures seem to strengthen the authors’ main result of negative interactions (although the inclusion of a quadratic term in GDP weakens it).

What does the study show about the effects of the interaction terms?

The authors find that using Frisch-Waugh residuals strengthens the size and sig-nificance of the interactions; in fact, the interaction of external dependence and equity market capitalization and credit turns from insignificant to clearly significant at the 5- percent level with the expected sign.

What is the coefficient of the interaction term when estimating equation (1)?

If X21 is part of the correctly specified regression with coefficient δ, the estimated coefficient to the interaction term when estimating equation (1) will be α δ.

What does the author find to be the strongest evidence of the effect of interaction terms?

Clementi, and MacDonald (2004) hypothesize that strengthening of property rights, as measured by laws mandating “one share-one vote,” “anti-director rights” (which limit the power of directors to extract surplus), “creditor rights,” and “rule of law,” are beneficial for growth and more so when restrictions on capital transactions (capital flows) are weaker where the latter effect is captured by interaction terms.

What is the way to estimate the coefficient of a quadratic term?

If quadratic terms are not otherwise ruled out, the authors recommend also estimating the specification (4) in order to verify that a purported interaction term is not spuriously capturing left-out squared terms.

What is the partial derivative of Y with respect to X1?

In this regression, λ1 = ∂Y/∂X1 is the partial derivative of Y with respect to X1, implicitly evaluated at X2 = X2 (the mean value of X2).

What is the main message of the Castro, Clementi, and MacDonald (2004)?

the point estimates in the Castro, Clementi, and MacDonald (2004) study are not all robust, as one might conjecture from the size of the t-statistics, but the overall message of their regressions appear very robust to the kind of robustness checks that the authors recommend.

What is the way to determine if a regression with interactions really captures only interactions?

Case 2: if one wants to ascertain that the interaction of X1 and X2 captures no other regressors the safest strategy is to run the following regression model:Y = β0 + β1X1 + β2X2 + β3X ψ 1 X ψ 2 + , (9)where Xψ1 = M2X1 and X ψ 2 = M1X2, M1 = [I − Pβ0,X1 ] and M2 = [I − Pβ0,X2 ] (M1 is a residual maker; regressing X2 on a constant and X1 and M2 is the residual maker; regressing X1 on a constant and X2).

What is the way to explain the difference in the slope of the interaction term?

In the second column, the authors illustrate how the simple suggestion of subtracting the country-specific means from each variable prevents the interaction term from becoming spuriously significant due to country-varying slopes.

(Open Access) Interaction Effects in Econometrics (2013) | Hatice Ozer Balli

Interaction Eﬀects in Econometrics

Hatice Ozer-Balli

∗

Massey University

Bent E. Sørensen

†

University of Houston and CEPR

25 June 2010

Abstract

We provide practical advice for applied economists regarding speciﬁcation and in-

terpretation of linear regression models with interaction terms.

JEL classiﬁcation: C12, C13

Keywords: Non-Linear Regression, Interaction Terms.

∗

School of Economics and Finance, Massey University, New Zealand, e-mail: h.ozer-

balli@massey.ac.nz, tel:+64 63505799 ext. 2666.

†

Department of Economics, University of Houston, TX, e-mail: bent.sorensen@mail.uh.edu, tel:

7137433841, fax: 7137433798

1 Introduction

A country may consider a reform that would strengthen the ﬁnancial sector. Would

this help economic growth and development? This simple question is frustratingly hard

to answer using empirical data because economic development itself spawns ﬁnancial

development, so while economic and ﬁnancial developments are positively correlated

this does not answer the question asked. In a highly inﬂuential paper, Rajan and

Zingales (1998) provide convincing evidence that ﬁnancial development is important for

economic development by asking the simple question: do industrial sectors that are more

dependent on external ﬁnance grow faster in countries with a high level of development.

This question involves interactions between ﬁnancial development and dependency on

external ﬁnance. Since the publication of Rajan and Zingales’ highly inﬂuential study,

the estimation of models with interaction eﬀects have become very common in applied

economics.

In Section 2, we discuss some practical issues related to the speciﬁcation of regres-

sions with interaction eﬀects and make recommendations for practitioners. In Section 3,

we illustrate our recommendations with Monte Carlo simulations and, in Section 4, we

revisit some prominent applied papers where interaction eﬀects ﬁgure prominently, in-

cluding Rajan and Zingales (1998), and examine if the published results are robust.

Section 5 concludes.

2 Linear Regression with Interaction Eﬀects

Many econometric issues related to models with interaction eﬀects are very simple and

we illustrate our discussion using simple Ordinary Least Squares (OLS) and instrumental

variable (IV) estimation. Often applied papers use more complicated methods involving,

say, Generalized Method of Moments, clustered standards errors, etc., but the points we

are making typically carry over to such settings with little modiﬁcation.

Let Y be dependent variable, such as growth of an industrial sector, and X

and X

independent variables that may impact on growth, such as the dependency on external

ﬁnance and ﬁnancial development. Applied econometricians have typically allowed for

interaction eﬀects between two independent variables, X

and X

by estimating a simple

multiple regression model of the form:

Y = β

+ β

+  , (1)

where X

refers to a variable calculated as the simple observation-by-observation

product of X

and X

. In the example of Rajan and Zingales (1998), the interest centers

around the coeﬃcient β

—a signiﬁcant positive coeﬃcient implies that sectors that are

more dependent on external ﬁnance grows faster following ﬁnancial development.

We refer to the independent terms X

and X

as “main terms” and the product

of the main terms, X

, as the “interaction term.” This brings us to our ﬁrst simple

observations:

1. In a regression with interaction terms, the main terms should always be included.

Otherwise, the interaction eﬀect may be signiﬁcant due to left-out variable bias.

is by construction likely to be correlated with the main terms.)

2. The partial derivative of Y with respect to X

is β

+ β

. The interpretation

of β

is the partial derivative of Y with respect to X

when X

= 0. A t-test for

= 0 is, therefore a test of the null of no eﬀect of X

when X

= 0. To test for

no eﬀect of X

one needs to test if (β

, β

) = (0, 0) using, for example, an F-test.

Some authors have referred to this as a multicollinearity problem. Althauser (1971) show that the

main terms and the interaction term in the equation (1) are correlated. These correlations are aﬀected

in part by the size and the diﬀerence in the sample means of X

and X

. Smith and Sasaki (1979) also

argue that the inclusion of the interaction term might cause a multicollinearity problem. In our view,

collinearity is not a problem for regressions with interaction eﬀects of a diﬀerent nature than elsewhere

in empirical economics—if one asks too much from a small sample, correlations between regressors make

for fragile inference.

In applied papers, the non-interacted regression

Y = λ

+ λ

+ υ, (2)

is often estimated before the interacted regression. In this regression, λ

= ∂Y /∂X

the partial derivative of Y with respect to X

, implicitly evaluated at X

= X

(the mean

value of X

The estimated β

-coeﬃcient in (1) is typically very close to

−

3. Estimating the interacted regression in the form

Y = β

+ β

− X

) (X

− X

) +  , (3)

results in the exact same ﬁt as equation (1) and the exact same coeﬃcient

will typically be close to

estimated from equation (2) because β

= ∂Y/∂X

is the partial derivative of Y with respect to X

, evaluated at X

= X

. If a

researcher reports results from (2), and wants to keep the interpretation of the

coeﬃcient to main terms similar, is usually preferable to report results of the

regression (3) with demeaned interaction terms.

4. In the case where, say, X

is endogenous, X

is exogenous, and Z is a valid in-

strument for X

, X

Z will be a valid instrument for X

. Alternatively, one can

regress X

on Z and obtain

and use X

for the interaction term and obtain

a consistent estimate of β

Some social scientists suggest that the interaction term undermines the interpretation of the re-

gression coeﬃcients associated with X

and X

(e.g., Allison (1977), Althauser (1971), Smith and

Sasaki (1979), and Braumoeller (2004)). The point is simply that researchers sometimes do not notice

the change in the interpretation of the coeﬃcient estimate for the main terms when the interaction term

is added.

Because β

+ β

− X

)(X

− X

) = (β

+ β

) + (β

− β

) X

+ (β

−

+ β

, we get the exact same ﬁt, with the changes in the estimated parameters given

from the correspondence between the left- and right-hand side of this equality. E.g.,

will be equal

+ β

2.1 Robustness to misspeciﬁcation

Often a researcher wants to test whether Y = f (X

, X

) and chose a linear speciﬁcation

such as (2) for convenience. A more adequate speciﬁcation may be a second order

expansion

Y = β

+ β

− X

) (X

− X

) + β

+ β

+  . (4)

(We will refer to X

; i = 1, 2 as “second-order terms”—in applications one may wish to

enter the second-order terms in a demeaned forms for the same reasons as discussed for

the interaction term, but for notational brevity we use the simpler non-demeaned form

here.) The relevance of this observation is as follows.

5. If Y = f(X

, X

) can be approximated by the second order expansion (4) with

a non-zero coeﬃcient to either X

or X

and corr(X

, X

) 6= 0, the coeﬃcient

in the interacted regression (1) may be spuriously signiﬁcant. For example, if

corr(X

, X

) > 0 the estimated coeﬃcient

will usually be positive even if β

= 0.

If quadratic terms are not otherwise ruled out, we recommend also estimating

the speciﬁcation (4) in order to verify that a purported interaction term is not

spuriously capturing left-out squared terms.

The potential bias from leaving out second order terms is easily understood. If X

and X

are (positively) correlated, we can write X

= αX

+ w (where α is positive) so

the interaction term (we suppress the mean for simplicity) becomes αX

+ X

w where

the latter term has mean zero and will be part of the error in the regression. If X

part of the correctly speciﬁed regression with coeﬃcient δ, the estimated coeﬃcient to

the interaction term when estimating equation (1) will be α δ.

Interaction Effects in Econometrics

Figures

Citations

Essays in international trade

Does climate vulnerability promote green investment under energy supply restriction?

The Collateral Channel: How Real Estate Shocks Affect Corporate Investment: Comment

Corporate governance and cost of equity: the moderating role of ownership concentration levels

Hybrid Model Using Interacted-ARIMA and ANN Models for Efficient Forecasting

References

Law and Finance

Africa's Growth Tragedy: Policies and Ethnic Divisions

Financial Dependence and Growth

Institutions and economic performance: cross‐country tests using alternative institutional measures

Investor Protection and Corporate Valuation

Related Papers (5)

Some Tests of Specification for Panel Data: Monte Carlo Evidence and an Application to Employment Equations.

Initial conditions and moment restrictions in dynamic panel data models

Econometric Analysis of Cross Section and Panel Data

Another look at the instrumental variable estimation of error-components models

Financial Dependence and Growth

Frequently Asked Questions (11)

Q1. What are the contributions mentioned in the paper "Interaction effects in econometrics" ?

Q2. What is the reason for the large change in the coefficient to the main term?

Q3. What does the authors find to be the strongest result of negative interactions?

Q4. What does the study show about the effects of the interaction terms?

Q5. What is the coefficient of the interaction term when estimating equation (1)?

Q6. What does the author find to be the strongest evidence of the effect of interaction terms?

Q7. What is the way to estimate the coefficient of a quadratic term?

Q8. What is the partial derivative of Y with respect to X1?

Q9. What is the main message of the Castro, Clementi, and MacDonald (2004)?

Q10. What is the way to determine if a regression with interactions really captures only interactions?

Q11. What is the way to explain the difference in the slope of the interaction term?