scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Sensitivity Analyses for Robust Causal Inference from Mendelian Randomization Analyses with Multiple Genetic Variants

01 Jan 2017-Epidemiology (Lippincott Williams and Wilkins)-Vol. 28, Iss: 1, pp 30-42
TL;DR: A range of sensitivity analyses are discussed that will either support or question the validity of causal inference from a Mendelian randomization analysis with multiple genetic variants, and those that can be undertaken using summarized data are focused on.
Abstract: Mendelian randomization investigations are becoming more powerful and simpler to perform, due to the increasing size and coverage of genome-wide association studies and the increasing availability of summarized data on genetic associations with risk factors and disease outcomes. However, when using multiple genetic variants from different gene regions in a Mendelian randomization analysis, it is highly implausible that all the genetic variants satisfy the instrumental variable assumptions. This means that a simple instrumental variable analysis alone should not be relied on to give a causal conclusion. In this article, we discuss a range of sensitivity analyses that will either support or question the validity of causal inference from a Mendelian randomization analysis with multiple genetic variants. We focus on sensitivity analyses of greatest practical relevance for ensuring robust causal inferences, and those that can be undertaken using summarized data. Aside from cases in which the justification of the instrumental variable assumptions is supported by strong biological understanding, a Mendelian randomization analysis in which no assessment of the robustness of the findings to violations of the instrumental variable assumptions has been made should be viewed as speculative and incomplete. In particular, Mendelian randomization investigations with large numbers of genetic variants without such sensitivity analyses should be treated with skepticism.

Summary (5 min read)

FIGURE 1.

  • Diagram of instrumental variable assumptions for Mendelian randomization.
  • The three assumptions (i, ii, iii) are illustrated by the presence of an arrow, indicating the effect of one variable on the other (assumption i), or by a dashed line with a cross, indicating that there is no direct effect of one variable on the other (assumptions ii and iii).

ASSESSING THE INSTRUMENTAL VARIABLE ASSUMPTIONS

  • The first set of approaches the authors consider are those to assess whether the instrumental variable assumptions are likely to be satisfied or not for a set of genetic variants.
  • The authors consider in turn the assessment of the association with measured confounders, the exploitation of a natural experiment in the form of a gene-environment interaction, examination of a scatter plot combined with a heterogeneity test, and of a funnel plot combined with a test for directional pleiotropy.

Use of Measured Covariates

  • The assumption that an instrumental variable is not associated with confounders of the risk factor-outcome association is not fully testable, as not all confounders will be known or measured.
  • Associations are no stronger than would be expected by chance alone.
  • Inhibition of interleukin-1 by the drug anakinra has been observed to lead to decreased levels of c-reactive protein and interleukin-6 in clinical trials.
  • In some cases, valid causal inference may still be possible even if a genetic variant has a pleiotropic association with a measured covariate; for instance, by adjusting for the covariate in the analysis model.
  • An alternative approach with summarized data is a multivariable Mendelian randomization analysis, in which genetic associations with the outcome are regressed on the genetic associations with the risk factor and covariates in a multivariable weighted regression model.

Gene-Environment Interaction

  • For some applications of Mendelian randomization, a further natural experiment may be available if the postulated causal effect is present in one stratum of the population, but absent in another.
  • 32 For example, the association of alcoholrelated genetic variants with esophageal cancer risk is present in those who drink alcohol, but absent in abstainers.
  • One potential complication of such an analysis is the possibility of collider bias; 34 by stratifying on the risk factor, associations between the genetic variants and the outcome may be distorted in the strata (in the examples above, in alcohol consumers/abstainers).
  • 35, 36 Associations (estimates in standard deviation units and 95% confidence intervals) of four genetic variants in the CRP gene region with a range of covariates per C-reactive protein increasing allele.
  • 16 Copyright © 2016 Wolters Kluwer Health, Inc. Unauthorized reproduction of this article is prohibited.

Scatter Plot and Test for Heterogeneity

  • Even if the instrumental variable assumptions are in doubt for some or all of the variants, if several independent genetic variants in different gene regions are concordantly associated with the outcome, then a causal conclusion would seem reasonable.
  • Any point that substantially deviates from this line should be investigated for potential pleiotropy.
  • A statistical test for heterogeneity can be performed using cochran's Q test on the causal estimates from each 2 are the inverse-variance weights.
  • This statistic can be calculated using only summarized data.

FIGURE 3.

  • Diagram to illustrate the difference between pleiotropy (left, the association of the genetic variant with the covariate is independent of the risk factor) and mediation (right, the association of the genetic variant with the covariate is mediated entirely via the risk factor).
  • Egger regression is a method for detecting small study bias (often interpreted as publication bias) in a meta-analysis of separate studies.
  • 45 the method can also be used for detecting directional pleiotropy from separate genetic variants.
  • 47 the genetic associations should be orientated so that the associations with the risk factor all have the same sign.
  • If there is no intercept term in this regression, the slope parameter is the inverse-variance weighted causal estimate.

ROBUST ANALYSIS METHODS

  • The second category of sensitivity analyses is that of robust analysis methods.
  • Robust analysis methods allow different (and when the main purpose is to test the causal null hypothesis, weaker) assumptions than standard instrumental variable methods.
  • In turn, the authors consider penalization methods, median-based methods, and egger regression.

Penalization Methods

  • The authors first consider methods in which the contribution of some genetic variants (e.g., heterogeneous or outlying variants) to the analysis is downweighted (or penalized).
  • The simplest way of performing a penalization method is to omit some of the variants from the analysis.
  • With a small number of genetic variants, the causal estimates omitting one variant at a time could be considered.
  • This sensitivity analysis has been undertaken for the effect of lDl-c on aortic stenosis.
  • They require individual-level data and a one-sample setting (genetic variants, risk factor, and outcome measurements are available for the same individuals).

Median-based Methods

  • An alternative family of methods that gives consistent estimates when up to half the genetic variants are not valid instrumental variables, but that can be performed using summarized data rather than individual-level data, are medianbased methods.
  • The weighted median estimate is consistent under the assumption that genetic variants representing over 50% of the weight in the analysis are valid instruments.
  • This is for Mendelian randomization analysis of C-reactive protein on coronary artery disease risk using genetic variants throughout the genome that have been demonstrated as associated with C-reactive protein at a genome-wide level of significance.
  • Horizontal lines represent 95% confidence intervals for the instrumental variable estimates.
  • Confidence intervals for the median and weighted estimates can be estimated using bootstrapping.

Egger Regression

  • The egger regression method was introduced above as a test for directional pleiotropy; this test does not make any assumption about the genetic variants.
  • Under an assumption that is weaker than standard instrumental variable assumptions, the slope coefficient from the egger regression method provides an estimate of the causal effect that is consistent asymptotically even if all the genetic variants have pleiotropic effects on the outcome.
  • There is some evidence for the general plausibility of the inSiDe assumption, as associations of genetic variants with different phenotypic variables have been shown to be largely uncorrelated in an empirical study.
  • The penalization and median-based methods allow more general departures from the instrumental variable assumptions for the invalid instruments.
  • Using genetic variants chosen solely on the basis of their association with the risk factor, a broad range of methods affirmed that lDl-c was a causal risk factor for cAD risk.

Example: C-reactive Protein and Coronary Artery Disease Risk

  • The inverse-variance weighted method was originally proposed as a fixed-effect meta-analysis of the causal estimates from each of the genetic variants.
  • The authors consider fixed-effect and multiplicative random-effects models for both the inversevariance weighted and egger regression methods.
  • 56 Also, the authors consider simple (i.e., unweighted) median and weighted median estimates.
  • The corresponding randomeffects analyses imply that there is no convincing evidence for a causal effect.

DISCUSSION

  • When multiple genetic variants from different gene regions are used in a Mendelian randomization analysis, it is highly implausible that all the genetic variants satisfy the instrumental variable assumptions.
  • This does not preclude a causal conclusion; however, it means that a simple instrumental variable analysis alone should not be relied on to give a causal conclusion.
  • Inappropriate and naive application of standard Mendelian randomization methods may lead to exactly the same problems of unmeasured confounding that the technique was designed to avoid.
  • The authors have discussed a range of sensitivity analyses that can be used to question the plausibility of a Mendelian randomization analysis using multiple variants, focusing on those analyses that are judged to be most useful to an applied analyst and those that can be performed using summarized data.
  • Not every sensitivity analysis may be appropriate for each case, but some effort should be made to investigate whether a causal finding is robust to violations of the instrumental variable assumptions.

Comparison with Previous Literature

  • From its initial popularization, proponents of Mendelian randomization have been candid about the stringent and untestable assumptions required in Mendelian randomization.
  • 3, 14 However, applied investigations have not always reflected this need for caution.
  • In comparison with previous attempts to offer robust approaches for causal inference in Mendelian randomization, the authors have here repeated some of the guidance of Glymour et al., 32 specifically relating to the search for gene-environment interactions and to testing for heterogeneity between the estimates from different variants.
  • Substantial attenuation of the association on adjustment for the risk factor is expected if the genetic variant is a valid instrumental variable; however, such attenuation may not occur in practice, for example, due to measurement error in the exposure 58 -conversely, some attenuation may occur for an invalid instrumental variable.
  • Violations of the assumptions of homogeneity and/or linearity of the causal effect would also lead to difficulties in interpreting the causal estimate, although they are unlikely to lead to inappropriate causal inferences or inflated type 1 error rates under the null.

Summarized Data and Two-sample Mendelian Randomization

  • All of the sensitivity analyses discussed in this article can equally be performed b.
  • Odds ratio for coronary artery disease per 1-SD (1.05 unit) increase in log-transformed c-reactive protein concentration (equivalent to a 2.86-fold increase in c-reactive protein concentration).
  • A further concern with summarized data is the use of two-sample analyses, in which data on the gene-risk factor and gene-outcome associations are taken from nonoverlapping datasets.
  • This is not to discourage the use of summarized data or two-sample Mendelian randomization analyses, but to acknowledge that the bar for evidential quality is even higher in this case.

Genetic Variants with Different Functional Effects

  • The authors have assumed that there is a single causal effect of the risk factor on the outcome, and interpreted deviation from this (i.e., heterogeneity of causal effect estimates) as evidence that the instrumental variable assumptions are violated for some of the genetic variants.
  • In reality, if genetic variants have different functional effects on the risk factor, then different magnitudes of causal effect may be expected.
  • Genetic variants associated with body mass index may have different biological mechanisms giving rise to the association, and may affect the outcome to different extents.
  • Heterogeneity between causal estimates based on sets of genetic variants grouped according to their biological function may help reveal which mechanisms are causal.
  • The causal estimates presented in this article still provide a valid test of the causal null hypothesis, but do not have an interpretation as estimates of a causal parameter.

Pleiotropy and Other Violations of the Instrumental Variable Assumptions

  • The authors have discussed violations of the instrumental variable assumptions primarily using the language of pleiotropy.
  • In particular, violations of the exclusion restriction assumption (i.e., no effect of the genetic variant on the outcome except for that via the risk factor) can be expressed as pleiotropic effects.
  • 63 while this adjustment has proved successful in some cases, it is not guaranteed to eliminate population stratification.
  • 32 classical (nondifferential, zero mean) measurement error in the risk factor does not lead to bias in instrumental variable estimates.
  • If there are multiple versions of the risk factor, then this would lead to difficulties in interpreting the causal findings.

CONCLUSIONS

  • The increasing size and coverage of genome-wide association studies and the increasing availability of summarized data on genetic associations are making the application of Mendelian randomization simpler.
  • The methods for sensitivity analysis described in this article will help to judge whether a causal conclusion from a Mendelian randomization analysis is reasonable or not.
  • Aside from cases in which the selection of the genetic variants and their justification as instrumental variables is motivated by strong biological understanding, a Mendelian randomization analysis in which no assessment of the robustness of the findings has been made should be viewed as speculative.

Did you find this useful? Give us your feedback

Figures (8)

Content maybe subject to copyright    Report

Burgess, S., Bowden, J., Fall, T., Ingelsson, E., & Thompson, S. G.
(2017). Sensitivity analyses for robust causal inference from
Mendelian randomization analyses with multiple genetic variants.
Epidemiology
,
28
(1), 30-42.
https://doi.org/10.1097/EDE.0000000000000559
Publisher's PDF, also known as Version of record
License (if available):
CC BY
Link to published version (if available):
10.1097/EDE.0000000000000559
Link to publication record in Explore Bristol Research
PDF-document
This is the final published version of the article (version of record). It first appeared online via Wolters Kluwer at
http://journals.lww.com/epidem/fulltext/2017/01000/Sensitivity_Analyses_for_Robust_Causal_Inference.6.aspx.
Please refer to any applicable terms of use of the publisher.
University of Bristol - Explore Bristol Research
General rights
This document is made available in accordance with publisher policies. Please cite only the
published version using the reference above. Full terms of use are available:
http://www.bristol.ac.uk/red/research-policy/pure/user-guides/ebr-terms/

Copyright © 2016 Wolters Kluwer Health, Inc. Unauthorized reproduction of this article is prohibited.
30 | www.epidem.com Epidemiology •  Volume 28, Number 1, January 2017
REVIEW ARTICLE
Abstract: Mendelian randomization investigations are becoming
more powerful and simpler to perform, due to the increasing size
and coverage of genome-wide association studies and the increas-
ing availability of summarized data on genetic associations with risk
factors and disease outcomes. However, when using multiple genetic
variants from different gene regions in a Mendelian randomization
analysis, it is highly implausible that all the genetic variants satisfy
the instrumental variable assumptions. This means that a simple
instrumental variable analysis alone should not be relied on to give
a causal conclusion. In this article, we discuss a range of sensitivity
analyses that will either support or question the validity of causal
inference from a Mendelian randomization analysis with multiple
genetic variants. We focus on sensitivity analyses of greatest practi-
cal relevance for ensuring robust causal inferences, and those that can
be undertaken using summarized data. Aside from cases in which the
justification of the instrumental variable assumptions is supported by
strong biological understanding, a Mendelian randomization analysis
in which no assessment of the robustness of the findings to viola-
tions of the instrumental variable assumptions has been made should
be viewed as speculative and incomplete. In particular, Mendelian
randomization investigations with large numbers of genetic variants
without such sensitivity analyses should be treated with skepticism.
(Epidemiology 2017;28: 30–42)
A
n instrumental variable in an observational study behaves
similarly to random treatment assignment in an experi-
mental setting.
1
It provides a natural experiment, whereby
individuals with different levels of the instrumental variable
differ on average with respect to the putative risk factor, but
not with respect to any confounders of the risk factor–out-
come association.
2
Mendelian randomization is the use of a
genetic variant as a proxy for a modifiable risk factor.
3,4
If
a genetic variant satisfies the assumptions of an instrumental
variable for the risk factor, then whether there is an associa-
tion between the genetic variant and the outcome is a test of
whether the risk factor is a cause of the outcome.
5
The instrumental variable assumptions are satisfied for
a genetic variant if
(i) the genetic variant is associated with the risk factor;
(ii) the genetic variant is not associated with confound-
ers of the risk factor–outcome relationship; and
(iii) the genetic variant is not associated with the out-
come conditional on the risk factor and confound-
ers of the risk factor–outcome relationship.
6
These assumptions imply that the only causal pathway
from the genetic variant to the outcome is via the risk factor,
and there is no other causal pathway either directly to the out-
come or via a confounder.
7
A diagram corresponding to these
assumptions is presented in Figure 1.
We further assume that all valid instrumental vari-
ables identify the same causal parameter; we return to this
assumption in the discussion. For this interpretation to hold,
it is necessary for certain parametric assumptions to hold. In
this article, we assume that the effects of (i) the instrumen-
tal variables on the risk factor, (ii) the instrumental variables
on the outcome, (iii) the risk factor on the outcome are lin-
ear without effect modification; and (iv) the association of
the genetic variant with the risk factor is homogeneous in
the population.
5
These assumptions are not necessary for the
identification of a causal effect, but they ensure that the esti-
mate from each instrumental variable targets the same average
causal effect.
8
Weaker assumptions can identify a local aver-
age causal effect;
9
however, the local average causal effect is
likely to differ for each instrumental variable. Although these
Copyright © 2016 Wolters Kluwer Health, Inc. All rights reserved.This is
an open access article distributed under the Creative Commons Attribution
License 4.0 (CCBY), which permits unrestricted use, distribution, and repro-
duction in any medium, provided the original work is properly cited.
ISSN: 1044-3983/16/2801-0030
DOI: 10.1097/EDE.0000000000000559
Submitted 9 October 2015; accepted 13 September 2016.
From the
a
Cardiovascular Epidemiology Unit, Department of Public Health
and Primary Care, University of Cambridge, Cambridge, United King-
dom;
b
Medical Research Council Integrative Epidemiology Unit, School
of Social and Community Medicine, University of Bristol, Bristol, United
Kingdom; and
c
Department of Medical Sciences, Molecular Epidemiol-
ogy, Uppsala University, Uppsala, Sweden.
Stephen Burgess is funded by a fellowship from the Wellcome Trust (100114).
Jack Bowden is supported by a Methodology Research Fellowship from
the UK Medical Research Council (Grant Number MR/N501906/1).
Simon G. Thompson is supported by the British Heart Foundation (Grant
Number CH/12/2/29428).
The authors report no conflicts of interest.
Supplemental digital content is available through direct URL citations
in the HTML and PDF versions of this article (www.epidem.com).
Editor’s Note: A Commentary on this article appears on p. 43.
Correspondence: Stephen Burgess, Department of Public Health & Primary
Care, Strangeways Research Laboratory, 2 Worts Causeway, Cambridge,
CB1 8RN, United Kingdom. E-mail: sb452@medschl.cam.ac.uk.
Sensitivity Analyses for Robust Causal Inference from 
Mendelian Randomization Analyses with Multiple 
Genetic Variants
Stephen Burgess,
a
Jack Bowden,
b
Tove Fall,
c
Erik Ingelsson,
c
and Simon G. Thompson
a

Copyright © 2016 Wolters Kluwer Health, Inc. Unauthorized reproduction of this article is prohibited.
Epidemiology •  Volume 28, Number 1, January 2017 Sensitivity Analyses for Mendelian Randomization
© 2016 Wolters Kluwer Health, Inc. All rights reserved. www.epidem.com | 31
assumptions are strict, the causal estimate from an instru-
mental variable analysis is a valid test statistic for the causal
null hypothesis without requiring the assumptions of linearity,
homogeneity, or monotonicity.
10
In any case, the causal effect
of intervention on a risk factor is likely to depend on several
aspects of the intervention (e.g., its magnitude, duration, and
pathway), and therefore will not precisely correspond to the
estimate from a Mendelian randomization analysis.
11
Hence,
we would urge practitioners to view the assessment of causal-
ity as the primary result of a Mendelian randomization, and
not to interpret any causal estimate too literally.
12
We also assume that the genetic variants are mutu-
ally independent in their distributions, although extensions
are available for most of the analysis methods in the case of
correlated variants, provided that the correlation structure is
known.
13
Genetic variants are particularly suitable candidate
instrumental variables, as they are fixed at conception, and
hence cannot be affected by environmental factors that could
otherwise lead to confounding or reverse causation.
14
How-
ever, there are many well-documented ways in which the
instrumental variable assumptions may be violated for any
particular genetic variant, such as pleiotropy, linkage disequi-
librium, and population stratification.
3,15
For risk factors that are soluble protein biomarkers,
there is often a gene region that encodes the protein (for exam-
ple, the CRP gene region for C-reactive protein
16
), or a regula-
tor or inhibitor of the protein (e.g., the IL6R gene region for
interleukin-6
17
). Using one or more variants from such a gene
region as instrumental variables would be ideal for a Mende-
lian randomization analysis, as these genetic variants would
be the most likely to satisfy the instrumental variable assump-
tions, and the most informative proxies for intervention on the
risk factor.
18
However, such genetic variants do not exist for
many risk factors.
The approach of using multiple genetic variants in dif-
ferent gene regions is particularly suitable for complex risk
factors that are multifactorial and polygenic, such as body
mass index,
19
height,
20
or blood pressure.
21
Summarized data
(in particular, beta-coefficients and standard errors) on genetic
associations with the risk factor can be combined with sum-
marized data on genetic associations with the outcome (that
are often publicly available for download) to provide causal
effect estimates, under the assumption that the genetic vari-
ants are all instrumental variables.
22,23
Using multiple genetic
variants increases the power of a Mendelian randomization
investigation compared with an analysis based on a single
variant.
24
However, even if only one of the genetic variants is
not a valid instrumental variable, the causal estimate based on
all the variants from a conventional Mendelian randomization
analysis will be biased and type 1 (false positive) error rates
will be inflated.
25,26
In this article, we describe a range of sensitivity analy-
ses that either support or question the validity of causal infer-
ence from a Mendelian randomization analysis with multiple
genetic variants. These sensitivity analyses will be useful for
judging whether a causal conclusion from such an analysis is
plausible or not. We focus on those sensitivity analyses that
can be implemented using summarized data only. We consider
approaches under two broad categories: methods for assess-
ing the instrumental variable assumptions, and robust analysis
methods that rely on a less stringent set of assumptions than a
conventional Mendelian randomization analysis.
We illustrate the approaches using the example of esti-
mating the causal effect of C-reactive protein (CRP) on coro-
nary artery disease (CAD) risk using four genetic variants in
the CRP gene region,
16
and using 17 genetic variants (eTable
A1; http://links.lww.com/EDE/B114) that have been shown to
be associated with CRP at a genome-wide level of significance
in a large meta-analysis—see eFigure in Ref. 27—beta-coef-
ficients represent per allele associations with log-transformed
CRP concentrations. Genetic associations with CAD risk
were taken from the CARDIoGRAM consortium;
28
beta-
coefficients represent per allele log odds ratios for CAD risk.
Ethical approval for the analyses using four genetic variants in
the CRP gene region was granted by the Cambridgeshire eth-
ics review committee; for the analyses using 17 genetic vari-
ants associated with CRP concentrations and with CAD risk,
ethical approval was granted to the constituent studies by local
institutional review boards.
For reference, the causal estimate based on the genetic
variants in the CRP gene region is null (odds ratio: 1.00, 95%
confidence interval: 0.90, 1.13 per 1-SD increase in CRP con-
centrations [equal to a 1.05-unit increase in log-transformed
CRP or a 2.86-fold increase]), whereas the “causal” estimate
using an inverse-variance weighted method based on the
genome-wide significant variants (a less reliable approach)
22
is negative (odds ratio: 0.87, 95% confidence interval: 0.79,
0.96 per 1-SD increase). Software code for performing the
proposed sensitivity analyses is provided in eAppendix A.1
and A.2 (http://links.lww.com/EDE/B114).
Genetic
variant
Risk factor
Confounders
Outcome
i.
iii.
ii.
FIGURE 1. Diagram of instrumental variable assumptions for 
Mendelian randomization. The three assumptions (i, ii, iii) are 
illustrated by the presence of an arrow, indicating the effect of 
one variable on the other (assumption i), or by a dashed line 
with a cross, indicating that there is no direct effect of one vari-
able on the other (assumptions ii and iii).

Copyright © 2016 Wolters Kluwer Health, Inc. Unauthorized reproduction of this article is prohibited.
Burgess et al. Epidemiology •  Volume 28, Number 1, January 2017
32
| www.epidem.com © 2016 Wolters Kluwer Health, Inc. All rights reserved.
ASSESSING THE INSTRUMENTAL VARIABLE
ASSUMPTIONS
The first set of approaches we consider are those to
assess whether the instrumental variable assumptions are
likely to be satisfied or not for a set of genetic variants. We
consider in turn the assessment of the association with mea-
sured confounders, the exploitation of a natural experiment in
the form of a gene–environment interaction, examination of a
scatter plot combined with a heterogeneity test, and of a fun-
nel plot combined with a test for directional pleiotropy.
Use of Measured Covariates
The assumption that an instrumental variable is not
associated with confounders of the risk factor–outcome
association is not fully testable, as not all confounders will
be known or measured. However, the associations of genetic
variants with measured covariates can be assessed. Lack
of association of the instrumental variable with measured
covariates does not imply lack of association with all con-
founders; however, an association with a measured covariate
should be investigated carefully for a potential pleiotropic
effect of the genetic variant. Figure 2, adapted from Wens-
ley et al.,
16
shows the associations of the four variants in
the CRP gene region with a range of potential confound-
ers. Associations are no stronger than would be expected by
chance alone.
If there are covariates that by biological considerations
should be downstream consequences of the risk factor, then
the associations of genetic variants with these covariates can
be assessed as positive controls to give confidence that the
function of the genetic variants matches the known conse-
quences of the risk factor. For instance, inhibition of inter-
leukin-1 by the drug anakinra has been observed to lead to
decreased levels of C-reactive protein and interleukin-6 in
clinical trials. If genetic variants associated with interleukin-1
are also associated with both these covariates, this makes it
more plausible that the variants are good proxies of interven-
tion on interleukin-1 levels.
29
A benefit of the use of multiple genetic variants is the
possibility to differentiate between pleiotropy and mediation,
two mechanisms by which a genetic variant may be associated
with a measured covariate (Figure 3). If a genetic variant is
associated with a covariate independently of the risk factor
(pleiotropy, or “horizontal pleiotropy”), then the instrumental
variable assumptions are likely to be violated and the genetic
variant should be excluded from an instrumental variable
analysis, as the association with the covariate is likely to open
a causal pathway from the variant to the outcome not via the
risk factor. However, if the genetic variant is associated with a
covariate due to its association with the risk factor of interest
(mediation or “vertical pleiotropy”), and there is no alterna-
tive causal pathway from the variant to the outcome except
for that via the risk factor, then the genetic variant is a valid
instrumental variable.
23
For instance, if increasing body mass index leads to
increased blood pressure, then genetic variants that are instru-
mental variables for body mass index should also be associ-
ated with blood pressure. If multiple genetic variants that are
candidate instrumental variables for body mass index are all
concordantly associated with blood pressure, then it is plau-
sible that the associations are due to mediation, not pleiot-
ropy. In contrast, if only one or two variants are associated
with blood pressure, then this is likely to be a manifestation of
pleiotropy. Pleiotropy and mediation are not mutually exclu-
sive (both could occur for the same covariate); however, this
approach may give an insight into whether the association
relates to a single genetic variant or to variants associated with
the risk factor more widely.
In some cases, valid causal inference may still be pos-
sible even if a genetic variant has a pleiotropic association
with a measured covariate; for instance, by adjusting for the
covariate in the analysis model. However, if the Mendelian
randomization investigation is performed using summarized
data, then the investigator is unlikely to be able to adjust for
covariates. An alternative approach with summarized data is
a multivariable Mendelian randomization analysis, in which
genetic associations with the outcome are regressed on the
genetic associations with the risk factor and covariates in a
multivariable weighted regression model.
30
A practical difficulty of determining which variants to
include in a Mendelian randomization analysis using mea-
sured covariates, aside from that of distinguishing between
pleiotropy and mediation, is that of multiple testing. If there
are large numbers of genetic variants and several measured
covariates, then it is difficult to set a statistical significance
threshold for rejecting a genetic variant as pleiotropic to
balance between the desire to exclude invalid instrumental
variables and the need to acknowledge the multiple tests. A
sensible compromise is to consider multiple thresholds, for
example, a conservative threshold to maximize robustness (a
fixed threshold such as P < 0.01), and a liberal threshold to
maximize power (such as a Bonferroni-corrected threshold
taking into account the number of comparisons made).
23
A
similar approach was previously taken to assess the causal role
of lipid fractions on CAD risk.
31
If no causal effect is detected
even in a liberal analysis, then the plausibility of a null causal
finding increases.
Gene–Environment Interaction
For some applications of Mendelian randomization, a
further natural experiment may be available if the postulated
causal effect is present in one stratum of the population, but
absent in another.
32
For example, the association of alcohol-
related genetic variants with esophageal cancer risk is present
in those who drink alcohol, but absent in abstainers.
33
A gene–
environment interaction provides evidence that a genetic asso-
ciation with the outcome in the population is a result of the
risk factor; if it were a result of pleiotropy, then it would be

Copyright © 2016 Wolters Kluwer Health, Inc. Unauthorized reproduction of this article is prohibited.
Epidemiology •  Volume 28, Number 1, January 2017 Sensitivity Analyses for Mendelian Randomization
© 2016 Wolters Kluwer Health, Inc. All rights reserved. www.epidem.com | 33
likely to be present in both strata of the population. Gene–
environment interactions may be difficult to find, but can pro-
vide convincing evidence of a causal effect.
One potential complication of such an analysis is the
possibility of collider bias;
34
by stratifying on the risk fac-
tor, associations between the genetic variants and the out-
come may be distorted in the strata (in the examples above,
in alcohol consumers/abstainers). To our knowledge, no sys-
tematic investigation has been conducted as to the degree
that collider bias may lead to inappropriate causal infer-
ences in a Mendelian randomization setting, although sen-
sitivity analyses to assess the potential bias in the context of
instrumental variable analysis with a single instrument are
available.
35,36
−0.10.0 0.10.2
0.13 ( 0.11 , 0.14 )
0.00 ( 0.00 , 0.01 )
0.01 ( 0.00 , 0.02 )
0.00 ( −0.01 , 0.01 )
0.00 ( 0.00 , 0.01 )
−0.01 ( −0.02 , 0.00 )
−0.01 ( −0.02 , 0.00 )
0.00 ( −0.01 , 0.01 )
0.00 ( −0.01 , 0.01 )
−0.01 ( −0.02 , 0.00 )
−0.01 ( −0.02 , 0.00 )
−0.01 ( −0.02 , 0.00 )
0.00 ( −0.02 , 0.02 )
0.00 ( −0.02 , 0.02 )
−0.01 ( −0.03 , 0.02 )
0.00 ( −0.01 , 0.01 )
−0.01 ( −0.03 , 0.02 )
0.01 ( 0.00 , 0.02 )
−0.02 ( −0.06 , 0.01 )
0.01 ( 0.00 , 0.02 )
0.00 ( −0.01 , 0.02 )
0.02 ( −0.01 , 0.04 )
Per allele effect
rs1130864
−0.2 0.00.1 0.20.3
0.21 ( 0.17 , 0.24 )
0.00 ( −0.02 , 0.02 )
0.01 ( −0.01 , 0.03 )
0.02 ( 0.00 , 0.05 )
0.01 ( −0.02 , 0.03 )
0.00 ( −0.03 , 0.02 )
0.00 ( −0.02 , 0.03 )
−0.01 ( −0.04 , 0.02 )
0.01 ( −0.01 , 0.03 )
0.00 ( −0.02 , 0.03 )
0.01 ( −0.03 , 0.05 )
0.01 ( −0.02 , 0.05 )
−0.10 ( −0.44 , 0.24 )
−0.02 ( −0.08 , 0.03 )
0.01 ( −0.04 , 0.06 )
0.01 ( −0.01 , 0.04 )
−0.08 ( −0.25 , 0.09 )
−0.01 ( −0.05 , 0.02 )
−0.15 ( −0.35 , 0.05 )
0.02 ( −0.01 , 0.04 )
0.01 ( −0.02 , 0.04 )
0.00 ( −0.02 , 0.02 )
Per allele effect
rs3093077
−0.1 0.0 0.1 0.2
0.17 ( 0.15 , 0.19 )
0.00 ( −0.01 , 0.00 )
0.00 ( −0.01 , 0.01 )
0.00 ( −0.01 , 0.01 )
0.01 ( 0.00 , 0.02 )
0.00 ( −0.01 , 0.01 )
0.00 ( −0.01 , 0.00 )
0.00 ( 0.00 , 0.01 )
0.00 ( −0.01 , 0.01 )
0.00 ( −0.01 , 0.00 )
0.01 ( 0.00 , 0.02 )
0.00 ( −0.01 , 0.01 )
0.01 ( −0.02 , 0.03 )
0.00 ( −0.02 , 0.02 )
0.00 ( −0.02 , 0.02 )
−0.01 ( −0.02 , 0.00 )
−0.01 ( −0.03 , 0.01 )
0.01 ( 0.00 , 0.02 )
−0.02 ( −0.06 , 0.01 )
0.01 ( −0.01 , 0.02 )
0.01 ( 0.00 , 0.02 )
0.01 ( 0.00 , 0.02 )
Variable
log C−reactive protein (mg/l)
Age at survey (yrs)
Body mass index (kg/m²)
Systolic BP (mmHg)
Diastolic BP (mmHg)
To tal cholesterol (mmol/l)
Non−HDL−C (mmol/l)
HDL−C (mmol/l)
log Tr iglycerides (mmol/l)
LDL−C (mmol/l)
Apo A1 (g/l)
Apo B (g/l)
Albumin (g/l)
Lipoprotein(a) (mg/dl)
log Interleukin−6 (mg/l)
Fibrinogen (µmol/l)
log Leukocyte count (× 10^9/l)
Glucose (mmol/l)
Smoking amount (pack yrs)
Weight (kg)
Height (cm)
Waist/Hip ratio
Per allele effect
rs1205
−0.2 0.0 0.1 0.2 0.3
0.26 ( 0.23 , 0.30 )
−0.02 ( −0.04 , 0.01 )
−0.02 ( −0.04 , 0.01 )
0.00 ( −0.03 , 0.03 )
0.00 ( −0.02 , 0.03 )
0.01 ( −0.02 , 0.05 )
0.00 ( −0.04 , 0.04 )
0.02 ( 0.00 , 0.05 )
−0.01 ( −0.06 , 0.03 )
0.01 ( −0.03 , 0.05 )
0.00 ( −0.04 , 0.04 )
0.01 ( −0.03 , 0.05 )
0.00 ( −0.04 , 0.05 )
−0.05 ( −0.11 , 0.01 )
0.00 ( −0.05 , 0.05 )
0.00 ( −0.04 , 0.04 )
0.00 ( −0.05 , 0.06 )
0.00 ( −0.03 , 0.04 )
−0.04 ( −0.11 , 0.04 )
−0.02 ( −0.05 , 0.02 )
0.00 ( −0.03 , 0.03 )
0.00 ( −0.03 , 0.04 )
Variable
log C−reactive protein (mg/l)
Age at survey (yrs)
Body mass index (kg/m²)
Systolic BP (mmHg)
Diastolic BP (mmHg)
To tal cholesterol (mmol/l)
Non−HDL−C (mmol/l)
HDL−C (mmol/l)
log Tr iglycerides (mmol/l)
LDL−C (mmol/l)
Apo A1 (g/l)
Apo B (g/l)
Albumin (g/l)
Lipoprotein(a) (mg/dl)
log Interleukin−6 (mg/l)
Fibrinogen (µmol/l)
log Leukocyte count (× 10^9/l)
Glucose (mmol/l)
Smoking amount (pack yrs)
Weight (kg)
Height (cm)
Waist/Hip ratio
Per allele effect
rs1800947
FIGURE 2. Associations (estimates in standard deviation units and 95% condence intervals) of four genetic variants in the CRP
gene region with a range of covariates per C-reactive protein increasing allele. Adapted from CRP CHD Genetics Collaboration.
16

Citations
More filters
Posted Content
TL;DR: In this paper, the authors investigated conditions sufficient for identification of average treatment effects using instrumental variables and showed that the existence of valid instruments is not sufficient to identify any meaningful average treatment effect.
Abstract: We investigate conditions sufficient for identification of average treatment effects using instrumental variables. First we show that the existence of valid instruments is not sufficient to identify any meaningful average treatment effect. We then establish that the combination of an instrument and a condition on the relation between the instrument and the participation status is sufficient for identification of a local average treatment effect for those who can be induced to change their participation status by changing the value of the instrument. Finally we derive the probability limit of the standard IV estimator under these conditions. It is seen to be a weighted average of local average treatment effects.

3,154 citations

Journal ArticleDOI
TL;DR: The MR-PRESSO test detects and corrects horizontal pleiotropy in multi-instrument Mendelian randomization (MR) analyses and introduces distortions in the causal estimates in MR that ranged on average from –131% to 201%; it is shown using simulations that the MR-pressO test is best suited when horizontal Pleiotropy occurs in <50% of instruments.
Abstract: Horizontal pleiotropy occurs when the variant has an effect on disease outside of its effect on the exposure in Mendelian randomization (MR). Violation of the ‘no horizontal pleiotropy’ assumption can cause severe bias in MR. We developed the Mendelian randomization pleiotropy residual sum and outlier (MR-PRESSO) test to identify horizontal pleiotropic outliers in multi-instrument summary-level MR testing. We showed using simulations that the MR-PRESSO test is best suited when horizontal pleiotropy occurs in 48% of causal relationships.

2,362 citations

Journal ArticleDOI
TL;DR: There are several reasons why causal estimates from the MR-Egger method may be biased and have inflated Type 1 error rates in practice, including violations of the InSIDE assumption and the influence of outlying variants.
Abstract: Stephen Burgess is supported by Sir Henry Dale Fellowship jointly funded by the Wellcome Trust and the Royal Society (Grant Number 204623/Z/16/Z). Simon G. Thompson is supported by the British Heart Foundation (Grant Number CH/12/2/ 29428).

1,066 citations

Journal ArticleDOI
TL;DR: The mode-based estimate (MBE) is proposed to obtain a single causal effect estimate from multiple genetic instruments and is used by investigating the causal effect of plasma lipid fractions and urate levels on coronary heart disease risk.
Abstract: Background Mendelian randomization (MR) is being increasingly used to strengthen causal inference in observational studies. Availability of summary data of genetic associations for a variety of phenotypes from large genome-wide association studies (GWAS) allows straightforward application of MR using summary data methods, typically in a two-sample design. In addition to the conventional inverse variance weighting (IVW) method, recently developed summary data MR methods, such as the MR-Egger and weighted median approaches, allow a relaxation of the instrumental variable assumptions. Methods Here, a new method - the mode-based estimate (MBE) - is proposed to obtain a single causal effect estimate from multiple genetic instruments. The MBE is consistent when the largest number of similar (identical in infinite samples) individual-instrument causal effect estimates comes from valid instruments, even if the majority of instruments are invalid. We evaluate the performance of the method in simulations designed to mimic the two-sample summary data setting, and demonstrate its use by investigating the causal effect of plasma lipid fractions and urate levels on coronary heart disease risk. Results The MBE presented less bias and lower type-I error rates than other methods under the null in many situations. Its power to detect a causal effect was smaller compared with the IVW and weighted median methods, but was larger than that of MR-Egger regression, with sample size requirements typically smaller than those available from GWAS consortia. Conclusions The MBE relaxes the instrumental variable assumptions, and should be used in combination with other approaches in sensitivity analyses.

987 citations

Journal ArticleDOI
TL;DR: MendelianRandomization is a software package for the R open-source software environment that performs Mendelian randomization analyses using summarized data to implement the inverse-variance weighted, MR-Egger and weighted median methods for multiple genetic variants.
Abstract: MendelianRandomization is a software package for the R open-source software environment that performs Mendelian randomization analyses using summarized data. The core functionality is to implement the inverse-variance weighted, MR-Egger and weighted median methods for multiple genetic variants. Several options are available to the user, such as the use of robust regression, fixed- or random-effects models and the penalization of weights for genetic variants with heterogeneous causal estimates. Extensions to these methods, such as allowing for variants to be correlated, can be chosen if appropriate. Graphical commands allow summarized data to be displayed in an interactive graph, or the plotting of causal estimates from multiple methods, for comparison. Although the main method of data entry is directly by the user, there is also an option for allowing summarized data to be incorporated from the PhenoScanner database of genotype-phenotype associations. We hope to develop this feature in future versions of the package. The R software environment is available for download from [https://www.r-project.org/]. The MendelianRandomization package can be downloaded from the Comprehensive R Archive Network (CRAN) within R, or directly from [https://cran.r-project.org/web/packages/MendelianRandomization/]. Both R and the MendelianRandomization package are released under GNU General Public Licenses (GPL-2|GPL-3).

911 citations


Cites methods from "Sensitivity Analyses for Robust Cau..."

  • ...Several publications have used some or all of these methods, and the use of all three methods is recommended when there are multiple genetic variants to assess robustness of any causal finding to different sets of assumptions.(13) Additionally, several variations on these methods have been proposed, such as the use of robust regression instead of standard linear regression in the IVW or MR-Egger methods, or the penalization of weights from genetic variants with heterogeneous causal estimates....

    [...]

References
More filters
Journal ArticleDOI
04 Sep 2003-BMJ
TL;DR: A new quantity is developed, I 2, which the authors believe gives a better measure of the consistency between trials in a meta-analysis, which is susceptible to the number of trials included in the meta- analysis.
Abstract: Cochrane Reviews have recently started including the quantity I 2 to help readers assess the consistency of the results of studies in meta-analyses. What does this new quantity mean, and why is assessment of heterogeneity so important to clinical practice? Systematic reviews and meta-analyses can provide convincing and reliable evidence relevant to many aspects of medicine and health care.1 Their value is especially clear when the results of the studies they include show clinically important effects of similar magnitude. However, the conclusions are less clear when the included studies have differing results. In an attempt to establish whether studies are consistent, reports of meta-analyses commonly present a statistical test of heterogeneity. The test seeks to determine whether there are genuine differences underlying the results of the studies (heterogeneity), or whether the variation in findings is compatible with chance alone (homogeneity). However, the test is susceptible to the number of trials included in the meta-analysis. We have developed a new quantity, I 2, which we believe gives a better measure of the consistency between trials in a meta-analysis. Assessment of the consistency of effects across studies is an essential part of meta-analysis. Unless we know how consistent the results of studies are, we cannot determine the generalisability of the findings of the meta-analysis. Indeed, several hierarchical systems for grading evidence state that the results of studies must be consistent or homogeneous to obtain the highest grading.2–4 Tests for heterogeneity are commonly used to decide on methods for combining studies and for concluding consistency or inconsistency of findings.5 6 But what does the test achieve in practice, and how should the resulting P values be interpreted? A test for heterogeneity examines the null hypothesis that all studies are evaluating the same effect. The usual test statistic …

45,105 citations

Journal ArticleDOI
13 Sep 1997-BMJ
TL;DR: Funnel plots, plots of the trials' effect estimates against sample size, are skewed and asymmetrical in the presence of publication bias and other biases Funnel plot asymmetry, measured by regression analysis, predicts discordance of results when meta-analyses are compared with single large trials.
Abstract: Objective: Funnel plots (plots of effect estimates against sample size) may be useful to detect bias in meta-analyses that were later contradicted by large trials. We examined whether a simple test of asymmetry of funnel plots predicts discordance of results when meta-analyses are compared to large trials, and we assessed the prevalence of bias in published meta-analyses. Design: Medline search to identify pairs consisting of a meta-analysis and a single large trial (concordance of results was assumed if effects were in the same direction and the meta-analytic estimate was within 30% of the trial); analysis of funnel plots from 37 meta-analyses identified from a hand search of four leading general medicine journals 1993-6 and 38 meta-analyses from the second 1996 issue of the Cochrane Database of Systematic Reviews . Main outcome measure: Degree of funnel plot asymmetry as measured by the intercept from regression of standard normal deviates against precision. Results: In the eight pairs of meta-analysis and large trial that were identified (five from cardiovascular medicine, one from diabetic medicine, one from geriatric medicine, one from perinatal medicine) there were four concordant and four discordant pairs. In all cases discordance was due to meta-analyses showing larger effects. Funnel plot asymmetry was present in three out of four discordant pairs but in none of concordant pairs. In 14 (38%) journal meta-analyses and 5 (13%) Cochrane reviews, funnel plot asymmetry indicated that there was bias. Conclusions: A simple analysis of funnel plots provides a useful test for the likely presence of bias in meta-analyses, but as the capacity to detect bias will be limited when meta-analyses are based on a limited number of small trials the results from such analyses should be treated with considerable caution. Key messages Systematic reviews of randomised trials are the best strategy for appraising evidence; however, the findings of some meta-analyses were later contradicted by large trials Funnel plots, plots of the trials9 effect estimates against sample size, are skewed and asymmetrical in the presence of publication bias and other biases Funnel plot asymmetry, measured by regression analysis, predicts discordance of results when meta-analyses are compared with single large trials Funnel plot asymmetry was found in 38% of meta-analyses published in leading general medicine journals and in 13% of reviews from the Cochrane Database of Systematic Reviews Critical examination of systematic reviews for publication and related biases should be considered a routine procedure

37,989 citations

Journal ArticleDOI
TL;DR: This work describes a method that enables explicit detection and correction of population stratification on a genome-wide scale and uses principal components analysis to explicitly model ancestry differences between cases and controls.
Abstract: Population stratification—allele frequency differences between cases and controls due to systematic ancestry differences—can cause spurious associations in disease studies. We describe a method that enables explicit detection and correction of population stratification on a genome-wide scale. Our method uses principal components analysis to explicitly model ancestry differences between cases and controls. The resulting correction is specific to a candidate marker’s variation in frequency across ancestral populations, minimizing spurious associations while maximizing power to detect true associations. Our simple, efficient approach can easily be applied to disease studies with hundreds of thousands of markers. Population stratification—allele frequency differences between cases and controls due to systematic ancestry differences—can cause spurious associations in disease studies 1‐8 . Because the effects of stratification vary in proportion to the number of samples 9 , stratification will be an increasing problem in the large-scale association studies of the future, which will analyze thousands of samples in an effort to detect common genetic variants of weak effect. The two prevailing methods for dealing with stratification are genomic control and structured association 9‐14 . Although genomic control and structured association have proven useful in a variety of contexts, they have limitations. Genomic control corrects for stratification by adjusting association statistics at each marker by a uniform overall inflation factor. However, some markers differ in their allele frequencies across ancestral populations more than others. Thus, the uniform adjustment applied by genomic control may be insufficient at markers having unusually strong differentiation across ancestral populations and may be superfluous at markers devoid of such differentiation, leading to a loss in power. Structured association uses a program such as STRUCTURE 15 to assign the samples to discrete subpopulation clusters and then aggregates evidence of association within each cluster. If fractional membership in more than one cluster is allowed, the method cannot currently be applied to genome-wide association studies because of its intensive computational cost on large data sets. Furthermore, assignments of individuals to clusters are highly sensitive to the number of clusters, which is not well defined 14,16 .

9,387 citations

Journal ArticleDOI
22 Jul 2011-BMJ
TL;DR: How to interpret funnel plot asymmetry, recommends appropriate tests, and explains the implications for choice of meta-analysis model are described.
Abstract: Funnel plots, and tests for funnel plot asymmetry, have been widely used to examine bias in the results of meta-analyses. Funnel plot asymmetry should not be equated with publication bias, because it has a number of other possible causes. This article describes how to interpret funnel plot asymmetry, recommends appropriate tests, and explains the implications for choice of meta-analysis model

4,518 citations

Journal ArticleDOI
TL;DR: Mendelian randomization provides new opportunities to test causality and demonstrates how investment in the human genome project may contribute to understanding and preventing the adverse effects on human health of modifiable exposures.
Abstract: Associations between modifiable exposures and disease seen in observational epidemiology are sometimes confounded and thus misleading, despite our best efforts to improve the design and analysis of studies. Mendelian randomization-the random assortment of genes from parents to offspring that occurs during gamete formation and conception-provides one method for assessing the causal nature of some environmental exposures. The association between a disease and a polymorphism that mimics the biological link between a proposed exposure and disease is not generally susceptible to the reverse causation or confounding that may distort interpretations of conventional observational studies. Several examples where the phenotypic effects of polymorphisms are well documented provide encouraging evidence of the explanatory power of Mendelian randomization and are described. The limitations of the approach include confounding by polymorphisms in linkage disequilibrium with the polymorphism under study, that polymorphisms may have several phenotypic effects associated with disease, the lack of suitable polymorphisms for studying modifiable exposures of interest, and canalization-the buffering of the effects of genetic variation during development. Nevertheless, Mendelian randomization provides new opportunities to test causality and demonstrates how investment in the human genome project may contribute to understanding and preventing the adverse effects on human health of modifiable exposures.

3,646 citations

Frequently Asked Questions (8)
Q1. What are the contributions in this paper?

In this article, the authors discuss a range of sensitivity analyses that will either support or question the validity of causal inference from a Mendelian randomization analysis with multiple genetic variants. 7 A diagram corresponding to these assumptions is presented in Figure 1. the authors further assume that all valid instrumental variables identify the same causal parameter ; they return to this assumption in the discussion. In this article, the authors assume that the effects of ( i ) the instrumental variables on the risk factor, ( ii ) the instrumental variables on the outcome, ( iii ) the risk factor on the outcome are linear without effect modification ; and ( iv ) the association of the genetic variant with the risk factor is homogeneous in the population. This is an open access article distributed under the creative commons Attribution license 4. 0 ( ccBY ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The authors report no conflicts of interest. Supplemental digital content is available through direct URl citations in the HtMl and PDF versions of this article ( www. epidem. com ). Editor ’ s Note: A commentary on this article appears on p. 43. The authors focus on sensitivity analyses of greatest practical relevance for ensuring robust causal inferences, and those that can be undertaken using summarized data. 

A practical difficulty of determining which variants to include in a Mendelian randomization analysis using measured covariates, aside from that of distinguishing between pleiotropy and mediation, is that of multiple testing. 

For instance, inhibition of interleukin-1 by the drug anakinra has been observed to lead to decreased levels of c-reactive protein and interleukin-6 in clinical trials. 

if there are covariates that by biological considerations should be downstream consequences of the risk factor, then the associations of genetic variants with these covariates can be assessed as positive controls to give confidence that the function of the genetic variants matches the known consequences of the risk factor. 

the penalization and median-based methods allow more general departures from the instrumental variable assumptions for the invalid instruments. 

23For instance, if increasing body mass index leads to increased blood pressure, then genetic variants that are instrumental variables for body mass index should also be associated with blood pressure. 

under an assumption that is weaker than standard instrumental variable assumptions, the slope coefficient from the egger regression method provides an estimate of the causal effect that is consistent asymptotically even if all the genetic variants have pleiotropic effects on the outcome. 

this approach has been applied for investigating the causal effect of lipid fractions on cAD risk.50 More formal penalizationmethods have been proposed using l1-penalization to downweight the contribution of outlying variants to the analysis in a continuous way.