scispace - formally typeset
Search or ask a question
JournalISSN: 0741-0395

Genetic Epidemiology 

Wiley
About: Genetic Epidemiology is an academic journal published by Wiley. The journal publishes majorly in the area(s): Population & Linkage (software). It has an ISSN identifier of 0741-0395. Over the lifetime, 2949 publications have been published receiving 99843 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: A novel weighted median estimator for combining data on multiple genetic variants into a single causal estimate is presented, which is consistent even when up to 50% of the information comes from invalid instrumental variables.
Abstract: Developments in genome-wide association studies and the increasing availability of summary genetic association data have made application of Mendelian randomization relatively straightforward. However, obtaining reliable results from a Mendelian randomization investigation remains problematic, as the conventional inverse-variance weighted method only gives consistent estimates if all of the genetic variants in the analysis are valid instrumental variables. We present a novel weighted median estimator for combining data on multiple genetic variants into a single causal estimate. This estimator is consistent even when up to 50% of the information comes from invalid instrumental variables. In a simulation analysis, it is shown to have better finite-sample Type 1 error rates than the inverse-variance weighted method, and is complementary to the recently proposed MR-Egger (Mendelian randomization-Egger) regression method. In analyses of the causal effects of low-density lipoprotein cholesterol and high-density lipoprotein cholesterol on coronary artery disease risk, the inverse-variance weighted method suggests a causal effect of both lipid fractions, whereas the weighted median and MR-Egger regression methods suggest a null effect of high-density lipoprotein cholesterol that corresponds with the experimental evidence. Both median-based and MR-Egger regression methods should be considered as sensitivity analyses for Mendelian randomization investigations with multiple genetic variants.

2,959 citations

Journal ArticleDOI
TL;DR: It is shown that genotype imputation of common variants using HapMap haplotypes as a reference is very accurate using either genome‐wide SNP data or smaller amounts of data typical in fine‐mapping studies, and it is illustrated how association analyses of unobserved variants will benefit from ongoing advances such as larger Hap map reference panels and whole genome shotgun sequencing technologies.
Abstract: Genome-wide association studies (GWAS) can identify common alleles that contribute to complex disease susceptibility. Despite the large number of SNPs assessed in each study, the effects of most common SNPs must be evaluated indirectly using either genotyped markers or haplotypes thereof as proxies. We have previously implemented a computationally efficient Markov Chain framework for genotype imputation and haplotyping in the freely available MaCH software package. The approach describes sampled chromosomes as mosaics of each other and uses available genotype and shotgun sequence data to estimate unobserved genotypes and haplotypes, together with useful measures of the quality of these estimates. Our approach is already widely used to facilitate comparison of results across studies as well as meta-analyses of GWAS. Here, we use simulations and experimental genotypes to evaluate its accuracy and utility, considering choices of genotyping panels, reference panel configurations, and designs where genotyping is replaced with shotgun sequencing. Importantly, we show that genotype imputation not only facilitates cross study analyses but also increases power of genetic association studies. We show that genotype imputation of common variants using HapMap haplotypes as a reference is very accurate using either genome-wide SNP data or smaller amounts of data typical in fine-mapping studies. Furthermore, we show the approach is applicable in a variety of populations. Finally, we illustrate how association analyses of unobserved variants will benefit from ongoing advances such as larger HapMap reference panels and whole genome shotgun sequencing technologies.

2,015 citations

Journal ArticleDOI
TL;DR: It is concluded that Mendelian randomization investigations using summarized data from uncorrelated variants are similarly efficient to those using individual‐level data, although the necessary assumptions cannot be so fully assessed.
Abstract: Genome-wide association studies, which typically report regression coefficients summarizing the associations of many genetic variants with various traits, are potentially a powerful source of data for Mendelian randomization investigations. We demonstrate how such coefficients from multiple variants can be combined in a Mendelian randomization analysis to estimate the causal effect of a risk factor on an outcome. The bias and efficiency of estimates based on summarized data are compared to those based on individual-level data in simulation studies. We investigate the impact of gene–gene interactions, linkage disequilibrium, and ‘weak instruments’ on these estimates. Both an inverse-variance weighted average of variant-specific associations and a likelihood-based approach for summarized data give similar estimates and precision to the two-stage least squares method for individual-level data, even when there are gene–gene interactions. However, these summarized data methods overstate precision when variants are in linkage disequilibrium. If the P-value in a linear regression of the risk factor for each variant is less than , then weak instrument bias will be small. We use these methods to estimate the causal association of low-density lipoprotein cholesterol (LDL-C) on coronary artery disease using published data on five genetic variants. A 30% reduction in LDL-C is estimated to reduce coronary artery disease risk by 67% (95% CI: 54% to 76%). We conclude that Mendelian randomization investigations using summarized data from uncorrelated variants are similarly efficient to those using individual-level data, although the necessary assumptions cannot be so fully assessed.

2,003 citations

Journal ArticleDOI
TL;DR: A permutation procedure for the null hypothesis of interest is described, which controls for violation of the assumptions, and a likelihood‐ratio test is proposed, using the expectation‐maximization (E‐M) algorithm to account for haplotype ambiguities.
Abstract: Association tests of multilocus haplotypes are of interest both in linkage disequilibrium mapping and in candidate gene studies. For case-parent trios, I discuss the extension of existing multilocus methods to include ambiguous haplotypes in tests of models which distinguish between the cis and trans phase. A likelihood-ratio test is proposed, using the expectation-maximization (E-M) algorithm to account for haplotype ambiguities. Assumptions about the population structure are required, but realistic situations, including population stratification, which violate the assumptions lead to conservative tests. I describe a permutation procedure for the null hypothesis of interest, which controls for violation of the assumptions. For general pedigrees, I describe extensions of the pedigree disequilibrium test to include uncertain haplotypes. The summary statistics are replaced by their expected values over prior distributions of haplotype frequencies. If prior distributions are not available, a valid test is possible by using the E-M algorithm to estimate the null distribution of haplotype frequencies. Similar methods are available for quantitative traits. Exact permutation tests are difficult to construct in small samples, but an approximate procedure is appropriate in large samples, and can be used to account for dependencies between tests of multiple haplotypes and loci.

1,182 citations

Journal ArticleDOI
TL;DR: A broad class of family‐based association tests that are adjusted for admixture; use either dichotomous or measured phenotypes; accommodate phenotype‐unknown subjects; use nuclear families, sibships or a combination of the two, permit multiple nuclear families from a single pedigree; incorporate di‐ or multi‐allelic marker data; and permit adjustment for covariates and gene‐by‐environment interactions are described.
Abstract: We describe a broad class of family-based association tests that are adjusted for admixture; use either dichotomous or measured phenotypes; accommodate phenotype-unknown subjects; use nuclear families, sibships or a combination of the two, permit multiple nuclear families from a single pedigree; incorporate di- or multi-allelic marker data; allow additive, dominant or recessive models; and permit adjustment for covariates and gene-by-environment interactions. The test statistic is the covariance between a user-specified function of the genotype and a user-specified function of the trait. The distribution of the statistic is computed using the appropriate conditional distribution of offspring genotypes that adjusts for admixture. Genet. Epidemiol. 19(Suppl 1):S36–S42, 2000. © 2000 Wiley-Liss, Inc.

896 citations

Performance
Metrics
No. of papers from the Journal in previous years
YearPapers
202326
202249
202164
202067
201971
201865