Author
Wouter van Rheenen
Other affiliations: University Medical Center Utrecht
Bio: Wouter van Rheenen is an academic researcher from Utrecht University. The author has contributed to research in topics: Amyotrophic lateral sclerosis & Genome-wide association study. The author has an hindex of 28, co-authored 53 publications receiving 5929 citations. Previous affiliations of Wouter van Rheenen include University Medical Center Utrecht.
Papers
More filters
••
Wellcome Trust Sanger Institute1, University of Michigan2, University of Oxford3, University of Geneva4, University of Exeter5, Greifswald University Hospital6, National Research Council7, University of Bristol8, University of Colorado Boulder9, Fred Hutchinson Cancer Research Center10, University of Washington11, SUNY Downstate Medical Center12, Erasmus University Rotterdam13, University of Trieste14, VU University Amsterdam15, South London and Maudsley NHS Foundation Trust16, King's College London17, University of Edinburgh18, Harvard University19, National Institutes of Health20, Harokopio University21, Innsbruck Medical University22, Broad Institute23, Lund University24, University of Helsinki25, Norwegian University of Science and Technology26, University of Cambridge27, University of Minnesota28, Technische Universität München29, University of North Carolina at Chapel Hill30, University of Toronto31, McGill University32, Leiden University33, University of Pennsylvania34, University of Groningen35, Utrecht University36, Churchill Hospital37
TL;DR: A reference panel of 64,976 human haplotypes at 39,235,157 SNPs constructed using whole-genome sequence data from 20 studies of predominantly European ancestry leads to accurate genotype imputation at minor allele frequencies as low as 0.1% and a large increase in the number of SNPs tested in association studies.
Abstract: We describe a reference panel of 64,976 human haplotypes at 39,235,157 SNPs constructed using whole-genome sequence data from 20 studies of predominantly European ancestry. Using this resource leads to accurate genotype imputation at minor allele frequencies as low as 0.1% and a large increase in the number of SNPs tested in association studies, and it can help to discover and refine causal loci. We describe remote server resources that allow researchers to carry out imputation and phasing consistently and efficiently.
2,149 citations
01 Jan 2016
TL;DR: In this article, a reference panel of 64,976 human haplotypes at 39,235,157 SNPs constructed using whole-genome sequence data from 20 studies of predominantly European ancestry is presented.
Abstract: We describe a reference panel of 64,976 human haplotypes at 39,235,157 SNPs constructed using whole-genome sequence data from 20 studies of predominantly European ancestry. Using this resource leads to accurate genotype imputation at minor allele frequencies as low as 0.1% and a large increase in the number of SNPs tested in association studies, and it can help to discover and refine causal loci. We describe remote server resources that allow researchers to carry out imputation and phasing consistently and efficiently.
1,261 citations
••
Wouter van Rheenen1, Aleksey Shatunov2, Annelot M. Dekker1, Russell L. McLaughlin3 +184 more•Institutions (54)
TL;DR: Evidence of ALS being a complex genetic trait with a polygenic architecture is established and the SNP-based heritability is estimated at 8.5%, with a distinct and important role for low-frequency variants (frequency 1–10%).
Abstract: To elucidate the genetic architecture of amyotrophic lateral sclerosis (ALS) and find associated loci, we assembled a custom imputation reference panel from whole-genome-sequenced patients with ALS and matched controls (n = 1,861). Through imputation and mixed-model association analysis in 12,577 cases and 23,475 controls, combined with 2,579 cases and 2,767 controls in an independent replication cohort, we fine-mapped a new risk locus on chromosome 21 and identified C21orf2 as a gene associated with ALS risk. In addition, we identified MOBP and SCFD1 as new associated risk loci. We established evidence of ALS being a complex genetic trait with a polygenic architecture. Furthermore, we estimated the SNP-based heritability at 8.5%, with a distinct and important role for low-frequency variants (frequency 1-10%). This study motivates the interrogation of larger samples with full genome coverage to identify rare causal variants that underpin ALS risk.
466 citations
••
TL;DR: Interestingly, mutations predominantly in the N-terminal motor domain of KIF5A are causative for two neurodegenerative diseases: hereditary spastic paraplegia and Charcot-Marie-Tooth type 2.
444 citations
••
University Medical Center Utrecht1, Trinity College, Dublin2, University of Turin3, King's College London4, University of Sheffield5, University of Oxford6, Instituto de Medicina Molecular7, Hannover Medical School8, University of Ulm9, Kantonsspital St. Gallen10, Katholieke Universiteit Leuven11, Beaumont Hospital12
TL;DR: An externally validated model is developed to predict survival without tracheostomy and non-invasive ventilation for more than 23 h per day in European patients with ALS and could be applied to individualised patient management, counselling, and future trial design.
Abstract: Summary Background Amyotrophic lateral sclerosis (ALS) is a relentlessly progressive, fatal motor neuron disease with a variable natural history. There are no accurate models that predict the disease course and outcomes, which complicates risk assessment and counselling for individual patients, stratification of patients for trials, and timing of interventions. We therefore aimed to develop and validate a model for predicting a composite survival endpoint for individual patients with ALS. Methods We obtained data for patients from 14 specialised ALS centres (each one designated as a cohort) in Belgium, France, the Netherlands, Germany, Ireland, Italy, Portugal, Switzerland, and the UK. All patients were diagnosed in the centres after excluding other diagnoses and classified according to revised El Escorial criteria. We assessed 16 patient characteristics as potential predictors of a composite survival outcome (time between onset of symptoms and non-invasive ventilation for more than 23 h per day, tracheostomy, or death) and applied backward elimination with bootstrapping in the largest population-based dataset for predictor selection. Data were gathered on the day of diagnosis or as soon as possible thereafter. Predictors that were selected in more than 70% of the bootstrap resamples were used to develop a multivariable Royston-Parmar model for predicting the composite survival outcome in individual patients. We assessed the generalisability of the model by estimating heterogeneity of predictive accuracy across external populations (ie, populations not used to develop the model) using internal–external cross-validation, and quantified the discrimination using the concordance ( c ) statistic (area under the receiver operator characteristic curve) and calibration using a calibration slope. Findings Data were collected between Jan 1, 1992, and Sept 22, 2016 (the largest data-set included data from 1936 patients). The median follow-up time was 97·5 months (IQR 52·9–168·5). Eight candidate predictors entered the prediction model: bulbar versus non-bulbar onset (univariable hazard ratio [HR] 1·71, 95% CI 1·63–1·79), age at onset (1·03, 1·03–1·03), definite versus probable or possible ALS (1·47, 1·39–1·55), diagnostic delay (0·52, 0·51–0·53), forced vital capacity (HR 0·99, 0·99–0·99), progression rate (6·33, 5·92–6·76), frontotemporal dementia (1·34, 1·20–1·50), and presence of a C9orf72 repeat expansion (1·45, 1·31–1·61), all p c statistic for external predictive accuracy of the model was 0·78 (95% CI 0·77–0·80; 95% prediction interval [PI] 0·74–0·82) and the calibration slope was 1·01 (95% CI 0·95–1·07; 95% PI 0·83–1·18). The model was used to define five groups with distinct median predicted (SE) and observed (SE) times in months from symptom onset to the composite survival outcome: very short 17·7 (0·20), 16·5 (0·23); short 25·3 (0·06), 25·2 (0·35); intermediate 32·2 (0·09), 32·8 (0·46); long 43·7 (0·21), 44·6 (0·74); and very long 91·0 (1·84), 85·6 (1·96). Interpretation We have developed an externally validated model to predict survival without tracheostomy and non-invasive ventilation for more than 23 h per day in European patients with ALS. This model could be applied to individualised patient management, counselling, and future trial design, but to maximise the benefit and prevent harm it is intended to be used by medical doctors only. Funding Netherlands ALS Foundation.
304 citations
Cited by
More filters
••
TL;DR: Deep phenotype and genome-wide genetic data from 500,000 individuals from the UK Biobank is described, describing population structure and relatedness in the cohort, and imputation to increase the number of testable variants to 96 million.
Abstract: The UK Biobank project is a prospective cohort study with deep genetic and phenotypic data collected on approximately 500,000 individuals from across the United Kingdom, aged between 40 and 69 at recruitment. The open resource is unique in its size and scope. A rich variety of phenotypic and health-related information is available on each participant, including biological measurements, lifestyle indicators, biomarkers in blood and urine, and imaging of the body and brain. Follow-up information is provided by linking health and medical records. Genome-wide genotype data have been collected on all participants, providing many opportunities for the discovery of new genetic associations and the genetic bases of complex traits. Here we describe the centralized analysis of the genetic data, including genotype quality, properties of population structure and relatedness of the genetic data, and efficient phasing and genotype imputation that increases the number of testable variants to around 96 million. Classical allelic variation at 11 human leukocyte antigen genes was imputed, resulting in the recovery of signals with known associations between human leukocyte antigen alleles and many diseases.
4,489 citations
••
Institute for Systems Biology1, BC Cancer Agency2, University of California, San Francisco3, University of North Carolina at Chapel Hill4, Columbia University5, Discovery Institute6, Massachusetts Institute of Technology7, Arizona State University8, Sage Bionetworks9, Harvard University10, Johns Hopkins University11, Stanford University12, University of Calgary13, Université libre de Bruxelles14, University of Texas MD Anderson Cancer Center15, Medical College of Wisconsin16, Qatar Airways17, Cold Spring Harbor Laboratory18, University of São Paulo19, Henry Ford Hospital20, University of Alabama at Birmingham21, Van Andel Institute22, Stony Brook University23
TL;DR: An extensive immunogenomic analysis of more than 10,000 tumors comprising 33 diverse cancer types by utilizing data compiled by TCGA identifies six immune subtypes that encompass multiple cancer types and are hypothesized to define immune response patterns impacting prognosis.
3,246 citations
••
TL;DR: The remarkable range of discoveriesGWASs has facilitated in population and complex-trait genetics, the biology of diseases, and translation toward new therapeutics are reviewed.
Abstract: Application of the experimental design of genome-wide association studies (GWASs) is now 10 years old (young), and here we review the remarkable range of discoveries it has facilitated in population and complex-trait genetics, the biology of diseases, and translation toward new therapeutics. We predict the likely discoveries in the next 10 years, when GWASs will be based on millions of samples with array data imputed to a large fully sequenced reference panel and on hundreds of thousands of samples with whole-genome sequencing data.
2,669 citations
••
TL;DR: Improvements to imputation machinery are described that reduce computational requirements by more than an order of magnitude with no loss of accuracy in comparison to standard imputation tools.
Abstract: Christian Fuchsberger, Goncalo Abecasis and colleagues describe a new web-based imputation service that enables rapid imputation of large numbers of samples and allows convenient access to large reference panels of sequenced individuals. Their state space reduction provides a computationally efficient solution for genotype imputation with no loss in imputation accuracy.
2,556 citations
••
TL;DR: It is proposed that gene regulatory networks are sufficiently interconnected such that all genes expressed in disease-relevant cells are liable to affect the functions of core disease-related genes and that most heritability can be explained by effects on genes outside core pathways.
2,257 citations