scispace - formally typeset
Search or ask a question
Author

J. Raphael Gibbs

Bio: J. Raphael Gibbs is an academic researcher from National Institutes of Health. The author has contributed to research in topics: Genome-wide association study & Single-nucleotide polymorphism. The author has an hindex of 54, co-authored 116 publications receiving 17508 citations. Previous affiliations of J. Raphael Gibbs include University College London & Johns Hopkins University.


Papers
More filters
Journal ArticleDOI
Alan E. Renton1, Elisa Majounie1, Adrian James Waite2, Javier Simón-Sánchez3, Javier Simón-Sánchez4, Sara Rollinson5, J. Raphael Gibbs6, J. Raphael Gibbs1, Jennifer C. Schymick1, Hannu Laaksovirta7, John C. van Swieten4, John C. van Swieten3, Liisa Myllykangas7, Hannu Kalimo7, Anders Paetau7, Yevgeniya Abramzon1, Anne M. Remes8, Alice Kaganovich1, Sonja W. Scholz1, Sonja W. Scholz9, Sonja W. Scholz10, Jamie Duckworth1, Jinhui Ding1, Daniel W. Harmer11, Dena G. Hernandez1, Dena G. Hernandez6, Janel O. Johnson1, Janel O. Johnson6, Kin Y. Mok6, Mina Ryten6, Danyah Trabzuni6, Rita Guerreiro6, Richard W. Orrell6, James Neal2, Alexandra Murray12, J. P. Pearson2, Iris E. Jansen4, David Sondervan4, Harro Seelaar3, Derek J. Blake2, Kate Young5, Nicola Halliwell5, Janis Bennion Callister5, Greg Toulson5, Anna Richardson5, Alexander Gerhard5, Julie S. Snowden5, David M. A. Mann5, David Neary5, Mike A. Nalls1, Terhi Peuralinna7, Lilja Jansson7, Veli-Matti Isoviita7, Anna-Lotta Kaivorinne8, Maarit Hölttä-Vuori7, Elina Ikonen7, Raimo Sulkava13, Michael Benatar14, Joanne Wuu14, Adriano Chiò15, Gabriella Restagno, Giuseppe Borghero16, Mario Sabatelli17, David Heckerman18, Ekaterina Rogaeva19, Lorne Zinman19, Jeffrey D. Rothstein10, Michael Sendtner20, Carsten Drepper20, Evan E. Eichler21, Can Alkan21, Ziedulla Abdullaev1, Svetlana Pack1, Amalia Dutra1, Evgenia Pak1, John Hardy6, Andrew B. Singleton1, Nigel Williams2, Peter Heutink4, Stuart Pickering-Brown5, Huw R. Morris22, Huw R. Morris2, Huw R. Morris12, Pentti J. Tienari7, Bryan J. Traynor10, Bryan J. Traynor1 
20 Oct 2011-Neuron
TL;DR: The chromosome 9p21 amyotrophic lateral sclerosis-frontotemporal dementia (ALS-FTD) locus contains one of the last major unidentified autosomal-dominant genes underlying these common neurodegenerative diseases, and a large hexanucleotide repeat expansion in the first intron of C9ORF72 is shown.

3,784 citations

Journal ArticleDOI
TL;DR: It is demonstrated that an unequivocal role for common genetic variants in the etiology of typical PD and population-specific genetic heterogeneity in this disease is suggested, and supporting evidence that common variation around LRRK2 modulates risk for PD is provided.
Abstract: We performed a genome-wide association study (GWAS) in 1,713 individuals of European ancestry with Parkinson's disease (PD) and 3,978 controls. After replication in 3,361 cases and 4,573 controls, we observed two strong association signals, one in the gene encoding a-synuclein (SNCA; rs2736990, OR = 1.23, P = 2.24 x 10(-16)) and another at the MAPT locus (rs393152, OR = 0.77, P = 1.95 x 10(-16)). We exchanged data with colleagues performing a GWAS in Japanese PD cases. Association to PD at SNCA was replicated in the Japanese GWAS1, confirming this as a major risk locus across populations. We replicated the effect of a new locus detected in the Japanese cohort (PARK16, rs823128, OR = 0.66, P = 7.29 x 10(-8)) and provide supporting evidence that common variation around LRRK2 modulates risk for PD (rs1491923, OR = 1.14, P = 1.55 x 10(-5)). These data demonstrate an unequivocal role for common genetic variants in the etiology of typical PD and suggest population-specific genetic heterogeneity in this disease.

1,793 citations

Journal ArticleDOI
TL;DR: These data provide the most comprehensive survey of genetic risk within Parkinson's disease to date, providing a biological context for these risk factors, and showing that a considerable genetic component of this disease remains unidentified.
Abstract: Summary Background Genome-wide association studies (GWAS) in Parkinson's disease have increased the scope of biological knowledge about the disease over the past decade. We aimed to use the largest aggregate of GWAS data to identify novel risk loci and gain further insight into the causes of Parkinson's disease. Methods We did a meta-analysis of 17 datasets from Parkinson's disease GWAS available from European ancestry samples to nominate novel loci for disease risk. These datasets incorporated all available data. We then used these data to estimate heritable risk and develop predictive models of this heritability. We also used large gene expression and methylation resources to examine possible functional consequences as well as tissue, cell type, and biological pathway enrichments for the identified risk factors. Additionally, we examined shared genetic risk between Parkinson's disease and other phenotypes of interest via genetic correlations followed by Mendelian randomisation. Findings Between Oct 1, 2017, and Aug 9, 2018, we analysed 7·8 million single nucleotide polymorphisms in 37 688 cases, 18 618 UK Biobank proxy-cases (ie, individuals who do not have Parkinson's disease but have a first degree relative that does), and 1·4 million controls. We identified 90 independent genome-wide significant risk signals across 78 genomic regions, including 38 novel independent risk signals in 37 loci. These 90 variants explained 16–36% of the heritable risk of Parkinson's disease depending on prevalence. Integrating methylation and expression data within a Mendelian randomisation framework identified putatively associated genes at 70 risk signals underlying GWAS loci for follow-up functional studies. Tissue-specific expression enrichment analyses suggested Parkinson's disease loci were heavily brain-enriched, with specific neuronal cell types being implicated from single cell data. We found significant genetic correlations with brain volumes (false discovery rate-adjusted p=0·0035 for intracranial volume, p=0·024 for putamen volume), smoking status (p=0·024), and educational attainment (p=0·038). Mendelian randomisation between cognitive performance and Parkinson's disease risk showed a robust association (p=8·00 × 10−7). Interpretation These data provide the most comprehensive survey of genetic risk within Parkinson's disease to date, to the best of our knowledge, by revealing many additional Parkinson's disease risk loci, providing a biological context for these risk factors, and showing that a considerable genetic component of this disease remains unidentified. These associations derived from European ancestry datasets will need to be followed-up with more diverse data. Funding The National Institute on Aging at the National Institutes of Health (USA), The Michael J Fox Foundation, and The Parkinson's Foundation (see appendix for full list of funding sources).

1,152 citations

Journal ArticleDOI
09 Dec 2010-Neuron
TL;DR: Exome sequencing data broaden the phenotype of IBMPFD to include motor neuron degeneration, suggest that VCP mutations may account for ∼1%-2% of familial ALS, and provide evidence directly implicating defects in the ubiquitination/protein degradation pathway in motor neurons degeneration.

1,040 citations

Journal ArticleDOI
21 Feb 2008-Nature
TL;DR: The analysis of high-quality genotypes at 525,910 single-nucleotide polymorphisms (SNPs) and 396 copy-number-variable loci in a worldwide sample of 29 populations produces new inferences about inter-population variation, support the utility of CNVs in human population-genetic research, and serve as a genomic resource for human- genetic studies in diverse worldwide populations.
Abstract: Genome-wide patterns of variation across individuals provide a powerful source of data for uncovering the history of migration, range expansion, and adaptation of the human species. However, high-resolution surveys of variation in genotype, haplotype and copy number have generally focused on a small number of population groups. Here we report the analysis of high-quality genotypes at 525,910 single-nucleotide polymorphisms (SNPs) and 396 copy-number-variable loci in a worldwide sample of 29 populations. Analysis of SNP genotypes yields strongly supported fine-scale inferences about population structure. Increasing linkage disequilibrium is observed with increasing geographic distance from Africa, as expected under a serial founder effect for the out-of-Africa spread of human populations. New approaches for haplotype analysis produce inferences about population structure that complement results based on unphased SNPs. Despite a difference from SNPs in the frequency spectrum of the copy-number variants (CNVs) detected--including a comparatively large number of CNVs in previously unexamined populations from Oceania and the Americas--the global distribution of CNVs largely accords with population structure analyses for SNP data sets of similar size. Our results produce new inferences about inter-population variation, support the utility of CNVs in human population-genetic research, and serve as a genomic resource for human-genetic studies in diverse worldwide populations.

872 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI’s website.
Abstract: In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI's website. NCBI resources include Entrez, PubMed, PubMed Central, LocusLink, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosome Aberration Project (CCAP), Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups (COGs) database, Retroviral Genotyping Tools, SARS Coronavirus Resource, SAGEmap, Gene Expression Omnibus (GEO), Online Mendelian Inheritance in Man (OMIM), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD) and the Conserved Domain Architecture Retrieval Tool (CDART). Augmenting many of the web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of the resources can be accessed through the NCBI home page at: http://www.ncbi.nlm.nih.gov.

9,604 citations

Journal ArticleDOI
TL;DR: The results show that ADMIXTURE's computational speed opens up the possibility of using a much larger set of markers in model-based ancestry estimation and that its estimates are suitable for use in correcting for population stratification in association studies.
Abstract: Population stratification has long been recognized as a confounding factor in genetic association studies. Estimated ancestries, derived from multi-locus genotype data, can be used to perform a statistical correction for population stratification. One popular technique for estimation of ancestry is the model-based approach embodied by the widely applied program structure. Another approach, implemented in the program EIGENSTRAT, relies on Principal Component Analysis rather than model-based estimation and does not directly deliver admixture fractions. EIGENSTRAT has gained in popularity in part owing to its remarkable speed in comparison to structure. We present a new algorithm and a program, ADMIXTURE, for model-based estimation of ancestry in unrelated individuals. ADMIXTURE adopts the likelihood model embedded in structure. However, ADMIXTURE runs considerably faster, solving problems in minutes that take structure hours. In many of our experiments, we have found that ADMIXTURE is almost as fast as EIGENSTRAT. The runtime improvements of ADMIXTURE rely on a fast block relaxation scheme using sequential quadratic programming for block updates, coupled with a novel quasi-Newton acceleration of convergence. Our algorithm also runs faster and with greater accuracy than the implementation of an Expectation-Maximization (EM) algorithm incorporated in the program FRAPPE. Our simulations show that ADMIXTURE's maximum likelihood estimates of the underlying admixture coefficients and ancestral allele frequencies are as accurate as structure's Bayesian estimates. On real-world data sets, ADMIXTURE's estimates are directly comparable to those from structure and EIGENSTRAT. Taken together, our results show that ADMIXTURE's computational speed opens up the possibility of using a much larger set of markers in model-based ancestry estimation and that its estimates are suitable for use in correcting for population stratification in association studies.

5,846 citations

01 Feb 2015
TL;DR: In this article, the authors describe the integrative analysis of 111 reference human epigenomes generated as part of the NIH Roadmap Epigenomics Consortium, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression.
Abstract: The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but epigenomic studies lack a similar reference. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection so far of human epigenomes for primary cells and tissues. Here we describe the integrative analysis of 111 reference human epigenomes generated as part of the programme, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation and human disease.

4,409 citations

Journal ArticleDOI
TL;DR: It is proposed that DNA methylation age measures the cumulative effect of an epigenetic maintenance system, and can be used to address a host of questions in developmental biology, cancer and aging research.
Abstract: It is not yet known whether DNA methylation levels can be used to accurately predict age across a broad spectrum of human tissues and cell types, nor whether the resulting age prediction is a biologically meaningful measure. I developed a multi-tissue predictor of age that allows one to estimate the DNA methylation age of most tissues and cell types. The predictor, which is freely available, was developed using 8,000 samples from 82 Illumina DNA methylation array datasets, encompassing 51 healthy tissues and cell types. I found that DNA methylation age has the following properties: first, it is close to zero for embryonic and induced pluripotent stem cells; second, it correlates with cell passage number; third, it gives rise to a highly heritable measure of age acceleration; and, fourth, it is applicable to chimpanzee tissues. Analysis of 6,000 cancer samples from 32 datasets showed that all of the considered 20 cancer types exhibit significant age acceleration, with an average of 36 years. Low age-acceleration of cancer tissue is associated with a high number of somatic mutations and TP53 mutations, while mutations in steroid receptors greatly accelerate DNA methylation age in breast cancer. Finally, I characterize the 353 CpG sites that together form an aging clock in terms of chromatin states and tissue variance. I propose that DNA methylation age measures the cumulative effect of an epigenetic maintenance system. This novel epigenetic clock can be used to address a host of questions in developmental biology, cancer and aging research.

4,233 citations