scispace - formally typeset
Search or ask a question
Author

Javier Simón-Sánchez

Bio: Javier Simón-Sánchez is an academic researcher from University of Tübingen. The author has contributed to research in topics: Genome-wide association study & Population. The author has an hindex of 41, co-authored 64 publications receiving 12461 citations. Previous affiliations of Javier Simón-Sánchez include National Institutes of Health & VU University Amsterdam.


Papers
More filters
Journal ArticleDOI
Alan E. Renton1, Elisa Majounie1, Adrian James Waite2, Javier Simón-Sánchez3, Javier Simón-Sánchez4, Sara Rollinson5, J. Raphael Gibbs6, J. Raphael Gibbs1, Jennifer C. Schymick1, Hannu Laaksovirta7, John C. van Swieten4, John C. van Swieten3, Liisa Myllykangas7, Hannu Kalimo7, Anders Paetau7, Yevgeniya Abramzon1, Anne M. Remes8, Alice Kaganovich1, Sonja W. Scholz1, Sonja W. Scholz9, Sonja W. Scholz10, Jamie Duckworth1, Jinhui Ding1, Daniel W. Harmer11, Dena G. Hernandez1, Dena G. Hernandez6, Janel O. Johnson6, Janel O. Johnson1, Kin Y. Mok6, Mina Ryten6, Danyah Trabzuni6, Rita Guerreiro6, Richard W. Orrell6, James Neal2, Alexandra Murray12, J. P. Pearson2, Iris E. Jansen3, David Sondervan3, Harro Seelaar4, Derek J. Blake2, Kate Young5, Nicola Halliwell5, Janis Bennion Callister5, Greg Toulson5, Anna Richardson5, Alexander Gerhard5, Julie S. Snowden5, David M. A. Mann5, David Neary5, Mike A. Nalls1, Terhi Peuralinna7, Lilja Jansson7, Veli-Matti Isoviita7, Anna-Lotta Kaivorinne8, Maarit Hölttä-Vuori7, Elina Ikonen7, Raimo Sulkava13, Michael Benatar14, Joanne Wuu14, Adriano Chiò15, Gabriella Restagno, Giuseppe Borghero16, Mario Sabatelli17, David Heckerman18, Ekaterina Rogaeva19, Lorne Zinman19, Jeffrey D. Rothstein10, Michael Sendtner20, Carsten Drepper20, Evan E. Eichler21, Can Alkan21, Ziedulla Abdullaev1, Svetlana Pack1, Amalia Dutra1, Evgenia Pak1, John Hardy6, Andrew B. Singleton1, Nigel Williams2, Peter Heutink3, Stuart Pickering-Brown5, Huw R. Morris2, Huw R. Morris22, Huw R. Morris12, Pentti J. Tienari7, Bryan J. Traynor10, Bryan J. Traynor1 
20 Oct 2011-Neuron
TL;DR: The chromosome 9p21 amyotrophic lateral sclerosis-frontotemporal dementia (ALS-FTD) locus contains one of the last major unidentified autosomal-dominant genes underlying these common neurodegenerative diseases, and a large hexanucleotide repeat expansion in the first intron of C9ORF72 is shown.

3,784 citations

Journal ArticleDOI
TL;DR: It is demonstrated that an unequivocal role for common genetic variants in the etiology of typical PD and population-specific genetic heterogeneity in this disease is suggested, and supporting evidence that common variation around LRRK2 modulates risk for PD is provided.
Abstract: We performed a genome-wide association study (GWAS) in 1,713 individuals of European ancestry with Parkinson's disease (PD) and 3,978 controls. After replication in 3,361 cases and 4,573 controls, we observed two strong association signals, one in the gene encoding a-synuclein (SNCA; rs2736990, OR = 1.23, P = 2.24 x 10(-16)) and another at the MAPT locus (rs393152, OR = 0.77, P = 1.95 x 10(-16)). We exchanged data with colleagues performing a GWAS in Japanese PD cases. Association to PD at SNCA was replicated in the Japanese GWAS1, confirming this as a major risk locus across populations. We replicated the effect of a new locus detected in the Japanese cohort (PARK16, rs823128, OR = 0.66, P = 7.29 x 10(-8)) and provide supporting evidence that common variation around LRRK2 modulates risk for PD (rs1491923, OR = 1.14, P = 1.55 x 10(-5)). These data demonstrate an unequivocal role for common genetic variants in the etiology of typical PD and suggest population-specific genetic heterogeneity in this disease.

1,793 citations

Journal ArticleDOI
TL;DR: These data provide the most comprehensive survey of genetic risk within Parkinson's disease to date, providing a biological context for these risk factors, and showing that a considerable genetic component of this disease remains unidentified.
Abstract: Summary Background Genome-wide association studies (GWAS) in Parkinson's disease have increased the scope of biological knowledge about the disease over the past decade. We aimed to use the largest aggregate of GWAS data to identify novel risk loci and gain further insight into the causes of Parkinson's disease. Methods We did a meta-analysis of 17 datasets from Parkinson's disease GWAS available from European ancestry samples to nominate novel loci for disease risk. These datasets incorporated all available data. We then used these data to estimate heritable risk and develop predictive models of this heritability. We also used large gene expression and methylation resources to examine possible functional consequences as well as tissue, cell type, and biological pathway enrichments for the identified risk factors. Additionally, we examined shared genetic risk between Parkinson's disease and other phenotypes of interest via genetic correlations followed by Mendelian randomisation. Findings Between Oct 1, 2017, and Aug 9, 2018, we analysed 7·8 million single nucleotide polymorphisms in 37 688 cases, 18 618 UK Biobank proxy-cases (ie, individuals who do not have Parkinson's disease but have a first degree relative that does), and 1·4 million controls. We identified 90 independent genome-wide significant risk signals across 78 genomic regions, including 38 novel independent risk signals in 37 loci. These 90 variants explained 16–36% of the heritable risk of Parkinson's disease depending on prevalence. Integrating methylation and expression data within a Mendelian randomisation framework identified putatively associated genes at 70 risk signals underlying GWAS loci for follow-up functional studies. Tissue-specific expression enrichment analyses suggested Parkinson's disease loci were heavily brain-enriched, with specific neuronal cell types being implicated from single cell data. We found significant genetic correlations with brain volumes (false discovery rate-adjusted p=0·0035 for intracranial volume, p=0·024 for putamen volume), smoking status (p=0·024), and educational attainment (p=0·038). Mendelian randomisation between cognitive performance and Parkinson's disease risk showed a robust association (p=8·00 × 10−7). Interpretation These data provide the most comprehensive survey of genetic risk within Parkinson's disease to date, to the best of our knowledge, by revealing many additional Parkinson's disease risk loci, providing a biological context for these risk factors, and showing that a considerable genetic component of this disease remains unidentified. These associations derived from European ancestry datasets will need to be followed-up with more diverse data. Funding The National Institute on Aging at the National Institutes of Health (USA), The Michael J Fox Foundation, and The Parkinson's Foundation (see appendix for full list of funding sources).

1,152 citations

Journal ArticleDOI
TL;DR: A common Mendelian genetic lesion in C9orf72 is implicated in many cases of sporadic and familial ALS and FTD, suggesting a one-off expansion occurring about 1500 years ago.
Abstract: Background We aimed to accurately estimate the frequency of a hexanucleotide repeat expansion in C9orf72 that has been associated with a large proportion of cases of amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD). Methods We screened 4448 patients diagnosed with ALS (El Escorial criteria) and 1425 patients with FTD (Lund-Manchester criteria) from 17 regions worldwide for the GGGGCC hexanucleotide expansion using a repeat-primed PCR assay. We assessed familial disease status on the basis of self-reported family history of similar neurodegenerative diseases at the time of sample collection. We compared haplotype data for 262 patients carrying the expansion with the known Finnish founder risk haplotype across the chromosomal locus. We calculated age-related penetrance using the Kaplan-Meier method with data for 603 individuals with the expansion. Findings In patients with sporadic ALS, we identified the repeat expansion in 236 (7·0%) of 3377 white individuals from the USA, Europe, and Australia, two (4·1%) of 49 black individuals from the USA, and six (8·3%) of 72 Hispanic individuals from the USA. The mutation was present in 217 (39·3%) of 552 white individuals with familial ALS from Europe and the USA. 59 (6·0%) of 981 white Europeans with sporadic FTD had the mutation, as did 99 (24·8%) of 400 white Europeans with familial FTD. Data for other ethnic groups were sparse, but we identified one Asian patient with familial ALS (from 20 assessed) and two with familial FTD (from three assessed) who carried the mutation. The mutation was not carried by the three Native Americans or 360 patients from Asia or the Pacific Islands with sporadic ALS who were tested, or by 41 Asian patients with sporadic FTD. All patients with the repeat expansion had (partly or fully) the founder haplotype, suggesting a one-off expansion occurring about 1500 years ago. The pathogenic expansion was non-penetrant in individuals younger than 35 years, 50% penetrant by 58 years, and almost fully penetrant by 80 years. Interpretation A common Mendelian genetic lesion in C9orf72 is implicated in many cases of sporadic and familial ALS and FTD. Testing for this pathogenic expansion should be considered in the management and genetic counselling of patients with these fatal neurodegenerative diseases. Funding Full funding sources listed at end of paper (see Acknowledgments).

951 citations

Journal ArticleDOI
21 Feb 2008-Nature
TL;DR: The analysis of high-quality genotypes at 525,910 single-nucleotide polymorphisms (SNPs) and 396 copy-number-variable loci in a worldwide sample of 29 populations produces new inferences about inter-population variation, support the utility of CNVs in human population-genetic research, and serve as a genomic resource for human- genetic studies in diverse worldwide populations.
Abstract: Genome-wide patterns of variation across individuals provide a powerful source of data for uncovering the history of migration, range expansion, and adaptation of the human species. However, high-resolution surveys of variation in genotype, haplotype and copy number have generally focused on a small number of population groups. Here we report the analysis of high-quality genotypes at 525,910 single-nucleotide polymorphisms (SNPs) and 396 copy-number-variable loci in a worldwide sample of 29 populations. Analysis of SNP genotypes yields strongly supported fine-scale inferences about population structure. Increasing linkage disequilibrium is observed with increasing geographic distance from Africa, as expected under a serial founder effect for the out-of-Africa spread of human populations. New approaches for haplotype analysis produce inferences about population structure that complement results based on unphased SNPs. Despite a difference from SNPs in the frequency spectrum of the copy-number variants (CNVs) detected--including a comparatively large number of CNVs in previously unexamined populations from Oceania and the Americas--the global distribution of CNVs largely accords with population structure analyses for SNP data sets of similar size. Our results produce new inferences about inter-population variation, support the utility of CNVs in human population-genetic research, and serve as a genomic resource for human-genetic studies in diverse worldwide populations.

872 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI’s website.
Abstract: In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI's website. NCBI resources include Entrez, PubMed, PubMed Central, LocusLink, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosome Aberration Project (CCAP), Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups (COGs) database, Retroviral Genotyping Tools, SARS Coronavirus Resource, SAGEmap, Gene Expression Omnibus (GEO), Online Mendelian Inheritance in Man (OMIM), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD) and the Conserved Domain Architecture Retrieval Tool (CDART). Augmenting many of the web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of the resources can be accessed through the NCBI home page at: http://www.ncbi.nlm.nih.gov.

9,604 citations

Journal ArticleDOI
TL;DR: The results show that ADMIXTURE's computational speed opens up the possibility of using a much larger set of markers in model-based ancestry estimation and that its estimates are suitable for use in correcting for population stratification in association studies.
Abstract: Population stratification has long been recognized as a confounding factor in genetic association studies. Estimated ancestries, derived from multi-locus genotype data, can be used to perform a statistical correction for population stratification. One popular technique for estimation of ancestry is the model-based approach embodied by the widely applied program structure. Another approach, implemented in the program EIGENSTRAT, relies on Principal Component Analysis rather than model-based estimation and does not directly deliver admixture fractions. EIGENSTRAT has gained in popularity in part owing to its remarkable speed in comparison to structure. We present a new algorithm and a program, ADMIXTURE, for model-based estimation of ancestry in unrelated individuals. ADMIXTURE adopts the likelihood model embedded in structure. However, ADMIXTURE runs considerably faster, solving problems in minutes that take structure hours. In many of our experiments, we have found that ADMIXTURE is almost as fast as EIGENSTRAT. The runtime improvements of ADMIXTURE rely on a fast block relaxation scheme using sequential quadratic programming for block updates, coupled with a novel quasi-Newton acceleration of convergence. Our algorithm also runs faster and with greater accuracy than the implementation of an Expectation-Maximization (EM) algorithm incorporated in the program FRAPPE. Our simulations show that ADMIXTURE's maximum likelihood estimates of the underlying admixture coefficients and ancestral allele frequencies are as accurate as structure's Bayesian estimates. On real-world data sets, ADMIXTURE's estimates are directly comparable to those from structure and EIGENSTRAT. Taken together, our results show that ADMIXTURE's computational speed opens up the possibility of using a much larger set of markers in model-based ancestry estimation and that its estimates are suitable for use in correcting for population stratification in association studies.

5,846 citations

Journal ArticleDOI
Alan E. Renton1, Elisa Majounie1, Adrian James Waite2, Javier Simón-Sánchez3, Javier Simón-Sánchez4, Sara Rollinson5, J. Raphael Gibbs1, J. Raphael Gibbs6, Jennifer C. Schymick1, Hannu Laaksovirta7, John C. van Swieten4, John C. van Swieten3, Liisa Myllykangas7, Hannu Kalimo7, Anders Paetau7, Yevgeniya Abramzon1, Anne M. Remes8, Alice Kaganovich1, Sonja W. Scholz9, Sonja W. Scholz1, Sonja W. Scholz10, Jamie Duckworth1, Jinhui Ding1, Daniel W. Harmer11, Dena G. Hernandez1, Dena G. Hernandez6, Janel O. Johnson6, Janel O. Johnson1, Kin Y. Mok6, Mina Ryten6, Danyah Trabzuni6, Rita Guerreiro6, Richard W. Orrell6, James Neal2, Alexandra Murray12, J. P. Pearson2, Iris E. Jansen4, David Sondervan4, Harro Seelaar3, Derek J. Blake2, Kate Young5, Nicola Halliwell5, Janis Bennion Callister5, Greg Toulson5, Anna Richardson5, Alexander Gerhard5, Julie S. Snowden5, David M. A. Mann5, David Neary5, Mike A. Nalls1, Terhi Peuralinna7, Lilja Jansson7, Veli-Matti Isoviita7, Anna-Lotta Kaivorinne8, Maarit Hölttä-Vuori7, Elina Ikonen7, Raimo Sulkava13, Michael Benatar14, Joanne Wuu14, Adriano Chiò15, Gabriella Restagno, Giuseppe Borghero16, Mario Sabatelli17, David Heckerman18, Ekaterina Rogaeva19, Lorne Zinman19, Jeffrey D. Rothstein10, Michael Sendtner20, Carsten Drepper20, Evan E. Eichler21, Can Alkan21, Ziedulla Abdullaev1, Svetlana Pack1, Amalia Dutra1, Evgenia Pak1, John Hardy6, Andrew B. Singleton1, Nigel Williams2, Peter Heutink4, Stuart Pickering-Brown5, Huw R. Morris2, Huw R. Morris22, Huw R. Morris12, Pentti J. Tienari7, Bryan J. Traynor1, Bryan J. Traynor10 
20 Oct 2011-Neuron
TL;DR: The chromosome 9p21 amyotrophic lateral sclerosis-frontotemporal dementia (ALS-FTD) locus contains one of the last major unidentified autosomal-dominant genes underlying these common neurodegenerative diseases, and a large hexanucleotide repeat expansion in the first intron of C9ORF72 is shown.

3,784 citations

Journal ArticleDOI
TL;DR: Clumpak, available at http://clumpak.tau.ac.il, simplifies the use of model-based analyses of population structure in population genetics and molecular ecology by automating the postprocessing of results of model‐based population structure analyses.
Abstract: The identification of the genetic structure of populations from multilocus genotype data has become a central component of modern population-genetic data analysis. Application of model-based clustering programs often entails a number of steps, in which the user considers different modelling assumptions, compares results across different predetermined values of the number of assumed clusters (a parameter typically denoted K), examines multiple independent runs for each fixed value of K, and distinguishes among runs belonging to substantially distinct clustering solutions. Here, we present CLUMPAK (Cluster Markov Packager Across K), a method that automates the postprocessing of results of model-based population structure analyses. For analysing multiple independent runs at a single K value, CLUMPAK identifies sets of highly similar runs, separating distinct groups of runs that represent distinct modes in the space of possible solutions. This procedure, which generates a consensus solution for each distinct mode, is performed by the use of a Markov clustering algorithm that relies on a similarity matrix between replicate runs, as computed by the software CLUMPP. Next, CLUMPAK identifies an optimal alignment of inferred clusters across different values of K, extending a similar approach implemented for a fixed K in CLUMPP and simplifying the comparison of clustering results across different K values. CLUMPAK incorporates additional features, such as implementations of methods for choosing K and comparing solutions obtained by different programs, models, or data subsets. CLUMPAK, available at http://clumpak.tau.ac.il, simplifies the use of model-based analyses of population structure in population genetics and molecular ecology.

2,252 citations