Home
/
Authors
/
Shamil R. Sunyaev

Author

Shamil R. Sunyaev

Other affiliations: Leiden University, Massachusetts Institute of Technology, Max Delbrück Center for Molecular Medicine ...read more

Bio: Shamil R. Sunyaev is an academic researcher from Harvard University. The author has contributed to research in topics: Population & Genome-wide association study. The author has an hindex of 77, co-authored 207 publications receiving 57138 citations. Previous affiliations of Shamil R. Sunyaev include Leiden University & Massachusetts Institute of Technology.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A method and server for predicting damaging missense mutations.

[...]

Ivan Adzhubei¹, Steffen Schmidt², Leonid Peshkin³, Vasily Ramensky⁴, Anna Gerasimova⁵, Peer Bork, Alexey S. Kondrashov⁵, Shamil R. Sunyaev¹ - Show less +4 more•Institutions (5)

Brigham and Women's Hospital¹, Max Planck Society², Harvard University³, Engelhardt Institute of Molecular Biology⁴, University of Michigan⁵

01 Apr 2010-Nature Methods

TL;DR: A new method and the corresponding software tool, PolyPhen-2, which is different from the early tool polyPhen1 in the set of predictive features, alignment pipeline, and the method of classification is presented and performance, as presented by its receiver operating characteristic curves, was consistently superior.

...read moreread less

Abstract: To the Editor: Applications of rapidly advancing sequencing technologies exacerbate the need to interpret individual sequence variants. Sequencing of phenotyped clinical subjects will soon become a method of choice in studies of the genetic causes of Mendelian and complex diseases. New exon capture techniques will direct sequencing efforts towards the most informative and easily interpretable protein-coding fraction of the genome. Thus, the demand for computational predictions of the impact of protein sequence variants will continue to grow. Here we present a new method and the corresponding software tool, PolyPhen-2 (http://genetics.bwh.harvard.edu/pph2/), which is different from the early tool PolyPhen1 in the set of predictive features, alignment pipeline, and the method of classification (Fig. 1a). PolyPhen-2 uses eight sequence-based and three structure-based predictive features (Supplementary Table 1) which were selected automatically by an iterative greedy algorithm (Supplementary Methods). Majority of these features involve comparison of a property of the wild-type (ancestral, normal) allele and the corresponding property of the mutant (derived, disease-causing) allele, which together define an amino acid replacement. Most informative features characterize how well the two human alleles fit into the pattern of amino acid replacements within the multiple sequence alignment of homologous proteins, how distant the protein harboring the first deviation from the human wild-type allele is from the human protein, and whether the mutant allele originated at a hypermutable site2. The alignment pipeline selects the set of homologous sequences for the analysis using a clustering algorithm and then constructs and refines their multiple alignment (Supplementary Fig. 1). The functional significance of an allele replacement is predicted from its individual features (Supplementary Figs. 2–4) by Naive Bayes classifier (Supplementary Methods). Figure 1 PolyPhen-2 pipeline and prediction accuracy. (a) Overview of the algorithm. (b) Receiver operating characteristic (ROC) curves for predictions made by PolyPhen-2 using five-fold cross-validation on HumDiv (red) and HumVar3 (light green). UniRef100 (solid ... We used two pairs of datasets to train and test PolyPhen-2. We compiled the first pair, HumDiv, from all 3,155 damaging alleles with known effects on the molecular function causing human Mendelian diseases, present in the UniProt database, together with 6,321 differences between human proteins and their closely related mammalian homologs, assumed to be non-damaging (Supplementary Methods). The second pair, HumVar3, consists of all the 13,032 human disease-causing mutations from UniProt, together with 8,946 human nsSNPs without annotated involvement in disease, which were treated as non-damaging. We found that PolyPhen-2 performance, as presented by its receiver operating characteristic curves, was consistently superior compared to PolyPhen (Fig. 1b) and it also compared favorably with the three other popular prediction tools4–6 (Fig. 1c). For a false positive rate of 20%, PolyPhen-2 achieves the rate of true positive predictions of 92% and 73% on HumDiv and HumVar, respectively (Supplementary Table 2). One reason for a lower accuracy of predictions on HumVar is that nsSNPs assumed to be non-damaging in HumVar contain a sizable fraction of mildly deleterious alleles. In contrast, most of amino acid replacements assumed non-damaging in HumDiv must be close to selective neutrality. Because alleles that are even mildly but unconditionally deleterious cannot be fixed in the evolving lineage, no method based on comparative sequence analysis is ideal for discriminating between drastically and mildly deleterious mutations, which are assigned to the opposite categories in HumVar. Another reason is that HumDiv uses an extra criterion to avoid possible erroneous annotations of damaging mutations. For a mutation, PolyPhen-2 calculates Naive Bayes posterior probability that this mutation is damaging and reports estimates of false positive (the chance that the mutation is classified as damaging when it is in fact non-damaging) and true positive (the chance that the mutation is classified as damaging when it is indeed damaging) rates. A mutation is also appraised qualitatively, as benign, possibly damaging, or probably damaging (Supplementary Methods). The user can choose between HumDiv- and HumVar-trained PolyPhen-2. Diagnostics of Mendelian diseases requires distinguishing mutations with drastic effects from all the remaining human variation, including abundant mildly deleterious alleles. Thus, HumVar-trained PolyPhen-2 should be used for this task. In contrast, HumDiv-trained PolyPhen-2 should be used for evaluating rare alleles at loci potentially involved in complex phenotypes, dense mapping of regions identified by genome-wide association studies, and analysis of natural selection from sequence data, where even mildly deleterious alleles must be treated as damaging.

...read moreread less

11,571 citations

Journal Article•DOI•

Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project

[...]

Ewan Birney, John A. Stamatoyannopoulos¹, Anindya Dutta², Roderic Guigó³ +317 more•Institutions (44)

14 Jun 2007-Nature

TL;DR: Functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project are reported, providing convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts.

...read moreread less

Abstract: We report the generation and analysis of functional data from multiple, diverse experiments performed on a targeted 1% of the human genome as part of the pilot phase of the ENCODE Project. These data have been further integrated and augmented by a number of evolutionary and computational analyses. Together, our results advance the collective knowledge about human genome function in several major areas. First, our studies provide convincing evidence that the genome is pervasively transcribed, such that the majority of its bases can be found in primary transcripts, including non-protein-coding transcripts, and those that extensively overlap one another. Second, systematic examination of transcriptional regulation has yielded new understanding about transcription start sites, including their relationship to specific regulatory sequences and features of chromatin accessibility and histone modification. Third, a more sophisticated view of chromatin structure has emerged, including its inter-relationship with DNA replication and transcriptional regulation. Finally, integration of these new sources of information, in particular with respect to mammalian evolution based on inter- and intra-species sequence comparisons, has yielded new mechanistic and evolutionary insights concerning the functional landscape of the human genome. Together, these studies are defining a path for pursuit of a more comprehensive characterization of human genome function.

...read moreread less

5,091 citations

Journal Article•DOI•

Integrative analysis of 111 reference human epigenomes

[...]

Anshul Kundaje¹, Wouter Meuleman², Wouter Meuleman¹, Jason Ernst³, Misha Bilenky⁴, Angela Yen¹, Angela Yen², Alireza Heravi-Moussavi⁴, Pouya Kheradpour¹, Pouya Kheradpour², Zhizhuo Zhang², Zhizhuo Zhang¹, Jianrong Wang¹, Jianrong Wang², Michael J. Ziller², Viren Amin⁵, John W. Whitaker, Matthew D. Schultz⁶, Lucas D. Ward², Lucas D. Ward¹, Abhishek Sarkar¹, Abhishek Sarkar², Gerald Quon², Gerald Quon¹, Richard Sandstrom⁷, Matthew L. Eaton¹, Matthew L. Eaton², Yi-Chieh Wu¹, Yi-Chieh Wu², Andreas R. Pfenning², Andreas R. Pfenning¹, Xinchen Wang², Xinchen Wang¹, Melina Claussnitzer², Melina Claussnitzer¹, Yaping Liu², Yaping Liu¹, Cristian Coarfa⁵, R. Alan Harris⁵, Noam Shoresh², Charles B. Epstein², Elizabeta Gjoneska¹, Elizabeta Gjoneska², Danny Leung⁸, Wei Xie⁸, R. David Hawkins⁸, Ryan Lister⁶, Chibo Hong⁹, Philippe Gascard⁹, Andrew J. Mungall⁴, Richard A. Moore⁴, Eric Chuah⁴, Angela Tam⁴, Theresa K. Canfield⁷, R. Scott Hansen⁷, Rajinder Kaul⁷, Peter J. Sabo⁷, Mukul S. Bansal¹, Mukul S. Bansal¹⁰, Mukul S. Bansal², Annaick Carles⁴, Jesse R. Dixon⁸, Kai How Farh², Soheil Feizi¹, Soheil Feizi², Rosa Karlic¹¹, Ah Ram Kim¹, Ah Ram Kim², Ashwinikumar Kulkarni¹², Daofeng Li¹³, Rebecca F. Lowdon¹³, Ginell Elliott¹³, Tim R. Mercer¹⁴, Shane Neph⁷, Vitor Onuchic⁵, Paz Polak¹⁵, Paz Polak², Nisha Rajagopal⁸, Pradipta R. Ray¹², Richard C Sallari², Richard C Sallari¹, Kyle Siebenthall⁷, Nicholas A Sinnott-Armstrong¹, Nicholas A Sinnott-Armstrong², Michael Stevens¹³, Robert E. Thurman⁷, Jie Wu¹⁶, Bo Zhang¹³, Xin Zhou¹³, Arthur E. Beaudet⁵, Laurie A. Boyer¹, Philip L. De Jager², Philip L. De Jager¹⁵, Peggy J. Farnham¹⁷, Susan J. Fisher⁹, David Haussler¹⁸, Steven J.M. Jones¹⁹, Steven J.M. Jones⁴, Wei Li⁵, Marco A. Marra⁴, Michael T. McManus⁹, Shamil R. Sunyaev², Shamil R. Sunyaev¹⁵, James A. Thomson²⁰, Thea D. Tlsty⁹, Li-Huei Tsai¹, Li-Huei Tsai², Wei Wang, Robert A. Waterland⁵, Michael Q. Zhang²¹, Lisa Helbling Chadwick²², Bradley E. Bernstein⁶, Bradley E. Bernstein², Bradley E. Bernstein¹⁵, Joseph F. Costello⁹, Joseph R. Ecker¹¹, Martin Hirst⁴, Alexander Meissner², Aleksandar Milosavljevic⁵, Bing Ren⁸, John A. Stamatoyannopoulos⁷, Ting Wang¹³, Manolis Kellis¹, Manolis Kellis² - Show less +120 more•Institutions (22)

Massachusetts Institute of Technology¹, Broad Institute², University of California, Los Angeles³, University of British Columbia⁴, Baylor College of Medicine⁵, Howard Hughes Medical Institute⁶, University of Washington⁷, Ludwig Institute for Cancer Research⁸, University of California, San Francisco⁹, University of Connecticut¹⁰, University of Zagreb¹¹, University of Texas at Austin¹², Washington University in St. Louis¹³, University of Queensland¹⁴, Harvard University¹⁵, Cold Spring Harbor Laboratory¹⁶, University of Southern California¹⁷, University of California, Santa Cruz¹⁸, Simon Fraser University¹⁹, Morgridge Institute for Research²⁰, University of Texas at Dallas²¹, National Institutes of Health²²

19 Feb 2015-Nature

TL;DR: It is shown that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease.

...read moreread less

Abstract: The reference human genome sequence set the stage for studies of genetic variation and its association with human disease, but epigenomic studies lack a similar reference. To address this need, the NIH Roadmap Epigenomics Consortium generated the largest collection so far of human epigenomes for primary cells and tissues. Here we describe the integrative analysis of 111 reference human epigenomes generated as part of the programme, profiled for histone modification patterns, DNA accessibility, DNA methylation and RNA expression. We establish global maps of regulatory elements, define regulatory modules of coordinated activity, and their likely activators and repressors. We show that disease- and trait-associated genetic variants are enriched in tissue-specific epigenomic marks, revealing biologically relevant cell types for diverse human traits, and providing a resource for interpreting the molecular basis of human disease. Our results demonstrate the central role of epigenomic information for understanding gene regulation, cellular differentiation and human disease.

...read moreread less

5,037 citations

Journal Article•DOI•

Mutational heterogeneity in cancer and the search for new cancer-associated genes

[...]

Michael S. Lawrence¹, Petar Stojanov², Petar Stojanov¹, Paz Polak¹, Paz Polak³, Paz Polak², Gregory V. Kryukov³, Gregory V. Kryukov¹, Gregory V. Kryukov², Kristian Cibulskis¹, Andrey Sivachenko¹, Scott L. Carter¹, Chip Stewart¹, Craig H. Mermel², Craig H. Mermel¹, Steven A. Roberts⁴, Adam Kiezun¹, Peter S. Hammerman², Peter S. Hammerman¹, Aaron McKenna⁵, Aaron McKenna¹, Yotam Drier, Lihua Zou¹, Alex H. Ramos¹, Trevor J. Pugh², Trevor J. Pugh¹, Nicolas Stransky¹, Elena Helman¹, Elena Helman⁶, Jaegil Kim¹, Carrie Sougnez¹, Lauren Ambrogio¹, Elizabeth Nickerson¹, Erica Shefler¹, Maria L. Cortes¹, Daniel Auclair¹, Gordon Saksena¹, Douglas Voet¹, Michael S. Noble¹, Daniel DiCara¹, Pei Lin¹, Lee Lichtenstein¹, David I. Heiman¹, Timothy Fennell¹, Marcin Imielinski¹, Marcin Imielinski², Bryan Hernandez¹, Eran Hodis², Eran Hodis¹, Sylvan C. Baca¹, Sylvan C. Baca², Austin M. Dulak¹, Austin M. Dulak², Jens G. Lohr², Jens G. Lohr¹, Dan A. Landau², Dan A. Landau⁷, Dan A. Landau¹, Catherine J. Wu², Jorge Melendez-Zajgla, Alfredo Hidalgo-Miranda, Amnon Koren², Amnon Koren¹, Steven A. McCarroll², Steven A. McCarroll¹, Jaume Mora⁸, Ryan S. Lee⁹, Ryan S. Lee², Brian D. Crompton², Brian D. Crompton⁹, Robert C. Onofrio¹, Melissa Parkin¹, Wendy Winckler¹, Kristin G. Ardlie¹, Stacey Gabriel¹, Charles W. M. Roberts², Charles W. M. Roberts⁹, Jaclyn A. Biegel¹⁰, Kimberly Stegmaier², Kimberly Stegmaier⁹, Kimberly Stegmaier¹, Adam J. Bass², Adam J. Bass¹, Levi A. Garraway², Levi A. Garraway¹, Matthew Meyerson², Matthew Meyerson¹, Todd R. Golub, Dmitry A. Gordenin⁴, Shamil R. Sunyaev², Shamil R. Sunyaev¹, Shamil R. Sunyaev³, Eric S. Lander⁶, Eric S. Lander², Eric S. Lander¹, Gad Getz¹, Gad Getz² - Show less +93 more•Institutions (10)

Broad Institute¹, Harvard University², Brigham and Women's Hospital³, National Institutes of Health⁴, University of Washington⁵, Massachusetts Institute of Technology⁶, Yale Cancer Center⁷, Hospital Sant Joan de Déu Barcelona⁸, Boston Children's Hospital⁹, Children's Hospital of Philadelphia¹⁰

11 Jul 2013-Nature

TL;DR: A fundamental problem with cancer genome studies is described: as the sample size increases, the list of putatively significant genes produced by current analytical methods burgeons into the hundreds and the list includes many implausible genes, suggesting extensive false-positive findings that overshadow true driver events.

...read moreread less

Abstract: Major international projects are underway that are aimed at creating a comprehensive catalogue of all the genes responsible for the initiation and progression of cancer. These studies involve the sequencing of matched tumour-normal samples followed by mathematical analysis to identify those genes in which mutations occur more frequently than expected by random chance. Here we describe a fundamental problem with cancer genome studies: as the sample size increases, the list of putatively significant genes produced by current analytical methods burgeons into the hundreds. The list includes many implausible genes (such as those encoding olfactory receptors and the muscle protein titin), suggesting extensive false-positive findings that overshadow true driver events. We show that this problem stems largely from mutational heterogeneity and provide a novel analytical methodology, MutSigCV, for resolving the problem. We apply MutSigCV to exome sequences from 3,083 tumour-normal pairs and discover extraordinary variation in mutation frequency and spectrum within cancer types, which sheds light on mutational processes and disease aetiology, and in mutation frequency across the genome, which is strongly correlated with DNA replication timing and also with transcriptional activity. By incorporating mutational heterogeneity into the analyses, MutSigCV is able to eliminate most of the apparent artefactual findings and enable the identification of genes truly associated with cancer.

...read moreread less

4,411 citations

Journal Article•DOI•

Systematic localization of common disease-associated variation in regulatory DNA.

[...]

Matthew T. Maurano¹, Richard Humbert¹, Eric Rynes¹, Robert E. Thurman¹, Eric Haugen¹, Hao Wang¹, Alex Reynolds¹, Richard Sandstrom¹, Hongzhu Qu², Hongzhu Qu¹, Jennifer A. Brody¹, Anthony Shafer¹, Fidencio Neri¹, Kristen Lee¹, Tanya Kutyavin¹, Sandra Stehling-Sun¹, Audra K. Johnson¹, Theresa K. Canfield¹, Erika Giste¹, Morgan Diegel¹, Daniel Bates¹, R. Scott Hansen¹, Shane Neph¹, Peter J. Sabo¹, Shelly Heimfeld³, Antony Raubitschek⁴, Steven F. Ziegler⁴, Chris Cotsapas⁵, Nona Sotoodehnia¹, Ian A. Glass¹, Shamil R. Sunyaev⁶, Rajinder Kaul¹, John A. Stamatoyannopoulos¹ - Show less +29 more•Institutions (6)

University of Washington¹, Beijing Institute of Genomics², Fred Hutchinson Cancer Research Center³, Benaroya Research Institute⁴, Yale University⁵, Brigham and Women's Hospital⁶

07 Sep 2012-Science

TL;DR: P pervasive involvement of regulatory DNA variation in common human disease and provide pathogenic insights into diverse disorders are suggested.

...read moreread less

Abstract: Genome-wide association studies have identified many noncoding variants associated with common diseases and traits. We show that these variants are concentrated in regulatory DNA marked by deoxyribonuclease I (DNase I) hypersensitive sites (DHSs). Eighty-eight percent of such DHSs are active during fetal development and are enriched in variants associated with gestational exposure–related phenotypes. We identified distant gene targets for hundreds of variant-containing DHSs that may explain phenotype associations. Disease-associated variants systematically perturb transcription factor recognition sequences, frequently alter allelic chromatin states, and form regulatory networks. We also demonstrated tissue-selective enrichment of more weakly disease-associated variants within DHSs and the de novo identification of pathogenic cell types for Crohn’s disease, multiple sclerosis, and an electrocardiogram trait, without prior knowledge of physiological mechanisms. Our results suggest pervasive involvement of regulatory DNA variation in common human disease and provide pathogenic insights into diverse disorders.

...read moreread less

3,177 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44

Collapse

Cited by

PDF

Open Access

More filters

疟原虫var基因转换速率变化导致抗原变异[英]／Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A

[...]

宁北芳, 朱淮民

28 Jul 2005

TL;DR: PfPMP1）与感染红细胞、树突状组胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作�ly.

...read moreread less

Abstract: 抗原变异可使得多种致病微生物易于逃避宿主免疫应答。表达在感染红细胞表面的恶性疟原虫红细胞表面蛋白1（PfPMP1）与感染红细胞、内皮细胞、树突状细胞以及胎盘的单个或多个受体作用，在黏附及免疫逃避中起关键的作用。每个单倍体基因组var基因家族编码约60种成员，通过启动转录不同的var基因变异体为抗原变异提供了分子基础。

...read moreread less

18,940 citations

Journal Article•DOI•

Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology.

[...]

Sue Richards¹, Nazneen Aziz², Nazneen Aziz³, Sherri J. Bale⁴, David P. Bick⁵, Soma Das⁶, Julie M. Gastier-Foster, Wayne W. Grody⁷, Madhuri Hegde⁸, Elaine Lyon⁹, Elaine B. Spector¹⁰, Karl V. Voelkerding⁹, Heidi L. Rehm¹¹ - Show less +9 more•Institutions (11)

Oregon Health & Science University¹, College of American Pathologists², Boston Children's Hospital³, GeneDx⁴, Medical College of Wisconsin⁵, University of Chicago⁶, University of California, Los Angeles⁷, Emory University⁸, University of Utah⁹, University of Colorado Denver¹⁰, Harvard University¹¹

05 Mar 2015-Genetics in Medicine

TL;DR: Because of the increased complexity of analysis and interpretation of clinical genetic testing described in this report, the ACMG strongly recommends thatclinical molecular genetic testing should be performed in a Clinical Laboratory Improvement Amendments–approved laboratory, with results interpreted by a board-certified clinical molecular geneticist or molecular genetic pathologist or the equivalent.

...read moreread less

17,834 citations

Journal Article•DOI•

An integrated encyclopedia of DNA elements in the human genome

[...]

Principal investigators¹, Nhgri groups², Data production leads³, Lead analysts³•Institutions (3)

Wellcome Trust¹, University of Washington², Pennsylvania State University³

06 Sep 2012-Nature

TL;DR: The Encyclopedia of DNA Elements project provides new insights into the organization and regulation of the authors' genes and genome, and is an expansive resource of functional annotations for biomedical research.

...read moreread less

Abstract: The human genome encodes the blueprint of life, but the function of the vast majority of its nearly three billion bases is unknown. The Encyclopedia of DNA Elements (ENCODE) project has systematically mapped regions of transcription, transcription factor association, chromatin structure and histone modification. These data enabled us to assign biochemical functions for 80% of the genome, in particular outside of the well-studied protein-coding regions. Many discovered candidate regulatory elements are physically associated with one another and with expressed genes, providing new insights into the mechanisms of gene regulation. The newly identified elements also show a statistical correspondence to sequence variants linked to human disease, and can thereby guide interpretation of this variation. Overall, the project provides new insights into the organization and regulation of our genes and genome, and is an expansive resource of functional annotations for biomedical research.

...read moreread less

13,548 citations

Journal Article•DOI•

A global reference for human genetic variation.

[...]

Adam Auton¹, Gonçalo R. Abecasis², David Altshuler³, Richard Durbin⁴ +514 more•Institutions (90)

01 Oct 2015-Nature

TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.

...read moreread less

Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.

...read moreread less

12,661 citations