Author
Jeremiah M. Scharf
Other affiliations: Cardiff University, Semel Institute for Neuroscience and Human Behavior, VA Boston Healthcare System ...read more
Bio: Jeremiah M. Scharf is an academic researcher from Harvard University. The author has contributed to research in topics: Tourette syndrome & Genome-wide association study. The author has an hindex of 41, co-authored 115 publications receiving 15450 citations. Previous affiliations of Jeremiah M. Scharf include Cardiff University & Semel Institute for Neuroscience and Human Behavior.
Papers published on a yearly basis
Papers
More filters
••
Broad Institute1, Harvard University2, Boston Children's Hospital3, University of Washington4, University of Arizona5, Cardiff University6, Google7, Icahn School of Medicine at Mount Sinai8, Samsung Medical Center9, Vertex Pharmaceuticals10, University of Michigan11, University of Cambridge12, State University of New York Upstate Medical University13, Karolinska Institutet14, University of Eastern Finland15, University of Oxford16, Wellcome Trust Centre for Human Genetics17, Cedars-Sinai Medical Center18, University of Ottawa19, University of Pennsylvania20, University of North Carolina at Chapel Hill21, University of Helsinki22, University of California, San Diego23, University of Mississippi Medical Center24
TL;DR: The aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC) provides direct evidence for the presence of widespread mutational recurrence.
Abstract: Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.
8,758 citations
••
Harvard University1, Broad Institute2, Cardiff University3, Icahn School of Medicine at Mount Sinai4, University of Michigan5, University of Cambridge6, Karolinska Institutet7, University of Eastern Finland8, University of Oxford9, Cedars-Sinai Medical Center10, University of Ottawa11, University of Helsinki12, University of Pennsylvania13, University of North Carolina at Chapel Hill14, University of Mississippi Medical Center15
TL;DR: The aggregation and analysis of high-quality exome (protein-coding region) sequence data for 60,706 individuals of diverse ethnicities generated as part of the Exome Aggregation Consortium (ExAC) provides direct evidence for the presence of widespread mutational recurrence.
Abstract: Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) sequence data for 60,706 individuals of diverse ethnicities. The resulting catalogue of human genetic diversity has unprecedented resolution, with an average of one variant every eight bases of coding sequence and the presence of widespread mutational recurrence. The deep catalogue of variation provided by the Exome Aggregation Consortium (ExAC) can be used to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; we identify 3,230 genes with near-complete depletion of truncating variants, 79% of which have no currently established human disease phenotype. Finally, we show that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human knockout variants in protein-coding genes.
1,552 citations
••
Verneri Anttila1, Verneri Anttila2, Brendan Bulik-Sullivan1, Brendan Bulik-Sullivan2 +717 more•Institutions (270)
TL;DR: It is demonstrated that, in the general population, the personality trait neuroticism is significantly correlated with almost every psychiatric disorder and migraine, and it is shown that both psychiatric and neurological disorders have robust correlations with cognitive and personality measures.
Abstract: Disorders of the brain can exhibit considerable epidemiological comorbidity and often share symptoms, provoking debate about their etiologic overlap. We quantified the genetic sharing of 25 brain disorders from genome-wide association studies of 265,218 patients and 784,643 control participants and assessed their relationship to 17 phenotypes from 1,191,588 individuals. Psychiatric disorders share common variant risk, whereas neurological disorders appear more distinct from one another and from the psychiatric disorders. We also identified significant sharing between disorders and a number of brain phenotypes, including cognitive measures. Further, we conducted simulations to explore how statistical power, diagnostic misclassification, and phenotypic heterogeneity affect genetic correlations. These results highlight the importance of common genetic variation as a risk factor for brain disorders and the value of heritability-based methods in understanding their etiology.
1,357 citations
••
TL;DR: Genetic influences on psychiatric disorders transcend diagnostic boundaries, suggesting substantial pleiotropy of contributing loci within genes that show heightened expression in the brain throughout the lifespan, beginning prenatally in the second trimester, and play prominent roles in neurodevelopmental processes.
781 citations
••
University of California, San Francisco1, Harvard University2, Université de Montréal3, Johns Hopkins University School of Medicine4, Yale University5, University of Toronto6, University of Utah7, Cold Spring Harbor Laboratory8, Utrecht University9, University of Cape Town10, Massachusetts Institute of Technology11
TL;DR: It is confirmed that psychiatric comorbidities are common among individuals with TS, demonstrates that most comor bidities begin early in life, and indicates that certain comorbiities may be mediated by the presence of comorbrid OCD or ADHD.
Abstract: Importance: Tourette syndrome (TS) is characterized by high rates of psychiatric comorbidity; however, fewstudies have fully characterized these comorbidities. Furthermore, most studies have included relatively fewparticipants (< 200), and none has examined the ages of highest risk for each TS-associated comorbidity or their etiologic relationship to TS.Objective: To characterize the lifetime prevalence, clinical associations, ages of highest risk, and etiology of psychiatric comorbidity among individuals with TS.Design, Setting, And Participants: Cross-sectional structured diagnostic interviews conducted between April 1, 1992, and December 31, 2008, of participants with TS (n = 1374) and TS-unaffected family members (n = 1142).Main Outcomes And Measures: Lifetime prevalence of comorbid DSM-IV-TR disorders, their heritabilities, ages of maximal risk, and associations with symptom severity, age at onset, and parental psychiatric history.Results: The lifetime prevalence of any psychiatric comorbidity among individuals with TS was 85.7%; 57.7%of the population had 2 or more psychiatric disorders. The mean (SD) number of lifetime comorbid diagnoses was 2.1 (1.6); the mean number was 0.9 (1.3) when obsessive-compulsive disorder (OCD) and attention-deficit/hyperactivity disorder (ADHD) were excluded, and 72.1% of the individuals met the criteria for OCD or ADHD. Other disorders, including mood, anxiety, and disruptive behavior, each occurred in approximately 30% of the participants. The age of greatest risk for the onset of most comorbid psychiatric disorders was between 4 and 10 years, with the exception of eating and substance use disorders, which began in adolescence (interquartile range, 15-19 years for both). Tourette syndrome was associated with increased risk of anxiety (odds ratio [OR], 1.4; 95%CI, 1.0-1.9; P = .04) and decreased risk of substance use disorders (OR, 0.6; 95%CI, 0.3-0.9; P = .02) independent from comorbid OCD and ADHD; however, high rates of mood disorders among participants with TS (29.8%)may be accounted for by comorbid OCD (OR, 3.7; 95%CI, 2.9-4.8; P < .001). Parental history of ADHD was associated with a higher burden of non-OCD, non-ADHD comorbid psychiatric disorders (OR, 1.86; 95%CI, 1.32-2.61; P < .001). Genetic correlations between TS and mood (RhoG, 0.47), anxiety (RhoG, 0.35), and disruptive behavior disorders (RhoG, 0.48), may be accounted for by ADHD and, for mood disorders, by OCD.Conclusions And Relevance: This study is, to our knowledge, the most comprehensive of its kind. It confirms the belief that psychiatric comorbidities are common among individuals with TS, demonstrates that most comorbidities begin early in life, and indicates that certain comorbiditiesmay be mediated by the presence of comorbid OCD or ADHD. In addition, genetic analyses suggest that some comorbiditiesmay be more biologically related to OCD and/or ADHD rather than to TS.
471 citations
Cited by
More filters
••
TL;DR: A catalogue of predicted loss-of-function variants in 125,748 whole-exome and 15,708 whole-genome sequencing datasets from the Genome Aggregation Database (gnomAD) reveals the spectrum of mutational constraints that affect these human protein-coding genes.
Abstract: Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences of gene disruption: genes that are crucial for the function of an organism will be depleted of such variants in natural populations, whereas non-essential genes will tolerate their accumulation. However, predicted loss-of-function variants are enriched for annotation errors, and tend to be found at extremely low frequencies, so their analysis requires careful variant annotation and very large sample sizes1. Here we describe the aggregation of 125,748 exomes and 15,708 genomes from human sequencing studies into the Genome Aggregation Database (gnomAD). We identify 443,769 high-confidence predicted loss-of-function variants in this cohort after filtering for artefacts caused by sequencing and annotation errors. Using an improved model of human mutation rates, we classify human protein-coding genes along a spectrum that represents tolerance to inactivation, validate this classification using data from model organisms and engineered human cells, and show that it can be used to improve the power of gene discovery for both common and rare diseases. A catalogue of predicted loss-of-function variants in 125,748 whole-exome and 15,708 whole-genome sequencing datasets from the Genome Aggregation Database (gnomAD) reveals the spectrum of mutational constraints that affect these human protein-coding genes.
4,913 citations
••
TL;DR: Deep phenotype and genome-wide genetic data from 500,000 individuals from the UK Biobank is described, describing population structure and relatedness in the cohort, and imputation to increase the number of testable variants to 96 million.
Abstract: The UK Biobank project is a prospective cohort study with deep genetic and phenotypic data collected on approximately 500,000 individuals from across the United Kingdom, aged between 40 and 69 at recruitment. The open resource is unique in its size and scope. A rich variety of phenotypic and health-related information is available on each participant, including biological measurements, lifestyle indicators, biomarkers in blood and urine, and imaging of the body and brain. Follow-up information is provided by linking health and medical records. Genome-wide genotype data have been collected on all participants, providing many opportunities for the discovery of new genetic associations and the genetic bases of complex traits. Here we describe the centralized analysis of the genetic data, including genotype quality, properties of population structure and relatedness of the genetic data, and efficient phasing and genotype imputation that increases the number of testable variants to around 96 million. Classical allelic variation at 11 human leukocyte antigen genes was imputed, resulting in the recovery of signals with known associations between human leukocyte antigen alleles and many diseases.
4,489 citations
••
TL;DR: It is found that local genetic variation affects gene expression levels for the majority of genes, and inter-chromosomal genetic effects for 93 genes and 112 loci are identified, enabling a mechanistic interpretation of gene regulation and the genetic basis of disease.
Abstract: Characterization of the molecular function of the human genome and its variation across individuals is essential for identifying the cellular mechanisms that underlie human genetic traits and diseases. The Genotype-Tissue Expression (GTEx) project aims to characterize variation in gene expression levels across individuals and diverse tissues of the human body, many of which are not easily accessible. Here we describe genetic effects on gene expression levels across 44 human tissues. We find that local genetic variation affects gene expression levels for the majority of genes, and we further identify inter-chromosomal genetic effects for 93 genes and 112 loci. On the basis of the identified genetic effects, we characterize patterns of tissue specificity, compare local and distal effects, and evaluate the functional properties of the genetic effects. We also demonstrate that multi-tissue, multi-individual data can be used to identify genes and pathways affected by human disease-associated variation, enabling a mechanistic interpretation of gene regulation and the genetic basis of disease.
3,289 citations
••
TL;DR: Examination of the oral and gut microbiome of melanoma patients undergoing anti-programmed cell death 1 protein (PD-1) immunotherapy suggested enhanced systemic and antitumor immunity in responding patients with a favorable gut microbiome as well as in germ-free mice receiving fecal transplants from responding patients.
Abstract: Preclinical mouse models suggest that the gut microbiome modulates tumor response to checkpoint blockade immunotherapy; however, this has not been well-characterized in human cancer patients. Here we examined the oral and gut microbiome of melanoma patients undergoing anti-programmed cell death 1 protein (PD-1) immunotherapy (n = 112). Significant differences were observed in the diversity and composition of the patient gut microbiome of responders versus nonresponders. Analysis of patient fecal microbiome samples (n = 43, 30 responders, 13 nonresponders) showed significantly higher alpha diversity (P < 0.01) and relative abundance of bacteria of the Ruminococcaceae family (P < 0.01) in responding patients. Metagenomic studies revealed functional differences in gut bacteria in responders, including enrichment of anabolic pathways. Immune profiling suggested enhanced systemic and antitumor immunity in responding patients with a favorable gut microbiome as well as in germ-free mice receiving fecal transplants from responding patients. Together, these data have important implications for the treatment of melanoma patients with immune checkpoint inhibitors.
2,791 citations
••
TL;DR: ClinVar continues to make improvements to its search and retrieval functions.
Abstract: ClinVar (https://www.ncbi.nlm.nih.gov/clinvar/) is a freely available, public archive of human genetic variants and interpretations of their significance to disease, maintained at the National Institutes of Health. Interpretations of the clinical significance of variants are submitted by clinical testing laboratories, research laboratories, expert panels and other groups. ClinVar aggregates data by variant-disease pairs, and by variant (or set of variants). Data aggregated by variant are accessible on the website, in an improved set of variant call format files and as a new comprehensive XML report. ClinVar recently started accepting submissions that are focused primarily on providing phenotypic information for individuals who have had genetic testing. Submissions may come from clinical providers providing their own interpretation of the variant ('provider interpretation') or from groups such as patient registries that primarily provide phenotypic information from patients ('phenotyping only'). ClinVar continues to make improvements to its search and retrieval functions. Several new fields are now indexed for more precise searching, and filters allow the user to narrow down a large set of search results.
2,345 citations