Author
Richard Smith
Other affiliations: Laboratory of Molecular Biology
Bio: Richard Smith is an academic researcher from Huntsman Cancer Institute. The author has contributed to research in topics: Environmental science & Genomics. The author has an hindex of 7, co-authored 7 publications receiving 11795 citations. Previous affiliations of Richard Smith include Laboratory of Molecular Biology.
Papers
More filters
••
TL;DR: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and has reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-generation sequencing, deep exome sequencing, and dense microarray genotyping.
Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.
12,661 citations
01 Oct 2015
TL;DR: The 1000 Genomes Project as mentioned in this paper provided a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations, and reported the completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole genome sequencing, deep exome sequencing and dense microarray genotyping.
Abstract: The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies.
3,247 citations
01 Oct 2010
TL;DR: The pilot phase of the 1000 Genomes Project is presented, designed to develop and compare different strategies for genome-wide sequencing with high-throughput platforms, and the location, allele frequency and local haplotype structure of approximately 15 million single nucleotide polymorphisms, 1 million short insertions and deletions, and 20,000 structural variants are described.
599 citations
••
TL;DR: It is shown that ongoing EWS/FLI expression is required for the tumorigenic phenotype of Ewing's sarcoma by combining retroviral-mediated RNA interference with reexpression studies, and a highly validated transcriptional profile for the EWS-FLI fusion protein is developed.
339 citations
••
TL;DR: It is shown that Argos does indeed repress this pathway in vivo and find that, in vitro, Argos protein can inhibit the activation of DER by Spitz, which is the first in vivo example of an extracellular inhibitor of a receptor tyrosine kinase.
Abstract: THE Drosophila homologue of the mammalian epidermal growth factor (EGF) receptor (DER)1,2 is a receptor tyrosine kinase involved in many stages of fly development, including photoreceptor determination, and wing-vein formation3–9. Its primary activating ligand is the Spitz protein10,11, which is similar to mammalian TGF-α (ref. 12). Argos is a secreted protein that, like Spitz, contains a single EGF motif13–15. It is a represser of cell determination in the eye, and acts in other tissues, including the wing16–18. Because Argos has the opposite effects to DER in the eye (the former blocks photoreceptor determination, the latter promotes it) we have tested whether it acts by blocking the DER pathway. We show that Argos does indeed repress this pathway in vivo and find that, in vitro, Argos protein can inhibit the activation of DER by Spitz. Thus the determination of cells by the DER pathway is regulated by a balance between extracellular activating and inhibiting signals. This is the first in vivo example of an extracellular inhibitor of a receptor tyrosine kinase.
265 citations
Cited by
More filters
••
Broad Institute1, Harvard University2, Boston Children's Hospital3, University of Washington4, University of Arizona5, Cardiff University6, Google7, Icahn School of Medicine at Mount Sinai8, Samsung Medical Center9, Vertex Pharmaceuticals10, University of Michigan11, University of Cambridge12, State University of New York Upstate Medical University13, Karolinska Institutet14, University of Eastern Finland15, Wellcome Trust Centre for Human Genetics16, University of Oxford17, Cedars-Sinai Medical Center18, University of Ottawa19, University of Pennsylvania20, University of North Carolina at Chapel Hill21, University of Helsinki22, University of California, San Diego23, University of Mississippi Medical Center24
TL;DR: The aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC) provides direct evidence for the presence of widespread mutational recurrence.
Abstract: Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes.
8,758 citations
••
TL;DR: When epidermal growth factor and its relatives bind the ErbB family of receptors, they trigger a rich network of signalling pathways, culminating in responses ranging from cell division to death, motility to adhesion.
Abstract: When epidermal growth factor and its relatives bind the ErbB family of receptors, they trigger a rich network of signalling pathways, culminating in responses ranging from cell division to death, motility to adhesion. The network is often dysregulated in cancer and lends credence to the mantra that molecular understanding yields clinical benefit: over 25,000 women with breast cancer have now been treated with trastuzumab (Herceptin), a recombinant antibody designed to block the receptor ErbB2. Likewise, small-molecule enzyme inhibitors and monoclonal antibodies to ErbB1 are in advanced phases of clinical testing. What can this pathway teach us about translating basic science into clinical use?
6,462 citations
••
TL;DR: Deep phenotype and genome-wide genetic data from 500,000 individuals from the UK Biobank is described, describing population structure and relatedness in the cohort, and imputation to increase the number of testable variants to 96 million.
Abstract: The UK Biobank project is a prospective cohort study with deep genetic and phenotypic data collected on approximately 500,000 individuals from across the United Kingdom, aged between 40 and 69 at recruitment. The open resource is unique in its size and scope. A rich variety of phenotypic and health-related information is available on each participant, including biological measurements, lifestyle indicators, biomarkers in blood and urine, and imaging of the body and brain. Follow-up information is provided by linking health and medical records. Genome-wide genotype data have been collected on all participants, providing many opportunities for the discovery of new genetic associations and the genetic bases of complex traits. Here we describe the centralized analysis of the genetic data, including genotype quality, properties of population structure and relatedness of the genetic data, and efficient phasing and genotype imputation that increases the number of testable variants to around 96 million. Classical allelic variation at 11 human leukocyte antigen genes was imputed, resulting in the recovery of signals with known associations between human leukocyte antigen alleles and many diseases.
4,489 citations
••
TL;DR: The discovery of a negative regulator acting on Tie2 emphasizes the need for exquisite regulation of this angiogenic receptor system.
Abstract: Angiogenesis is thought to depend on a precise balance of positive and negative regulation. Angiopoietin-1 (Ang1) is an angiogenic factor that signals through the endothelial cell-specific Tie2 receptor tyrosine kinase. Like vascular endothelial growth factor, Ang1 is essential for normal vascular development in the mouse. An Ang1 relative, termed angiopoietin-2 (Ang2), was identified by homology screening and shown to be a naturally occurring antagonist for Ang1 and Tie2. Transgenic overexpression of Ang2 disrupts blood vessel formation in the mouse embryo. In adult mice and humans, Ang2 is expressed only at sites of vascular remodeling. Natural antagonists for vertebrate receptor tyrosine kinases are atypical; thus, the discovery of a negative regulator acting on Tie2 emphasizes the need for exquisite regulation of this angiogenic receptor system.
3,552 citations
••
TL;DR: It is found that local genetic variation affects gene expression levels for the majority of genes, and inter-chromosomal genetic effects for 93 genes and 112 loci are identified, enabling a mechanistic interpretation of gene regulation and the genetic basis of disease.
Abstract: Characterization of the molecular function of the human genome and its variation across individuals is essential for identifying the cellular mechanisms that underlie human genetic traits and diseases. The Genotype-Tissue Expression (GTEx) project aims to characterize variation in gene expression levels across individuals and diverse tissues of the human body, many of which are not easily accessible. Here we describe genetic effects on gene expression levels across 44 human tissues. We find that local genetic variation affects gene expression levels for the majority of genes, and we further identify inter-chromosomal genetic effects for 93 genes and 112 loci. On the basis of the identified genetic effects, we characterize patterns of tissue specificity, compare local and distal effects, and evaluate the functional properties of the genetic effects. We also demonstrate that multi-tissue, multi-individual data can be used to identify genes and pathways affected by human disease-associated variation, enabling a mechanistic interpretation of gene regulation and the genetic basis of disease.
3,289 citations