Institution
Wellcome Trust Sanger Institute
Nonprofit•Cambridge, United Kingdom•
About: Wellcome Trust Sanger Institute is a nonprofit organization based out in Cambridge, United Kingdom. It is known for research contribution in the topics: Population & Genome. The organization has 4009 authors who have published 9671 publications receiving 1224479 citations.
Topics: Population, Genome, Gene, Genome-wide association study, Genomics
Papers published on a yearly basis
Papers
More filters
••
TL;DR: An amplification-free method of library preparation is presented, in which the cluster amplification step, rather than the PCR, enriches for fully ligated template strands, reducing the incidence of duplicate sequences, improving read mapping and single nucleotide polymorphism calling and aiding de novo assembly.
Abstract: Amplification artifacts introduced during library preparation for the Illumina Genome Analyzer increase the likelihood that an appreciable proportion of these sequences will be duplicates and cause an uneven distribution of read coverage across the targeted sequencing regions. As a consequence, these unfavorable features result in difficulties in genome assembly and variation analysis from the short reads, particularly when the sequences are from genomes with base compositions at the extremes of high or low G+C content. Here we present an amplification-free method of library preparation, in which the cluster amplification step, rather than the PCR, enriches for fully ligated template strands, reducing the incidence of duplicate sequences, improving read mapping and single nucleotide polymorphism calling and aiding de novo assembly. We illustrate this by generating and analyzing DNA sequences from extremely (G+C)-poor (Plasmodium falciparum), (G+C)-neutral (Escherichia coli) and (G+C)-rich (Bordetella pertussis) genomes.
640 citations
••
Charité1, Oregon Health & Science University2, French Institute of Health and Medical Research3, University of Western Australia4, King Edward Memorial Hospital5, Sanford Health6, Children's Hospital of Eastern Ontario7, University of Toronto8, University of Cambridge9, University College London10, Cold Spring Harbor Laboratory11, Queen Mary University of London12, Radboud University Nijmegen13, Wellcome Trust Sanger Institute14, Katholieke Universiteit Leuven15, Johns Hopkins University School of Medicine16, University of Pennsylvania17, University of Kiel18, University of Luxembourg19, Medical College of Wisconsin20, Newcastle University21, Tohoku University22, Manchester Royal Eye Hospital23, John Radcliffe Hospital24, University of Sydney25, University of Miami26, Garvan Institute of Medical Research27, Lawrence Berkeley National Laboratory28, University of Connecticut29
TL;DR: The progress of the HPO project is reviewed, including specific areas of expansion such as common (complex) disease, new algorithms for phenotype driven genomic discovery and diagnostics, integration of cross-species mapping efforts with the Mammalian Phenotype Ontology, an improved quality control pipeline, and the addition of patient-friendly terminology.
Abstract: Deep phenotyping has been defined as the precise and comprehensive analysis of phenotypic abnormalities in which the individual components of the phenotype are observed and described. The three components of the Human Phenotype Ontology (HPO; www.human-phenotype-ontology.org) project are the phenotype vocabulary, disease-phenotype annotations and the algorithms that operate on these. These components are being used for computational deep phenotyping and precision medicine as well as integration of clinical data into translational research. The HPO is being increasingly adopted as a standard for phenotypic abnormalities by diverse groups such as international rare disease organizations, registries, clinical labs, biomedical resources, and clinical software tools and will thereby contribute toward nascent efforts at global data exchange for identifying disease etiologies. This update article reviews the progress of the HPO project since the debut Nucleic Acids Research database article in 2014, including specific areas of expansion such as common (complex) disease, new algorithms for phenotype driven genomic discovery and diagnostics, integration of cross-species mapping efforts with the Mammalian Phenotype Ontology, an improved quality control pipeline, and the addition of patient-friendly terminology.
638 citations
••
Massachusetts Institute of Technology1, University of Copenhagen2, University of California, Santa Cruz3, University of Tartu4, Wellcome Trust Sanger Institute5, Lawrence Berkeley National Laboratory6, Harvard University7, University of New Mexico8, Watson School of Biological Sciences9, University of California, San Francisco10, European Bioinformatics Institute11, Rutgers University12, Indiana University13, Connecticut College14, Katholieke Universiteit Leuven15, Université libre de Bruxelles16, Wesleyan University17, University of Wisconsin-Madison18, University of California, Berkeley19, Memorial Sloan Kettering Cancer Center20, Cornell University21
TL;DR: This work uses the genomes of 12 Drosophila species for the de novo discovery of functional elements in the fly, and identifies several classes of pre- and post-transcriptional regulatory motifs, and predicts individual motif instances with high confidence.
Abstract: Sequencing of multiple related species followed by comparative genomics analysis constitutes a powerful approach for the systematic understanding of any genome. Here, we use the genomes of 12 Drosophila species for the de novo discovery of functional elements in the fly. Each type of functional element shows characteristic patterns of change, or 'evolutionary signatures', dictated by its precise selective constraints. Such signatures enable recognition of new protein-coding genes and exons, spurious and incorrect gene annotations, and numerous unusual gene structures, including abundant stop-codon readthrough. Similarly, we predict non-protein-coding RNA genes and structures, and new microRNA (miRNA) genes. We provide evidence of miRNA processing and functionality from both hairpin arms and both DNA strands. We identify several classes of pre- and post-transcriptional regulatory motifs, and predict individual motif instances with high confidence. We also study how discovery power scales with the divergence and number of species compared, and we provide general guidelines for comparative studies.
636 citations
••
Marine Biological Laboratory1, Stanford University2, Yale University3, University of Wisconsin-Madison4, University of Birmingham5, National Institutes of Natural Sciences, Japan6, SRI International7, Nara Institute of Science and Technology8, Keio University9, University of Miami10, University of York11, Wellcome Trust Sanger Institute12, University of Alberta13, Purdue University14
TL;DR: A snapshot analysis based on the most recent genome sequences of two E.coli K-12 strains allows comparison of their genotypes and mutant status of alleles.
Abstract: The goal of this group project has been to coordinate and bring up-to-date information on all genes of Escherichia coli K-12. Annotation of the genome of an organism entails identification of genes, the boundaries of genes in terms of precise start and end sites, and description of the gene products. Known and predicted functions were assigned to each gene product on the basis of experimental evidence or sequence analysis. Since both kinds of evidence are constantly expanding, no annotation is complete at any moment in time. This is a snapshot analysis based on the most recent genome sequences of two E.coli K-12 bacteria. An accurate and up-to-date description of E.coli K-12 genes is of particular importance to the scientific community because experimentally determined properties of its gene products provide fundamental information for annotation of innumerable genes of other organisms. Availability of the complete genome sequence of two K-12 strains allows comparison of their genotypes and mutant status of alleles.
636 citations
••
TL;DR: The prevalence of pathogenic LEPR mutations in a cohort of subjects with severe, early-onset obesity was 3%.
Abstract: Of the 300 subjects, 8 (3%) had nonsense or missense LEPR mutations — 7 were homozygotes, and 1 was a compound heterozygote. All missense mutations resulted in impaired receptor signaling. Affected subjects were characterized by hyperphagia, severe obesity, alterations in immune function, and delayed puberty due to hypogonadotropic hypogonadism. Serum leptin levels were within the range predicted by the elevated fat mass in these subjects. Their clinical features were less severe than those of subjects with congenital leptin deficiency. CONCLUSIONS The prevalence of pathogenic LEPR mutations in a cohort of subjects with severe, earlyonset obesity was 3%. Circulating levels of leptin were not disproportionately elevated, suggesting that serum leptin cannot be used as a marker for leptin-receptor deficiency. Congenital leptin-receptor deficiency should be considered in the differential diagnosis in any child with hyperphagia and severe obesity in the absence of developmental delay or dysmorphism.
635 citations
Authors
Showing all 4058 results
Name | H-index | Papers | Citations |
---|---|---|---|
Nicholas J. Wareham | 212 | 1657 | 204896 |
Gonçalo R. Abecasis | 179 | 595 | 230323 |
Panos Deloukas | 162 | 410 | 154018 |
Michael R. Stratton | 161 | 443 | 142586 |
David W. Johnson | 160 | 2714 | 140778 |
Michael John Owen | 160 | 1110 | 135795 |
Naveed Sattar | 155 | 1326 | 116368 |
Robert E. W. Hancock | 152 | 775 | 88481 |
Julian Parkhill | 149 | 759 | 104736 |
Nilesh J. Samani | 149 | 779 | 113545 |
Michael Conlon O'Donovan | 142 | 736 | 118857 |
Jian Yang | 142 | 1818 | 111166 |
Christof Koch | 141 | 712 | 105221 |
Andrew G. Clark | 140 | 823 | 123333 |
Stylianos E. Antonarakis | 138 | 746 | 93605 |