scispace - formally typeset
Search or ask a question
Institution

Wellcome Trust Sanger Institute

NonprofitCambridge, United Kingdom
About: Wellcome Trust Sanger Institute is a nonprofit organization based out in Cambridge, United Kingdom. It is known for research contribution in the topics: Population & Genome. The organization has 4009 authors who have published 9671 publications receiving 1224479 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: Genetic exchange with related species sharing the same ecological niche is the main mechanism of evolution of S. pneumoniae, and the open pan-genome guarantees the species a quick and economical response to diverse environments.
Abstract: Background Streptococcus pneumoniae is one of the most important causes of microbial diseases in humans. The genomes of 44 diverse strains of S. pneumoniae were analyzed and compared with strains of non-pathogenic streptococci of the Mitis group.

351 citations

Journal ArticleDOI
Panos Deloukas1, M Earthrowl1, Darren Grafham1, Marc Rubenfield, Lisa French1, Charles A. Steward1, Sarah Sims1, Matthew Jones1, S. Searle1, Carol Scott1, Kerstin Howe1, Sarah E. Hunt1, T D Andrews1, James G. R. Gilbert1, David Swarbreck1, Jennifer L. Ashurst1, A Taylor1, J Battles, Christine P. Bird1, R Ainscough1, J P Almeida1, R I S Ashwell1, K D Ambrose1, A K Babbage1, C L Bagguley1, J Bailey1, Ruby Banerjee1, K Bates1, Helen Beasley1, S Bray-Allen1, A J Brown1, J Y Brown1, D C Burford1, W Burrill1, John Burton1, Patrick Cahill, D Camire, Nigel P. Carter1, J C Chapman1, S Y Clark1, G Clarke1, C M Clee1, S. M. Clegg1, N Corby1, Alan Coulson1, Pawandeep Dhami1, I Dutta1, Matthew Dunn1, L M Faulkner1, Adam Frankish1, J Frankland1, P Garner1, J Garnett1, Susan M. Gribble1, C Griffiths1, Russell J. Grocock1, Erik Gustafson, S Hammond1, Joanna Harley1, E. Hart1, Paul Heath1, T P Ho, B Hopkins1, J Horne, Philip Howden1, Elizabeth J. Huckle1, C Hynds, Chris Johnson1, David W. Johnson1, A Kana, M. Kay1, A M Kimberley1, J K Kershaw1, M Kokkinaki2, Gavin K. Laird1, S Lawlor1, H M Lee, Daniel Leongamornlert1, G Laird1, Christine Lloyd1, D. M. Lloyd1, Jane E. Loveland1, J Lovell1, Stuart McLaren1, Kirsten McLay1, Amanda McMurray1, M Mashreghi-Mohammadi1, Lucy Matthews1, Sarah Milne1, T Nickerson1, M Nguyen, E K Overton-Larty1, Sophie Palmer1, A. V. Pearce1, A I Peck1, Sarah Pelan1, Benjamin Phillimore1, K M Porter1, Catherine M. Rice1, A Rogosin, Mark T. Ross1, Theologia Sarafidou2, Harminder Sehra1, Ratna Shownkeen1, C. D. Skuce1, Michelle Smith1, L Standring, N Sycamore1, J Tester1, A Thorpe1, W Torcasso, Alan Tracey1, A Tromans1, J Tsolas, Melanie M. Wall1, J Walsh, H Wang, Keith Weinstock, Anthony P. West1, David Willey1, S. Whitehead1, Laurens G. Wilming1, Paul Wray1, L Young1, Yuan Chen3, Ruth C. Lovering4, Nicholas K. Moschonas2, Reiner Siebert5, Kim Fechtel, David Bentley1, Richard Durbin1, Tim Hubbard1, Lynn Doucette-Stamm, Stephan Beck1, Douglas Smith, Jane Rogers1 
27 May 2004-Nature
TL;DR: Comparative analysis of the sequence of chromosome 20 to whole-genome shotgun-sequence data of two other vertebrates provides an independent measure of the efficiency of gene annotation, and indicates that this analysis may account for more than 95% of all coding exons and almost all genes.
Abstract: Chromosome 5 is one of the largest human chromosomes and contains numerous intrachromosomal duplications, yet it has one of the lowest gene densities. This is partially explained by numerous gene-poor regions that display a remarkable degree of noncoding conservation with non-mammalian vertebrates, suggesting that they are functionally constrained. In total, we compiled 177.7 million base pairs of highly accurate finished sequence containing 923 manually curated protein-coding genes including the protocadherin and interleukin gene families. We also completely sequenced versions of the large chromosome-5-specific internal duplications. These duplications are very recent evolutionary events and probably have a mechanistic role in human physiological variation, as deletions in these regions are the cause of debilitating disorders including spinal muscular atrophy.

351 citations

Journal ArticleDOI
TL;DR: This is the largest collection of JIA cases investigated so far and provides new insight into the genetic basis of this childhood autoimmune disease, highlighting crucial pathways, including the interleukin (IL)-2 pathway, in JIA disease pathogenesis.
Abstract: Anne Hinks and colleagues identify 14 new susceptibility loci for juvenile idiopathic arthritis through targeted analyses of genomic regions implicated in immune function. Their study implicates several pathways, including IL-2 signaling, in the pathogenesis of this common childhood autoimmune disease.

349 citations

Journal ArticleDOI
TL;DR: The results presented here contribute to the value of ongoing large-scale annotation projects and should guide further experimental methods when being scaled up to the entire human genome sequence.
Abstract: Background: We present the results of EGASP, a community experiment to assess the state-ofthe-art in genome annotation within the ENCODE regions, which span 1% of the human genome sequence. The experiment had two major goals: the assessment of the accuracy of computational methods to predict protein coding genes; and the overall assessment of the completeness of the current human genome annotations as represented in the ENCODE regions. For the computational prediction assessment, eighteen groups contributed gene predictions. We evaluated these submissions against each other based on a ‘reference set’ of annotations generated as part of the GENCODE project. These annotations were not available to the prediction groups prior to the submission deadline, so that their predictions were blind and an external advisory committee could perform a fair assessment. Results: The best methods had at least one gene transcript correctly predicted for close to 70% of the annotated genes. Nevertheless, the multiple transcript accuracy, taking into account alternative splicing, reached only approximately 40% to 50% accuracy. At the coding nucleotide level, the best programs reached an accuracy of 90% in both sensitivity and specificity. Programs relying on mRNA and protein sequences were the most accurate in reproducing the manually curated annotations. Experimental validation shows that only a very small percentage (3.2%) of

348 citations

Journal ArticleDOI
TL;DR: The Open Targets Validation Platform is designed to support identification and prioritization of biological targets for follow-up and provides evidence about the association of known and potential drug targets with diseases.
Abstract: We have designed and developed a data integration and visualization platform that provides evidence about the association of known and potential drug targets with diseases. The platform is designed to support identification and prioritization of biological targets for follow-up. Each drug target is linked to a disease using integrated genome-wide data from a broad range of data sources. The platform provides either a target-centric workflow to identify diseases that may be associated with a specific target, or a disease-centric workflow to identify targets that may be associated with a specific disease. Users can easily transition between these target- and disease-centric workflows. The Open Targets Validation Platform is accessible at https://www.targetvalidation.org.

348 citations


Authors

Showing all 4058 results

NameH-indexPapersCitations
Nicholas J. Wareham2121657204896
Gonçalo R. Abecasis179595230323
Panos Deloukas162410154018
Michael R. Stratton161443142586
David W. Johnson1602714140778
Michael John Owen1601110135795
Naveed Sattar1551326116368
Robert E. W. Hancock15277588481
Julian Parkhill149759104736
Nilesh J. Samani149779113545
Michael Conlon O'Donovan142736118857
Jian Yang1421818111166
Christof Koch141712105221
Andrew G. Clark140823123333
Stylianos E. Antonarakis13874693605
Network Information
Related Institutions (5)
Broad Institute
11.6K papers, 1.5M citations

96% related

Howard Hughes Medical Institute
34.6K papers, 5.2M citations

95% related

Laboratory of Molecular Biology
24.2K papers, 2.1M citations

94% related

Salk Institute for Biological Studies
13.1K papers, 1.6M citations

93% related

National Institutes of Health
297.8K papers, 21.3M citations

93% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
202317
202270
2021836
2020810
2019854
2018764