Institution

Wellcome Trust Sanger Institute

Nonprofit•Cambridge, United Kingdom•

About: Wellcome Trust Sanger Institute is a nonprofit organization based out in Cambridge, United Kingdom. It is known for research contribution in the topics: Population & Genome. The organization has 4009 authors who have published 9671 publications receiving 1224479 citations.

...read moreread less

Topics: Population, Genome, Gene, Genome-wide association study, Genomics ...read more

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Patterns of \(Cis\) Regulatory Variation in Diverse Human Populations

[...]

Barbara E. Stranger¹, Stephen B. Montgomery², Stephen B. Montgomery¹, Antigone S. Dimas, Leopold Parts¹, Oliver Stegle³, Catherine E. Ingle¹, Magda Sekowska¹, George Davey Smith³, David M. Evans³, Maria Gutierrez-Arcelus², Alkes L. Price⁴, Alkes L. Price⁵, Towfique Raj⁶, Towfique Raj⁴, James Nisbett¹, Alexandra C. Nica¹, Alexandra C. Nica², Claude Beazley¹, Richard Durbin¹, Panos Deloukas¹, Emmanouil T. Dermitzakis², Emmanouil T. Dermitzakis¹ - Show less +19 more•Institutions (6)

Wellcome Trust Sanger Institute¹, University of Geneva², University of Bristol³, Massachusetts Institute of Technology⁴, Harvard University⁵, Brigham and Women's Hospital⁶

19 Apr 2012-PLOS Genetics

TL;DR: This work analyzed genome-wide gene expression in lymphoblastoid cell lines from a total of 726 individuals from 8 global populations from the HapMap3 project and correlated gene expression levels with Hap Map3 SNPs located in cis to the genes, offering a unique picture and resource of the degree of differentiation among human populations in functional regulatory variation.

...read moreread less

Abstract: The genetic basis of gene expression variation has long been studied with the aim to understand the landscape of regulatory variants, but also more recently to assist in the interpretation and elucidation of disease signals. To date, many studies have looked in specific tissues and population-based samples, but there has been limited assessment of the degree of inter-population variability in regulatory variation. We analyzed genome-wide gene expression in lymphoblastoid cell lines from a total of 726 individuals from 8 global populations from the HapMap3 project and correlated gene expression levels with HapMap3 SNPs located in cis to the genes. We describe the influence of ancestry on gene expression levels within and between these diverse human populations and uncover a non-negligible impact on global patterns of gene expression. We further dissect the specific functional pathways differentiated between populations. We also identify 5,691 expression quantitative trait loci (eQTLs) after controlling for both non-genetic factors and population admixture and observe that half of the cis-eQTLs are replicated in one or more of the populations. We highlight patterns of eQTL-sharing between populations, which are partially determined by population genetic relatedness, and discover significant sharing of eQTL effects between Asians, European-admixed, and African subpopulations. Specifically, we observe that both the effect size and the direction of effect for eQTLs are highly conserved across populations. We observe an increasing proximity of eQTLs toward the transcription start site as sharing of eQTLs among populations increases, highlighting that variants close to TSS have stronger effects and therefore are more likely to be detected across a wider panel of populations. Together these results offer a unique picture and resource of the degree of differentiation among human populations in functional regulatory variation and provide an estimate for the transferability of complex trait variants across populations.

...read moreread less

501 citations

Journal Article•DOI•

Global dissemination of a multidrug resistant Escherichia coli clone

[...]

Nicola K. Petty¹, Nouri L. Ben Zakour¹, Mitchell Stanton-Cook¹, Elizabeth Skippington¹, Makrina Totsika¹, Brian M. Forde¹, Minh-Duy Phan¹, Danilo Gomes Moriel¹, Kate M. Peters¹, Mark R. Davies², Mark R. Davies¹, Benjamin A. Rogers³, Gordon Dougan², Jesús Rodríguez-Baño⁴, Álvaro Pascual⁴, Johann D. D. Pitout⁵, Mathew Upton⁶, David L. Paterson³, Timothy R. Walsh⁷, Mark A. Schembri¹, Scott A. Beatson¹ - Show less +17 more•Institutions (7)

University of Queensland¹, Wellcome Trust Sanger Institute², Royal Brisbane and Women's Hospital³, University of Seville⁴, University of Calgary⁵, University of Plymouth⁶, Cardiff University⁷

15 Apr 2014-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: This study confirms the global dispersal of a single E. coli ST131 clone and demonstrates the role of MGEs and recombination in the evolution of this important MDR pathogen.

...read moreread less

Abstract: Escherichia coli sequence type 131 (ST131) is a globally disseminated, multidrug resistant (MDR) clone responsible for a high proportion of urinary tract and bloodstream infections. The rapid emergence and successful spread of E. coli ST131 is strongly associated with several factors, including resistance to fluoroquinolones, high virulence gene content, the possession of the type 1 fimbriae FimH30 allele, and the production of the CTX-M-15 extended spectrum β-lactamase (ESBL). Here, we used genome sequencing to examine the molecular epidemiology of a collection of E. coli ST131 strains isolated from six distinct geographical locations across the world spanning 2000-2011. The global phylogeny of E. coli ST131, determined from whole-genome sequence data, revealed a single lineage of E. coli ST131 distinct from other extraintestinal E. coli strains within the B2 phylogroup. Three closely related E. coli ST131 sublineages were identified, with little association to geographic origin. The majority of single-nucleotide variants associated with each of the sublineages were due to recombination in regions adjacent to mobile genetic elements (MGEs). The most prevalent sublineage of ST131 strains was characterized by fluoroquinolone resistance, and a distinct virulence factor and MGE profile. Four different variants of the CTX-M ESBL-resistance gene were identified in our ST131 strains, with acquisition of CTX-M-15 representing a defining feature of a discrete but geographically dispersed ST131 sublineage. This study confirms the global dispersal of a single E. coli ST131 clone and demonstrates the role of MGEs and recombination in the evolution of this important MDR pathogen.

...read moreread less

499 citations

Journal Article•DOI•

Single Cell RNA-Sequencing of Pluripotent States Unlocks Modular Transcriptional Variation.

[...]

Aleksandra A. Kolodziejczyk¹, Aleksandra A. Kolodziejczyk², Jong Kyoung Kim², Jason C.H. Tsang¹, Tomislav Ilicic², Tomislav Ilicic¹, Johan Henriksson², Kedar Nath Natarajan¹, Kedar Nath Natarajan², Alex Tuck², Alex Tuck³, Xuefei Gao¹, Marc Bühler³, Pentao Liu¹, John C. Marioni², John C. Marioni⁴, John C. Marioni¹, Sarah A. Teichmann², Sarah A. Teichmann¹ - Show less +15 more•Institutions (4)

Wellcome Trust Sanger Institute¹, European Bioinformatics Institute², Friedrich Miescher Institute for Biomedical Research³, University of Cambridge⁴

01 Oct 2015-Cell Stem Cell

TL;DR: Embryonic stem cell culture conditions are important for maintaining long-term self-renewal, and they influence cellular pluripotency state, with 2i being the most similar to blastocyst cells and including a subpopulation resembling the two-cell embryo state.

...read moreread less

499 citations

Journal Article•DOI•

Multiple evidence strands suggest that there may be as few as 19 000 human protein-coding genes

[...]

Iakes Ezkurdia, David Juan, Jose Manuel Rodriguez, Adam Frankish¹, Mark Diekhans², Jennifer Harrow¹, Jesús Vázquez³, Alfonso Valencia, Michael L. Tress - Show less +5 more•Institutions (3)

Wellcome Trust Sanger Institute¹, University of California, Santa Cruz², Centro Nacional de Investigaciones Cardiovasculares³

16 Jun 2014-Human Molecular Genetics

TL;DR: A set of 2001 potential non-coding genes are described based on features such as weak conservation, a lack of protein features, or ambiguous annotations from major databases, all of which correlated with low peptide detection across the seven experiments.

...read moreread less

Abstract: Determining the full complement of protein-coding genes is a key goal of genome annotation. The most powerful approach for confirming protein-coding potential is the detection of cellular protein expression through peptide mass spectrometry (MS) experiments. Here, we mapped peptides detected in seven large-scale proteomics studies to almost 60% of the protein-coding genes in the GENCODE annotation of the human genome. We found a strong relationship between detection in proteomics experiments and both gene family age and cross-species conservation. Most of the genes for which we detected peptides were highly conserved. We found peptides for >96% of genes that evolved before bilateria. At the opposite end of the scale, we identified almost no peptides for genes that have appeared since primates, for genes that did not have any protein-like features or for genes with poor cross-species conservation. These results motivated us to describe a set of 2001 potential non-coding genes based on features such as weak conservation, a lack of protein features, or ambiguous annotations from major databases, all of which correlated with low peptide detection across the seven experiments. We identified peptides for just 3% of these genes. We show that many of these genes behave more like non-coding genes than protein-coding genes and suggest that most are unlikely to code for proteins under normal circumstances. We believe that their inclusion in the human protein-coding gene catalogue should be revised as part of the ongoing human genome annotation effort.

...read moreread less

499 citations

Journal Article•DOI•

EmptyDrops: distinguishing cells from empty droplets in droplet-based single-cell RNA sequencing data

[...]

Aaron T. L. Lun¹, Samantha J. Riesenfeld², Tallulah S. Andrews³, Tomás Gomes³, John C. Marioni⁴, John C. Marioni¹, John C. Marioni³ - Show less +3 more•Institutions (4)

University of Cambridge¹, Broad Institute², Wellcome Trust Sanger Institute³, European Bioinformatics Institute⁴

22 Mar 2019-Genome Biology

TL;DR: This work describes a new statistical method, EmptyDrops, based on detecting significant deviations from the expression profile of the ambient solution that retains distinct cell types that would have been discarded by existing methods in several real data sets.

...read moreread less

Abstract: Droplet-based single-cell RNA sequencing protocols have dramatically increased the throughput of single-cell transcriptomics studies. A key computational challenge when processing these data is to distinguish libraries for real cells from empty droplets. Here, we describe a new statistical method for calling cells from droplet-based data, based on detecting significant deviations from the expression profile of the ambient solution. Using simulations, we demonstrate that EmptyDrops has greater power than existing approaches while controlling the false discovery rate among detected cells. Our method also retains distinct cell types that would have been discarded by existing methods in several real data sets.

...read moreread less

499 citations

Collapse

Authors

Showing all 4058 results

Name	H-index	Papers	Citations
Nicholas J. Wareham	212	1657	204896
Gonçalo R. Abecasis	179	595	230323
Panos Deloukas	162	410	154018
Michael R. Stratton	161	443	142586
David W. Johnson	160	2714	140778
Michael John Owen	160	1110	135795
Naveed Sattar	155	1326	116368
Robert E. W. Hancock	152	775	88481
Julian Parkhill	149	759	104736
Nilesh J. Samani	149	779	113545
Michael Conlon O'Donovan	142	736	118857
Jian Yang	142	1818	111166
Christof Koch	141	712	105221
Andrew G. Clark	140	823	123333
Stylianos E. Antonarakis	138	746	93605