Institution
Wellcome Trust Sanger Institute
Nonprofit•Cambridge, United Kingdom•
About: Wellcome Trust Sanger Institute is a nonprofit organization based out in Cambridge, United Kingdom. It is known for research contribution in the topics: Population & Genome. The organization has 4009 authors who have published 9671 publications receiving 1224479 citations.
Topics: Population, Genome, Gene, Genome-wide association study, Genomics
Papers published on a yearly basis
Papers
More filters
••
TL;DR: The first genome-wide association study (GWAS) whose sample size (1,053 Swedish subjects) is sufficiently powered to detect genome- wide significance (p<1.5×10−7) for polymorphisms that modestly alter therapeutic warfarin dose is reported.
Abstract: We report the first genome-wide association study (GWAS) whose sample size (1,053 Swedish subjects) is sufficiently powered to detect genome-wide significance (p<1.5 x 10(-7)) for polymorphisms that modestly alter therapeutic warfarin dose. The anticoagulant drug warfarin is widely prescribed for reducing the risk of stroke, thrombosis, pulmonary embolism, and coronary malfunction. However, Caucasians vary widely (20-fold) in the dose needed for therapeutic anticoagulation, and hence prescribed doses may be too low (risking serious illness) or too high (risking severe bleeding). Prior work established that approximately 30% of the dose variance is explained by single nucleotide polymorphisms (SNPs) in the warfarin drug target VKORC1 and another approximately 12% by two non-synonymous SNPs (*2, *3) in the cytochrome P450 warfarin-metabolizing gene CYP2C9. We initially tested each of 325,997 GWAS SNPs for association with warfarin dose by univariate regression and found the strongest statistical signals (p<10(-78)) at SNPs clustering near VKORC1 and the second lowest p-values (p<10(-31)) emanating from CYP2C9. No other SNPs approached genome-wide significance. To enhance detection of weaker effects, we conducted multiple regression adjusting for known influences on warfarin dose (VKORC1, CYP2C9, age, gender) and identified a single SNP (rs2108622) with genome-wide significance (p = 8.3 x 10(-10)) that alters protein coding of the CYP4F2 gene. We confirmed this result in 588 additional Swedish patients (p<0.0029) and, during our investigation, a second group provided independent confirmation from a scan of warfarin-metabolizing genes. We also thoroughly investigated copy number variations, haplotypes, and imputed SNPs, but found no additional highly significant warfarin associations. We present power analysis of our GWAS that is generalizable to other studies, and conclude we had 80% power to detect genome-wide significance for common causative variants or markers explaining at least 1.5% of dose variance. These GWAS results provide further impetus for conducting large-scale trials assessing patient benefit from genotype-based forecasting of warfarin dose.
607 citations
••
TL;DR: A unique hyperactive piggyBac transposase is generated with 17-fold and ninefold increases in excision and integration, respectively, and its applicability is shown by demonstrating an increased efficiency of generation of transgene-free mouse induced pluripotent stem cells.
Abstract: DNA transposons have been widely used for transgenesis and insertional mutagenesis in various organisms. Among the transposons active in mammalian cells, the moth-derived transposon piggyBac is most promising with its highly efficient transposition, large cargo capacity, and precise repair of the donor site. Here we report the generation of a hyperactive piggyBac transposase. The active transposition of piggyBac in multiple organisms allowed us to screen a transposase mutant library in yeast for hyperactive mutants and then to test candidates in mouse ES cells. We isolated 18 hyperactive mutants in yeast, among which five were also hyperactive in mammalian cells. By combining all mutations, a total of 7 aa substitutions, into a single reading frame, we generated a unique hyperactive piggyBac transposase with 17-fold and ninefold increases in excision and integration, respectively. We showed its applicability by demonstrating an increased efficiency of generation of transgene-free mouse induced pluripotent stem cells. We also analyzed whether this hyperactive piggyBac transposase affects the genomic integrity of the host cells. The frequency of footprints left by the hyperactive piggyBac transposase was as low as WT transposase (~1%) and we found no evidence that the expression of the transposase affects genomic integrity. This hyperactive piggyBac transposase expands the utility of the piggyBac transposon for applications in mammalian genetics and gene therapy.
607 citations
••
TL;DR: The results indicate that mucosal microbial diversity is reduced in IBD, particularly in CD, and that the species composition is disturbed, and may support the hypothesis that the overall dysbiosis observed in inflammatory bowel disease patients relative to non-IBD controls might be a result of the disturbed gut environment rather than the direct cause of disease.
Abstract: The gut microbiota is thought to play a key role in the development of the inflammatory bowel diseases Crohn's disease (CD) and ulcerative colitis (UC). Shifts in the composition of resident bacteria have been postulated to drive the chronic inflammation seen in both diseases (the "dysbiosis" hypothesis). We therefore specifically sought to compare the mucosa-associated microbiota from both inflamed and non-inflamed sites of the colon in CD and UC patients to that from non-IBD controls and to detect disease-specific profiles. Paired mucosal biopsies of inflamed and non-inflamed intestinal tissue from 6 CD (n = 12) and 6 UC (n = 12) patients were compared to biopsies from 5 healthy controls (n = 5) by in-depth sequencing of over 10,000 near full-length bacterial 16S rRNA genes. The results indicate that mucosal microbial diversity is reduced in IBD, particularly in CD, and that the species composition is disturbed. Firmicutes were reduced in IBD samples and there were concurrent increases in Bacteroidetes, and in CD only, Enterobacteriaceae. There were also significant differences in microbial community structure between inflamed and non-inflamed mucosal sites. However, these differences varied greatly between individuals, meaning there was no obvious bacterial signature that was positively associated with the inflamed gut. These results may support the hypothesis that the overall dysbiosis observed in inflammatory bowel disease patients relative to non-IBD controls might to some extent be a result of the disturbed gut environment rather than the direct cause of disease. Nonetheless, the observed shifts in microbiota composition may be important factors in disease maintenance and severity.
606 citations
••
TL;DR: The third-generation sequencing technology, led by Pacific Biosciences (PacBio), is progressing rapidly, moving from a technology once only capable of providing data for small genome analysis, or for performing targeted screening, to one that promises high quality de novo assembly and structural variation detection for human-sized genomes.
604 citations
••
Yale University1, BC Cancer Agency2, Laval University3, Joint Genome Institute4, École normale supérieure de Cachan5, Centre national de la recherche scientifique6, Wayne State University7, University of Georgia8, Wellcome Trust Sanger Institute9, University of California, Santa Cruz10, University of Notre Dame11, European Bioinformatics Institute12, Duke University13, Baylor College of Medicine14, Broad Institute15, University of Washington16, University of Maryland, College Park17, University of Hong Kong18, French Institute for Research in Computer Science and Automation19, California Institute for Quantitative Biosciences20, University of Lisbon21, University of California, Berkeley22, University of California, San Francisco23, Howard Hughes Medical Institute24, Cold Spring Harbor Laboratory25, Royal Institute of Technology26, University of Udine27
TL;DR: The Assemblathon 2 as discussed by the authors presented a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and a snake) from 21 participating teams.
Abstract: Background: The process of generating raw genome sequence data continues to become cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished genome sequences remains challenging. Many genome assembly tools are available, but they differ greatly in terms of their performance (speed, scalability, hardware requirements, acceptance of newer read technologies) and in their final output (composition of assembled sequence). More importantly, it remains largely unclear how to best assess the quality of assembled genome sequences. The Assemblathon competitions are intended to assess current state-of-the-art methods in genome assembly. Results: In Assemblathon 2, we provided a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and snake). This resulted in a total of 43 submitted assemblies from 21 participating teams. We evaluated these assemblies using a combination of optical map data, Fosmid sequences, and several statistical methods. From over 100 different metrics, we chose ten key measures by which to assess the overall quality of the assemblies. (Continued on next page)
602 citations
Authors
Showing all 4058 results
Name | H-index | Papers | Citations |
---|---|---|---|
Nicholas J. Wareham | 212 | 1657 | 204896 |
Gonçalo R. Abecasis | 179 | 595 | 230323 |
Panos Deloukas | 162 | 410 | 154018 |
Michael R. Stratton | 161 | 443 | 142586 |
David W. Johnson | 160 | 2714 | 140778 |
Michael John Owen | 160 | 1110 | 135795 |
Naveed Sattar | 155 | 1326 | 116368 |
Robert E. W. Hancock | 152 | 775 | 88481 |
Julian Parkhill | 149 | 759 | 104736 |
Nilesh J. Samani | 149 | 779 | 113545 |
Michael Conlon O'Donovan | 142 | 736 | 118857 |
Jian Yang | 142 | 1818 | 111166 |
Christof Koch | 141 | 712 | 105221 |
Andrew G. Clark | 140 | 823 | 123333 |
Stylianos E. Antonarakis | 138 | 746 | 93605 |