scispace - formally typeset
Search or ask a question
Institution

Wellcome Trust Sanger Institute

NonprofitCambridge, United Kingdom
About: Wellcome Trust Sanger Institute is a nonprofit organization based out in Cambridge, United Kingdom. It is known for research contribution in the topics: Population & Genome. The organization has 4009 authors who have published 9671 publications receiving 1224479 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: The first genome-wide association study (GWAS) whose sample size (1,053 Swedish subjects) is sufficiently powered to detect genome- wide significance (p<1.5×10−7) for polymorphisms that modestly alter therapeutic warfarin dose is reported.
Abstract: We report the first genome-wide association study (GWAS) whose sample size (1,053 Swedish subjects) is sufficiently powered to detect genome-wide significance (p<1.5 x 10(-7)) for polymorphisms that modestly alter therapeutic warfarin dose. The anticoagulant drug warfarin is widely prescribed for reducing the risk of stroke, thrombosis, pulmonary embolism, and coronary malfunction. However, Caucasians vary widely (20-fold) in the dose needed for therapeutic anticoagulation, and hence prescribed doses may be too low (risking serious illness) or too high (risking severe bleeding). Prior work established that approximately 30% of the dose variance is explained by single nucleotide polymorphisms (SNPs) in the warfarin drug target VKORC1 and another approximately 12% by two non-synonymous SNPs (*2, *3) in the cytochrome P450 warfarin-metabolizing gene CYP2C9. We initially tested each of 325,997 GWAS SNPs for association with warfarin dose by univariate regression and found the strongest statistical signals (p<10(-78)) at SNPs clustering near VKORC1 and the second lowest p-values (p<10(-31)) emanating from CYP2C9. No other SNPs approached genome-wide significance. To enhance detection of weaker effects, we conducted multiple regression adjusting for known influences on warfarin dose (VKORC1, CYP2C9, age, gender) and identified a single SNP (rs2108622) with genome-wide significance (p = 8.3 x 10(-10)) that alters protein coding of the CYP4F2 gene. We confirmed this result in 588 additional Swedish patients (p<0.0029) and, during our investigation, a second group provided independent confirmation from a scan of warfarin-metabolizing genes. We also thoroughly investigated copy number variations, haplotypes, and imputed SNPs, but found no additional highly significant warfarin associations. We present power analysis of our GWAS that is generalizable to other studies, and conclude we had 80% power to detect genome-wide significance for common causative variants or markers explaining at least 1.5% of dose variance. These GWAS results provide further impetus for conducting large-scale trials assessing patient benefit from genotype-based forecasting of warfarin dose.

607 citations

Journal ArticleDOI
TL;DR: A unique hyperactive piggyBac transposase is generated with 17-fold and ninefold increases in excision and integration, respectively, and its applicability is shown by demonstrating an increased efficiency of generation of transgene-free mouse induced pluripotent stem cells.
Abstract: DNA transposons have been widely used for transgenesis and insertional mutagenesis in various organisms. Among the transposons active in mammalian cells, the moth-derived transposon piggyBac is most promising with its highly efficient transposition, large cargo capacity, and precise repair of the donor site. Here we report the generation of a hyperactive piggyBac transposase. The active transposition of piggyBac in multiple organisms allowed us to screen a transposase mutant library in yeast for hyperactive mutants and then to test candidates in mouse ES cells. We isolated 18 hyperactive mutants in yeast, among which five were also hyperactive in mammalian cells. By combining all mutations, a total of 7 aa substitutions, into a single reading frame, we generated a unique hyperactive piggyBac transposase with 17-fold and ninefold increases in excision and integration, respectively. We showed its applicability by demonstrating an increased efficiency of generation of transgene-free mouse induced pluripotent stem cells. We also analyzed whether this hyperactive piggyBac transposase affects the genomic integrity of the host cells. The frequency of footprints left by the hyperactive piggyBac transposase was as low as WT transposase (~1%) and we found no evidence that the expression of the transposase affects genomic integrity. This hyperactive piggyBac transposase expands the utility of the piggyBac transposon for applications in mammalian genetics and gene therapy.

607 citations

Journal ArticleDOI
TL;DR: The results indicate that mucosal microbial diversity is reduced in IBD, particularly in CD, and that the species composition is disturbed, and may support the hypothesis that the overall dysbiosis observed in inflammatory bowel disease patients relative to non-IBD controls might be a result of the disturbed gut environment rather than the direct cause of disease.
Abstract: The gut microbiota is thought to play a key role in the development of the inflammatory bowel diseases Crohn's disease (CD) and ulcerative colitis (UC). Shifts in the composition of resident bacteria have been postulated to drive the chronic inflammation seen in both diseases (the "dysbiosis" hypothesis). We therefore specifically sought to compare the mucosa-associated microbiota from both inflamed and non-inflamed sites of the colon in CD and UC patients to that from non-IBD controls and to detect disease-specific profiles. Paired mucosal biopsies of inflamed and non-inflamed intestinal tissue from 6 CD (n = 12) and 6 UC (n = 12) patients were compared to biopsies from 5 healthy controls (n = 5) by in-depth sequencing of over 10,000 near full-length bacterial 16S rRNA genes. The results indicate that mucosal microbial diversity is reduced in IBD, particularly in CD, and that the species composition is disturbed. Firmicutes were reduced in IBD samples and there were concurrent increases in Bacteroidetes, and in CD only, Enterobacteriaceae. There were also significant differences in microbial community structure between inflamed and non-inflamed mucosal sites. However, these differences varied greatly between individuals, meaning there was no obvious bacterial signature that was positively associated with the inflamed gut. These results may support the hypothesis that the overall dysbiosis observed in inflammatory bowel disease patients relative to non-IBD controls might to some extent be a result of the disturbed gut environment rather than the direct cause of disease. Nonetheless, the observed shifts in microbiota composition may be important factors in disease maintenance and severity.

606 citations

Journal ArticleDOI
TL;DR: The third-generation sequencing technology, led by Pacific Biosciences (PacBio), is progressing rapidly, moving from a technology once only capable of providing data for small genome analysis, or for performing targeted screening, to one that promises high quality de novo assembly and structural variation detection for human-sized genomes.

604 citations

Journal ArticleDOI
Keith Bradnam, Joseph Fass, Anton Alexandrov, Paul Baranay1, Michael Bechner, Inanc Birol2, Sébastien Boisvert3, Jarrod Chapman4, Guillaume Chapuis5, Guillaume Chapuis6, Rayan Chikhi5, Rayan Chikhi6, Hamidreza Chitsaz7, Wen-Chi Chou8, Jacques Corbeil3, Cristian Del Fabbro, Roderick R. Docking2, Richard Durbin9, Dent Earl10, Scott J. Emrich11, Pavel Fedotov, Nuno A. Fonseca12, Ganeshkumar Ganapathy13, Richard A. Gibbs14, Sante Gnerre15, Elenie Godzaridis3, Steve Goldstein, Matthias Haimel12, Giles Hall15, David Haussler10, Joseph B. Hiatt16, Isaac Ho4, Jason T. Howard13, Martin Hunt9, Shaun D. Jackman2, David B. Jaffe15, Erich D. Jarvis13, Huaiyang Jiang14, Sergey Kazakov, Paul J. Kersey12, Jacob O. Kitzman16, James R. Knight, Sergey Koren17, Tak-Wah Lam18, Dominique Lavenier19, Dominique Lavenier6, Dominique Lavenier5, François Laviolette3, Yingrui Li18, Zhenyu Li, Binghang Liu, Yue Liu14, Ruibang Luo18, Iain MacCallum15, Matthew D. MacManes20, Nicolas Maillet6, Nicolas Maillet19, Sergey Melnikov, Delphine Naquin6, Delphine Naquin19, Zemin Ning9, Thomas D. Otto9, Benedict Paten10, Octávio S. Paulo21, Adam M. Phillippy17, Francisco Pina-Martins21, Michael Place, Dariusz Przybylski15, Xiang Qin14, Carson Qu14, Filipe J. Ribeiro, Stephen Richards14, Daniel S. Rokhsar22, Daniel S. Rokhsar4, J. Graham Ruby23, J. Graham Ruby24, Simone Scalabrin, Michael C. Schatz25, David C. Schwartz, Alexey Sergushichev, Ted Sharpe15, Timothy I. Shaw8, Jay Shendure16, Yujian Shi, Jared T. Simpson9, Henry Song14, Fedor Tsarev, Francesco Vezzi26, Riccardo Vicedomini27, Bruno Vieira21, Jun Wang, Kim C. Worley14, Shuangye Yin15, Siu-Ming Yiu18, Jianying Yuan, Guojie Zhang, Hao Zhang, Shiguo Zhou, Ian F Korf 
TL;DR: The Assemblathon 2 as discussed by the authors presented a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and a snake) from 21 participating teams.
Abstract: Background: The process of generating raw genome sequence data continues to become cheaper, faster, and more accurate. However, assembly of such data into high-quality, finished genome sequences remains challenging. Many genome assembly tools are available, but they differ greatly in terms of their performance (speed, scalability, hardware requirements, acceptance of newer read technologies) and in their final output (composition of assembled sequence). More importantly, it remains largely unclear how to best assess the quality of assembled genome sequences. The Assemblathon competitions are intended to assess current state-of-the-art methods in genome assembly. Results: In Assemblathon 2, we provided a variety of sequence data to be assembled for three vertebrate species (a bird, a fish, and snake). This resulted in a total of 43 submitted assemblies from 21 participating teams. We evaluated these assemblies using a combination of optical map data, Fosmid sequences, and several statistical methods. From over 100 different metrics, we chose ten key measures by which to assess the overall quality of the assemblies. (Continued on next page)

602 citations


Authors

Showing all 4058 results

NameH-indexPapersCitations
Nicholas J. Wareham2121657204896
Gonçalo R. Abecasis179595230323
Panos Deloukas162410154018
Michael R. Stratton161443142586
David W. Johnson1602714140778
Michael John Owen1601110135795
Naveed Sattar1551326116368
Robert E. W. Hancock15277588481
Julian Parkhill149759104736
Nilesh J. Samani149779113545
Michael Conlon O'Donovan142736118857
Jian Yang1421818111166
Christof Koch141712105221
Andrew G. Clark140823123333
Stylianos E. Antonarakis13874693605
Network Information
Related Institutions (5)
Broad Institute
11.6K papers, 1.5M citations

96% related

Howard Hughes Medical Institute
34.6K papers, 5.2M citations

95% related

Laboratory of Molecular Biology
24.2K papers, 2.1M citations

94% related

Salk Institute for Biological Studies
13.1K papers, 1.6M citations

93% related

National Institutes of Health
297.8K papers, 21.3M citations

93% related

Performance
Metrics
No. of papers from the Institution in previous years
YearPapers
202317
202270
2021836
2020810
2019854
2018764