Showing papers by "Wellcome Trust Sanger Institute published in 2008"

PDF

Open Access

Journal Article•DOI•

Accurate whole human genome sequencing using reversible terminator chemistry

[...]

David R. Bentley¹, Shankar Balasubramanian², Harold Swerdlow¹, Harold Swerdlow³ +198 more•Institutions (4)

06 Nov 2008-Nature

TL;DR: An approach that generates several billion bases of accurate nucleotide sequence per experiment at low cost is reported, effective for accurate, rapid and economical whole-genome re-sequencing and many other biomedical applications.

...read moreread less

Abstract: DNA sequence information underpins genetic research, enabling discoveries of important biological or medical benefit. Sequencing projects have traditionally used long (400-800 base pair) reads, but the existence of reference sequences for the human and many other genomes makes it possible to develop new, fast approaches to re-sequencing, whereby shorter reads are compared to a reference to identify intraspecies genetic variation. Here we report an approach that generates several billion bases of accurate nucleotide sequence per experiment at low cost. Single molecules of DNA are attached to a flat surface, amplified in situ and used as templates for synthetic sequencing with fluorescent reversible terminator deoxyribonucleotides. Images of the surface are analysed to generate high-quality sequence. We demonstrate application of this approach to human genome sequencing on flow-sorted X chromosomes and then scale the approach to determine the genome sequence of a male Yoruba from Ibadan, Nigeria. We build an accurate consensus sequence from >30x average depth of paired 35-base reads. We characterize four million single-nucleotide polymorphisms and four hundred thousand structural variants, many of which were previously unknown. Our approach is effective for accurate, rapid and economical whole-genome re-sequencing and many other biomedical applications.

...read moreread less

3,802 citations

Journal Article•DOI•

Mapping short DNA sequencing reads and calling variants using mapping quality scores

[...]

Heng Li¹, Jue Ruan, Richard Durbin•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Nov 2008-Genome Research

TL;DR: This work describes the software MAQ, software that can build assemblies by mapping shotgun short reads to a reference genome, using quality scores to derive genotype calls of the consensus sequence of a diploid genome, e.g., from a human sample.

...read moreread less

Abstract: New sequencing technologies promise a new era in the use of DNA sequence. However, some of these technologies produce very short reads, typically of a few tens of base pairs, and to use these reads effectively requires new algorithms and software. In particular, there is a major issue in efficiently aligning short reads to a reference genome and handling ambiguity or lack of accuracy in this alignment. Here we introduce the concept of mapping quality, a measure of the confidence that a read actually comes from the position it is aligned to by the mapping algorithm. We describe the software MAQ that can build assemblies by mapping shotgun short reads to a reference genome, using quality scores to derive genotype calls of the consensus sequence of a diploid genome, e.g., from a human sample. MAQ makes full use of mate-pair information and estimates the error probability of each read alignment. Error probabilities are also derived for the final genotype calls, using a Bayesian statistical model that incorporates the mapping qualities, error probabilities from the raw sequence quality scores, sampling of the two haplotypes, and an empirical model for correlated errors at a site. Both read mapping and genotype calling are evaluated on simulated data and real data. MAQ is accurate, efficient, versatile, and user-friendly. It is freely available at http://maq.sourceforge.net.

...read moreread less

2,927 citations

Journal Article•DOI•

Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease

[...]

Jeffrey C. Barrett¹, Sarah Hansoul², Dan L. Nicolae³, Judy H. Cho⁴, Richard H. Duerr⁵, John D. Rioux⁶, John D. Rioux⁷, Steven R. Brant⁸, Mark S. Silverberg⁹, Kent D. Taylor¹⁰, M. Michael Barmada⁵, Alain Bitton¹¹, Themistocles Dassopoulos⁸, Lisa W. Datta⁸, Todd Green⁶, Anne M. Griffiths⁹, Emily O. Kistner³, Michael T. Murtha⁴, Miguel Regueiro⁵, Jerome I. Rotter¹⁰, L. Philip Schumm³, A. Hillary Steinhart⁹, Stephan R. Targan¹⁰, Ramnik J. Xavier¹², Cécile Libioulle², Cynthia Sandor², Mark Lathrop, Jacques Belaiche², Olivier Dewit, Ivo Gut, Simon Heath, Debby Laukens¹³, Myriam Mni², Paul Rutgeerts¹⁴, André Van Gossum¹⁵, Diana Zelenika, Denis Franchimont¹⁵, Jean-Pierre Hugot¹⁶, Martine De Vos¹³, Severine Vermeire¹⁴, Edouard Louis², Lon R. Cardon¹, Carl A. Anderson¹, Hazel E. Drummond¹⁷, Elaine R. Nimmo¹⁷, Tariq Ahmad, Natalie J. Prescott¹⁸, Clive M. Onnie¹⁸, Sheila A. Fisher¹⁸, Jonathan Marchini¹⁹, Jilur Ghori²⁰, Suzannah Bumpstead²⁰, Rhian Gwilliam²⁰, Mark Tremelling²¹, Panos Deloukas²⁰, John C. Mansfield²², Derek P. Jewell¹⁹, Jack Satsangi¹⁷, Christopher G. Mathew¹⁸, Miles Parkes²¹, Michel Georges², Mark J. Daly¹², Mark J. Daly⁶ - Show less +59 more•Institutions (22)

01 Aug 2008-Nature Genetics

TL;DR: The results strongly confirm 11 previously reported loci and provide genome-wide significant evidence for 21 additional loci, including the regions containing STAT3, JAK2, ICOSLG, CDKAL1 and ITLN1, which offer promise for informed therapeutic development.

...read moreread less

Abstract: Several risk factors for Crohn's disease have been identified in recent genome-wide association studies. To advance gene discovery further, we combined data from three studies on Crohn's disease (a total of 3,230 cases and 4,829 controls) and carried out replication in 3,664 independent cases with a mixture of population-based and family-based controls. The results strongly confirm 11 previously reported loci and provide genome-wide significant evidence for 21 additional loci, including the regions containing STAT3, JAK2, ICOSLG, CDKAL1 and ITLN1. The expanded molecular understanding of the basis of this disease offers promise for informed therapeutic development.

...read moreread less

2,584 citations

Journal Article•DOI•

Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes

[...]

Eleftheria Zeggini¹, Laura J. Scott², Richa Saxena, Benjamin F. Voight, Jonathan Marchini³, T Hu², de Bakker Piw.⁴, de Bakker Piw.⁵, de Bakker Piw.⁶, Gonçalo R. Abecasis², Peter Almgren⁷, Gregers S. Andersen⁸, Kristin Ardlie⁶, Kristina Bengtsson Boström, Richard N. Bergman⁹, Lori L. Bonnycastle¹⁰, Knut Borch-Johnsen⁸, Knut Borch-Johnsen¹¹, Noël P. Burtt⁶, H Chen¹², Peter S. Chines¹⁰, Mark J. Daly, P Deodhar¹⁰, Ding C-J.², Doney Asf.¹³, William L. Duren², Katherine S. Elliott¹, Mike Erdos¹⁰, Timothy M. Frayling¹⁴, Rachel M. Freathy¹⁴, Lauren Gianniny⁶, Harald Grallert, Niels Grarup⁸, Christopher J. Groves³, Candace Guiducci⁶, Torben Hansen⁸, Christian Herder¹⁵, Graham A. Hitman¹⁶, Thomas Edward Hughes¹², Bo Isomaa, Anne U. Jackson², Torben Jørgensen¹⁷, Augustine Kong¹⁸, Kari Kubalanza¹⁰, Finny G Kuruvilla⁵, Finny G Kuruvilla⁶, Johanna Kuusisto¹⁹, Claudia Langenberg²⁰, Hana Lango¹⁴, Torsten Lauritzen²¹, Yun Li², Cecilia M. Lindgren³, Cecilia M. Lindgren¹, Valeriya Lyssenko⁷, Amanda F. Marvelle²², Christine Meisinger, Kristian Midthjell²³, Karen L. Mohlke²², Mario A. Morken¹⁰, Andrew D. Morris¹³, Narisu Narisu¹⁰, Peter M. Nilsson⁷, Katharine R. Owen³, Palmer Cna.¹³, Felicity Payne²⁴, Perry Jrb.¹⁴, E Pettersen²³, Carl Platou²³, Inga Prokopenko³, Inga Prokopenko¹, Lu Qi⁵, Lu Qi⁴, L Qin²², Nigel W. Rayner¹, Nigel W. Rayner³, Matthew G. Rees¹⁰, J J Roix¹², A Sandbaek¹¹, Beverley M. Shields, Marketa Sjögren⁷, Valgerdur Steinthorsdottir¹⁸, Heather M. Stringham², Amy J. Swift¹⁰, Gudmar Thorleifsson¹⁸, Unnur Thorsteinsdottir¹⁸, Nicholas J. Timpson²⁵, Nicholas J. Timpson¹, Tiinamaija Tuomi²⁶, Jaakko Tuomilehto²⁶, Mark Walker²⁷, Richard M. Watanabe⁹, Michael N. Weedon¹⁴, Cristen J. Willer², Thomas Illig, Kristian Hveem²³, Frank B. Hu⁵, Frank B. Hu⁴, Markku Laakso¹⁹, Kari Stefansson¹⁸, Oluf Pedersen¹¹, Oluf Pedersen⁸, Nicholas J. Wareham²⁰, Inês Barroso²⁴, Andrew T. Hattersley¹⁴, Francis S. Collins¹⁰, Leif Groop²⁶, Leif Groop⁷, Mark I. McCarthy³, Mark I. McCarthy¹, Michael Boehnke², David Altshuler - Show less +107 more•Institutions (27)

30 Mar 2008-Nature Genetics

TL;DR: The results illustrate the value of large discovery and follow-up samples for gaining further insights into the inherited basis of T2D, and detect at least six previously unknown loci with robust evidence for association.

...read moreread less

Abstract: Genome-wide association (GWA) studies have identified multiple loci at which common variants modestly but reproducibly influence risk of type 2 diabetes (T2D). Established associations to common and rare variants explain only a small proportion of the heritability of T2D. As previously published analyses had limited power to identify variants with modest effects, we carried out meta-analysis of three T2D GWA scans comprising 10,128 individuals of European descent and approximately 2.2 million SNPs (directly genotyped and imputed), followed by replication testing in an independent sample with an effective sample size of up to 53,975. We detected at least six previously unknown loci with robust evidence for association, including the JAZF1 (P = 5.0 x 10(-14)), CDC123-CAMK1D (P = 1.2 x 10(-10)), TSPAN8-LGR5 (P = 1.1 x 10(-9)), THADA (P = 1.1 x 10(-9)), ADAMTS9 (P = 1.2 x 10(-8)) and NOTCH2 (P = 4.1 x 10(-8)) gene regions. Our results illustrate the value of large discovery and follow-up samples for gaining further insights into the inherited basis of T2D.

...read moreread less

1,872 citations

Journal Article•DOI•

Large recurrent microdeletions associated with schizophrenia

[...]

Hreinn Stefansson¹, Dan Rujescu², Sven Cichon³, Olli Pietiläinen, Andres Ingason¹, Stacy Steinberg¹, Ragnheidur Fossdal¹, Engilbert Sigurdsson, Thordur Sigmundsson, Jacobine E. Buizer-Voskamp⁴, Thomas Hansen⁵, Thomas Hansen⁶, Klaus D. Jakobsen⁶, Klaus D. Jakobsen⁵, Pierandrea Muglia⁷, Clyde Francks⁷, Paul M. Matthews⁸, Arnaldur Gylfason¹, Bjarni V. Halldorsson¹, Daniel F. Gudbjartsson¹, Thorgeir E. Thorgeirsson¹, Asgeir Sigurdsson¹, Adalbjorg Jonasdottir¹, Aslaug Jonasdottir¹, Asgeir Björnsson¹, Sigurborg Mattiasdottir¹, Thorarinn Blondal¹, Magnús Haraldsson, Brynja B. Magnusdottir, Ina Giegling², Hans-Jürgen Möller², Annette M. Hartmann², Kevin V. Shianna⁹, Dongliang Ge⁹, Anna C. Need⁹, Caroline Crombie¹⁰, Gillian Fraser¹⁰, Nicholas Walker, Jouko Lönnqvist, Jaana Suvisaari, Annamarie Tuulio-Henriksson, Tiina Paunio, T. Toulopoulou¹¹, Elvira Bramon¹¹, Marta Di Forti¹¹, Robin M. Murray¹¹, Mirella Ruggeri¹², Evangelos Vassos¹¹, Sarah Tosato¹², Muriel Walshe¹¹, Tao Li¹¹, Tao Li¹³, Catalina Vasilescu³, Thomas W. Mühleisen³, August G. Wang⁵, Henrik Ullum⁵, Srdjan Djurovic¹⁴, Ingrid Melle, Jes Olesen¹⁵, Lambertus A. Kiemeney¹⁶, Barbara Franke¹⁶, Chiara Sabatti¹⁷, Nelson B. Freimer¹⁷, Jeffrey R. Gulcher¹, Unnur Thorsteinsdottir¹, Augustine Kong¹, Ole A. Andreassen¹⁴, Roel A. Ophoff⁴, Roel A. Ophoff¹⁷, Alexander Georgi¹⁸, Marcella Rietschel¹⁸, Thomas Werge⁵, Hannes Petursson, David Goldstein⁹, Markus M. Nöthen³, Leena Peltonen¹⁹, Leena Peltonen²⁰, David A. Collier¹¹, David A. Collier¹³, David St Clair¹⁰, Kari Stefansson¹, Kari Stefansson²¹ - Show less +78 more•Institutions (21)

deCODE genetics¹, Ludwig Maximilian University of Munich², University of Bonn³, Utrecht University⁴, Copenhagen University Hospital⁵, University of Copenhagen⁶, GlaxoSmithKline⁷, Hammersmith Hospital⁸, Duke University⁹, Royal Cornhill Hospital¹⁰, King's College London¹¹, University of Verona¹², Sichuan University¹³, University of Oslo¹⁴, Glostrup Hospital¹⁵, Radboud University Nijmegen Medical Centre¹⁶, University of California, Los Angeles¹⁷, Heidelberg University¹⁸, Wellcome Trust Sanger Institute¹⁹, Broad Institute²⁰, University of Iceland²¹

11 Sep 2008-Nature

TL;DR: In a genome-wide search for CNVs associating with schizophrenia, a population-based sample was used to identify de novo CNVs by analysing 9,878 transmissions from parents to offspring and three deletions significantly associate with schizophrenia and related psychoses in the combined sample.

...read moreread less

Abstract: Reduced fecundity, associated with severe mental disorders, places negative selection pressure on risk alleles and may explain, in part, why common variants have not been found that confer risk of disorders such as autism, schizophrenia and mental retardation. Thus, rare variants may account for a larger fraction of the overall genetic risk than previously assumed. In contrast to rare single nucleotide mutations, rare copy number variations (CNVs) can be detected using genome-wide single nucleotide polymorphism arrays. This has led to the identification of CNVs associated with mental retardation and autism. In a genome-wide search for CNVs associating with schizophrenia, we used a population-based sample to identify de novo CNVs by analysing 9,878 transmissions from parents to offspring. The 66 de novo CNVs identified were tested for association in a sample of 1,433 schizophrenia cases and 33,250 controls. Three deletions at 1q21.1, 15q11.2 and 15q13.3 showing nominal association with schizophrenia in the first sample (phase I) were followed up in a second sample of 3,285 cases and 7,951 controls (phase II). All three deletions significantly associate with schizophrenia and related psychoses in the combined sample. The identification of these rare, recurrent risk variants, having occurred independently in multiple founders and being subject to negative selection, is important in itself. CNV analysis may also point the way to the identification of additional and more prevalent risk variants in genes and pathways involved in schizophrenia.

...read moreread less

1,767 citations

Journal Article•DOI•

Systemic spread is an early step in breast cancer.

[...]

Yves Hüsemann¹, Jochen B. Geigl¹, Falk Schubert², Piero Musiani³, Manfred Meyer¹, Elke Burghart¹, Guido Forni⁴, Roland Eils⁵, Tanja Fehm⁶, Gert Riethmüller⁷, Christoph Klein¹ - Show less +7 more•Institutions (7)

University of Regensburg¹, Wellcome Trust Sanger Institute², University of Chieti-Pescara³, University of Turin⁴, Heidelberg University⁵, University of Tübingen⁶, Ludwig Maximilian University of Munich⁷

08 Jan 2008-Cancer Cell

TL;DR: It is shown that tumor cells can disseminate systemically from earliest epithelial alterations in HER-2 and PyMT transgenic mice and from ductal carcinoma in situ in women, and release from dormancy of early-disseminated cancer cells may frequently account for metachronous metastasis.

...read moreread less

1,126 citations

Journal Article•DOI•

The minimum information about a genome sequence (MIGS) specification.

[...]

Dawn Field, George M. Garrity¹, Tanya Gray, Norman Morrison, Jeremy D. Selengut², Peter Sterk, Tatiana Tatusova³, Nicholas R. Thomson⁴, Michael J. Allen⁵, Samuel V. Angiuoli⁶, Michael Ashburner⁷, Nelson Axelrod², Sandra L. Baldauf⁸, S. Ballard⁷, Jeffrey L. Boore⁹, Guy Cochrane, James R. Cole¹, Peter Dawyndt¹⁰, Paul De Vos¹⁰, Claude W. dePamphilis¹¹, Robert Edwards¹², Nadeem Faruque, Robert G. Feldman, Jack A. Gilbert⁵, Paul Gilna¹³, Frank Oliver Glöckner¹⁴, Philip Goldstein¹⁵, Robert P. Guralnick¹⁵, Daniel H. Haft², David Hancock, Henning Hermjakob, Christiane Hertz-Fowler⁴, Phil Hugenholtz⁹, Ian Joint⁵, Leonid Kagan², Matthew D. Kane¹⁶, Jessie Kennedy¹⁷, George A. Kowalchuk, Renzo Kottmann¹⁴, Eugene Kolker¹⁸, Saul A. Kravitz², Nikos C. Kyrpides⁹, Jim Leebens-Mack¹⁹, Suzanna E. Lewis²⁰, Kelvin Li², Allyson L. Lister²¹, Phillip Lord²¹, Natalia Maltsev¹², Victor Markowitz²², Jennifer B. H. Martiny²³, Barbara A. Methé², Ilene Mizrachi³, Richard Moxon²⁴, Karen E. Nelson²⁵, Julian Parkhill⁴, Lita M. Proctor¹⁶, Owen White⁶, Susanna-Assunta Sansone, Andrew J. Spiers²⁶, Robert Stevens²⁷, Paul Swift, Chris F. Taylor, Yoshio Tateno, Adrian Tett, Sarah L. Turner, David W. Ussery²⁸, Bob Vaughan, Naomi L. Ward²⁹, Trish Whetzel³⁰, Ingio San Gil³¹, Gareth A. Wilson, Anil Wipat²¹ - Show less +68 more•Institutions (31)

Michigan State University¹, J. Craig Venter Institute², National Institutes of Health³, Wellcome Trust Sanger Institute⁴, Plymouth Marine Laboratory⁵, University of Maryland, Baltimore⁶, University of Cambridge⁷, University of York⁸, United States Department of Energy⁹, Ghent University¹⁰, Pennsylvania State University¹¹, Argonne National Laboratory¹², University of California, San Diego¹³, Jacobs University Bremen¹⁴, University of Colorado Boulder¹⁵, National Science Foundation¹⁶, Edinburgh Napier University¹⁷, Boston Children's Hospital¹⁸, University of Georgia¹⁹, University of California, Berkeley²⁰, Newcastle University²¹, Lawrence Berkeley National Laboratory²², University of California, Irvine²³, University of Oxford²⁴, Howard University²⁵, Abertay University²⁶, University of Manchester²⁷, Technical University of Denmark²⁸, University of Wyoming²⁹, University of Pennsylvania³⁰, University of New Mexico³¹

01 May 2008-Nature Biotechnology

TL;DR: Here, the minimum information about a genome sequence (MIGS) specification is introduced with the intent of promoting participation in its development and discussing the resources that will be required to develop improved mechanisms of metadata capture and exchange.

...read moreread less

Abstract: With the quantity of genomic data increasing at an exponential rate, it is imperative that these data be captured electronically, in a standard format. Standardization activities must proceed within the auspices of open-access and international working bodies. To tackle the issues surrounding the development of better descriptions of genomic investigations, we have formed the Genomic Standards Consortium (GSC). Here, we introduce the minimum information about a genome sequence (MIGS) specification with the intent of promoting participation in its development and discussing the resources that will be required to develop improved mechanisms of metadata capture and exchange. As part of its wider goals, the GSC also supports improving the 'transparency' of the information contained in existing genomic databases.

...read moreread less

1,097 citations

Journal Article•DOI•

Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide resolution

[...]

Brian T. Wilhelm¹, Brian T. Wilhelm², Samuel Marguerat², Samuel Marguerat¹, Stephen Watt², Stephen Watt¹, Falk Schubert¹, Falk Schubert², Valerie Wood¹, Ian Goodhead¹, Ian Goodhead², Christopher J. Penkett², Christopher J. Penkett¹, Jane Rogers¹, Jürg Bähler², Jürg Bähler¹ - Show less +12 more•Institutions (2)

Wellcome Trust Sanger Institute¹, European Bioinformatics Institute²

26 Jun 2008-Nature

TL;DR: High-throughput sequencing of complementary DNAs (RNA-Seq) and strand-specific array data provide rich condition-specific information on novel, mostly non-coding transcripts, untranslated regions and gene structures, thus improving the existing genome annotation.

...read moreread less

Abstract: Until recently, it was thought that much of a genome sequence is silent for much of the time. Now a study in the fission yeast Schizosaccharomyces pombe, using recently developed DNA sequencing technologies, shows that almost all of the yeast genome is genetically active. More than 90% of the genome is transcribed into RNA, including more than 450 newly discovered transcripts, many of them non-coding, with regulatory or other unknown roles. Using recently developed DNA sequencing technologies, nucleic acid transcripts are characterized in unprecedented detail from the yeast Schizosaccharomyces pombe. The sequences definitively demonstrate that 90% of more of the genome is transcribed into RNA, and show a previously unseen link between transcription and splicing efficiency at different points in the cell's growth. Recent data from several organisms indicate that the transcribed portions of genomes are larger and more complex than expected, and that many functional properties of transcripts are based not on coding sequences but on regulatory sequences in untranslated regions or non-coding RNAs1,2,3,4,5,6,7,8,9. Alternative start and polyadenylation sites and regulation of intron splicing add additional dimensions to the rich transcriptional output10,11. This transcriptional complexity has been sampled mainly using hybridization-based methods under one or few experimental conditions. Here we applied direct high-throughput sequencing of complementary DNAs (RNA-Seq), supplemented with data from high-density tiling arrays, to globally sample transcripts of the fission yeast Schizosaccharomyces pombe, independently from available gene annotations. We interrogated transcriptomes under multiple conditions, including rapid proliferation, meiotic differentiation and environmental stress, as well as in RNA processing mutants to reveal the dynamic plasticity of the transcriptional landscape as a function of environmental, developmental and genetic factors. High-throughput sequencing proved to be a powerful and quantitative method to sample transcriptomes deeply at maximal resolution. In contrast to hybridization, sequencing showed little, if any, background noise and was sensitive enough to detect widespread transcription in >90% of the genome, including traces of RNAs that were not robustly transcribed or rapidly degraded. The combined sequencing and strand-specific array data provide rich condition-specific information on novel, mostly non-coding transcripts, untranslated regions and gene structures, thus improving the existing genome annotation. Sequence reads spanning exon–exon or exon–intron junctions give unique insight into a surprising variability in splicing efficiency across introns, genes and conditions. Splicing efficiency was largely coordinated with transcript levels, and increased transcription led to increased splicing in test genes. Hundreds of introns showed such regulated splicing during cellular proliferation or differentiation.

...read moreread less

991 citations

Journal Article•DOI•

The diploid genome sequence of an Asian individual.

[...]

Jun Wang, Wei Wang¹, Ruiqiang Li², Ruiqiang Li¹, Yingrui Li³, Yingrui Li¹, Yingrui Li⁴, Geng Tian¹, Geng Tian⁵, Laurie Goodman¹, Wei Fan¹, Junqing Zhang¹, Jun Li¹, Juanbin Zhang¹, Yiran Guo¹, Yiran Guo⁵, Binxiao Feng¹, Heng Li¹, Heng Li⁶, Yao Lu¹, Xiaodong Fang¹, Huiqing Liang¹, Zhenglin Du¹, Dong Li¹, Yiqing Zhao⁵, Yiqing Zhao¹, Yujie Hu⁵, Yujie Hu¹, Zhenzhen Yang¹, Hancheng Zheng¹, Ines Hellmann⁷, Michael Inouye⁶, John E. Pool⁷, Xin Yi¹, Xin Yi⁵, Jing Zhao¹, Jinjie Duan¹, Yan Zhou¹, Junjie Qin¹, Junjie Qin⁵, Lijia Ma¹, Lijia Ma⁵, Guoqing Li¹, Zhentao Yang¹, Guojie Zhang⁵, Guojie Zhang¹, Bin Yang¹, Chang Yu¹, Fang Liang¹, Fang Liang⁵, Wenjie Li¹, Shaochuan Li¹, Dawei Li¹, Peixiang Ni¹, Jue Ruan¹, Jue Ruan⁵, Qibin Li¹, Qibin Li⁵, Hongmei Zhu¹, Dongyuan Liu¹, Zhike Lu¹, Ning Li¹, Ning Li⁵, Guangwu Guo¹, Guangwu Guo⁵, Jianguo Zhang¹, Jia Ye¹, Lin Fang¹, Qin Hao⁵, Qin Hao¹, Quan Chen³, Quan Chen¹, Yu Liang¹, Yu Liang⁵, Yeyang Su¹, Yeyang Su⁵, A. san⁵, A. san¹, Cuo Ping⁵, Cuo Ping¹, Shuang Yang¹, Fang Chen¹, Fang Chen⁵, Li Li¹, Ke Zhou¹, Hongkun Zheng², Hongkun Zheng¹, Yuanyuan Ren¹, Ling Yang¹, Yang Gao⁴, Yang Gao¹, Guohua Yang¹, Guohua Yang⁸, Zhuo Li¹, Xiaoli Feng¹, Karsten Kristiansen², Gane Ka-Shu Wong⁹, Gane Ka-Shu Wong¹, Rasmus Nielsen⁷, Richard Durbin⁶, Lars Bolund¹⁰, Lars Bolund¹, Xiuqing Zhang¹, Xiuqing Zhang⁴, Songgang Li³, Songgang Li¹, Songgang Li⁸, Huanming Yang¹, Huanming Yang⁸, Jian Wang¹, Jian Wang⁸ - Show less +107 more•Institutions (10)

Beijing Genomics Institute¹, University of Southern Denmark², Peking University³, Beijing Institute of Genomics⁴, Chinese Academy of Sciences⁵, Wellcome Trust Sanger Institute⁶, University of California, Berkeley⁷, Shenzhen University⁸, University of Alberta⁹, Aarhus University¹⁰

06 Nov 2008-Nature

TL;DR: Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly, and the potential usefulness of next-generation sequencing technologies for personal genomics.

...read moreread less

Abstract: Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual's genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J. D. Watson and J. C. Venter), and structural variation identification. These variations were considered for their potential biological impact. Our sequence data and analyses demonstrate the potential usefulness of next-generation sequencing technologies for personal genomics.

...read moreread less

963 citations

Journal Article•DOI•

Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing

[...]

Peter J. Campbell¹, Philip J. Stephens¹, Erin Pleasance¹, Sarah O’Meara¹, Heng Li¹, Thomas Santarius¹, Thomas Santarius², Lucy Stebbings¹, Catherine Leroy¹, Sarah Edkins¹, Claire Hardy¹, Jon W. Teague¹, Andrew Menzies¹, Ian Goodhead¹, Daniel J. Turner¹, C M Clee¹, Michael A. Quail¹, Antony V. Cox¹, Clive Gavin Brown¹, Richard Durbin¹, Matthew E. Hurles¹, Paul A.W. Edwards², Graham R. Bignell¹, Michael R. Stratton¹, P. Andrew Futreal¹ - Show less +21 more•Institutions (2)

Wellcome Trust Sanger Institute¹, University of Cambridge²

01 Jun 2008-Nature Genetics

TL;DR: The results demonstrate the feasibility of systematic, genome-wide characterization of rearrangements in complex human cancer genomes, raising the prospect of a new harvest of genes associated with cancer using this strategy.

...read moreread less

Abstract: Human cancers often carry many somatically acquired genomic rearrangements, some of which may be implicated in cancer development. However, conventional strategies for characterizing rearrangements are laborious and low-throughput and have low sensitivity or poor resolution. We used massively parallel sequencing to generate sequence reads from both ends of short DNA fragments derived from the genomes of two individuals with lung cancer. By investigating read pairs that did not align correctly with respect to each other on the reference human genome, we characterized 306 germline structural variants and 103 somatic rearrangements to the base-pair level of resolution. The patterns of germline and somatic rearrangement were markedly different. Many somatic rearrangements were from amplicons, although rearrangements outside these regions, notably including tandem duplications, were also observed. Some somatic rearrangements led to abnormal transcripts, including two from internal tandem duplications and two fusion transcripts created by interchromosomal rearrangements. Germline variants were predominantly mediated by retrotransposition, often involving AluY and LINE elements. The results demonstrate the feasibility of systematic, genome-wide characterization of rearrangements in complex human cancer genomes, raising the prospect of a new harvest of genes associated with cancer using this strategy.

...read moreread less

899 citations

Journal Article•DOI•

Genome-wide association analysis identifies 20 loci that influence adult height

[...]

Michael N. Weedon, Hana Lango¹, Cecilia M. Lindgren², Cecilia M. Lindgren³, Chris Wallace⁴, David M. Evans⁵, Massimo Mangino⁶, Rachel M. Freathy¹, Perry Jrb.¹, Suzanne Stevens⁶, Alistair S. Hall⁷, Nilesh J. Samani⁶, Beverley M. Shields, Inga Prokopenko², Inga Prokopenko³, Martin Farrall², Anna F. Dominiczak⁸, Toby Johnson⁹, Toby Johnson¹⁰, Toby Johnson¹¹, Sven Bergmann⁹, Sven Bergmann¹⁰, Jacques S. Beckmann¹¹, Jacques S. Beckmann¹⁰, Peter Vollenweider¹¹, Dawn M. Waterworth¹², Vincent Mooser¹², Palmer Cna.¹³, Palmer Cna.¹⁴, Andrew D. Morris¹⁵, Willem H. Ouwehand¹³, Willem H. Ouwehand¹⁴, Jing Hua Zhao, Shengxu Li, Loos Rjf.¹⁶, Loos Rjf.¹⁴, Inês Barroso¹⁷, Panagiotis Deloukas¹⁷, Manjinder S. Sandhu¹⁴, Manjinder S. Sandhu¹⁶, Eleanor Wheeler¹⁷, Nicole Soranzo¹⁷, Michael Inouye¹⁷, Nicholas J. Wareham, Mark J. Caulfield⁴, Patricia B. Munroe⁴, Andrew T. Hattersley¹, Mark I. McCarthy³, Mark I. McCarthy², Timothy M. Frayling¹ - Show less +46 more•Institutions (17)

University of Exeter¹, Wellcome Trust Centre for Human Genetics², University of Oxford³, Queen Mary University of London⁴, University of Bristol⁵, University of Leicester⁶, University of Leeds⁷, British Heart Foundation⁸, Swiss Institute of Bioinformatics⁹, University of Lausanne¹⁰, University Hospital of Lausanne¹¹, GlaxoSmithKline¹², National Health Service¹³, University of Cambridge¹⁴, University of Dundee¹⁵, Imperial College London¹⁶, Wellcome Trust Sanger Institute¹⁷

01 May 2008-Nature Genetics

TL;DR: The loci the authors identified implicate genes in Hedgehog signaling, extracellular matrix, and cancer pathways, and provide new insights into human growth and developmental processes and insights into the genetic architecture of a classic quantitative trait.

...read moreread less

Abstract: Adult height is a model polygenic trait, but there has been limited success in identifying the genes underlying its normal variation. To identify genetic variants influencing adult human height, we used genome-wide association data from 13,665 individuals and genotyped 39 variants in an additional 16,482 samples. We identified 20 variants associated with adult height (P < 5 x 10(-7), with 10 reaching P < 1 x 10(-10)). Combined, the 20 SNPs explain approximately 3% of height variation, with a approximately 5 cm difference between the 6.2% of people with 17 or fewer 'tall' alleles compared to the 5.5% with 27 or more 'tall' alleles. The loci we identified implicate genes in Hedgehog signaling (IHH, HHIP, PTCH1), extracellular matrix (EFEMP1, ADAMTSL3, ACAN) and cancer (CDK6, HMGA2, DLEU7) pathways, and provide new insights into human growth and developmental processes. Finally, our results provide insights into the genetic architecture of a classic quantitative trait.

...read moreread less

Journal Article•DOI•

The Pangenome Structure of Escherichia coli: Comparative Genomic Analysis of E. coli Commensal and Pathogenic Isolates

[...]

David A. Rasko¹, M. J. Rosovitz², Garry S. A. Myers¹, Emmanuel F. Mongodin¹, W. Florian Fricke¹, Pawel Gajer¹, Jonathan Crabtree², Mohammed Sebaihia³, Nicholas R. Thomson³, Roy R. Chaudhuri⁴, Ian R. Henderson⁵, Vanessa Sperandio⁶, Jacques Ravel¹ - Show less +9 more•Institutions (6)

University of Maryland, Baltimore¹, J. Craig Venter Institute², Wellcome Trust Sanger Institute³, University of Cambridge⁴, University of Birmingham⁵, University of Texas Southwestern Medical Center⁶

15 Oct 2008-Journal of Bacteriology

TL;DR: Pangenomic calculations indicate that E. coli genomic diversity represents an open pangenome model containing a reservoir of more than 13,000 genes, many of which may be uncharacterized but important virulence factors, which should provide the basis for future functional work on this important group of pathogens.

...read moreread less

Abstract: Whole-genome sequencing has been skewed toward bacterial pathogens as a consequence of the prioritization of medical and veterinary diseases. However, it is becoming clear that in order to accurately measure genetic variation within and between pathogenic groups, multiple isolates, as well as commensal species, must be sequenced. This study examined the pangenomic content of Escherichia coli. Six distinct E. coli pathovars can be distinguished using molecular or phenotypic markers, but only two of the six pathovars have been subjected to any genome sequencing previously. Thus, this report provides a seminal description of the genomic contents and unique features of three unsequenced pathovars, enterotoxigenic E. coli, enteropathogenic E. coli, and enteroaggregative E. coli. We also determined the first genome sequence of a human commensal E. coli isolate, E. coli HS, which will undoubtedly provide a new baseline from which workers can examine the evolution of pathogenic E. coli. Comparison of 17 E. coli genomes, 8 of which are new, resulted in identification of ∼2,200 genes conserved in all isolates. We were also able to identify genes that were isolate and pathovar specific. Fewer pathovar-specific genes were identified than anticipated, suggesting that each isolate may have independently developed virulence capabilities. Pangenome calculations indicate that E. coli genomic diversity represents an open pangenome model containing a reservoir of more than 13,000 genes, many of which may be uncharacterized but important virulence factors. This comparative study of the species E. coli, while descriptive, should provide the basis for future functional work on this important group of pathogens.

...read moreread less

Journal Article•DOI•

A large genome center's improvements to the Illumina sequencing system.

[...]

Michael A. Quail¹, Iwanka Kozarewa¹, Frances Smith¹, Aylwyn Scally¹, Philip J. Stephens¹, Richard Durbin¹, Harold Swerdlow¹, Daniel J. Turner¹ - Show less +4 more•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Dec 2008-Nature Methods

TL;DR: A set of improvements are described to the standard Illumina protocols to make the library preparation more reliable in a high-throughput environment, to reduce bias, tighten insert size distribution and reliably obtain high yields of data.

...read moreread less

Abstract: The Wellcome Trust Sanger Institute is one of the world's largest genome centers, and a substantial amount of our sequencing is performed with 'next-generation' massively parallel sequencing technologies: in June 2008 the quantity of purity-filtered sequence data generated by our Genome Analyzer (Illumina) platforms reached 1 terabase, and our average weekly Illumina production output is currently 64 gigabases. Here we describe a set of improvements we have made to the standard Illumina protocols to make the library preparation more reliable in a high-throughput environment, to reduce bias, tighten insert size distribution and reliably obtain high yields of data.

...read moreread less

Journal Article•DOI•

Recurrent Rearrangements of Chromosome 1q21.1 and Variable Pediatric Phenotypes

[...]

Heather C Mefford¹, Andrew J. Sharp², Carl Baker¹, Andy Itsara¹, Zhaoshi Jiang¹, Karen Buysse³, Shuwen Huang⁴, Viv K. Maloney⁴, John A. Crolla⁴, Diana Baralle⁵, Amanda L. Collins⁵, Catherine Mercer⁵, Koenraad Norga⁶, Thomy de Ravel⁶, Koenraad Devriendt⁶, Ernie M.H.F. Bongers⁷, Nicole de Leeuw⁷, William Reardon, Stefania Gimelli², Frédérique Béna², Raoul C.M. Hennekam⁸, Raoul C.M. Hennekam⁹, Alison Male⁹, Lorraine Gaunt¹⁰, Jill Clayton-Smith¹⁰, Ingrid Simonic, Soo Mi Park, Sarju G. Mehta, Serena Nik-Zainal, C. Geoffrey Woods, Helen V. Firth, Georgina Parkin, Marco Fichera, Santina Reitano, Mariangela Lo Giudice, Kelly Li, Iris Casuga, Adam Broomer, Bernard Conrad¹¹, Markus Schwerzmann¹¹, Lorenz Räber¹¹, Sabina Gallati¹¹, Pasquale Striano¹², Antonietta Coppola¹², John Tolmie¹³, Edward S. Tobias¹³, Chris Lilley¹³, Lluís Armengol¹⁴, Yves Spysschaert³, Patrick Verloo³, Anja De Coene³, Linde Goossens³, Geert Mortier³, Frank Speleman³, Ellen van Binsbergen¹⁵, Marcel R. Nelen¹⁵, Ron Hochstenbach¹⁵, Martin Poot¹⁵, Louise Gallagher, Michael Gill, Jon McClellan¹, Mary Claire King¹, Regina Regan¹⁶, Cindy Skinner, Roger E. Stevenson, Stylianos E. Antonarakis², Caifu Chen, Xavier Estivill¹⁴, Björn Menten³, Giorgio Gimelli, Susan M. Gribble¹⁷, Stuart Schwartz¹⁸, James S. Sutcliffe¹⁹, Tom Walsh¹, Samantha J. L. Knight¹⁶, Jonathan Sebat²⁰, Corrado Romano, Charles E. Schwartz, Joris A. Veltman⁷, Bert B.A. de Vries⁷, Joris Vermeesch⁶, John C. K. Barber⁴, Lionel Willatt, May Tassabehji¹⁰, Evan E. Eichler²¹, Evan E. Eichler¹ - Show less +82 more•Institutions (21)

16 Oct 2008-The New England Journal of Medicine

TL;DR: Recurrent molecular lesions that elude syndromic classification and whose disease manifestations must be considered in a broader context of development as opposed to being assigned to a specific disease are identified.

...read moreread less

Abstract: BACKGROUND: Duplications and deletions in the human genome can cause disease or predispose persons to disease. Advances in technologies to detect these changes allow for the routine identification of submicroscopic imbalances in large numbers of patients. METHODS: We tested for the presence of microdeletions and microduplications at a specific region of chromosome 1q21.1 in two groups of patients with unexplained mental retardation, autism, or congenital anomalies and in unaffected persons. RESULTS: We identified 25 persons with a recurrent 1.35-Mb deletion within 1q21.1 from screening 5218 patients. The microdeletions had arisen de novo in eight patients, were inherited from a mildly affected parent in three patients, were inherited from an apparently unaffected parent in six patients, and were of unknown inheritance in eight patients. The deletion was absent in a series of 4737 control persons (P=1.1x10(-7)). We found considerable variability in the level of phenotypic expression of the microdeletion; phenotypes included mild-to-moderate mental retardation, microcephaly, cardiac abnormalities, and cataracts. The reciprocal duplication was enriched in nine children with mental retardation or autism spectrum disorder and other variable features (P=0.02). We identified three deletions and three duplications of the 1q21.1 region in an independent sample of 788 patients with mental retardation and congenital anomalies. CONCLUSIONS: We have identified recurrent molecular lesions that elude syndromic classification and whose disease manifestations must be considered in a broader context of development as opposed to being assigned to a specific disease. Clinical diagnosis in patients with these lesions may be most readily achieved on the basis of genotype rather than phenotype.

...read moreread less

Journal Article•DOI•

Newly identified genetic risk variants for celiac disease related to the immune response

[...]

Karen A. Hunt¹, Alexandra Zhernakova², Graham Turner³, Graham A. Heap¹, Lude Franke², Marcel Bruinenberg⁴, Jihane Romanos⁴, Lotte C. Dinesen⁵, Anthony W. Ryan³, Davinder Panesar¹, Rhian Gwilliam⁶, Fumihiko Takeuchi⁶, William M. McLaren⁶, Geoffrey Holmes, Peter D. Howdle⁷, Julian R.F. Walters⁸, David S Sanders⁹, Raymond J. Playford¹, Gosia Trynka⁴, Chris J. J. Mulder¹⁰, M. Luisa Mearin¹⁰, M. Luisa Mearin¹¹, Wieke H. M. Verbeek¹⁰, Valerie Trimble³, Fiona M. Stevens¹², Colm O'Morain³, Nicholas P. Kennedy³, Dermot Kelleher³, Daniel J. Pennington¹, David P. Strachan¹³, Wendy L. McArdle¹⁴, Charles A. Mein¹, Martin C. Wapenaar⁴, Panos Deloukas⁶, Ralph McGinnis⁶, Ross McManus³, Cisca Wijmenga², Cisca Wijmenga⁴, David A. van Heel¹ - Show less +35 more•Institutions (14)

Queen Mary University of London¹, Utrecht University², Trinity College, Dublin³, University of Groningen⁴, University of Oxford⁵, Wellcome Trust Sanger Institute⁶, University of Leeds⁷, Imperial College London⁸, University of Sheffield⁹, VU University Amsterdam¹⁰, Leiden University¹¹, National University of Ireland¹², University of London¹³, University of Bristol¹⁴

01 Apr 2008-Nature Genetics

TL;DR: This extensive genome-wide association follow-up study has identified additional celiac disease risk variants in relevant biological pathways and identified seven previously unknown risk regions.

...read moreread less

Abstract: Our genome-wide association study of celiac disease previously identified risk variants in the IL2-IL21 region. To identify additional risk variants, we genotyped 1,020 of the most strongly associated non-HLA markers in an additional 1,643 cases and 3,406 controls. Through joint analysis including the genome-wide association study data (767 cases, 1,422 controls), we identified seven previously unknown risk regions (P < 5 x 10(-7)). Six regions harbor genes controlling immune responses, including CCR3, IL12A, IL18RAP, RGS1, SH2B3 (nsSNP rs3184504) and TAGAP. Whole-blood IL18RAP mRNA expression correlated with IL18RAP genotype. Type 1 diabetes and celiac disease share HLA-DQ, IL2-IL21, CCR3 and SH2B3 risk regions. Thus, this extensive genome-wide association follow-up study has identified additional celiac disease risk variants in relevant biological pathways.

...read moreread less

Journal Article•DOI•

Genome analysis of the platypus reveals unique signatures of evolution

[...]

Wesley C. Warren¹, LaDeana W. Hillier¹, Jennifer A. Marshall Graves², Ewan Birney, Chris P. Ponting³, Frank Grützner⁴, Katherine Belov⁵, Webb Miller⁶, Laura Clarke⁷, Asif T. Chinwalla¹, Shiaw Pyng Yang¹, Andreas Heger³, Devin P. Locke¹, Pat Miethke², Paul D. Waters², Frédéric Veyrunes⁸, Frédéric Veyrunes², Lucinda Fulton¹, Bob Fulton¹, Tina Graves¹, John W. Wallis¹, Xose S. Puente⁹, Carlos López-Otín⁹, Gonzalo R. Ordóñez⁹, Evan E. Eichler¹⁰, Lin Chen¹⁰, Ze Cheng¹⁰, Janine E. Deakin², Amber E. Alsop², Katherine Thompson², Patrick J. Kirby², Anthony T. Papenfuss¹¹, Matthew Wakefield¹¹, Tsviya Olender¹², Doron Lancet¹², Gavin A. Huttley², Arian F.A. Smit¹³, Andrew J Pask¹⁴, Peter Temple-Smith¹⁴, Peter Temple-Smith¹⁵, Mark A. Batzer¹⁶, Jerilyn A. Walker¹⁶, Miriam K. Konkel¹⁶, Robert S. Harris⁶, Camilla M. Whittington⁵, Emily S. W. Wong⁵, Neil J. Gemmell¹⁷, Emmanuel Buschiazzo¹⁷, Iris M. Vargas Jentzsch¹⁷, Angelika Merkel¹⁷, Juergen Schmitz¹⁸, Anja Zemann¹⁸, Gennady Churakov¹⁸, Jan Ole Kriegs¹⁸, Juergen Brosius¹⁸, Elizabeth P. Murchison¹⁹, Ravi Sachidanandam¹⁹, Carly Smith¹⁹, Gregory J. Hannon¹⁹, Enkhjargal Tsend-Ayush⁴, Daniel McMillan², Rosalind Attenborough², Willem Rens⁸, Malcolm A. Ferguson-Smith⁸, Christophe Lefevre²⁰, Christophe Lefevre¹⁴, Julie A. Sharp¹⁴, Kevin R. Nicholas¹⁴, David A. Ray²¹, Michael Kube, Richard Reinhardt, Thomas H. Pringle, James Taylor²², Russell C. Jones, Brett Nixon, Jean Louis Dacheux²³, Hitoshi Niwa, Yoko Sekita, Xiaoqiu Huang²⁴, Alexander Stark²⁵, Pouya Kheradpour²⁵, Manolis Kellis²⁵, Paul Flicek, Yuan Chen, Caleb Webber³, Ross C. Hardison, Joanne O. Nelson¹, Kym Hallsworth-Pepin¹, Kim D. Delehaunty¹, Chris Markovic¹, Patrick Minx¹, Yucheng Feng¹, Colin Kremitzki¹, Makedonka Mitreva¹, Jarret Glasscock¹, Todd Wylie¹, Patricia Wohldmann¹, Prathapan Thiru¹, Michael N. Nhan¹, Craig Pohl¹, Scott M. Smith¹, Shunfeng Hou¹, Marilyn B. Renfree¹⁴, Elaine R. Mardis¹, Richard K. Wilson¹ - Show less +101 more•Institutions (25)

08 May 2008-Nature

TL;DR: It is found that reptile and platypus venom proteins have been co-opted independently from the same gene families; milk protein genes are conserved despite platypuses laying eggs; and immune gene family expansions are directly related to platypUS biology.

...read moreread less

Abstract: We present a draft genome sequence of the platypus, Ornithorhynchus anatinus This monotreme exhibits a fascinating combination of reptilian and mammalian characters For example, platypuses have a coat of fur adapted to an aquatic lifestyle; platypus females lactate, yet lay eggs; and males are equipped with venom similar to that of reptiles Analysis of the first monotreme genome aligned these features with genetic innovations We find that reptile and platypus venom proteins have been co-opted independently from the same gene families; milk protein genes are conserved despite platypuses laying eggs; and immune gene family expansions are directly related to platypus biology Expansions of protein, non-protein-coding RNA and microRNA families, as well as repeat elements, are identified Sequencing of this genome now provides a valuable resource for deep mammalian comparative analyses, as well as for monotreme biology and conservation

...read moreread less

Journal Article•DOI•

A Bayesian deconvolution strategy for immunoprecipitation-based DNA methylome analysis

[...]

Thomas A. Down¹, Vardhman K. Rakyan², Daniel J. Turner³, Paul Flicek⁴, Heng Li³, Eugene Kulesha⁴, Stefan Gräf⁴, Nathan Johnson⁴, Javier Herrero⁴, Eleni M. Tomazou³, Natalie P. Thorne⁵, Liselotte Bäckdahl⁶, Marlis Herberth⁵, Kevin L. Howe⁵, David K. Jackson³, Marcos Mateo Miretti³, John C. Marioni⁵, Ewan Birney⁴, Tim Hubbard³, Richard Durbin³, Simon Tavaré⁵, Stephan Beck⁶ - Show less +18 more•Institutions (6)

Wellcome Trust/Cancer Research UK Gurdon Institute¹, Queen Mary University of London², Wellcome Trust Sanger Institute³, European Bioinformatics Institute⁴, University of Cambridge⁵, University College London⁶

01 Jul 2008-Nature Biotechnology

TL;DR: This work has developed a cross-platform algorithm—Bayesian tool for methylation analysis (Batman)—for analyzing methylated DNA immunoprecipitation profiles generated using oligonucleotide arrays or next-generation sequencing, developed to provide a high-resolution whole-genome DNA methylation profile (DNA methylome) of a mammalian genome.

...read moreread less

Abstract: DNA methylation is an indispensible epigenetic modification required for regulating the expression of mammalian genomes. Immunoprecipitation-based methods for DNA methylome analysis are rapidly shifting the bottleneck in this field from data generation to data analysis, necessitating the development of better analytical tools. In particular, an inability to estimate absolute methylation levels remains a major analytical difficulty associated with immunoprecipitation-based DNA methylation profiling. To address this issue, we developed a cross-platform algorithm-Bayesian tool for methylation analysis (Batman)-for analyzing methylated DNA immunoprecipitation (MeDIP) profiles generated using oligonucleotide arrays (MeDIP-chip) or next-generation sequencing (MeDIP-seq). We developed the latter approach to provide a high-resolution whole-genome DNA methylation profile (DNA methylome) of a mammalian genome. Strong correlation of our data, obtained using mature human spermatozoa, with those obtained using bisulfite sequencing suggest that combining MeDIP-seq or MeDIP-chip with Batman provides a robust, quantitative and cost-effective functional genomic strategy for elucidating the function of DNA methylation.

...read moreread less

Journal Article•DOI•

Bone mineral density, osteoporosis, and osteoporotic fractures : a genome-wide association study

[...]

J. B. Richards¹, Fernando Rivadeneira², Michael Inouye³, Tomi Pastinen⁴, Nicole Soranzo³, Scott Wilson⁵, Scott Wilson⁶, Toby Andrew¹, Mario Falchi¹, Rhian Gwilliam³, Kourosh R. Ahmadi¹, Ana M. Valdes¹, P. Arp², Pamela Whittaker³, Dominique J. Verlaan⁷, Dominique J. Verlaan⁴, Mila Jhamai², Vasudev Kumanduri³, M. Moorhouse², J.B. van Meurs², Albert Hofman², Huib A.P. Pols², D Hart¹, Guangju Zhai¹, Bernet S. Kato¹, Benjamin H. Mullin⁶, Feng Zhang¹, Panos Deloukas³, André G. Uitterlinden², Tim D. Spector¹ - Show less +26 more•Institutions (7)

King's College London¹, Erasmus University Rotterdam², Wellcome Trust Sanger Institute³, McGill University⁴, University of Western Australia⁵, Sir Charles Gairdner Hospital⁶, Université de Montréal⁷

03 May 2008-The Lancet

TL;DR: The combined effect of these risk alleles on fractures is similar to that of most well-replicated environmental risk factors, and they are present in more than one in five white people, suggesting a potential role in screening.

...read moreread less

Journal Article•DOI•

BAC TransgeneOmics: a high-throughput method for exploration of protein function in mammals.

[...]

Ina Poser¹, Mihail Sarov¹, Mihail Sarov², James R. A. Hutchins³, Jean-Karim Hériché⁴, Yusuke Toyoda¹, Andrei Pozniakovsky¹, Daniela Weigl⁵, Anja Nitzsche¹, Björn Hegemann³, Alexander W. Bird¹, Laurence Pelletier⁶, Laurence Pelletier¹, Ralf Kittler¹, Ralf Kittler⁷, Sujun Hua, Ronald Naumann¹, Martina Augsburg¹, Martina M. Sykora³, Helmut Hofemeister², Youming Zhang, Kim Nasmyth⁸, Kevin P. White⁷, Steffen Dietzel⁵, Karl Mechtler³, Richard Durbin⁴, A. Francis Stewart², Jan-Michael Peters³, Frank Buchholz¹, Anthony A. Hyman¹ - Show less +26 more•Institutions (8)

Max Planck Society¹, Dresden University of Technology², Research Institute of Molecular Pathology³, Wellcome Trust Sanger Institute⁴, Ludwig Maximilian University of Munich⁵, University of Toronto⁶, University of Chicago⁷, University of Oxford⁸

01 May 2008-Nature Methods

TL;DR: A fast and reliable pipeline to study protein function in mammalian cells based on protein tagging in bacterial artificial chromosomes (BACs) is described and it is shown that BAC transgenes can be rapidly and reliably generated using 96-well-format recombineering.

...read moreread less

Abstract: The interpretation of genome sequences requires reliable and standardized methods to assess protein function at high throughput. Here we describe a fast and reliable pipeline to study protein function in mammalian cells based on protein tagging in bacterial artificial chromosomes (BACs). The large size of the BAC transgenes ensures the presence of most, if not all, regulatory elements and results in expression that closely matches that of the endogenous gene. We show that BAC transgenes can be rapidly and reliably generated using 96-well-format recombineering. After stable transfection of these transgenes into human tissue culture cells or mouse embryonic stem cells, the localization, protein-protein and/or protein-DNA interactions of the tagged protein are studied using generic, tag-based assays. The same high-throughput approach will be generally applicable to other model systems. NOTE: In the version of this article initially published online, the name of one individual was misspelled in the Acknowledgments. The second sentence of the Acknowledgments paragraph should read, “We thank I. Cheesman for helpful discussions.” The error has been corrected for all versions of the article.

...read moreread less

Journal Article•DOI•

Identification of ten loci associated with height highlights new biological pathways in human growth.

[...]

Guillaume Lettre¹, Guillaume Lettre², Anne U. Jackson³, Christian Gieger⁴, Fredrick R. Schumacher⁵, Fredrick R. Schumacher⁶, Sonja I. Berndt⁷, Serena Sanna³, Susana Eyheramendy⁴, Benjamin F. Voight², Benjamin F. Voight⁵, Johannah L. Butler¹, Candace Guiducci², Thomas Illig, Rachel Hackett², Iris M. Heid⁴, Kevin B. Jacobs, Valeriya Lyssenko⁸, Manuela Uda, Michael Boehnke³, Stephen J. Chanock⁹, Leif Groop¹⁰, Leif Groop⁸, Frank B. Hu⁶, Frank B. Hu⁵, Bo Isomaa, Peter Kraft⁵, Leena Peltonen², Leena Peltonen¹¹, Leena Peltonen¹², Veikko Salomaa, David Schlessinger⁷, David J. Hunter, Richard B. Hayes⁷, Gonçalo R. Abecasis³, H-Erich Wichmann⁴, Karen L. Mohlke¹³, Joel N. Hirschhorn², Joel N. Hirschhorn⁵, Joel N. Hirschhorn¹ - Show less +36 more•Institutions (13)

Boston Children's Hospital¹, Broad Institute², University of Michigan³, Ludwig Maximilian University of Munich⁴, Harvard University⁵, Brigham and Women's Hospital⁶, National Institutes of Health⁷, Lund University⁸, United States Department of Health and Human Services⁹, Helsinki University Central Hospital¹⁰, University of Helsinki¹¹, Wellcome Trust Sanger Institute¹², University of North Carolina at Chapel Hill¹³

01 May 2008-Nature Genetics

TL;DR: A meta-analysis of genome-wide association study data of height from 15,821 individuals at 2.2 million SNPs found 10 newly identified and two previously reported loci were strongly associated with variation in height, and highlight several pathways as important regulators of human stature.

...read moreread less

Abstract: Identification of ten loci associated with height highlights new biological pathways in human growth

...read moreread less

Journal Article•DOI•

A study of typhoid fever in five asian countries: disease burden and implications for controls

[...]

R. Leon Ochiai¹, Camilo J. Acosta¹, M. Carolina Danovaro-Holliday¹, Dong Baiqing², Sujit K. Bhattacharya, Magdarina D. Agtini, Zulfiqar A Bhutta³, Do Gia Canh, Mohammad Ali¹, Seonghye Shin¹, John Wain⁴, Anne-Laure Page¹, M. John Albert⁵, Jeremy Farrar⁶, Remon Abu-Elyazeed, Tikki Pang⁷, Claudia M. Galindo¹, Lorenz von Seidlein¹, John D. Clemens¹ - Show less +15 more•Institutions (7)

International Vaccine Institute¹, Centers for Disease Control and Prevention², Aga Khan University³, Wellcome Trust Sanger Institute⁴, Kuwait University⁵, University of Oxford⁶, World Health Organization⁷

01 Apr 2008-Bulletin of The World Health Organization

TL;DR: The incidence of typhoid varied substantially between sites, being high in India and Pakistan, intermediate in Indonesia, and low in China and Viet Nam, and underscore the importance of evidence on disease burden in making policy decisions about interventions to control this disease.

...read moreread less

Abstract: Objective To inform policy-makers about introduction of preventive interventions against typhoid, including vaccination. Methods A population-based prospective surveillance design was used. Study sites where typhoid was considered a problem by local authorities were established in China, India, Indonesia, Pakistan and Viet Nam. Standardized clinical, laboratory, and surveillance methods were used to investigate cases of fever of ³ 3 days’ duration for a one-year period. A total of 441 435 persons were under surveillance, 159 856 of whom were aged 5–15 years. Findings A total of 21 874 episodes of fever were detected. Salmonella typhi was isolated from 475 (2%) blood cultures, 57% (273/475) of which were from 5–15 year-olds. The annual typhoid incidence (per 100 000 person years) among this age group varied from 24.2 and 29.3 in sites in Viet Nam and China, respectively, to 180.3 in the site in Indonesia; and to 412.9 and 493.5 in sites in Pakistan and India, respectively. Altogether, 23% (96/413) of isolates were multidrug resistant (chloramphenicol, ampicillin and trimethoprim-sulfamethoxazole). Conclusion The incidence of typhoid varied substantially between sites, being high in India and Pakistan, intermediate in Indonesia, and low in China and Viet Nam. These findings highlight the considerable, but geographically heterogeneous, burden of typhoid fever in endemic areas of Asia, and underscore the importance of evidence on disease burden in making policy decisions about interventions to control this disease.

...read moreread less

Journal Article•DOI•

High-resolution mapping of expression-QTLs yields insight into human gene regulation.

[...]

Jean-Baptiste Veyrieras¹, Sridhar Kudaravalli¹, Su Yeon Kim¹, Emmanouil T. Dermitzakis², Yoav Gilad¹, Matthew Stephens¹, Jonathan K. Pritchard³, Jonathan K. Pritchard¹ - Show less +4 more•Institutions (3)

University of Chicago¹, Wellcome Trust Sanger Institute², Howard Hughes Medical Institute³

10 Oct 2008-PLOS Genetics

TL;DR: An important role for mRNA stability in determining steady-state mRNA levels is suggested, and the potential of eQTL mapping as a high-resolution tool for studying the determinants of gene regulation is highlighted.

...read moreread less

Abstract: Recent studies of the HapMap lymphoblastoid cell lines have identified large numbers of quantitative trait loci for gene expression (eQTLs) Reanalyzing these data using a novel Bayesian hierarchical model, we were able to create a surprisingly high-resolution map of the typical locations of sites that affect mRNA levels in cis Strikingly, we found a strong enrichment of eQTLs in the 250 bp just upstream of the transcription end site (TES), in addition to an enrichment around the transcription start site (TSS) Most eQTLs lie either within genes or close to genes; for example, we estimate that only 5% of eQTLs lie more than 20 kb upstream of the TSS After controlling for position effects, SNPs in exons are approximately 2-fold more likely than SNPs in introns to be eQTLs Our results suggest an important role for mRNA stability in determining steady-state mRNA levels, and highlight the potential of eQTL mapping as a high-resolution tool for studying the determinants of gene regulation

...read moreread less

Journal Article•DOI•

Artemis and ACT

[...]

Tim Carver¹, Matthew Berriman¹, Adrian Tivey¹, Chinmay Patel¹, Ulrike Böhme¹, Barclay G. Barrell¹, Julian Parkhill¹, Marie-Adèle Rajandream¹ - Show less +4 more•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Dec 2008-Bioinformatics

TL;DR: A new version of Artemis has been developed, which reads from and writes to a relational database schema, and allows users to annotate more complex, often large and fragmented, genome sequences.

...read moreread less

Abstract: Motivation: Artemis and Artemis Comparison Tool (ACT) have become mainstream tools for viewing and annotating sequence data, particularly for microbial genomes. Since its first release, Artemis has been continuously developed and supported with additional functionality for editing and analysing sequences based on feedback from an active user community of laboratory biologists and professional annotators. Nevertheless, its utility has been somewhat restricted by its limitation to reading and writing from flat files. Therefore, a new version of Artemis has been developed, which reads from and writes to a relational database schema, and allows users to annotate more complex, often large and fragmented, genome sequences. Results: Artemis and ACT have now been extended to read and write directly to the Generic Model Organism Database (GMOD, http://www.gmod.org) Chado relational database schema. In addition, a Gene Builder tool has been developed to provide structured forms and tables to edit coordinates of gene models and edit functional annotation, based on standard ontologies, controlled vocabularies and free text. Availability: Artemis and ACT are freely available (under a GPL licence) for download (for MacOSX, UNIX and Windows) at the Wellcome Trust Sanger Institute web sites: http://www.sanger.ac.uk/Software/Artemis/http://www.sanger.ac.uk/Software/ACT/ Contact: artemis@sanger.ac.uk Supplementary information:Supplementary data are available at Bioinformatics online.

...read moreread less

Journal Article•DOI•

Promoting coherent minimum reporting guidelines for biological and biomedical investigations: the MIBBI project

[...]

Chris F. Taylor¹, Chris F. Taylor², Dawn Field, Susanna-Assunta Sansone², Susanna-Assunta Sansone¹, Jan Aerts³, Rolf Apweiler, Michael Ashburner⁴, Catherine A. Ball⁵, Pierre-Alain Binz⁶, Molly Bogue, Timothy F. Booth², Alvis Brazma, Ryan R. Brinkman⁷, Adam Clark⁸, Eric W. Deutsch⁹, Oliver Fiehn¹⁰, Jennifer Fostel¹¹, Peter Ghazal¹², Frank Gibson¹³, Tanya Gray, Graeme R. Grimes¹², John M. Hancock¹⁴, Nigel Hardy¹⁵, Henning Hermjakob, Randall K. Julian, Matthew D. Kane¹⁶, Carsten Kettner¹⁷, Christopher R. Kinsinger¹⁸, Eugene Kolker¹⁹, Martin Kuiper²⁰, Nicolas Le Novère, Jim Leebens-Mack²¹, Suzanna E. Lewis²², Phillip Lord¹³, Ann-Marie Mallon¹⁴, Nishanth Marthandan²³, Hiroshi Masuya, Ruth McNally²⁴, Alexander Mehrle²⁵, Norman Morrison²⁶, Sandra Orchard, John Quackenbush²⁷, James M. Reecy²⁸, Donald G. Robertson²⁹, Philippe Rocca-Serra, Henry Rodriguez¹⁸, Heiko Rosenfelder²⁵, Javier Santoyo-Lopez¹², Richard H. Scheuermann²³, Daniel Schober, Barry Smith³⁰, Jason Snape³¹, Christian J. Stoeckert, Keith F. Tipton³², Peter Sterk, Andreas Untergasser³³, Jo Vandesompele³⁴, Stefan Wiemann²⁵ - Show less +55 more•Institutions (34)

European Bioinformatics Institute¹, Natural Environment Research Council², Wellcome Trust Sanger Institute³, University of Cambridge⁴, Stanford University⁵, Swiss Institute of Bioinformatics⁶, University of British Columbia⁷, Livestrong Foundation⁸, Institute for Systems Biology⁹, University of California, Davis¹⁰, Lockheed Martin Corporation¹¹, University of Edinburgh¹², Newcastle University¹³, Medical Research Council¹⁴, Aberystwyth University¹⁵, National Science Foundation¹⁶, Beilstein-Institut¹⁷, National Institutes of Health¹⁸, Boston Children's Hospital¹⁹, Norwegian University of Science and Technology²⁰, University of Georgia²¹, University of California, Berkeley²², University of Texas Southwestern Medical Center²³, Lancaster University²⁴, German Cancer Research Center²⁵, University of Manchester²⁶, Harvard University²⁷, Iowa State University²⁸, Bristol-Myers Squibb²⁹, University at Buffalo³⁰, AstraZeneca³¹, Trinity College, Dublin³², Wageningen University and Research Centre³³, Ghent University³⁴

01 Aug 2008-Nature Biotechnology

TL;DR: The Minimum Information for Biological and Biomedical Investigations (MIBBI) project aims to foster the coordinated development of minimum-information checklists and provide a resource for those exploring the range of extant checklists.

...read moreread less

Abstract: The Minimum Information for Biological and Biomedical Investigations (MIBBI) project aims to foster the coordinated development of minimum-information checklists and provide a resource for those exploring the range of extant checklists.

...read moreread less

Journal Article•DOI•

Genome Sequence of Staphylococcus aureus Strain Newman and Comparative Analysis of Staphylococcal Genomes: Polymorphism and Evolution of Two Major Pathogenicity Islands

[...]

Tadashi Baba¹, Taeok Bae², Olaf Schneewind³, Fumihiko Takeuchi⁴, Keiichi Hiramatsu¹ - Show less +1 more•Institutions (4)

Juntendo University¹, Indiana University², University of Chicago³, Wellcome Trust Sanger Institute⁴

01 Jan 2008-Journal of Bacteriology

TL;DR: The complete genome sequence of S. aureus Newman is reported, which carries four integrated prophages, as well as two large pathogenicity islands, and the absence of drug resistance genes reflects the general antibiotic-susceptible phenotype of Sengers Newman.

...read moreread less

Abstract: Strains of Staphylococcus aureus, an important human pathogen, display up to 20% variability in their genome sequence, and most sequence information is available for human clinical isolates that have not been subjected to genetic analysis of virulence attributes. S. aureus strain Newman, which was also isolated from a human infection, displays robust virulence properties in animal models of disease and has already been extensively analyzed for its molecular traits of staphylococcal pathogenesis. We report here the complete genome sequence of S. aureus Newman, which carries four integrated prophages, as well as two large pathogenicity islands. In agreement with the view that S. aureus Newman prophages contribute important properties to pathogenesis, fewer virulence factors are found outside of the prophages than for the highly virulent strain MW2. The absence of drug resistance genes reflects the general antibiotic-susceptible phenotype of S. aureus Newman. Phylogenetic analyses reveal clonal relationships between the staphylococcal strains Newman, COL, NCTC8325, and USA300 and a greater evolutionary distance to strains MRSA252, MW2, MSSA476, N315, Mu50, JH1, JH9, and RF122. However, polymorphism analysis of two large pathogenicity islands distributed among these strains shows that the two islands were acquired independently from the evolutionary pathway of the chromosomal backbones of staphylococcal genomes. Prophages and pathogenicity islands play central roles in S. aureus virulence and evolution.

...read moreread less

Journal Article•DOI•

Insights from the complete genome sequence of Mycobacterium marinum on the evolution of Mycobacterium tuberculosis

[...]

Timothy P. Stinear¹, Torsten Seemann², Paul Harrison², Grant A. Jenkin², John K. Davies², Paul D R Johnson³, Zahra Abdellah⁴, Claire Arrowsmith⁴, Tracey Chillingworth⁴, Carol Churcher⁴, Kay Clarke⁴, Ann Cronin⁴, Paul Davis⁴, Ian Goodhead⁴, Nancy Holroyd⁴, Kay Jagels⁴, Angela Lord⁴, Sharon Moule⁴, Karen Mungall⁴, Halina Norbertczak⁴, Michael A. Quail⁴, Ester Rabbinowitsch⁴, Danielle Walker⁴, Brian White⁴, Sally Whitehead⁴, Pamela L. C. Small⁵, Roland Brosch⁶, Lalita Ramakrishnan⁷, Michael A. Fischbach⁸, Julian Parkhill⁴, Stewart T. Cole⁹ - Show less +27 more•Institutions (9)

Monash University, Clayton campus¹, Monash University², University of Melbourne³, Wellcome Trust Sanger Institute⁴, University of Tennessee⁵, Pasteur Institute⁶, University of Washington⁷, Broad Institute⁸, École Polytechnique Fédérale de Lausanne⁹

01 May 2008-Genome Research

TL;DR: The genome of the M strain of M. marinum comprises a 6,636,827-bp circular chromosome with 5424 CDS, 10 prophages, and a 23-kb mercury-resistance plasmid as discussed by the authors.

...read moreread less

Abstract: Mycobacterium marinum, a ubiquitous pathogen of fish and amphibia, is a near relative of Mycobacterium tuberculosis, the etiologic agent of tuberculosis in humans. The genome of the M strain of M. marinum comprises a 6,636,827-bp circular chromosome with 5424 CDS, 10 prophages, and a 23-kb mercury-resistance plasmid. Prominent features are the very large number of genes (57) encoding polyketide synthases (PKSs) and nonribosomal peptide synthases (NRPSs) and the most extensive repertoire yet reported of the mycobacteria-restricted PE and PPE proteins, and related-ESX secretion systems. Some of the NRPS genes comprise a novel family and seem to have been acquired horizontally. M. marinum is used widely as a model organism to study M. tuberculosis pathogenesis, and genome comparisons confirmed the close genetic relationship between these two species, as they share 3000 orthologs with an average amino acid identity of 85%. Comparisons with the more distantly related Mycobacterium avium subspecies paratuberculosis and Mycobacterium smegmatis reveal how an ancestral generalist mycobacterium evolved into M. tuberculosis and M. marinum. M. tuberculosis has undergone genome downsizing and extensive lateral gene transfer to become a specialized pathogen of humans and other primates without retaining an environmental niche. M. marinum has maintained a large genome so as to retain the capacity for environmental survival while becoming a broad host range pathogen that produces disease strikingly similar to M. tuberculosis. The work described herein provides a foundation for using M. marinum to better understand the determinants of pathogenesis of tuberculosis.

...read moreread less

Journal Article•DOI•

Use and misuse of the gene ontology annotations.

[...]

Seung Y. Rhee¹, Valerie Wood², Kara Dolinski³, Sorin Draghici⁴•Institutions (4)

Carnegie Institution for Science¹, Wellcome Trust Sanger Institute², Princeton University³, Wayne State University⁴

13 May 2008-Nature Reviews Genetics

TL;DR: Key aspects of GO are described, which, when overlooked, can cause erroneous results, and how these pitfalls can be avoided.

...read moreread less

Abstract: The Gene Ontology (GO) project is a collaboration among model organism databases to describe gene products from all organisms using a consistent and computable language. GO produces sets of explicitly defined, structured vocabularies that describe biological processes, molecular functions and cellular components of gene products in both a computer- and human-readable manner. Here we describe key aspects of GO, which, when overlooked, can cause erroneous results, and address how these pitfalls can be avoided.

...read moreread less

Journal Article•DOI•

The complete genome, comparative and functional analysis of Stenotrophomonas maltophilia reveals an organism heavily shielded by drug resistance determinants.

[...]

Lisa Crossman¹, Virginia C. Gould², J. Maxwell Dow³, Georgios S. Vernikos¹, Aki Okazaki², Mohammed Sebaihia¹, David L. Saunders¹, Claire Arrowsmith¹, Tim Carver¹, Nicholas S. Peters¹, Ellen Adlem¹, Arnaud Kerhornou¹, Angela Lord¹, Lee Murphy¹, Katharine Seeger¹, R. Squares¹, Simon Rutter¹, Michael A. Quail¹, Mari Adele Rajandream¹, David Harris¹, Carol Churcher¹, Stephen D. Bentley¹, Julian Parkhill¹, Nicholas R. Thomson¹, Matthew B. Avison² - Show less +21 more•Institutions (3)

Wellcome Trust Sanger Institute¹, University of Bristol², National University of Ireland³

17 Apr 2008-Genome Biology

TL;DR: The panoply of antimicrobial drug resistance genes and mobile genetic elements found suggests that the organism can act as a reservoir of antimacterial drug resistance determinants in a clinical environment, which is an issue of considerable concern.

...read moreread less

Abstract: Background Stenotrophomonas maltophilia is a nosocomial opportunistic pathogen of the Xanthomonadaceae. The organism has been isolated from both clinical and soil environments in addition to the sputum of cystic fibrosis patients and the immunocompromised. Whilst relatively distant phylogenetically, the closest sequenced relatives of S. maltophilia are the plant pathogenic xanthomonads.

...read moreread less

Journal Article•DOI•

High-throughput sequencing provides insights into genome variation and evolution in Salmonella Typhi

[...]

Kathryn E. Holt¹, Julian Parkhill¹, Camila J. Mazzoni², Philippe Roumagnac², François-Xavier Weill³, Ian Goodhead⁴, Ian Goodhead¹, Richard Rance¹, Stephen Baker¹, Stephen Baker⁵, Duncan J. Maskell⁶, John Wain¹, Christiane Dolecek⁵, Mark Achtman², Gordon Dougan¹ - Show less +11 more•Institutions (6)

Wellcome Trust Sanger Institute¹, Max Planck Society², Pasteur Institute³, University of Liverpool⁴, University of Oxford⁵, University of Cambridge⁶

01 Aug 2008-Nature Genetics

TL;DR: The observed patterns of genetic isolation and drift are consistent with the proposed key role of asymptomatic carriers of Typhi as the main reservoir of this pathogen, highlighting the need for identification and treatment of carriers.

...read moreread less

Abstract: Isolates of Salmonella enterica serovar Typhi (Typhi), a human-restricted bacterial pathogen that causes typhoid, show limited genetic variation. We generated whole-genome sequences for 19 Typhi isolates using 454 (Roche) and Solexa (Illumina) technologies. Isolates, including the previously sequenced CT18 and Ty2 isolates, were selected to represent major nodes in the phylogenetic tree. Comparative analysis showed little evidence of purifying selection, antigenic variation or recombination between isolates. Rather, evolution in the Typhi population seems to be characterized by ongoing loss of gene function, consistent with a small effective population size. The lack of evidence for antigenic variation driven by immune selection is in contrast to strong adaptive selection for mutations conferring antibiotic resistance in Typhi. The observed patterns of genetic isolation and drift are consistent with the proposed key role of asymptomatic carriers of Typhi as the main reservoir of this pathogen, highlighting the need for identification and treatment of carriers.

...read moreread less

Journal Article•DOI•

Genetic determinants of ulcerative colitis include the ECM1 locus and five loci implicated in Crohn's disease

[...]

Sheila A. Fisher¹, Mark Tremelling², Carl A. Anderson³, Rhian Gwilliam⁴, Suzannah Bumpstead⁴, Natalie J. Prescott¹, Elaine R. Nimmo⁵, Dunecan Massey², Carlo Berzuini², Chris Johnson², Jeffrey C. Barrett³, Fraser Cummings⁶, Hazel E. Drummond⁵, Charlie W. Lees⁵, Clive M. Onnie¹, Catherine Hanson⁷, Katarzyna Blaszczyk¹, Michael Inouye⁴, Philip Ewels⁴, Radhi Ravindrarajah⁴, Andrew Keniry⁴, Sarah E. Hunt⁴, M J Carter⁸, Nicholas A. Watkins², Willem H. Ouwehand², Cathryn M. Lewis¹, Lon R. Cardon³, Alan J Lobo⁸, Alastair Forbes⁹, Jeremy D. Sanderson¹⁰, Derek P. Jewell⁶, John C. Mansfield⁷, Panos Deloukas⁴, Christopher G. Mathew¹, Miles Parkes², Jack Satsangi⁵ - Show less +32 more•Institutions (10)

King's College London¹, University of Cambridge², Wellcome Trust Centre for Human Genetics³, Wellcome Trust Sanger Institute⁴, Western General Hospital⁵, University of Oxford⁶, Newcastle University⁷, Royal Hallamshire Hospital⁸, University College London⁹, Guy's and St Thomas' NHS Foundation Trust¹⁰

01 Jun 2008-Nature Genetics

TL;DR: Results of a nonsynonymous SNP scan for ulcerative colitis and a previously unknown susceptibility locus at ECM1 are reported, providing the first detailed illustration of the genetic relationship between these common inflammatory bowel diseases.

...read moreread less

Abstract: We report results of a nonsynonymous SNP scan for ulcerative colitis and identify a previously unknown susceptibility locus at ECM1. We also show that several risk loci are common to ulcerative colitis and Crohn's disease (IL23R, IL12B, HLA, NKX2-3 and MST1), whereas autophagy genes ATG16L1 and IRGM, along with NOD2 (also known as CARD15), are specific for Crohn's disease. These data provide the first detailed illustration of the genetic relationship between these common inflammatory bowel diseases.

...read moreread less

Collapse