Showing papers by "Wellcome Trust Sanger Institute published in 2004"

PDF

Open Access

Journal Article•DOI•

[...]

Bino John¹, Anton J. Enright², Anton J. Enright¹, Alexei A. Aravin³, Thomas Tuschl³, Chris Sander¹, Debora S. Marks⁴ - Show less +3 more•Institutions (4)

Memorial Sloan Kettering Cancer Center¹, Wellcome Trust Sanger Institute², Rockefeller University³, Harvard University⁴

05 Oct 2004-PLOS Biology

TL;DR: This work has predicted target sites on the 3′ untranslated regions of human gene transcripts for all currently known 218 mammalian miRNAs to facilitate focused experiments and suggests that miRNA genes, which are about 1% of all human genes, regulate protein production for 10% or more of allhuman genes.

...read moreread less

Abstract: MicroRNAs (miRNAs) interact with target mRNAs at specific sites to induce cleavage of the message or inhibit translation. The specific function of most mammalian miRNAs is unknown. We have predicted target sites on the 3′ untranslated regions of human gene transcripts for all currently known 218 mammalian miRNAs to facilitate focused experiments. We report about 2,000 human genes with miRNA target sites conserved in mammals and about 250 human genes conserved as targets between mammals and fish. The prediction algorithm optimizes sequence complementarity using position-specific rules and relies on strict requirements of interspecies conservation. Experimental support for the validity of the method comes from known targets and from strong enrichment of predicted targets in mRNAs associated with the fragile X mental retardation protein in mammals. This is consistent with the hypothesis that miRNAs act as sequence-specific adaptors in the interaction of ribonuclear particles with translationally regulated messages. Overrepresented groups of targets include mRNAs coding for transcription factors, components of the miRNA machinery, and other proteins involved in translational regulation, as well as components of the ubiquitin machinery, representing novel feedback loops in gene regulation. Detailed information about target genes, target processes, and open-source software for target prediction (miRanda) is available at http://www.microrna.org. Our analysis suggests that miRNA genes, which are about 1% of all human genes, regulate protein production for 10% or more of all human genes.

...read moreread less

3,654 citations

Journal Article•DOI•

The Gene Ontology (GO) database and informatics resource.

[...]

Midori A. Harris, Jennifer I. Clark¹, Ireland A¹, Jane Lomax¹, Michael Ashburner¹, Michael Ashburner², R. Foulger¹, R. Foulger², Karen Eilbeck¹, Karen Eilbeck³, Suzanna E. Lewis³, Suzanna E. Lewis¹, B. Marshall¹, B. Marshall³, Christopher J. Mungall³, Christopher J. Mungall¹, J. Richter³, J. Richter¹, Gerald M. Rubin¹, Gerald M. Rubin³, Judith A. Blake¹, Carol J. Bult¹, Dolan M¹, Drabkin H¹, Janan T. Eppig¹, Hill Dp¹, L. Ni¹, Ringwald M¹, Rama Balakrishnan⁴, Rama Balakrishnan¹, J. M. Cherry¹, J. M. Cherry⁴, Karen R. Christie¹, Karen R. Christie⁴, Maria C. Costanzo⁴, Maria C. Costanzo¹, Selina S. Dwight⁴, Selina S. Dwight¹, Stacia R. Engel¹, Stacia R. Engel⁴, Dianna G. Fisk¹, Dianna G. Fisk⁴, Jodi E. Hirschman¹, Jodi E. Hirschman⁴, Eurie L. Hong⁴, Eurie L. Hong¹, Robert S. Nash¹, Robert S. Nash⁴, Anand Sethuraman¹, Anand Sethuraman⁴, Chandra L. Theesfeld¹, Chandra L. Theesfeld⁴, David Botstein⁵, David Botstein¹, Kara Dolinski¹, Kara Dolinski⁵, Becket Feierbach⁵, Becket Feierbach¹, Tanya Z. Berardini⁶, Tanya Z. Berardini¹, S. Mundodi⁶, S. Mundodi¹, Seung Y. Rhee⁶, Seung Y. Rhee¹, Rolf Apweiler¹, Daniel Barrell¹, Camon E¹, E. Dimmer¹, Lee¹, Rex L. Chisholm, Pascale Gaudet⁷, Pascale Gaudet¹, Warren A. Kibbe⁷, Warren A. Kibbe¹, Ranjana Kishore¹, Ranjana Kishore⁸, Erich M. Schwarz¹, Erich M. Schwarz⁸, Paul W. Sternberg¹, Paul W. Sternberg⁸, M. Gwinn¹, Hannick L¹, Wortman J¹, Matthew Berriman⁹, Matthew Berriman¹, Wood¹, Wood⁹, de la Cruz N¹⁰, de la Cruz N¹, Peter J. Tonellato¹⁰, Peter J. Tonellato¹, Pankaj Jaiswal¹, Pankaj Jaiswal¹¹, Seigfried T¹², Seigfried T¹, White R¹³, White R¹ - Show less +93 more•Institutions (13)

Wellcome Trust¹, University of Cambridge², University of California, Berkeley³, Stanford University⁴, Princeton University⁵, Carnegie Institution for Science⁶, Northwestern University⁷, California Institute of Technology⁸, Wellcome Trust Sanger Institute⁹, Medical College of Wisconsin¹⁰, Cornell University¹¹, Iowa State University¹², Incyte¹³

01 Jan 2004-Nucleic Acids Research

TL;DR: The Gene Ontology (GO) project as discussed by the authors provides structured, controlled vocabularies and classifications that cover several domains of molecular and cellular biology and are freely available for community use in the annotation of genes, gene products and sequences.

...read moreread less

Abstract: The Gene Ontology (GO) project (http://www.geneontology.org/) provides structured, controlled vocabularies and classifications that cover several domains of molecular and cellular biology and are freely available for community use in the annotation of genes, gene products and sequences. Many model organism databases and genome annotation groups use the GO and contribute their annotation sets to the GO resource. The GO database integrates the vocabularies and contributed annotations and provides full access to this information in several formats. Members of the GO Consortium continually work collectively, involving outside experts as needed, to expand and update the GO vocabularies. The GO Web resource also provides access to extensive documentation about the GO project and links to applications that use GO data for functional analyses.

...read moreread less

3,565 citations

Journal Article•DOI•

A census of human cancer genes

[...]

P. Andrew Futreal¹, Lachlan J. M. Coin¹, Mhairi Marshall¹, Thomas A. Down¹, Tim Hubbard¹, Richard Wooster¹, Nazneen Rahman, Michael R. Stratton¹ - Show less +4 more•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Mar 2004-Nature Reviews Cancer

TL;DR: A 'census' of cancer genes is conducted that indicates that mutations in more than 1% of genes contribute to human cancer.

...read moreread less

Abstract: A central aim of cancer research has been to identify the mutated genes that are causally implicated in oncogenesis ('cancer genes'). After two decades of searching, how many have been identified and how do they compare to the complete gene set that has been revealed by the human genome sequence? We have conducted a 'census' of cancer genes that indicates that mutations in more than 1% of genes contribute to human cancer. The census illustrates striking features in the types of sequence alteration, cancer classes in which oncogenic mutations have been identified and protein domains that are encoded by cancer genes.

...read moreread less

3,136 citations

Journal Article•DOI•

Mechanism of Activation of the Raf-Erk Signaling Pathway by Oncogenic Mutations of B-Raf

[...]

Paul T C Wan, Mathew J. Garnett, S. Mark Roe, Sharlene Lee¹, Dan Niculescu-Duvaz¹, Valerie M. Good, Cancer Genome², C. Michael Jones¹, Christopher J. Marshall, Caroline J. Springer¹, David Barford, Richard Marais - Show less +8 more•Institutions (2)

Institute of Cancer Research¹, Wellcome Trust Sanger Institute²

19 Mar 2004-Cell

TL;DR: The high activity mutants signal to ERK by directly phosphorylating MEK, whereas the impaired activity mutants stimulate MEK by activating endogenous C-RAF, possibly via an allosteric or transphosphorylation mechanism.

...read moreread less

2,588 citations

Journal Article•DOI•

Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution

[...]

LaDeana W. Hillier¹, Webb Miller², Ewan Birney, Wesley C. Warren¹ +171 more•Institutions (39)

09 Dec 2004-Nature

TL;DR: A draft genome sequence of the red jungle fowl, Gallus gallus, provides a new perspective on vertebrate genome evolution, while also improving the annotation of mammalian genomes.

...read moreread less

Abstract: We present here a draft genome sequence of the red jungle fowl, Gallus gallus. Because the chicken is a modern descendant of the dinosaurs and the first non-mammalian amniote to have its genome sequenced, the draft sequence of its genome--composed of approximately one billion base pairs of sequence and an estimated 20,000-23,000 genes--provides a new perspective on vertebrate genome evolution, while also improving the annotation of mammalian genomes. For example, the evolutionary distance between chicken and human provides high specificity in detecting functional elements, both non-coding and coding. Notably, many conserved non-coding sequences are far from genes and cannot be assigned to defined functional classes. In coding regions the evolutionary dynamics of protein domains and orthologous groups illustrate processes that distinguish the lineages leading to birds and mammals. The distinctive properties of avian microchromosomes, together with the inferred patterns of conserved synteny, provide additional insights into vertebrate chromosome architecture.

...read moreread less

2,579 citations

Journal Article•DOI•

The microRNA Registry

[...]

Sam Griffiths-Jones¹•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Jan 2004-Nucleic Acids Research

TL;DR: The miRNA Registry provides a service for the assignment of miRNA gene names prior to publication and a comprehensive and searchable database of published miRNA sequences is accessible via a web interface.

...read moreread less

Abstract: The miRNA Registry provides a service for the assignment of miRNA gene names prior to publication. A comprehensive and searchable database of published miRNA sequences is accessible via a web interface (http://www.sanger.ac.uk/Software/Rfam/mirna/), and all sequence and annotation data are freely available for download. Release 2.0 of the database contains 506 miRNA entries from six organisms.

...read moreread less

2,405 citations

Journal Article•DOI•

Gene finding in novel genomes

[...]

Ian F Korf¹•Institutions (1)

Wellcome Trust Sanger Institute¹

14 May 2004-BMC Bioinformatics

TL;DR: The SNAP gene finder is introduced which has been designed to be easily adaptable to a variety of genomes and finds that foreign gene finders are more usefully employed to bootstrap parameter estimation and that the resulting parameters can be highly accurate.

...read moreread less

Abstract: Background Computational gene prediction continues to be an important problem, especially for genomes with little experimental data.

...read moreread less

2,315 citations

Journal Article•DOI•

The ENCODE (ENCyclopedia of DNA elements) Project

[...]

Elise A. Feingold¹, Peter J. Good¹, Mark S. Guyer¹, S. Kamholz¹ +193 more•Institutions (19)

22 Oct 2004-Science

TL;DR: The ENCyclopedia Of DNA Elements (ENCODE) Project is organized as an international consortium of computational and laboratory-based scientists working to develop and apply high-throughput approaches for detecting all sequence elements that confer biological function.

...read moreread less

Abstract: The ENCyclopedia Of DNA Elements (ENCODE) Project aims to identify all functional elements in the human genome sequence. The pilot phase of the Project is focused on a specified 30 megabases (∼1%) of the human genome sequence and is organized as an international consortium of computational and laboratory-based scientists working to develop and apply high-throughput approaches for detecting all sequence elements that confer biological function. The results of this pilot phase will guide future efforts to analyze the entire human genome.

...read moreread less

2,248 citations

Journal Article•DOI•

Identification of Mammalian microRNA Host Genes and Transcription Units

[...]

Antony Rodriguez¹, Sam Griffiths-Jones, Jennifer L. Ashurst, Allan Bradley•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Oct 2004-Genome Research

TL;DR: It is strongly suggested that miRNAs are transcribed in parallel with their host transcripts, and that the two different transcription classes of miRNAAs ('exonic' and 'intronic') identified here may require slightly different mechanisms of biogenesis.

...read moreread less

Abstract: To derive a global perspective on the transcription of microRNAs (miRNAs) in mammals, we annotated the genomic position and context of this class of noncoding RNAs (ncRNAs) in the human and mouse genomes. Of the 232 known mammalian miRNAs, we found that 161 overlap with 123 defined transcription units (TUs). We identified miRNAs within introns of 90 protein-coding genes with a broad spectrum of molecular functions, and in both introns and exons of 66 mRNA-like noncoding RNAs (mlncRNAs). In addition, novel families of miRNAs based on host gene identity were identified. The transcription patterns of all miRNA host genes were curated from a variety of sources illustrating spatial, temporal, and physiological regulation of miRNA expression. These findings strongly suggest that miRNAs are transcribed in parallel with their host transcripts, and that the two different transcription classes of miRNAs (`exonic' and `intronic') identified here may require slightly different mechanisms of biogenesis.

...read moreread less

2,043 citations

Journal Article•DOI•

Genome sequence of the Brown Norway rat yields insights into mammalian evolution

[...]

Richard A. Gibbs¹, George M. Weinstock¹, Michael L. Metzker¹, Donna M. Muzny¹ +239 more•Institutions (35)

01 Apr 2004-Nature

TL;DR: This first comprehensive analysis of the genome sequence of the Brown Norway (BN) rat strain is reported, which is the third complete mammalian genome to be deciphered, and three-way comparisons with the human and mouse genomes resolve details of mammalian evolution.

...read moreread less

Abstract: The laboratory rat (Rattus norvegicus) is an indispensable tool in experimental medicine and drug development, having made inestimable contributions to human health. We report here the genome sequence of the Brown Norway (BN) rat strain. The sequence represents a high-quality 'draft' covering over 90% of the genome. The BN rat sequence is the third complete mammalian genome to be deciphered, and three-way comparisons with the human and mouse genomes resolve details of mammalian evolution. This first comprehensive analysis includes genes and proteins and their relation to human disease, repeated sequences, comparative genome-wide studies of mammalian orthologous chromosomal regions and rearrangement breakpoints, reconstruction of ancestral karyotypes and the events leading to existing species, rates of variation, and lineage-specific and lineage-independent evolutionary events such as expansion of gene families, orthology relations and protein evolution.

...read moreread less

1,964 citations

Journal Article•DOI•

A Map of the Interactome Network of the Metazoan C. elegans

[...]

Siming Li¹, Christopher M. Armstrong¹, Nicolas Bertin¹, Hui Ge¹, Stuart Milstein¹, Mike Boxem¹, Pierre-Olivier Vidalain¹, Jing-Dong J. Han¹, Alban Chesneau, Tong Hao¹, Debra S. Goldberg¹, Ning Li¹, Monica Martinez¹, Jean François Rual¹, Philippe Lamesch¹, Lai Xu², Lai Xu¹, Muneesh Tewari¹, Sharyl L. Wong¹, Lan V. Zhang¹, Gabriel F. Berriz¹, Laurent Jacotot¹, Philippe Vaglio¹, Jérôme Reboul¹, Tomoko Hirozane-Kishikawa¹, Qian-Ru Li¹, Harrison W. Gabel¹, Ahmed Elewa³, Ahmed Elewa¹, Bridget L. Baumgartner², Debra J. Rose⁴, Haiyuan Yu⁵, Stephanie Bosak, Reynaldo Sequerra, Andrew G. Fraser⁶, Susan E. Mango⁷, William M. Saxton⁴, Susan Strome⁴, Sander van den Heuvel¹, Fabio Piano⁸, Jean Vandenhaute, Claude Sardet, Mark Gerstein⁵, Lynn Doucette-Stamm, Kristin C. Gunsalus⁸, J. Wade Harper², J. Wade Harper¹, Michael E. Cusick¹, Frederick P. Roth¹, David E. Hill¹, Marc Vidal¹ - Show less +47 more•Institutions (8)

Harvard University¹, Baylor College of Medicine², University of Massachusetts Medical School³, Indiana University⁴, Yale University⁵, Wellcome Trust Sanger Institute⁶, University of Utah⁷, New York University⁸

23 Jan 2004-Science

TL;DR: A large fraction of the Caenorhabditis elegans interactome network is mapped, starting with a subset of metazoan-specific proteins, and more than 4000 interactions were identified from high-throughput, yeast two-hybrid screens.

...read moreread less

Abstract: To initiate studies on how protein-protein interaction (or "interactome") networks relate to multicellular functions, we have mapped a large fraction of the Caenorhabditis elegans interactome network. Starting with a subset of metazoan-specific proteins, more than 4000 interactions were identified from high-throughput, yeast two-hybrid (HT=Y2H) screens. Independent coaffinity purification assays experimentally validated the overall quality of this Y2H data set. Together with already described Y2H interactions and interologs predicted in silico, the current version of the Worm Interactome (WI5) map contains approximately 5500 interactions. Topological and biological features of this interactome network, as well as its integration with phenome and transcriptome data sets, lead to numerous biological hypotheses.

...read moreread less

Journal Article•DOI•

The Jalview Java alignment editor

[...]

Michele Clamp¹, James Cuff¹, Stephen M. J. Searle¹, Geoffrey J. Barton²•Institutions (2)

Wellcome Trust Sanger Institute¹, European Bioinformatics Institute²

12 Feb 2004-Bioinformatics

TL;DR: The Jalview Java alignment editor is presented here, which enables fast viewing and editing of large multiple sequence alignments.

...read moreread less

Abstract: Summary: Multiple sequence alignment remains a crucial method for understanding the function of groups of related nucleic acid and protein sequences. However, it is known that automatic multiple sequence alignments can often be improved by manual editing. Therefore, tools are needed to view and edit multiple sequence alignments. Due to growth in the sequence databases, multiple sequence alignments can often be large and difficult to view efficiently. The Jalview Java alignment editor is presented here, which enables fast viewing and editing of large multiple sequence alignments. Availability: The Jar file and source code for Jalview is freely available via the World Wide Web at http://www.jalview.org. A Jalview mailing list is also available by e-mailing majordomo@sanger.ac.uk with subscribe Jalview in the body of the mail.

...read moreread less

Journal Article•DOI•

Rfam: annotating non-coding RNAs in complete genomes

[...]

Sam Griffiths-Jones¹, Simon Moxon, Mhairi Marshall, Ajay Khanna, Sean R. Eddy, Alex Bateman - Show less +2 more•Institutions (1)

Wellcome Trust Sanger Institute¹

17 Dec 2004-Nucleic Acids Research

TL;DR: The Rfam database aims to facilitate the identification and classification of new members of known sequence families, and distributes annotation of ncRNAs in over 200 complete genome sequences.

...read moreread less

Abstract: Rfam is a comprehensive collection of non-coding RNA (ncRNA) families, represented by multiple sequence alignments and profile stochastic context-free grammars. Rfam aims to facilitate the identification and classification of new members of known sequence families, and distributes annotation of ncRNAs in over 200 complete genome sequences. The data provide the first glimpses of conservation of multiple ncRNA families across a wide taxonomic range. A small number of large families are essential in all three kingdoms of life, with large numbers of smaller families specific to certain taxa. Recent improvements in the database are discussed, together with challenges for the future. Rfam is available on the Web at http://www.sanger.ac.uk/Software/Rfam/ and http://rfam.wustl.edu/.

...read moreread less

Journal Article•DOI•

The COSMIC (Catalogue of Somatic Mutations in Cancer) database and website

[...]

Sally Bamford¹, Elisabeth Dawson², Simon A. Forbes², Jody Clements², Roger Pettett², Ahmet Dogan³, Adrienne M. Flanagan³, Jon W. Teague², P A Futreal, Michael R. Stratton², Richard Wooster² - Show less +7 more•Institutions (3)

Wellcome Trust Sanger Institute¹, Wellcome Trust², University College London³

13 Jul 2004-British Journal of Cancer

TL;DR: The COSMIC (Catalogue of Somatic Mutations in Cancer) database and website have been developed to store somatic mutation data in a single location and display the data and other information related to human cancer.

...read moreread less

Abstract: The discovery of mutations in cancer genes has advanced our understanding of cancer. These results are dispersed across the scientific literature and with the availability of the human genome sequence will continue to accrue. The COSMIC (Catalogue of Somatic Mutations in Cancer) database and website have been developed to store somatic mutation data in a single location and display the data and other information related to human cancer. To populate this resource, data has currently been extracted from reports in the scientific literature for somatic mutations in four genes, BRAF, HRAS, KRAS2 and NRAS. At present, the database holds information on 66 634 samples and reports a total of 10 647 mutations. Through the web pages, these data can be queried, displayed as figures or tables and exported in a number of formats. COSMIC is an ongoing project that will continue to curate somatic mutation data and release it through the website.

...read moreread less

Journal Article•DOI•

Gene map of the extended human MHC

[...]

Roger Horton¹, Laurens G. Wilming¹, Vikki Rand¹, Ruth C. Lovering², Elspeth A. Bruford², Varsha K. Khodiyar², Michael J. Lush², Sue Povey², C. Conover Talbot³, Mathew W. Wright², H Wain², John Trowsdale⁴, Andreas Ziegler⁵, Stephan Beck¹ - Show less +10 more•Institutions (5)

Wellcome Trust Sanger Institute¹, University College London², Johns Hopkins University³, University of Cambridge⁴, Humboldt University of Berlin⁵

01 Dec 2004-Nature Reviews Genetics

TL;DR: A gene map of the xMHC is presented and its content in relation to paralogy, polymorphism, immune function and disease is reviewed.

...read moreread less

Abstract: The major histocompatibility complex (MHC) is the most important region in the vertebrate genome with respect to infection and autoimmunity, and is crucial in adaptive and innate immunity. Decades of biomedical research have revealed many MHC genes that are duplicated, polymorphic and associated with more diseases than any other region of the human genome. The recent completion of several large-scale studies offers the opportunity to assimilate the latest data into an integrated gene map of the extended human MHC. Here, we present this map and review its content in relation to paralogy, polymorphism, immune function and disease.

...read moreread less

Journal Article•DOI•

The fine-scale structure of recombination rate variation in the human genome.

[...]

Gilean McVean¹, Simon Myers¹, Sarah E. Hunt², Panos Deloukas², David R. Bentley², Peter Donnelly¹ - Show less +2 more•Institutions (2)

University of Oxford¹, Wellcome Trust Sanger Institute²

23 Apr 2004-Science

TL;DR: It is demonstrated that recombination hotspots are a ubiquitous feature of the human genome, occurring on average every 200 kilobases or less, but recombination occurs preferentially outside genes.

...read moreread less

Abstract: The nature and scale of recombination rate variation are largely unknown for most species. In humans, pedigree analysis has documented variation at the chromosomal level, and sperm studies have identified specific hotspots in which crossing-over events cluster. To address whether this picture is representative of the genome as a whole, we have developed and validated a method for estimating recombination rates from patterns of genetic variation. From extensive single-nucleotide polymorphism surveys in European and African populations, we find evidence for extreme local rate variation spanning four orders in magnitude, in which 50% of all recombination events take place in less than 10% of the sequence. We demonstrate that recombination hotspots are a ubiquitous feature of the human genome, occurring on average every 200 kilobases or less, but recombination occurs preferentially outside genes.

...read moreread less

Journal Article•DOI•

Complete genomes of two clinical Staphylococcus aureus strains: Evidence for the rapid evolution of virulence and drug resistance

[...]

Matthew T. G. Holden¹, Edward J. Feil, Jodi A. Lindsay², Sharon J. Peacock³, Nicholas P. J. Day³, Mark C. Enright⁴, Timothy J. Foster⁵, Catrin E. Moore, Laurence D. Hurst, Rebecca Atkin¹, Andrew Barron¹, Nathalie Bason¹, Stephen D. Bentley¹, Carol Chillingworth¹, Tracey Chillingworth¹, Carol Churcher¹, Louise Clark¹, Craig Corton¹, Ann Cronin¹, Jon Doggett¹, Linda Dowd¹, Theresa Feltwell¹, Zahra Hance¹, Barbara Harris¹, Heidi Hauser¹, S. Holroyd¹, Kay Jagels¹, Keith D. James¹, Nicola Lennard¹, Alexandra Line¹, Rebecca Mayes¹, Sharon Moule¹, Karen Mungall¹, Douglas Ormond¹, Michael A. Quail¹, Ester Rabbinowitsch¹, Kim Rutherford¹, Mandy Sanders¹, Sarah Sharp¹, Mark Simmonds¹, K. Stevens¹, Sally Whitehead¹, Bart Barrell¹, Brian G. Spratt⁶, Julian Parkhill¹ - Show less +41 more•Institutions (6)

Wellcome Trust Sanger Institute¹, St George's, University of London², Mahidol University³, University of Bath⁴, Trinity College, Dublin⁵, St Mary's Hospital⁶

29 Jun 2004-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The crucial role that accessory elements play in the rapid evolution of S. aureus is clearly illustrated by comparing the MSSA476 genome with that of an extremely closely related MRSA community-acquired strain; the differential distribution of large mobile elements carrying virulence and drug-resistance determinants may be responsible for the clinically important phenotypic differences in these strains.

...read moreread less

Abstract: Staphylococcus aureus is an important nosocomial and community-acquired pathogen. Its genetic plasticity has facilitated the evolution of many virulent and drug-resistant strains, presenting a major and constantly changing clinical challenge. We sequenced the ≈2.8-Mbp genomes of two disease-causing S. aureus strains isolated from distinct clinical settings: a recent hospital-acquired representative of the epidemic methicillin-resistant S. aureus EMRSA-16 clone (MRSA252), a clinically important and globally prevalent lineage; and a representative of an invasive community-acquired methicillin-susceptible S. aureus clone (MSSA476). A comparative-genomics approach was used to explore the mechanisms of evolution of clinically important S. aureus genomes and to identify regions affecting virulence and drug resistance. The genome sequences of MRSA252 and MSSA476 have a well conserved core region but differ markedly in their accessory genetic elements. MRSA252 is the most genetically diverse S. aureus strain sequenced to date: ≈6% of the genome is novel compared with other published genomes, and it contains several unique genetic elements. MSSA476 is methicillin-susceptible, but it contains a novel Staphylococcal chromosomal cassette (SCC) mec-like element (designated SCC476), which is integrated at the same site on the chromosome as SCCmec elements in MRSA strains but encodes a putative fusidic acid resistance protein. The crucial role that accessory elements play in the rapid evolution of S. aureus is clearly illustrated by comparing the MSSA476 genome with that of an extremely closely related MRSA community-acquired strain; the differential distribution of large mobile elements carrying virulence and drug-resistance determinants may be responsible for the clinically important phenotypic differences in these strains.

...read moreread less

Journal Article•DOI•

Lung cancer: intragenic ERBB2 kinase mutations in tumours.

[...]

Philip J. Stephens¹, Christopher I. Hunter¹, Graham R. Bignell¹, Sarah Edkins¹, Helen Davies¹, Jon W. Teague¹, Claire Stevens¹, Sarah O’Meara¹, Raffaella Smith¹, Adrian Parker¹, Andy Barthorpe¹, Matthew J. Blow¹, Lisa Brackenbury¹, Adam Butler¹, Oliver Clarke¹, Jennifer Cole¹, Ed Dicks¹, Angus Dike¹, Anja Drozd¹, Ken Edwards¹, Simon A. Forbes¹, Rebecca Foster¹, Kristian Gray¹, Christopher Greenman¹, Kelly Halliday¹, Katy Hills¹, Vivienne Kosmidou¹, Richard Lugg¹, Andy Menzies¹, Janet Perry¹, Robert Petty¹, Keiran Raine¹, Lewis Ratford¹, Rebecca Shepherd¹, Alexandra Small¹, Yvonne Stephens¹, Calli Tofts¹, Jennifer Varian¹, Sofie West¹, Sara Widaa¹, Andrew D. Yates¹, Francis Brasseur², Colin Cooper³, Adrienne M. Flanagan⁴, Margaret A. Knowles⁵, Suet Yi Leung⁶, David N. Louis⁷, Leendert H. J. Looijenga⁸, Bruce Malkowicz⁹, Marco A. Pierotti, Bin Teh¹⁰, Georgia Chenevix-Trench¹¹, Barbara L. Weber⁹, Siu Tsan Yuen⁶, Grace Harris, Peter Goldstraw, Andrew G. Nicholson, P. Andrew Futreal¹, Richard Wooster¹, Michael R. Stratton¹, Michael R. Stratton³ - Show less +57 more•Institutions (11)

Wellcome Trust Sanger Institute¹, Ludwig Institute for Cancer Research², Institute of Cancer Research³, University College London⁴, St James's University Hospital⁵, University of Hong Kong⁶, Harvard University⁷, Erasmus University Rotterdam⁸, University of Pennsylvania⁹, Van Andel Institute¹⁰, QIMR Berghofer Medical Research Institute¹¹

30 Sep 2004-Nature

TL;DR: The protein-kinase family is the most frequently mutated gene family found in human cancer and faulty kinase enzymes are being investigated as promising targets for the design of antitumour therapies as mentioned in this paper.

...read moreread less

Abstract: The protein-kinase family is the most frequently mutated gene family found in human cancer and faulty kinase enzymes are being investigated as promising targets for the design of antitumour therapies. We have sequenced the gene encoding the transmembrane protein tyrosine kinase ERBB2 (also known as HER2 or Neu) from 120 primary lung tumours and identified 4% that have mutations within the kinase domain; in the adenocarcinoma subtype of lung cancer, 10% of cases had mutations. ERBB2 inhibitors, which have so far proved to be ineffective in treating lung cancer, should now be clinically re-evaluated in the specific subset of patients with lung cancer whose tumours carry ERBB2 mutations.

...read moreread less

Journal Article•DOI•

Genomic plasticity of the causative agent of melioidosis, Burkholderia pseudomallei

[...]

Matthew T. G. Holden¹, Richard W. Titball², Richard W. Titball³, Sharon J. Peacock⁴, Sharon J. Peacock⁵, Ana Cerdeño-Tárraga¹, Timothy P. Atkins³, Lisa Crossman¹, Tyrone Pitt, Carol Churcher¹, Karen Mungall¹, Stephen D. Bentley¹, Mohammed Sebaihia¹, Nicholas R. Thomson¹, Nathalie Bason¹, Ifor R. Beacham⁶, Karen Brooks¹, Katherine A. Brown⁷, Nat F. Brown⁶, Greg L. Challis⁸, Inna Cherevach¹, Tracy Chillingworth¹, Ann Cronin¹, Ben Crossett⁷, Paul Davis¹, David DeShazer⁹, Theresa Feltwell¹, Audrey Fraser¹, Zahra Hance¹, Heidi Hauser¹, S. Holroyd¹, Kay Jagels¹, Karen E. Keith⁷, Mark Maddison¹, Sharon Moule¹, Claire Price¹, Michael A. Quail¹, Ester Rabbinowitsch¹, Kim Rutherford¹, Mandy Sanders¹, Mark Simmonds¹, Sirirurg Songsivilai⁵, K. Stevens¹, Sarinna Tumapa⁵, Monkgol Vesaratchavest⁵, Sally Whitehead¹, Corin Yeats¹, Bart Barrell¹, Petra C. F. Oyston³, Julian Parkhill¹ - Show less +46 more•Institutions (9)

Wellcome Trust Sanger Institute¹, University of London², Defence Science and Technology Laboratory³, University of Oxford⁴, Mahidol University⁵, Griffith University⁶, Imperial College London⁷, University of Warwick⁸, United States Army Medical Research Institute of Infectious Diseases⁹

28 Sep 2004-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: It is proposed that variable horizontal gene acquisition by B. pseudomallei is an important feature of recent genetic evolution and that this has resulted in a genetically diverse pathogenic species.

...read moreread less

Abstract: Burkholderia pseudomallei is a recognized biothreat agent and the causative agent of melioidosis. This Gram-negative bacterium exists as a soil saprophyte in melioidosis-endemic areas of the world and accounts for 20% of community-acquired septicaemias in northeastern Thailand where half of those affected die. Here we report the complete genome of B. pseudomallei, which is composed of two chromosomes of 4.07 megabase pairs and 3.17 megabase pairs, showing significant functional partitioning of genes between them. The large chromosome encodes many of the core functions associated with central metabolism and cell growth, whereas the small chromosome carries more accessory functions associated with adaptation and survival in different niches. Genomic comparisons with closely and more distantly related bacteria revealed a greater level of gene order conservation and a greater number of orthologous genes on the large chromosome, suggesting that the two replicons have distinct evolutionary origins. A striking feature of the genome was the presence of 16 genomic islands (GIs) that together made up 6.1% of the genome. Further analysis revealed these islands to be variably present in a collection of invasive and soil isolates but entirely absent from the clonally related organism B. mallei. We propose that variable horizontal gene acquisition by B. pseudomallei is an important feature of recent genetic evolution and that this has resulted in a genetically diverse pathogenic species.

...read moreread less

Journal Article•DOI•

The knockout mouse project

[...]

Christopher P. Austin¹, James F. Battey¹, Allan Bradley², Maja Bucan³, Mario R. Capecchi⁴, Francis S. Collins¹, William F. Dove⁵, Geoffrey M. Duyk, Susan M. Dymecki⁶, J.T. Eppig, Franziska B. Grieder¹, Nathaniel Heintz⁷, Geoff Hicks⁸, Thomas R. Insel¹, Alexandra L. Joyner⁹, Beverly H. Koller¹⁰, Kevin C K Lloyd¹¹, Terry Magnuson¹⁰, Mark W. Moore, Andras Nagy¹², Jonathan D. Pollock¹, Allen D. Roses¹³, Arthur T. Sands, Brian Seed⁶, William C. Skarnes², Jay Snoddy¹⁴, Philippe Soriano¹⁵, D. Stewart¹⁶, Francis Stewart¹⁷, Bruce Stillman¹⁶, Harold E. Varmus¹⁸, Lyuba Varticovski¹, Inder M. Verma¹⁹, Thomas F. Vogt²⁰, Harald von Melchner²¹, Jan A. Witkowski¹⁶, Richard P. Woychik, Wolfgang Wurst²², George D. Yancopoulos²³, Stephen G. Young²⁴, Brian Zambrowicz - Show less +37 more•Institutions (24)

01 Sep 2004-Nature Genetics

TL;DR: It is time to harness new technologies and efficiencies of production to mount a high-throughput international effort to produce and phenotype knockouts for all mouse genes, and place these resources into the public domain.

...read moreread less

Abstract: Mouse knockout technology provides a powerful means of elucidating gene function in vivo, and a publicly available genome-wide collection of mouse knockouts would be significantly enabling for biomedical discovery. To date, published knockouts exist for only about 10% of mouse genes. Furthermore, many of these are limited in utility because they have not been made or phenotyped in standardized ways, and many are not freely available to researchers. It is time to harness new technologies and efficiencies of production to mount a high-throughput international effort to produce and phenotype knockouts for all mouse genes, and place these resources into the public domain.

...read moreread less

Journal Article•DOI•

Chromosome 21 and down syndrome: from genomics to pathophysiology.

[...]

Stylianos E. Antonarakis¹, Robert Lyle¹, Emmanouil T. Dermitzakis², Alexandre Reymond¹, Samuel Deutsch¹ - Show less +1 more•Institutions (2)

University of Geneva¹, Wellcome Trust Sanger Institute²

01 Oct 2004-Nature Reviews Genetics

TL;DR: Comparative genomics is beginning to identify the functional components of the chromosome and that in turn will set the stage for the functional characterization of the sequences.

...read moreread less

Abstract: The sequence of chromosome 21 was a turning point for the understanding of Down syndrome. Comparative genomics is beginning to identify the functional components of the chromosome and that in turn will set the stage for the functional characterization of the sequences. Animal models combined with genome-wide analytical methods have proved indispensable for unravelling the mysteries of gene dosage imbalance.

...read moreread less

Journal Article•DOI•

Evolutionary families of peptidase inhibitors.

[...]

Neil D. Rawlings¹, Dominic P. Tolle¹, Alan J. Barrett¹•Institutions (1)

Wellcome Trust Sanger Institute¹

15 Mar 2004-Biochemical Journal

TL;DR: A system wherein the inhibitor units of the peptidase inhibitors are assigned to 48 families on the basis of similarities detectable at the level of amino acid sequence, and a simple system of nomenclature is introduced for reference to each clan, family and inhibitor.

...read moreread less

Abstract: The proteins that inhibit peptidases are of great importance in medicine and biotechnology, but there has never been a comprehensive system of classification for them. Some of the terminology currently in use is potentially confusing. In the hope of facilitating the exchange, storage and retrieval of information about this important group of proteins, we now describe a system wherein the inhibitor units of the peptidase inhibitors are assigned to 48 families on the basis of similarities detectable at the level of amino acid sequence. Then, on the basis of three-dimensional structures, 31 of the families are assigned to 26 clans. A simple system of nomenclature is introduced for reference to each clan, family and inhibitor. We briefly discuss the specificities and mechanisms of the interactions of the inhibitors in the various families with their target enzymes. The system of families and clans of inhibitors described has been implemented in the MEROPS peptidase database (http://merops.sanger.ac.uk/), and this will provide a mechanism for updating it as new information becomes available.

...read moreread less

Journal Article•DOI•

Methylation of histone H4 lysine 20 controls recruitment of Crb2 to sites of DNA damage.

[...]

Steven L. Sanders¹, Manuela Portoso², Juan Mata³, Jürg Bähler³, Robin C. Allshire², Tony Kouzarides¹ - Show less +2 more•Institutions (3)

Wellcome Trust/Cancer Research UK Gurdon Institute¹, University of Edinburgh², Wellcome Trust Sanger Institute³

24 Nov 2004-Cell

TL;DR: It is argued that H4-K20 methylation functions as a "histone mark" required for the recruitment of the checkpoint protein Crb2, a homolog of the mammalian checkpoint protein 53BP1.

...read moreread less

Journal Article•DOI•

Interaction between differentially methylated regions partitions the imprinted genes Igf2 and H19 into parent-specific chromatin loops

[...]

Adele Murrell¹, Adele Murrell², Sarah Heeson¹, Sarah Heeson³, Wolf Reik¹ - Show less +1 more•Institutions (3)

Babraham Institute¹, Wellcome Trust Sanger Institute², Hoffmann-La Roche³

01 Aug 2004-Nature Genetics

TL;DR: A GAL4 knock-in approach as well as the chromosome conformation capture technique are used to show that the differentially methylated regions in the imprinted genes Igf2 and H19 interact in mice and partition maternal and paternal chromatin into distinct loops.

...read moreread less

Abstract: Imprinted genes are expressed from only one of the parental alleles and are marked epigenetically by DNA methylation and histone modifications. The paternally expressed gene insulin-like growth-factor 2 (Igf2) is separated by approximately 100 kb from the maternally expressed noncoding gene H19 on mouse distal chromosome 7. Differentially methylated regions in Igf2 and H19 contain chromatin boundaries, silencers and activators and regulate the reciprocal expression of the two genes in a methylation-sensitive manner by allowing them exclusive access to a shared set of enhancers. Various chromatin models have been proposed that separate Igf2 and H19 into active and silent domains. Here we used a GAL4 knock-in approach as well as the chromosome conformation capture technique to show that the differentially methylated regions in the imprinted genes Igf2 and H19 interact in mice. These interactions are epigenetically regulated and partition maternal and paternal chromatin into distinct loops. This generates a simple epigenetic switch for Igf2 through which it moves between an active and a silent chromatin domain.

...read moreread less

Journal Article•DOI•

A family with severe insulin resistance and diabetes due to a mutation in AKT2.

[...]

Stella George¹, Justin J. Rochford¹, Christian Wolfrum², Sarah L. Gray¹, S Schinner¹, Jenny C Wilson¹, Maria A. Soos¹, Peter R. Murgatroyd¹, Rachel M. Williams¹, Carlo L. Acerini¹, David B. Dunger¹, David Barford³, A. Margot Umpleby, Nicholas J. Wareham⁴, Huw Alban Davies⁵, Alan J. Schafer⁶, Markus Stoffel², Stephen O'Rahilly¹, Inês Barroso⁶, Inês Barroso⁷ - Show less +16 more•Institutions (7)

University of Cambridge¹, Rockefeller University², Institute of Cancer Research³, Medical Research Council⁴, Valley Hospital⁵, Incyte⁶, Wellcome Trust Sanger Institute⁷

28 May 2004-Science

TL;DR: A mutation in the gene encoding the protein kinase AKT2/PKBβ in a family that shows autosomal dominant inheritance of severe insulin resistance and diabetes mellitus is described, demonstrating the central importance of AKT signaling to insulin sensitivity in humans.

...read moreread less

Abstract: Inherited defects in signaling pathways downstream of the insulin receptor have long been suggested to contribute to human type 2 diabetes mellitus. Here we describe a mutation in the gene encoding the protein kinase AKT2/PKBbeta in a family that shows autosomal dominant inheritance of severe insulin resistance and diabetes mellitus. Expression of the mutant kinase in cultured cells disrupted insulin signaling to metabolic end points and inhibited the function of coexpressed, wild-type AKT. These findings demonstrate the central importance of AKT signaling to insulin sensitivity in humans.

...read moreread less

Journal Article•DOI•

Chromatin Architecture of the Human Genome: Gene-Rich Domains Are Enriched in Open Chromatin Fibers

[...]

Nick Gilbert, Shelagh Boyle, Heike Fiegler¹, Kathryn Woodfine¹, Nigel P. Carter¹, Wendy A. Bickmore - Show less +2 more•Institutions (1)

Wellcome Trust Sanger Institute¹

03 Sep 2004-Cell

TL;DR: It is suggested that domains of open chromatin may create an environment that facilitates transcriptional activation and could provide an evolutionary constraint to maintain clusters of genes together along chromosomes.

...read moreread less

Journal Article•DOI•

Periodic gene expression program of the fission yeast cell cycle

[...]

Gabriella Rustici¹, Juan Mata¹, Katja Kivinen², Pietro Liò², Christopher J. Penkett¹, Gavin Burns¹, Jacqueline Hayles³, Alvis Brazma², Paul Nurse⁴, Paul Nurse³, Jürg Bähler¹ - Show less +7 more•Institutions (4)

Wellcome Trust Sanger Institute¹, European Bioinformatics Institute², London Research Institute³, Rockefeller University⁴

13 Jun 2004-Nature Genetics

TL;DR: The genome-wide transcriptional program of the Schizosaccharomyces pombe cell cycle was studied in this paper, identifying 407 periodically expressed genes of which 136 show high-amplitude changes.

...read moreread less

Abstract: Cell-cycle control of transcription seems to be universal, but little is known about its global conservation and biological significance. We report on the genome-wide transcriptional program of the Schizosaccharomyces pombe cell cycle, identifying 407 periodically expressed genes of which 136 show high-amplitude changes. These genes cluster in four major waves of expression. The forkhead protein Sep1p regulates mitotic genes in the first cluster, including Ace2p, which activates transcription in the second cluster during the M-G1 transition and cytokinesis. Other genes in the second cluster, which are required for G1-S progression, are regulated by the MBF complex independently of Sep1p and Ace2p. The third cluster coincides with S phase and a fourth cluster contains genes weakly regulated during G2 phase. Despite conserved cell-cycle transcription factors, differences in regulatory circuits between fission and budding yeasts are evident, revealing evolutionary plasticity of transcriptional control. Periodic transcription of most genes is not conserved between the two yeasts, except for a core set of approximately 40 genes that seem to be universally regulated during the eukaryotic cell cycle and may have key roles in cell-cycle progression.

...read moreread less

Journal Article•DOI•

Microevolution and history of the plague bacillus, Yersinia pestis

[...]

Mark Achtman¹, Giovanna Morelli¹, Peixuan Zhu¹, Thierry Wirth¹, Ines Diehl¹, Barica Kusecek¹, Amy J. Vogler², David M. Wagner, Christopher J. Allender², W. Ryan Easterday², Viviane Chenal-Francisque³, Patricia L. Worsham⁴, Nicholas R. Thomson⁵, Julian Parkhill⁵, Luther E. Lindler⁶, Elisabeth Carniel³, Paul Keim - Show less +13 more•Institutions (6)

Max Planck Society¹, Northern Arizona University², Pasteur Institute³, United States Army Medical Research Institute of Infectious Diseases⁴, Wellcome Trust Sanger Institute⁵, Walter Reed Army Institute of Research⁶

21 Dec 2004-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: An evolutionary tree is proposed for these populations, rooted on Yersinia pseudotuberculosis, which invokes microevolution over millennia, during which enzootic pestoides isolates evolved and led to populations that are more frequently associated with human disease.

...read moreread less

Abstract: The association of historical plague pandemics with Yersinia pestis remains controversial, partly because the evolutionary history of this largely monomorphic bacterium was unknown. The microevolution of Y. pestis was therefore investigated by three different multilocus molecular methods, targeting genomewide synonymous SNPs, variation in number of tandem repeats, and insertion of IS100 insertion elements. Eight populations were recognized by the three methods, and we propose an evolutionary tree for these populations, rooted on Yersinia pseudotuberculosis. The tree invokes microevolution over millennia, during which enzootic pestoides isolates evolved. This initial phase was followed by a binary split 6,500 years ago, which led to populations that are more frequently associated with human disease. These populations do not correspond directly to classical biovars that are based on phenotypic properties. Thus, we recommend that henceforth groupings should be based on molecular signatures. The age of Y. pestis inferred here is compatible with the dates of historical pandemic plague. However, it is premature to infer an association between any modern molecular grouping and a particular pandemic wave that occurred before the 20th century.

...read moreread less

Journal Article•DOI•

Staphylococcus aureus: superbug, super genome?

[...]

Jodi A. Lindsay¹, Matthew T. G. Holden²•Institutions (2)

St George's Hospital¹, Wellcome Trust Sanger Institute²

01 Aug 2004-Trends in Microbiology

TL;DR: The recent sequencing of seven strains of S. aureus provides unprecedented information about its genome diversity, and dramatic differences in the carriage and spread of accessory genes, including those involved in virulence and resistance, contribute to the emergence of new strains with healthcare implications.

...read moreread less

Journal Article•DOI•

High-resolution analysis of DNA copy number using oligonucleotide microarrays

[...]

Graham R. Bignell¹, Jing Huang², Joel Greshock³, Stephen Watt¹, Adam Butler¹, Sofie West¹, Mira Grigorova⁴, Keith W. Jones², Wen Wei², Michael R. Stratton¹, P. Andrew Futreal, Barbara L. Weber³, Michael H. Shapero², Richard Wooster¹ - Show less +10 more•Institutions (4)

Wellcome Trust Sanger Institute¹, Thermo Fisher Scientific², University of Pennsylvania³, University of Cambridge⁴

01 Feb 2004-Genome Research

TL;DR: The studies demonstrate that combining the genotype and copy number analyses gives greater insight into the underlying genetic alterations in cancer cells with identification of complex events including loss and reduplication of loci.

...read moreread less

Abstract: Genomic copy number alterations are a feature of many human diseases including cancer. We have evaluated the effectiveness of an oligonucleotide array, originally designed to detect single-nucleotide polymorphisms, to assess DNA copy number. We first showed that fluorescent signal from the oligonucleotide array varies in proportion to both decreases and increases in copy number. Subsequently we applied the system to a series of 20 cancer cell lines. All of the putative homozygous deletions (10) and high-level amplifications (12; putative copy number >4) tested were confirmed by PCR (either qPCR or normal PCR) analysis. Low-level copy number changes for two of the lines under analysis were compared with BAC array CGH; 77% (n = 44) of the autosomal chromosomes used in the comparison showed consistent patterns of LOH (loss of heterozygosity) and low-level amplification. Of the remaining 10 comparisons that were discordant, eight were caused by low SNP densities and failed in both lines. The studies demonstrate that combining the genotype and copy number analyses gives greater insight into the underlying genetic alterations in cancer cells with identification of complex events including loss and reduplication of loci.

...read moreread less

Collapse