scispace - formally typeset
Search or ask a question
Author

Kathy Seeger

Bio: Kathy Seeger is an academic researcher from Wellcome Trust Sanger Institute. The author has contributed to research in topics: Genome & Gene. The author has an hindex of 19, co-authored 23 publications receiving 10352 citations.

Papers
More filters
Journal ArticleDOI
09 May 2002-Nature
TL;DR: The 8,667,507 base pair linear chromosome of Streptomyces coelicolor is reported, containing the largest number of genes so far discovered in a bacterium.
Abstract: Streptomyces coelicolor is a representative of the group of soil-dwelling, filamentous bacteria responsible for producing most natural antibiotics used in human and veterinary medicine. Here we report the 8,667,507 base pair linear chromosome of this organism, containing the largest number of genes so far discovered in a bacterium. The 7,825 predicted genes include more than 20 clusters coding for known or predicted secondary metabolites. The genome contains an unprecedented proportion of regulatory genes, predominantly those likely to be involved in responses to external stimuli and stresses, and many duplicated gene sets that may represent 'tissue-specific' isoforms operating in different phases of colonial development, a unique situation for a bacterium. An ancient synteny was revealed between the central 'core' of the chromosome and the whole chromosome of pathogens Mycobacterium tuberculosis and Corynebacterium diphtheriae. The genome sequence will greatly increase our understanding of microbial life in the soil as well as aiding the generation of new drug candidates by genetic engineering.

3,077 citations

Journal ArticleDOI
Valerie Wood1, R. Gwilliam1, Marie-Adèle Rajandream1, M. Lyne1, Rachel Lyne1, A. Stewart2, J. Sgouros2, N. Peat2, Jacqueline Hayles2, Stephen Baker1, D. Basham1, Sharen Bowman1, Karen Brooks1, D. Brown1, Steve D.M. Brown1, Tracey Chillingworth1, Carol Churcher1, Mark O. Collins1, R. Connor1, Ann Cronin1, P. Davis1, Theresa Feltwell1, Andrew G. Fraser1, S. Gentles1, Arlette Goble1, N. Hamlin1, David Harris1, J. Hidalgo1, Geoffrey M. Hodgson1, S. Holroyd1, T. Hornsby1, S. Howarth1, Elizabeth J. Huckle1, Sarah E. Hunt1, Kay Jagels1, Kylie R. James1, L. Jones1, Matthew Jones1, S. Leather1, S. McDonald1, J. McLean1, P. Mooney1, Sharon Moule1, Karen Mungall1, Lee Murphy1, D. Niblett1, C. Odell1, Karen Oliver1, Susan O'Neil1, D. Pearson1, Michael A. Quail1, Ester Rabbinowitsch1, Kim Rutherford1, Simon Rutter1, David L. Saunders1, Kathy Seeger1, Sarah Sharp1, Jason Skelton1, Mark Simmonds1, R. Squares1, S. Squares1, K. Stevens1, K. Taylor1, Ruth Taylor1, Adrian Tivey1, S. Walsh1, T. Warren1, S. Whitehead1, John Woodward1, Guido Volckaert3, Rita Aert3, Johan Robben3, B. Grymonprez3, I. Weltjens3, E. Vanstreels3, Michael A. Rieger, M. Schafer, S. Muller-Auer, C. Gabel, M. Fuchs, C. Fritzc, E. Holzer, D. Moestl, H. Hilbert, K. Borzym4, I. Langer4, Alfred Beck4, Hans Lehrach4, Richard Reinhardt4, Thomas M. Pohl5, P. Eger5, Wolfgang Zimmermann, H. Wedler, R. Wambutt, Bénédicte Purnelle6, André Goffeau6, Edouard Cadieu7, Stéphane Dréano7, Stéphanie Gloux7, Valerie Lelaure7, Stéphanie Mottier7, Francis Galibert7, Stephen J. Aves8, Z. Xiang8, Cherryl Hunt8, Karen Moore8, S. M. Hurst8, M. Lucas9, M. Rochet9, Claude Gaillardin9, Victor A. Tallada10, Victor A. Tallada11, Andrés Garzón10, Andrés Garzón11, G. Thode11, Rafael R. Daga10, Rafael R. Daga11, L. Cruzado11, Juan Jimenez10, Juan Jimenez11, Miguel del Nogal Sánchez12, F. del Rey12, J. Benito12, Angel Domínguez12, José L. Revuelta12, Sergio Moreno12, John Armstrong13, Susan L. Forsburg14, L. Cerrutti1, Todd M. Lowe15, W. R. McCombie16, Ian T. Paulsen17, Judith A. Potashkin18, G. V. Shpakovski19, David W. Ussery20, Bart Barrell1, Paul Nurse2 
21 Feb 2002-Nature
TL;DR: The genome of fission yeast (Schizosaccharomyces pombe), which contains the smallest number of protein-coding genes yet recorded for a eukaryote, is sequenced and highly conserved genes important for eukARYotic cell organization including those required for the cytoskeleton, compartmentation, cell-cycle control, proteolysis, protein phosphorylation and RNA splicing are identified.
Abstract: We have sequenced and annotated the genome of fission yeast (Schizosaccharomyces pombe), which contains the smallest number of protein-coding genes yet recorded for a eukaryote: 4,824. The centromeres are between 35 and 110 kilobases (kb) and contain related repeats including a highly conserved 1.8-kb element. Regions upstream of genes are longer than in budding yeast (Saccharomyces cerevisiae), possibly reflecting more-extended control regions. Some 43% of the genes contain introns, of which there are 4,730. Fifty genes have significant similarity with human disease genes; half of these are cancer related. We identify highly conserved genes important for eukaryotic cell organization including those required for the cytoskeleton, compartmentation, cell-cycle control, proteolysis, protein phosphorylation and RNA splicing. These genes may have originated with the appearance of eukaryotic life. Few similarly conserved genes that are important for multicellular organization were identified, suggesting that the transition from prokaryotes to eukaryotes required more new genes than did the transition from unicellular to multicellular organization.

1,686 citations

Journal ArticleDOI
Alasdair Ivens1, Christopher S. Peacock1, Elizabeth A. Worthey2, Lee Murphy1, Gautam Aggarwal2, Matthew Berriman1, Ellen Sisk2, Marie-Adèle Rajandream1, Ellen Adlem1, Rita Aert3, Atashi Anupama2, Zina Apostolou, Philip Attipoe2, Nathalie Bason1, Christopher Bauser4, Alfred Beck5, Stephen M. Beverley6, Gabriella Bianchettin7, K. Borzym5, G. Bothe4, Carlo V. Bruschi8, Carlo V. Bruschi7, Matt Collins1, Eithon Cadag2, Laura Ciarloni7, Christine Clayton, Richard M.R. Coulson9, Ann Cronin1, Angela K. Cruz10, Robert L. Davies1, Javier G. De Gaudenzi11, Deborah E. Dobson6, Andreas Duesterhoeft, Gholam Fazelina2, Nigel Fosker1, Alberto C.C. Frasch11, Audrey Fraser1, Monika Fuchs, Claudia Gabel, Arlette Goble1, André Goffeau12, David Harris1, Christiane Hertz-Fowler1, Helmut Hilbert, David Horn13, Yiting Huang2, Sven Klages5, Andrew J Knights1, Michael Kube5, Natasha Larke1, Lyudmila Litvin2, Angela Lord1, Tin Louie2, Marco A. Marra, David Masuy12, Keith R. Matthews14, Shulamit Michaeli, Jeremy C. Mottram15, Silke Müller-Auer, Heather Munden2, Siri Nelson2, Halina Norbertczak1, Karen Oliver1, Susan O'Neil1, Martin Pentony2, Thomas M. Pohl4, Claire Price1, Bénédicte Purnelle12, Michael A. Quail1, Ester Rabbinowitsch1, Richard Reinhardt5, Michael A. Rieger, Joel Rinta2, Johan Robben3, Laura Robertson2, Jeronimo C. Ruiz10, Simon Rutter1, David L. Saunders1, Melanie Schäfer, Jacquie Schein, David C. Schwartz16, Kathy Seeger1, Amber Seyler2, Sarah Sharp1, Heesun Shin, Dhileep Sivam2, Rob Squares1, Steve Squares1, Valentina Tosato7, Christy Vogt2, Guido Volckaert3, Rolf Wambutt, T. Warren1, Holger Wedler, John Woodward1, Shiguo Zhou16, Wolfgang Zimmermann, Deborah F. Smith17, Jenefer M. Blackwell18, Kenneth Stuart19, Kenneth Stuart2, Bart Barrell1, Peter J. Myler2, Peter J. Myler19 
15 Jul 2005-Science
TL;DR: The organization of protein-coding genes into long, strand-specific, polycistronic clusters and lack of general transcription factors in the L. major, Trypanosoma brucei, and Tritryp genomes suggest that the mechanisms regulating RNA polymerase II–directed transcription are distinct from those operating in other eukaryotes, although the trypanosomatids appear capable of chromatin remodeling.
Abstract: Leishmania species cause a spectrum of human diseases in tropical and subtropical regions of the world. We have sequenced the 36 chromosomes of the 32.8-megabase haploid genome of Leishmania major (Friedlin strain) and predict 911 RNA genes, 39 pseudogenes, and 8272 protein-coding genes, of which 36% can be ascribed a putative function. These include genes involved in host-pathogen interactions, such as proteolytic enzymes, and extensive machinery for synthesis of complex surface glycoconjugates. The organization of protein-coding genes into long, strand-specific, polycistronic clusters and lack of general transcription factors in the L. major, Trypanosoma brucei, and Trypanosoma cruzi (Tritryp) genomes suggest that the mechanisms regulating RNA polymerase II-directed transcription are distinct from those operating in other eukaryotes, although the trypanosomatids appear capable of chromatin remodeling. Abundant RNA-binding proteins are encoded in the Tritryp genomes, consistent with active posttranscriptional regulation of gene expression.

1,357 citations

Journal ArticleDOI
William C. Nierman1, William C. Nierman2, Arnab Pain3, Michael J. Anderson4, Jennifer R. Wortman1, Jennifer R. Wortman2, H. Stanley Kim2, H. Stanley Kim1, Javier Arroyo5, Matthew Berriman3, Keietsu Abe6, David B. Archer7, Clara Bermejo5, Joan W. Bennett8, Paul Bowyer4, Dan Chen2, Dan Chen1, Matthew Collins3, Richard Coulsen, Robert L. Davies3, Paul S. Dyer7, Mark L. Farman9, Nadia Fedorova1, Nadia Fedorova2, Natalie D. Fedorova1, Natalie D. Fedorova2, T. Feldblyum1, T. Feldblyum2, Reinhard Fischer10, Nigel Fosker3, Audrey Fraser3, José Luis García11, María Josefa Marcos García12, Ariette Goble3, Gustavo H. Goldman13, Katsuya Gomi6, Sam Griffith-Jones3, R. Gwilliam3, Brian J. Haas1, Brian J. Haas2, Hubertus Haas14, David Harris3, H. Horiuchi15, Jiaqi Huang1, Jiaqi Huang2, Sean Humphray3, Javier Jiménez12, Nancy P. Keller15, H. Khouri1, H. Khouri2, Katsuhiko Kitamoto16, Tetsuo Kobayashi17, Sven Konzack10, Resham Kulkarni1, Resham Kulkarni2, Toshitaka Kumagai18, Anne Lafton19, Jean-Paul Latgé19, Weixi Li9, Angela Lord3, Charles Lu2, Charles Lu1, William H. Majoros2, William H. Majoros1, Gregory S. May20, Bruce L. Miller21, Yasmin Ali Mohamoud2, Yasmin Ali Mohamoud1, María Molina5, Michel Monod22, Isabelle Mouyna19, Stephanie Mulligan1, Stephanie Mulligan2, Lee Murphy3, Susan O'Neil3, Ian T. Paulsen1, Ian T. Paulsen2, Miguel A. Peñalva11, Mihaela Pertea1, Mihaela Pertea2, Claire Price3, Bethan L. Pritchard4, Michael A. Quail3, Ester Rabbinowitsch3, Neil Rawlins3, Marie Adele Rajandream3, Utz Reichard23, Hubert Renauld3, Geoffrey D. Robson4, Santiago Rodríguez de Córdoba11, José Manuel Rodríguez-Peña5, Catherine M. Ronning2, Catherine M. Ronning1, Simon Rutter3, Steven L. Salzberg1, Steven L. Salzberg2, Miguel del Nogal Sánchez12, Juan C. Sánchez-Ferrero11, David L. Saunders3, Kathy Seeger3, Rob Squares3, S. Squares3, Michio Takeuchi24, Fredj Tekaia19, Geoffrey Turner25, Carlos R. Vázquez de Aldana12, J. Weidman1, J. Weidman2, Owen White2, Owen White1, John Woodward3, Jae-Hyuk Yu15, Claire M. Fraser2, Claire M. Fraser1, James E. Galagan26, Kiyoshi Asai18, Masayuki Machida18, Neil Hall3, Neil Hall2, Bart Barrell3, David W. Denning4 
22 Dec 2005-Nature
TL;DR: The Af293 genome sequence provides an unparalleled resource for the future understanding of this remarkable fungus and revealed temperature-dependent expression of distinct sets of genes, as well as 700 A. fumigatus genes not present or significantly diverged in the closely related sexual species Neosartorya fischeri, many of which may have roles in the pathogenicity phenotype.
Abstract: Aspergillus fumigatus is exceptional among microorganisms in being both a primary and opportunistic pathogen as well as a major allergen. Its conidia production is prolific, and so human respiratory tract exposure is almost constant. A. fumigatus is isolated from human habitats and vegetable compost heaps. In immunocompromised individuals, the incidence of invasive infection can be as high as 50% and the mortality rate is often about 50% (ref. 2). The interaction of A. fumigatus and other airborne fungi with the immune system is increasingly linked to severe asthma and sinusitis. Although the burden of invasive disease caused by A. fumigatus is substantial, the basic biology of the organism is mostly obscure. Here we show the complete 29.4-megabase genome sequence of the clinical isolate Af293, which consists of eight chromosomes containing 9,926 predicted genes. Microarray analysis revealed temperature-dependent expression of distinct sets of genes, as well as 700 A. fumigatus genes not present or significantly diverged in the closely related sexual species Neosartorya fischeri, many of which may have roles in the pathogenicity phenotype. The Af293 genome sequence provides an unparalleled resource for the future understanding of this remarkable fungus.

1,356 citations

Journal ArticleDOI
TL;DR: It is shown that pseudogene formation and gene loss are the principal forces shaping the different genomes of Leishmania, and genes that are differentially distributed between the species encode proteins implicated in host-pathogen interactions and parasite survival in the macrophage.
Abstract: Leishmania parasites cause a broad spectrum of clinical disease. Here we report the sequencing of the genomes of two species of Leishmania: Leishmania infantum and Leishmania braziliensis. The comparison of these sequences with the published genome of Leishmania major reveals marked conservation of synteny and identifies only 200 genes with a differential distribution between the three species. L. braziliensis, contrary to Leishmania species examined so far, possesses components of a putative RNA-mediated interference pathway, telomere-associated transposable elements and spliced leader–associated SLACS retrotransposons. We show that pseudogene formation and gene loss are the principal forces shaping the different genomes. Genes that are differentially distributed between the species encode proteins implicated in host-pathogen interactions and parasite survival in the macrophage.

721 citations


Cited by
More filters
Journal ArticleDOI
03 Oct 2002-Nature
TL;DR: The genome sequence of P. falciparum clone 3D7 is reported, which is the most (A + T)-rich genome sequenced to date and is being exploited in the search for new drugs and vaccines to fight malaria.
Abstract: The parasite Plasmodium falciparum is responsible for hundreds of millions of cases of malaria, and kills more than one million African children annually. Here we report an analysis of the genome sequence of P. falciparum clone 3D7. The 23-megabase nuclear genome consists of 14 chromosomes, encodes about 5,300 genes, and is the most (A + T)-rich genome sequenced to date. Genes involved in antigenic variation are concentrated in the subtelomeric regions of the chromosomes. Compared to the genomes of free-living eukaryotic microbes, the genome of this intracellular parasite encodes fewer enzymes and transporters, but a large proportion of genes are devoted to immune evasion and host-parasite interactions. Many nuclear-encoded proteins are targeted to the apicoplast, an organelle involved in fatty-acid and isoprenoid metabolism. The genome sequence provides the foundation for future studies of this organism, and is being exploited in the search for new drugs and vaccines to fight malaria.

4,312 citations

Journal ArticleDOI
TL;DR: A major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes is described and is expected to be a useful platform for functional annotation of newlysequenced genomes, including those of complex eukARYotes, and genome-wide evolutionary studies.
Abstract: The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies. We describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after euk aryotic o rthologous g roups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The euk aryotic o rthologous g roups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or ~54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of ~20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (~1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes. The updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies.

4,167 citations

Journal ArticleDOI
16 Jan 2003-Nature
TL;DR: It is found that genes of similar functions are clustered in distinct, multi-megabase regions of individual chromosomes; genes in these regions tend to share transcriptional profiles.
Abstract: A principal challenge currently facing biologists is how to connect the complete DNA sequence of an organism to its development and behaviour. Large-scale targeted-deletions have been successful in defining gene functions in the single-celled yeast Saccharomyces cerevisiae, but comparable analyses have yet to be performed in an animal. Here we describe the use of RNA interference to inhibit the function of ∼86% of the 19,427 predicted genes of C. elegans. We identified mutant phenotypes for 1,722 genes, about two-thirds of which were not previously associated with a phenotype. We find that genes of similar functions are clustered in distinct, multi-megabase regions of individual chromosomes; genes in these regions tend to share transcriptional profiles. Our resulting data set and reusable RNAi library of 16,757 bacterial clones will facilitate systematic analyses of the connections among gene sequence, chromosomal location and gene function in C. elegans.

3,529 citations

Journal ArticleDOI
22 Oct 2015-Cell
TL;DR: In this paper, the authors characterized Cpf1, a putative class 2 CRISPR effector, which is a single RNA-guided endonuclease lacking tracrRNA and utilizes a T-rich protospacer-adjacent motif.

3,436 citations

Journal ArticleDOI
TL;DR: The short history, specific features and future prospects of research of microbial metabolites, including antibiotics and other bioactive metabolites, are summarized.
Abstract: The short history, specific features and future prospects of research of microbial metabolites, including antibiotics and other bioactive metabolites, are summarized. The microbial origin, diversity of producing species, functions and various bioactivities of metabolites, unique features of their chemical structures are discussed, mainly on the basis of statistical data. The possible numbers of metabolites may be discovered in the future, the problems of dereplication of newly isolated compounds as well as the new trends and prospects of the research are also discussed.

2,706 citations