Home
/
Authors
/
Kathy Seeger

Author

Kathy Seeger

Bio: Kathy Seeger is an academic researcher from Wellcome Trust Sanger Institute. The author has contributed to research in topics: Genome & Gene. The author has an hindex of 19, co-authored 23 publications receiving 10352 citations.

Topics: Genome, Gene, Schizosaccharomyces pombe, Synteny, Conserved sequence ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2)

[...]

Stephen D. Bentley¹, Keith F. Chater², Ana Cerdeño-Tárraga¹, Gregory L. Challis³, Gregory L. Challis², Nicholas R. Thomson¹, Keith D. James¹, David Harris¹, Michael A. Quail¹, H. M. Kieser², D. Harper¹, Alex Bateman¹, Steve D.M. Brown¹, Govind Chandra², Carton W. Chen⁴, Mark O. Collins¹, Ann Cronin¹, Andrew G. Fraser¹, Arlette Goble¹, J. Hidalgo¹, T. Hornsby¹, S. Howarth¹, Chih-Hung Huang⁴, Tobias Kieser², L. Larke¹, Lee Murphy¹, Karen Oliver¹, Susan O'Neil¹, Ester Rabbinowitsch¹, Marie-Adèle Rajandream¹, Kim Rutherford¹, Simon Rutter¹, Kathy Seeger¹, David L. Saunders¹, Sarah Sharp¹, R. Squares¹, S. Squares¹, K. Taylor¹, T. Warren¹, Andreas Wietzorrek², John Woodward¹, Bart Barrell¹, Julian Parkhill¹, David A. Hopwood² - Show less +40 more•Institutions (4)

Wellcome Trust Sanger Institute¹, Norwich Research Park², University of Warwick³, National Yang-Ming University⁴

09 May 2002-Nature

TL;DR: The 8,667,507 base pair linear chromosome of Streptomyces coelicolor is reported, containing the largest number of genes so far discovered in a bacterium.

...read moreread less

Abstract: Streptomyces coelicolor is a representative of the group of soil-dwelling, filamentous bacteria responsible for producing most natural antibiotics used in human and veterinary medicine. Here we report the 8,667,507 base pair linear chromosome of this organism, containing the largest number of genes so far discovered in a bacterium. The 7,825 predicted genes include more than 20 clusters coding for known or predicted secondary metabolites. The genome contains an unprecedented proportion of regulatory genes, predominantly those likely to be involved in responses to external stimuli and stresses, and many duplicated gene sets that may represent 'tissue-specific' isoforms operating in different phases of colonial development, a unique situation for a bacterium. An ancient synteny was revealed between the central 'core' of the chromosome and the whole chromosome of pathogens Mycobacterium tuberculosis and Corynebacterium diphtheriae. The genome sequence will greatly increase our understanding of microbial life in the soil as well as aiding the generation of new drug candidates by genetic engineering.

...read moreread less

3,077 citations

Journal Article•DOI•

The genome sequence of Schizosaccharomyces pombe

[...]

Valerie Wood¹, R. Gwilliam¹, Marie-Adèle Rajandream¹, M. Lyne¹, Rachel Lyne¹, A. Stewart², J. Sgouros², N. Peat², Jacqueline Hayles², Stephen Baker¹, D. Basham¹, Sharen Bowman¹, Karen Brooks¹, D. Brown¹, Steve D.M. Brown¹, Tracey Chillingworth¹, Carol Churcher¹, Mark O. Collins¹, R. Connor¹, Ann Cronin¹, P. Davis¹, Theresa Feltwell¹, Andrew G. Fraser¹, S. Gentles¹, Arlette Goble¹, N. Hamlin¹, David Harris¹, J. Hidalgo¹, Geoffrey M. Hodgson¹, S. Holroyd¹, T. Hornsby¹, S. Howarth¹, Elizabeth J. Huckle¹, Sarah E. Hunt¹, Kay Jagels¹, Kylie R. James¹, L. Jones¹, Matthew Jones¹, S. Leather¹, S. McDonald¹, J. McLean¹, P. Mooney¹, Sharon Moule¹, Karen Mungall¹, Lee Murphy¹, D. Niblett¹, C. Odell¹, Karen Oliver¹, Susan O'Neil¹, D. Pearson¹, Michael A. Quail¹, Ester Rabbinowitsch¹, Kim Rutherford¹, Simon Rutter¹, David L. Saunders¹, Kathy Seeger¹, Sarah Sharp¹, Jason Skelton¹, Mark Simmonds¹, R. Squares¹, S. Squares¹, K. Stevens¹, K. Taylor¹, Ruth Taylor¹, Adrian Tivey¹, S. Walsh¹, T. Warren¹, S. Whitehead¹, John Woodward¹, Guido Volckaert³, Rita Aert³, Johan Robben³, B. Grymonprez³, I. Weltjens³, E. Vanstreels³, Michael A. Rieger, M. Schafer, S. Muller-Auer, C. Gabel, M. Fuchs, C. Fritzc, E. Holzer, D. Moestl, H. Hilbert, K. Borzym⁴, I. Langer⁴, Alfred Beck⁴, Hans Lehrach⁴, Richard Reinhardt⁴, Thomas M. Pohl⁵, P. Eger⁵, Wolfgang Zimmermann, H. Wedler, R. Wambutt, Bénédicte Purnelle⁶, André Goffeau⁶, Edouard Cadieu⁷, Stéphane Dréano⁷, Stéphanie Gloux⁷, Valerie Lelaure⁷, Stéphanie Mottier⁷, Francis Galibert⁷, Stephen J. Aves⁸, Z. Xiang⁸, Cherryl Hunt⁸, Karen Moore⁸, S. M. Hurst⁸, M. Lucas⁹, M. Rochet⁹, Claude Gaillardin⁹, Victor A. Tallada¹⁰, Victor A. Tallada¹¹, Andrés Garzón¹⁰, Andrés Garzón¹¹, G. Thode¹¹, Rafael R. Daga¹⁰, Rafael R. Daga¹¹, L. Cruzado¹¹, Juan Jimenez¹⁰, Juan Jimenez¹¹, Miguel del Nogal Sánchez¹², F. del Rey¹², J. Benito¹², Angel Domínguez¹², José L. Revuelta¹², Sergio Moreno¹², John Armstrong¹³, Susan L. Forsburg¹⁴, L. Cerrutti¹, Todd M. Lowe¹⁵, W. R. McCombie¹⁶, Ian T. Paulsen¹⁷, Judith A. Potashkin¹⁸, G. V. Shpakovski¹⁹, David W. Ussery²⁰, Bart Barrell¹, Paul Nurse² - Show less +133 more•Institutions (20)

Wellcome Trust Sanger Institute¹, London Research Institute², Katholieke Universiteit Leuven³, Max Planck Society⁴, GATC Biotech⁵, Université catholique de Louvain⁶, Centre national de la recherche scientifique⁷, University of Exeter⁸, Institut national agronomique Paris Grignon⁹, Pablo de Olavide University¹⁰, University of Málaga¹¹, University of Salamanca¹², University of Sussex¹³, Salk Institute for Biological Studies¹⁴, Stanford University¹⁵, Cold Spring Harbor Laboratory¹⁶, TigerLogic¹⁷, Rosalind Franklin University of Medicine and Science¹⁸, Russian Academy of Sciences¹⁹, Technical University of Denmark²⁰

21 Feb 2002-Nature

TL;DR: The genome of fission yeast (Schizosaccharomyces pombe), which contains the smallest number of protein-coding genes yet recorded for a eukaryote, is sequenced and highly conserved genes important for eukARYotic cell organization including those required for the cytoskeleton, compartmentation, cell-cycle control, proteolysis, protein phosphorylation and RNA splicing are identified.

...read moreread less

Abstract: We have sequenced and annotated the genome of fission yeast (Schizosaccharomyces pombe), which contains the smallest number of protein-coding genes yet recorded for a eukaryote: 4,824. The centromeres are between 35 and 110 kilobases (kb) and contain related repeats including a highly conserved 1.8-kb element. Regions upstream of genes are longer than in budding yeast (Saccharomyces cerevisiae), possibly reflecting more-extended control regions. Some 43% of the genes contain introns, of which there are 4,730. Fifty genes have significant similarity with human disease genes; half of these are cancer related. We identify highly conserved genes important for eukaryotic cell organization including those required for the cytoskeleton, compartmentation, cell-cycle control, proteolysis, protein phosphorylation and RNA splicing. These genes may have originated with the appearance of eukaryotic life. Few similarly conserved genes that are important for multicellular organization were identified, suggesting that the transition from prokaryotes to eukaryotes required more new genes than did the transition from unicellular to multicellular organization.

...read moreread less

1,686 citations

Journal Article•DOI•

The genome of the kinetoplastid parasite, Leishmania major.

[...]

Alasdair Ivens¹, Christopher S. Peacock¹, Elizabeth A. Worthey², Lee Murphy¹, Gautam Aggarwal², Matthew Berriman¹, Ellen Sisk², Marie-Adèle Rajandream¹, Ellen Adlem¹, Rita Aert³, Atashi Anupama², Zina Apostolou, Philip Attipoe², Nathalie Bason¹, Christopher Bauser⁴, Alfred Beck⁵, Stephen M. Beverley⁶, Gabriella Bianchettin⁷, K. Borzym⁵, G. Bothe⁴, Carlo V. Bruschi⁸, Carlo V. Bruschi⁷, Matt Collins¹, Eithon Cadag², Laura Ciarloni⁷, Christine Clayton, Richard M.R. Coulson⁹, Ann Cronin¹, Angela K. Cruz¹⁰, Robert L. Davies¹, Javier G. De Gaudenzi¹¹, Deborah E. Dobson⁶, Andreas Duesterhoeft, Gholam Fazelina², Nigel Fosker¹, Alberto C.C. Frasch¹¹, Audrey Fraser¹, Monika Fuchs, Claudia Gabel, Arlette Goble¹, André Goffeau¹², David Harris¹, Christiane Hertz-Fowler¹, Helmut Hilbert, David Horn¹³, Yiting Huang², Sven Klages⁵, Andrew J Knights¹, Michael Kube⁵, Natasha Larke¹, Lyudmila Litvin², Angela Lord¹, Tin Louie², Marco A. Marra, David Masuy¹², Keith R. Matthews¹⁴, Shulamit Michaeli, Jeremy C. Mottram¹⁵, Silke Müller-Auer, Heather Munden², Siri Nelson², Halina Norbertczak¹, Karen Oliver¹, Susan O'Neil¹, Martin Pentony², Thomas M. Pohl⁴, Claire Price¹, Bénédicte Purnelle¹², Michael A. Quail¹, Ester Rabbinowitsch¹, Richard Reinhardt⁵, Michael A. Rieger, Joel Rinta², Johan Robben³, Laura Robertson², Jeronimo C. Ruiz¹⁰, Simon Rutter¹, David L. Saunders¹, Melanie Schäfer, Jacquie Schein, David C. Schwartz¹⁶, Kathy Seeger¹, Amber Seyler², Sarah Sharp¹, Heesun Shin, Dhileep Sivam², Rob Squares¹, Steve Squares¹, Valentina Tosato⁷, Christy Vogt², Guido Volckaert³, Rolf Wambutt, T. Warren¹, Holger Wedler, John Woodward¹, Shiguo Zhou¹⁶, Wolfgang Zimmermann, Deborah F. Smith¹⁷, Jenefer M. Blackwell¹⁸, Kenneth Stuart¹⁹, Kenneth Stuart², Bart Barrell¹, Peter J. Myler², Peter J. Myler¹⁹ - Show less +100 more•Institutions (19)

Wellcome Trust Sanger Institute¹, Seattle Biomed², Katholieke Universiteit Leuven³, GATC Biotech⁴, Max Planck Society⁵, Washington University in St. Louis⁶, University of Trieste⁷, International Centre for Genetic Engineering and Biotechnology⁸, European Bioinformatics Institute⁹, University of São Paulo¹⁰, National Scientific and Technical Research Council¹¹, Université catholique de Louvain¹², University of London¹³, University of Edinburgh¹⁴, University of Glasgow¹⁵, University of Wisconsin-Madison¹⁶, University of York¹⁷, University of Cambridge¹⁸, University of Washington¹⁹

15 Jul 2005-Science

TL;DR: The organization of protein-coding genes into long, strand-specific, polycistronic clusters and lack of general transcription factors in the L. major, Trypanosoma brucei, and Tritryp genomes suggest that the mechanisms regulating RNA polymerase II–directed transcription are distinct from those operating in other eukaryotes, although the trypanosomatids appear capable of chromatin remodeling.

...read moreread less

Abstract: Leishmania species cause a spectrum of human diseases in tropical and subtropical regions of the world. We have sequenced the 36 chromosomes of the 32.8-megabase haploid genome of Leishmania major (Friedlin strain) and predict 911 RNA genes, 39 pseudogenes, and 8272 protein-coding genes, of which 36% can be ascribed a putative function. These include genes involved in host-pathogen interactions, such as proteolytic enzymes, and extensive machinery for synthesis of complex surface glycoconjugates. The organization of protein-coding genes into long, strand-specific, polycistronic clusters and lack of general transcription factors in the L. major, Trypanosoma brucei, and Trypanosoma cruzi (Tritryp) genomes suggest that the mechanisms regulating RNA polymerase II-directed transcription are distinct from those operating in other eukaryotes, although the trypanosomatids appear capable of chromatin remodeling. Abundant RNA-binding proteins are encoded in the Tritryp genomes, consistent with active posttranscriptional regulation of gene expression.

...read moreread less

1,357 citations

Journal Article•DOI•

Genomic sequence of the pathogenic and allergenic filamentous fungus Aspergillus fumigatus

[...]

William C. Nierman¹, William C. Nierman², Arnab Pain³, Michael J. Anderson⁴, Jennifer R. Wortman¹, Jennifer R. Wortman², H. Stanley Kim², H. Stanley Kim¹, Javier Arroyo⁵, Matthew Berriman³, Keietsu Abe⁶, David B. Archer⁷, Clara Bermejo⁵, Joan W. Bennett⁸, Paul Bowyer⁴, Dan Chen², Dan Chen¹, Matthew Collins³, Richard Coulsen, Robert L. Davies³, Paul S. Dyer⁷, Mark L. Farman⁹, Nadia Fedorova¹, Nadia Fedorova², Natalie D. Fedorova¹, Natalie D. Fedorova², T. Feldblyum¹, T. Feldblyum², Reinhard Fischer¹⁰, Nigel Fosker³, Audrey Fraser³, José Luis García¹¹, María Josefa Marcos García¹², Ariette Goble³, Gustavo H. Goldman¹³, Katsuya Gomi⁶, Sam Griffith-Jones³, R. Gwilliam³, Brian J. Haas¹, Brian J. Haas², Hubertus Haas¹⁴, David Harris³, H. Horiuchi¹⁵, Jiaqi Huang¹, Jiaqi Huang², Sean Humphray³, Javier Jiménez¹², Nancy P. Keller¹⁵, H. Khouri¹, H. Khouri², Katsuhiko Kitamoto¹⁶, Tetsuo Kobayashi¹⁷, Sven Konzack¹⁰, Resham Kulkarni¹, Resham Kulkarni², Toshitaka Kumagai¹⁸, Anne Lafton¹⁹, Jean-Paul Latgé¹⁹, Weixi Li⁹, Angela Lord³, Charles Lu², Charles Lu¹, William H. Majoros², William H. Majoros¹, Gregory S. May²⁰, Bruce L. Miller²¹, Yasmin Ali Mohamoud², Yasmin Ali Mohamoud¹, María Molina⁵, Michel Monod²², Isabelle Mouyna¹⁹, Stephanie Mulligan¹, Stephanie Mulligan², Lee Murphy³, Susan O'Neil³, Ian T. Paulsen¹, Ian T. Paulsen², Miguel A. Peñalva¹¹, Mihaela Pertea¹, Mihaela Pertea², Claire Price³, Bethan L. Pritchard⁴, Michael A. Quail³, Ester Rabbinowitsch³, Neil Rawlins³, Marie Adele Rajandream³, Utz Reichard²³, Hubert Renauld³, Geoffrey D. Robson⁴, Santiago Rodríguez de Córdoba¹¹, José Manuel Rodríguez-Peña⁵, Catherine M. Ronning², Catherine M. Ronning¹, Simon Rutter³, Steven L. Salzberg¹, Steven L. Salzberg², Miguel del Nogal Sánchez¹², Juan C. Sánchez-Ferrero¹¹, David L. Saunders³, Kathy Seeger³, Rob Squares³, S. Squares³, Michio Takeuchi²⁴, Fredj Tekaia¹⁹, Geoffrey Turner²⁵, Carlos R. Vázquez de Aldana¹², J. Weidman¹, J. Weidman², Owen White², Owen White¹, John Woodward³, Jae-Hyuk Yu¹⁵, Claire M. Fraser², Claire M. Fraser¹, James E. Galagan²⁶, Kiyoshi Asai¹⁸, Masayuki Machida¹⁸, Neil Hall³, Neil Hall², Bart Barrell³, David W. Denning⁴ - Show less +117 more•Institutions (26)

Washington University in St. Louis¹, J. Craig Venter Institute², Wellcome Trust Sanger Institute³, University of Manchester⁴, Complutense University of Madrid⁵, Tohoku University⁶, University of Nottingham⁷, Tulane University⁸, University of Kentucky⁹, Max Planck Society¹⁰, Spanish National Research Council¹¹, University of Salamanca¹², University of São Paulo¹³, Innsbruck Medical University¹⁴, University of Wisconsin-Madison¹⁵, University of Tokyo¹⁶, Nagoya University¹⁷, National Institute of Advanced Industrial Science and Technology¹⁸, Pasteur Institute¹⁹, University of Texas MD Anderson Cancer Center²⁰, University of Idaho²¹, University of Lausanne²², University of Göttingen²³, Tokyo University of Agriculture and Technology²⁴, University of Sheffield²⁵, Broad Institute²⁶

22 Dec 2005-Nature

TL;DR: The Af293 genome sequence provides an unparalleled resource for the future understanding of this remarkable fungus and revealed temperature-dependent expression of distinct sets of genes, as well as 700 A. fumigatus genes not present or significantly diverged in the closely related sexual species Neosartorya fischeri, many of which may have roles in the pathogenicity phenotype.

...read moreread less

Abstract: Aspergillus fumigatus is exceptional among microorganisms in being both a primary and opportunistic pathogen as well as a major allergen. Its conidia production is prolific, and so human respiratory tract exposure is almost constant. A. fumigatus is isolated from human habitats and vegetable compost heaps. In immunocompromised individuals, the incidence of invasive infection can be as high as 50% and the mortality rate is often about 50% (ref. 2). The interaction of A. fumigatus and other airborne fungi with the immune system is increasingly linked to severe asthma and sinusitis. Although the burden of invasive disease caused by A. fumigatus is substantial, the basic biology of the organism is mostly obscure. Here we show the complete 29.4-megabase genome sequence of the clinical isolate Af293, which consists of eight chromosomes containing 9,926 predicted genes. Microarray analysis revealed temperature-dependent expression of distinct sets of genes, as well as 700 A. fumigatus genes not present or significantly diverged in the closely related sexual species Neosartorya fischeri, many of which may have roles in the pathogenicity phenotype. The Af293 genome sequence provides an unparalleled resource for the future understanding of this remarkable fungus.

...read moreread less

1,356 citations

Journal Article•DOI•

Comparative genomic analysis of three Leishmania species that cause diverse human disease

[...]

Christopher S. Peacock¹, Kathy Seeger¹, David Harris¹, Lee Murphy¹, Jeronimo C. Ruiz², Michael A. Quail¹, Nicholas S. Peters¹, Ellen Adlem¹, Adrian Tivey¹, Martin Aslett¹, Arnaud Kerhornou¹, Alasdair Ivens¹, Audrey Fraser¹, Marie-Adèle Rajandream¹, Tim Carver¹, Halina Norbertczak¹, Tracey Chillingworth¹, Zahra Hance¹, Kay Jagels¹, Sharon Moule¹, Doug Ormond¹, Simon Rutter¹, Rob Squares¹, Sally Whitehead¹, Ester Rabbinowitsch¹, Claire Arrowsmith¹, Brian White¹, Scott Thurston¹, Frédéric Bringaud³, Sandra L. Baldauf⁴, Adam Faulconbridge⁴, Daniel C. Jeffares¹, Daniel P. Depledge⁴, Samuel O. Oyola⁴, James D. Hilley⁵, Loislene O. Brito², Luiz R. O. Tosi², Barclay G. Barrell¹, Angela K. Cruz², Jeremy C. Mottram⁵, Deborah F. Smith⁴, Matthew Berriman¹ - Show less +38 more•Institutions (5)

Wellcome Trust Sanger Institute¹, University of São Paulo², Centre national de la recherche scientifique³, University of York⁴, University of Glasgow⁵

01 Jul 2007-Nature Genetics

TL;DR: It is shown that pseudogene formation and gene loss are the principal forces shaping the different genomes of Leishmania, and genes that are differentially distributed between the species encode proteins implicated in host-pathogen interactions and parasite survival in the macrophage.

...read moreread less

Abstract: Leishmania parasites cause a broad spectrum of clinical disease. Here we report the sequencing of the genomes of two species of Leishmania: Leishmania infantum and Leishmania braziliensis. The comparison of these sequences with the published genome of Leishmania major reveals marked conservation of synteny and identifies only 200 genes with a differential distribution between the three species. L. braziliensis, contrary to Leishmania species examined so far, possesses components of a putative RNA-mediated interference pathway, telomere-associated transposable elements and spliced leader–associated SLACS retrotransposons. We show that pseudogene formation and gene loss are the principal forces shaping the different genomes. Genes that are differentially distributed between the species encode proteins implicated in host-pathogen interactions and parasite survival in the macrophage.

...read moreread less

721 citations

1
2
3
4
…
5

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Genome sequence of the human malaria parasite Plasmodium falciparum

[...]

Malcolm J. Gardner¹, Neil Hall¹, Eula Fung¹, Owen White¹, Matthew Berriman¹, Richard W. Hyman¹, Jane M. Carlton¹, Arnab Pain¹, Karen E. Nelson¹, Sharen Bowman¹, Ian T. Paulsen¹, Keith D. James¹, Jonathan A. Eisen¹, Kim Rutherford¹, Steven L. Salzberg¹, Alister Craig¹, Sue Kyes¹, Man Suen Chan¹, Vishvanath Nene¹, Shamira J. Shallom¹, Bernard B. Suh¹, Jeremy Peterson¹, Samuel V. Angiuoli¹, Mihaela Pertea¹, Jonathan E. Allen¹, Jeremy D. Selengut¹, Daniel H. Haft¹, Michael W. Mather¹, Akhil B. Vaidya¹, David M. A. Martin¹, Alan H. Fairlamb¹, Martin Fraunholz¹, David S. Roos¹, Stuart A. Ralph¹, Geoffrey I. McFadden¹, Leda M. Cummings¹, G. Mani Subramanian¹, Christopher J. Mungall¹, J. Craig Venter¹, Daniel J. Carucci¹, Stephen L. Hoffman¹, Chris I. Newbold¹, Ronald W. Davis¹, Claire M. Fraser¹, Bart Barrell¹ - Show less +41 more•Institutions (1)

J. Craig Venter Institute¹

03 Oct 2002-Nature

TL;DR: The genome sequence of P. falciparum clone 3D7 is reported, which is the most (A + T)-rich genome sequenced to date and is being exploited in the search for new drugs and vaccines to fight malaria.

...read moreread less

Abstract: The parasite Plasmodium falciparum is responsible for hundreds of millions of cases of malaria, and kills more than one million African children annually. Here we report an analysis of the genome sequence of P. falciparum clone 3D7. The 23-megabase nuclear genome consists of 14 chromosomes, encodes about 5,300 genes, and is the most (A + T)-rich genome sequenced to date. Genes involved in antigenic variation are concentrated in the subtelomeric regions of the chromosomes. Compared to the genomes of free-living eukaryotic microbes, the genome of this intracellular parasite encodes fewer enzymes and transporters, but a large proportion of genes are devoted to immune evasion and host-parasite interactions. Many nuclear-encoded proteins are targeted to the apicoplast, an organelle involved in fatty-acid and isoprenoid metabolism. The genome sequence provides the foundation for future studies of this organism, and is being exploited in the search for new drugs and vaccines to fight malaria.

...read moreread less

4,312 citations

Journal Article•DOI•

The COG database: an updated version includes eukaryotes

[...]

Roman L. Tatusov¹, Natalie D. Fedorova¹, John D. Jackson¹, Aviva R. Jacobs¹, Boris Kiryutin¹, Eugene V. Koonin¹, Dmitri M. Krylov¹, Raja Mazumder², Sergei L. Mekhedov¹, Anastasia N. Nikolskaya², B Sridhar Rao¹, Sergei Smirnov¹, Alexander V. Sverdlov¹, Sona Vasudevan¹, Yuri I. Wolf¹, Jodie J. Yin¹, Darren A. Natale² - Show less +13 more•Institutions (2)

National Institutes of Health¹, Georgetown University Medical Center²

11 Sep 2003-BMC Bioinformatics

TL;DR: A major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes is described and is expected to be a useful platform for functional annotation of newlysequenced genomes, including those of complex eukARYotes, and genome-wide evolutionary studies.

...read moreread less

Abstract: The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies. We describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs) from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after euk aryotic o rthologous g roups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted) proteins encoded in 66 genomes of unicellular organisms. The euk aryotic o rthologous g roups (KOGs) include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens), one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe), and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or ~54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of ~20% of the KOG set. This conserved portion of the KOG set is much greater than the ubiquitous portion of the COG set (~1% of the COGs). In part, this difference is probably due to the small number of included eukaryotic genomes, but it could also reflect the relative compactness of eukaryotes as a clade and the greater evolutionary stability of eukaryotic genomes. The updated collection of orthologous protein sets for prokaryotes and eukaryotes is expected to be a useful platform for functional annotation of newly sequenced genomes, including those of complex eukaryotes, and genome-wide evolutionary studies.

...read moreread less

4,167 citations

Journal Article•DOI•

Systematic functional analysis of the Caenorhabditis elegans genome using RNAi

[...]

Ravi S. Kamath¹, Andrew G. Fraser¹, Andrew G. Fraser², Yan Dong¹, Gino B. Poulin¹, Richard Durbin², Monica Gotta¹, Alexander Kanapin³, Nathalie Le Bot¹, Sergio Moreno¹, Sergio Moreno⁴, Marc Sohrmann², David P. Welchman¹, Peder Zipperlen¹, Julie Ahringer¹ - Show less +11 more•Institutions (4)

University of Cambridge¹, Wellcome Trust Sanger Institute², European Bioinformatics Institute³, Spanish National Research Council⁴

16 Jan 2003-Nature

TL;DR: It is found that genes of similar functions are clustered in distinct, multi-megabase regions of individual chromosomes; genes in these regions tend to share transcriptional profiles.

...read moreread less

Abstract: A principal challenge currently facing biologists is how to connect the complete DNA sequence of an organism to its development and behaviour. Large-scale targeted-deletions have been successful in defining gene functions in the single-celled yeast Saccharomyces cerevisiae, but comparable analyses have yet to be performed in an animal. Here we describe the use of RNA interference to inhibit the function of ∼86% of the 19,427 predicted genes of C. elegans. We identified mutant phenotypes for 1,722 genes, about two-thirds of which were not previously associated with a phenotype. We find that genes of similar functions are clustered in distinct, multi-megabase regions of individual chromosomes; genes in these regions tend to share transcriptional profiles. Our resulting data set and reusable RNAi library of 16,757 bacterial clones will facilitate systematic analyses of the connections among gene sequence, chromosomal location and gene function in C. elegans.

...read moreread less

3,529 citations

Journal Article•DOI•

Cpf1 is a single RNA-guided endonuclease of a class 2 CRISPR-Cas system.

[...]

Bernd Zetsche, Jonathan S. Gootenberg, Omar O. Abudayyeh, Ian Slaymaker, Kira S. Makarova¹, Patrick Essletzbichler, Sara E. Volz, Julia Joung, John van der Oost², Aviv Regev³, Aviv Regev⁴, Eugene V. Koonin¹, Feng Zhang - Show less +9 more•Institutions (4)

National Institutes of Health¹, Wageningen University and Research Centre², Broad Institute³, Massachusetts Institute of Technology⁴

22 Oct 2015-Cell

TL;DR: In this paper, the authors characterized Cpf1, a putative class 2 CRISPR effector, which is a single RNA-guided endonuclease lacking tracrRNA and utilizes a T-rich protospacer-adjacent motif.

...read moreread less

3,436 citations

Journal Article•DOI•

Bioactive microbial metabolites.

[...]

János Bérdy

01 Jan 2005-The Journal of Antibiotics

TL;DR: The short history, specific features and future prospects of research of microbial metabolites, including antibiotics and other bioactive metabolites, are summarized.

...read moreread less

Abstract: The short history, specific features and future prospects of research of microbial metabolites, including antibiotics and other bioactive metabolites, are summarized. The microbial origin, diversity of producing species, functions and various bioactivities of metabolites, unique features of their chemical structures are discussed, mainly on the basis of statistical data. The possible numbers of metabolites may be discovered in the future, the problems of dereplication of newly isolated compounds as well as the new trends and prospects of the research are also discussed.

...read moreread less

2,706 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse