Showing papers by "J. Craig Venter Institute published in 2005"

PDF

Open Access

Journal Article•DOI•

The map-based sequence of the rice genome

[...]

Takashi Matsumoto¹, Jianzhong Wu¹, Hiroyuki Kanamori¹, Yuichi Katayose¹ +262 more•Institutions (25)

11 Aug 2005-Nature

TL;DR: A map-based, finished quality sequence that covers 95% of the 389 Mb rice genome, including virtually all of the euchromatin and two complete centromeres, and finds evidence for widespread and recurrent gene transfer from the organelles to the nuclear chromosomes.

...read moreread less

Abstract: Rice, one of the world's most important food plants, has important syntenic relationships with the other cereal species and is a model plant for the grasses. Here we present a map-based, finished quality sequence that covers 95% of the 389 Mb genome, including virtually all of the euchromatin and two complete centromeres. A total of 37,544 non-transposable-element-related protein-coding genes were identified, of which 71% had a putative homologue in Arabidopsis. In a reciprocal analysis, 90% of the Arabidopsis proteins had a putative homologue in the predicted rice proteome. Twenty-nine per cent of the 37,544 predicted genes appear in clustered gene families. The number and classes of transposable elements found in the rice genome are consistent with the expansion of syntenic regions in the maize and sorghum genomes. We find evidence for widespread and recurrent gene transfer from the organelles to the nuclear chromosomes. The map-based sequence has proven useful for the identification of genes underlying agronomic traits. The additional single-nucleotide polymorphisms and simple sequence repeats identified in our study should accelerate improvements in rice production.

...read moreread less

3,423 citations

Journal Article•DOI•

The Transcriptional Landscape of the Mammalian Genome

[...]

Piero Carninci, Takeya Kasukawa¹, Shintaro Katayama, Julian Gough +194 more•Institutions (36)

02 Sep 2005-Science

TL;DR: Detailed polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.

...read moreread less

Abstract: This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.

...read moreread less

3,412 citations

Journal Article•DOI•

Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: Implications for the microbial “pan-genome”

[...]

Hervé Tettelin, Vega Masignani, Michael J. Cieslewicz¹, Claudio Donati, Duccio Medini, Naomi L. Ward², Samuel V. Angiuoli³, Jonathan Crabtree³, Amanda L. Jones⁴, A. Scott Durkin³, Robert T. DeBoy³, Tanja M. Davidsen³, Marirosa Mora, Maria Scarselli, Immaculada Margarit Y Ros, Jeremy Peterson³, Christopher R. Hauser³, Jaideep P. Sundaram³, William C. Nelson³, Ramana Madupu³, Lauren M. Brinkac³, Robert J. Dodson³, M. J. Rosovitz³, Steven A. Sullivan³, Sean C. Daugherty³, Daniel H. Haft³, Jeremy D. Selengut³, Michelle L. Gwinn³, Liwei Zhou³, Nikhat Zafar³, Hoda Khouri³, Diana Radune³, George Dimitrov³, Kisha Watkins³, Kevin J. B. O'Connor⁵, Shannon Smith³, Teresa Utterback³, Owen White³, Craig E. Rubens⁴, Guido Grandi, Lawrence C. Madoff¹, Dennis L. Kasper¹, John L. Telford, Michael R. Wessels¹, Rino Rappuoli, Claire M. Fraser⁶ - Show less +42 more•Institutions (6)

Harvard University¹, University of Maryland, Baltimore County², J. Craig Venter Institute³, Boston Children's Hospital⁴, Johns Hopkins University⁵, George Washington University⁶

27 Sep 2005-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The genomic sequence of six strains representing the five major disease-causing serotypes of Streptococcus agalactiae, the main cause of neonatal infection in humans, was generated and Mathematical extrapolation of the data suggests that the gene reservoir available for inclusion in the S. agalactic pan-genome is vast and that unique genes will continue to be identified even after sequencing hundreds of genomes.

...read moreread less

Abstract: The development of efficient and inexpensive genome sequencing methods has revolutionized the study of human bacterial pathogens and improved vaccine design. Unfortunately, the sequence of a single genome does not reflect how genetic variability drives pathogenesis within a bacterial species and also limits genome-wide screens for vaccine candidates or for antimicrobial targets. We have generated the genomic sequence of six strains representing the five major disease-causing serotypes of Streptococcus agalactiae, the main cause of neonatal infection in humans. Analysis of these genomes and those available in databases showed that the S. agalactiae species can be described by a pan-genome consisting of a core genome shared by all isolates, accounting for ≈80% of any single genome, plus a dispensable genome consisting of partially shared and strain-specific genes. Mathematical extrapolation of the data suggests that the gene reservoir available for inclusion in the S. agalactiae pan-genome is vast and that unique genes will continue to be identified even after sequencing hundreds of genomes.

...read moreread less

2,092 citations

Journal Article•DOI•

The Genome of the African Trypanosome Trypanosoma brucei

[...]

Matthew Berriman¹, Elodie Ghedin², Elodie Ghedin³, Christiane Hertz-Fowler¹, Gaëlle Blandin³, Hubert Renauld¹, Daniella Castanheira Bartholomeu³, Nicola Lennard¹, Elisabet Caler³, N. Hamlin¹, Brian J. Haas³, Ulrike Böhme¹, Linda Hannick³, Martin Aslett¹, Joshua Shallom³, Lucio Marcello⁴, Lihua Hou³, Bill Wickstead⁵, U. Cecilia M. Alsmark⁶, Claire Arrowsmith¹, Rebecca Atkin¹, Andrew Barron¹, Frédéric Bringaud⁷, Karen Brooks¹, Mark Carrington⁸, Inna Cherevach¹, Tracey-Jane Chillingworth¹, Carol Churcher¹, Louise Clark¹, Craig Corton¹, Ann Cronin¹, Robert L. Davies¹, Jonathon Doggett¹, Appolinaire Djikeng³, Tamara Feldblyum³, Mark C. Field⁸, Audrey Fraser¹, Ian Goodhead¹, Zahra Hance¹, David Harper¹, Barbara Harris¹, Heidi Hauser¹, Jessica B. Hostetler³, Al Ivens¹, Kay Jagels¹, David W. Johnson¹, Justin Johnson³, Kristine Jones³, Arnaud Kerhornou¹, Hean Koo³, Natasha Larke¹, Scott M. Landfear⁹, Christopher Larkin³, Vanessa Leech⁸, Alexandra Line¹, Angela Lord¹, Annette MacLeod⁴, P. Mooney¹, Sharon Moule¹, David M. A. Martin¹⁰, Gareth W. Morgan¹¹, Karen Mungall¹, Halina Norbertczak¹, Doug Ormond¹, Grace Pai³, Christopher S. Peacock¹, Jeremy Peterson³, Michael A. Quail¹, Ester Rabbinowitsch¹, Marie-Adèle Rajandream¹, Chris P Reitter⁸, Steven L. Salzberg³, Mandy Sanders¹, Seth Schobel³, Sarah Sharp¹, Mark Simmonds¹, Anjana J. Simpson³, Luke J. Tallon³, C. Michael R. Turner⁴, Andrew Tait⁴, Adrian Tivey¹, Susan Van Aken³, Danielle Walker¹, David Wanless³, Shiliang Wang³, Brian White¹, Owen White³, Sally Whitehead¹, John Woodward¹, Jennifer R. Wortman³, Mark Raymond Adams¹², T. Martin Embley⁶, Keith Gull⁵, Elisabetta Ullu¹³, J. David Barry⁴, Alan H. Fairlamb¹⁰, Fred R. Opperdoes¹⁴, Barclay G. Barrell¹, John E. Donelson¹⁵, Neil Hall³, Neil Hall¹⁶, Claire M. Fraser³, Sara E. Melville⁸, Najib M. El-Sayed², Najib M. El-Sayed³ - Show less +101 more•Institutions (16)

Wellcome Trust Sanger Institute¹, George Washington University², J. Craig Venter Institute³, University of Glasgow⁴, University of Oxford⁵, Newcastle University⁶, University of Bordeaux⁷, University of Cambridge⁸, Oregon Health & Science University⁹, University of Dundee¹⁰, Imperial College London¹¹, Case Western Reserve University¹², Yale University¹³, Université catholique de Louvain¹⁴, University of Iowa¹⁵, Wellcome Trust¹⁶

15 Jul 2005-Science

TL;DR: Comparisons of the cytoskeleton and endocytic trafficking systems of Trypanosoma brucei with those of humans and other eukaryotic organisms reveal major differences.

...read moreread less

Abstract: African trypanosomes cause human sleeping sickness and livestock trypanosomiasis in sub-Saharan Africa. We present the sequence and analysis of the 11 megabase-sized chromosomes of Trypanosoma brucei. The 26-megabase genome contains 9068 predicted genes, including ∼900 pseudogenes and ∼1700 T. brucei–specific genes. Large subtelomeric arrays contain an archive of 806 variant surface glycoprotein (VSG) genes used by the parasite to evade the mammalian immune system. Most VSG genes are pseudogenes, which may be used to generate expressed mosaic genes by ectopic recombination. Comparisons of the cytoskeleton and endocytic trafficking systems with those of humans and other eukaryotic organisms reveal major differences. A comparison of metabolic pathways encoded by the genomes of T. brucei, T. cruzi, and Leishmania major reveals the least overall metabolic capability in T. brucei and the greatest in L. major. Horizontal transfer of genes of bacterial origin has contributed to some of the metabolic differences in these parasites, and a number of novel potential drug targets have been identified.

...read moreread less

1,631 citations

Journal Article•DOI•

Genomic sequence of the pathogenic and allergenic filamentous fungus Aspergillus fumigatus

[...]

William C. Nierman¹, William C. Nierman², Arnab Pain³, Michael J. Anderson⁴, Jennifer R. Wortman¹, Jennifer R. Wortman², H. Stanley Kim², H. Stanley Kim¹, Javier Arroyo⁵, Matthew Berriman³, Keietsu Abe⁶, David B. Archer⁷, Clara Bermejo⁵, Joan W. Bennett⁸, Paul Bowyer⁴, Dan Chen², Dan Chen¹, Matthew Collins³, Richard Coulsen, Robert L. Davies³, Paul S. Dyer⁷, Mark L. Farman⁹, Nadia Fedorova², Nadia Fedorova¹, Natalie D. Fedorova², Natalie D. Fedorova¹, T. Feldblyum¹, T. Feldblyum², Reinhard Fischer¹⁰, Nigel Fosker³, Audrey Fraser³, José Luis García¹¹, María Josefa Marcos García¹², Ariette Goble³, Gustavo H. Goldman¹³, Katsuya Gomi⁶, Sam Griffith-Jones³, R. Gwilliam³, Brian J. Haas², Brian J. Haas¹, Hubertus Haas¹⁴, David Harris³, H. Horiuchi¹⁵, Jiaqi Huang¹, Jiaqi Huang², Sean Humphray³, Javier Jiménez¹², Nancy P. Keller¹⁵, H. Khouri¹, H. Khouri², Katsuhiko Kitamoto¹⁶, Tetsuo Kobayashi¹⁷, Sven Konzack¹⁰, Resham Kulkarni², Resham Kulkarni¹, Toshitaka Kumagai¹⁸, Anne Lafton¹⁹, Jean-Paul Latgé¹⁹, Weixi Li⁹, Angela Lord³, Charles Lu², Charles Lu¹, William H. Majoros², William H. Majoros¹, Gregory S. May²⁰, Bruce L. Miller²¹, Yasmin Ali Mohamoud¹, Yasmin Ali Mohamoud², María Molina⁵, Michel Monod²², Isabelle Mouyna¹⁹, Stephanie Mulligan², Stephanie Mulligan¹, Lee Murphy³, Susan O'Neil³, Ian T. Paulsen¹, Ian T. Paulsen², Miguel A. Peñalva¹¹, Mihaela Pertea², Mihaela Pertea¹, Claire Price³, Bethan L. Pritchard⁴, Michael A. Quail³, Ester Rabbinowitsch³, Neil Rawlins³, Marie Adele Rajandream³, Utz Reichard²³, Hubert Renauld³, Geoffrey D. Robson⁴, Santiago Rodríguez de Córdoba¹¹, José Manuel Rodríguez-Peña⁵, Catherine M. Ronning¹, Catherine M. Ronning², Simon Rutter³, Steven L. Salzberg², Steven L. Salzberg¹, Miguel del Nogal Sánchez¹², Juan C. Sánchez-Ferrero¹¹, David L. Saunders³, Kathy Seeger³, Rob Squares³, S. Squares³, Michio Takeuchi²⁴, Fredj Tekaia¹⁹, Geoffrey Turner²⁵, Carlos R. Vázquez de Aldana¹², J. Weidman², J. Weidman¹, Owen White², Owen White¹, John Woodward³, Jae-Hyuk Yu¹⁵, Claire M. Fraser², Claire M. Fraser¹, James E. Galagan²⁶, Kiyoshi Asai¹⁸, Masayuki Machida¹⁸, Neil Hall², Neil Hall³, Bart Barrell³, David W. Denning⁴ - Show less +117 more•Institutions (26)

22 Dec 2005-Nature

TL;DR: The Af293 genome sequence provides an unparalleled resource for the future understanding of this remarkable fungus and revealed temperature-dependent expression of distinct sets of genes, as well as 700 A. fumigatus genes not present or significantly diverged in the closely related sexual species Neosartorya fischeri, many of which may have roles in the pathogenicity phenotype.

...read moreread less

Abstract: Aspergillus fumigatus is exceptional among microorganisms in being both a primary and opportunistic pathogen as well as a major allergen. Its conidia production is prolific, and so human respiratory tract exposure is almost constant. A. fumigatus is isolated from human habitats and vegetable compost heaps. In immunocompromised individuals, the incidence of invasive infection can be as high as 50% and the mortality rate is often about 50% (ref. 2). The interaction of A. fumigatus and other airborne fungi with the immune system is increasingly linked to severe asthma and sinusitis. Although the burden of invasive disease caused by A. fumigatus is substantial, the basic biology of the organism is mostly obscure. Here we show the complete 29.4-megabase genome sequence of the clinical isolate Af293, which consists of eight chromosomes containing 9,926 predicted genes. Microarray analysis revealed temperature-dependent expression of distinct sets of genes, as well as 700 A. fumigatus genes not present or significantly diverged in the closely related sexual species Neosartorya fischeri, many of which may have roles in the pathogenicity phenotype. The Af293 genome sequence provides an unparalleled resource for the future understanding of this remarkable fungus.

...read moreread less

1,356 citations

Journal Article•DOI•

The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas disease

[...]

Najib M. El-Sayed¹, Peter J. Myler², Peter J. Myler³, Daniella Castanheira Bartholomeu⁴, Daniel Nilsson⁵, Gautam Aggarwal², Anh-Nhi Tran⁵, Elodie Ghedin¹, Elizabeth A. Worthey², Arthur L. Delcher, Gaëlle Blandin⁴, Scott J. Westenberger⁶, Elisabet Caler⁴, Gustavo C. Cerqueira⁷, Carole Branche⁵, Brian J. Haas⁴, Atashi Anupama², Erik Arner⁵, Lena Åslund⁸, Philip Attipoe², Esteban J. Bontempi⁵, Frédéric Bringaud⁹, Peter Burton¹⁰, Eithon Cadag², David A. Campbell⁶, Mark Carrington¹¹, Jonathan Crabtree⁴, Hamid Darban⁵, José Franco da Silveira¹², Pieter J. de Jong¹³, Kimberly Edwards⁵, Paul T. Englund¹⁴, Gholam Fazelina², Tamara Feldblyum⁴, Marcela Ferella⁵, Alberto C.C. Frasch¹⁵, Keith Gull¹⁶, David Horn¹⁷, Lihua Hou⁴, Yiting Huang², Ellen Kindlund⁵, Michele M. Klingbeil¹⁸, Sindy Kluge⁵, Hean Koo⁴, Daniela R. Lacerda¹⁹, Mariano J. Levin²⁰, Hernan Lorenzi²⁰, Tin Louie², Carlos Renato Machado⁷, Richard McCulloch¹⁰, Alan McKenna⁵, Yumi Mizuno⁵, Jeremy C. Mottram¹⁰, Siri Nelson², Stephen Ochaya⁵, Kazutoyo Osoegawa¹³, Grace Pai⁴, Marilyn Parsons², Marilyn Parsons³, Martin Pentony², Ulf Pettersson⁸, Mihai Pop⁴, José Luis Ramírez²¹, Joel Rinta², Laura Robertson², Steven L. Salzberg, Daniel O. Sánchez¹⁵, Amber Seyler², Reuben Sunil Kumar Sharma¹¹, Jyoti Shetty⁴, Anjana J. Simpson⁴, Ellen Sisk², Martti T. Tammi²², Martti T. Tammi⁵, Rick L. Tarleton²³, Santuza M. R. Teixeira⁷, Susan Van Aken⁴, Christy Vogt², Pauline N. Ward¹⁰, Bill Wickstead¹⁶, Jennifer R. Wortman⁴, Owen White⁴, Claire M. Fraser⁴, Kenneth Stuart², Kenneth Stuart³, Björn Andersson⁵ - Show less +82 more•Institutions (23)

15 Jul 2005-Science

TL;DR: Although the Tritryp lack several classes of signaling molecules, their kinomes contain a large and diverse set of protein kinases and phosphatases; their size and diversity imply previously unknown interactions and regulatory processes, which may be targets for intervention.

...read moreread less

Abstract: Whole-genome sequencing of the protozoan pathogen Trypanosoma cruzi revealed that the diploid genome contains a predicted 22,570 proteins encoded by genes, of which 12,570 represent allelic pairs. Over 50% of the genome consists of repeated sequences, such as retrotransposons and genes for large families of surface molecules, which include trans-sialidases, mucins, gp63s, and a large novel family (>1300 copies) of mucin-associated surface protein (MASP) genes. Analyses of the T. cruzi, T. brucei, and Leishmania major (Tritryp) genomes imply differences from other eukaryotes in DNA repair and initiation of replication and reflect their unusual mitochondrial DNA. Although the Tritryp lack several classes of signaling molecules, their kinomes contain a large and diverse set of protein kinases and phosphatases; their size and diversity imply previously unknown interactions and regulatory processes, which may be targets for intervention.

...read moreread less

1,349 citations

Journal Article•DOI•

Sequencing of Aspergillus nidulans and comparative analysis with A. fumigatus and A. oryzae

[...]

James E. Galagan¹, Sarah E. Calvo¹, Christina A. Cuomo¹, Li-Jun Ma¹, Jennifer R. Wortman², Serafim Batzoglou³, Su-In Lee³, Meray Baştürkmen⁴, Christina C. Spevak⁴, John Clutterbuck⁵, Vladimir V. Kapitonov⁶, Jerzy Jurka⁶, Claudio Scazzocchio⁷, Mark L. Farman⁸, Jonathan Butler¹, Seth Purcell¹, Steve Harris⁹, Gerhard H. Braus¹⁰, Oliver W. Draht¹⁰, Silke Busch¹⁰, Christophe d'Enfert¹¹, Christiane Bouchier¹¹, Gustavo H. Goldman¹², Deborah Bell-Pedersen¹³, Sam Griffiths-Jones¹⁴, John H. Doonan¹⁵, Jae-Hyuk Yu¹⁶, Kay Vienken¹⁷, Arnab Pain¹⁴, Michael Freitag¹⁸, Eric U. Selker¹⁸, David B. Archer¹⁹, Miguel A. Peñalva²⁰, Berl R. Oakley²¹, Michelle Momany²², Toshihiro Tanaka²³, Toshitaka Kumagai²⁴, Kiyoshi Asai²⁴, Masayuki Machida²⁴, William C. Nierman²⁵, David W. Denning²⁶, Mark X. Caddick²⁷, Michael J. Hynes²⁸, Mathieu Paoletti¹⁹, Reinhard Fischer²⁹, Reinhard Fischer¹⁷, Bruce L. Miller³⁰, Paul S. Dyer¹⁹, Matthew S. Sachs⁴, Stephen A. Osmani²¹, Bruce W. Birren¹ - Show less +47 more•Institutions (30)

Broad Institute¹, J. Craig Venter Institute², Stanford University³, Oregon Health & Science University⁴, University of Glasgow⁵, Genetic Information Research Institute⁶, Institut Universitaire de France⁷, University of Kentucky⁸, University of Nebraska–Lincoln⁹, University of Göttingen¹⁰, Pasteur Institute¹¹, University of São Paulo¹², Texas A&M University¹³, Wellcome Trust Sanger Institute¹⁴, John Innes Centre¹⁵, University of Wisconsin-Madison¹⁶, Max Planck Society¹⁷, University of Oregon¹⁸, University of Nottingham¹⁹, Spanish National Research Council²⁰, Ohio State University²¹, University of Georgia²², Tokyo Institute of Technology²³, National Institute of Advanced Industrial Science and Technology²⁴, George Washington University²⁵, University of Manchester²⁶, University of Liverpool²⁷, University of Melbourne²⁸, Karlsruhe Institute of Technology²⁹, University of Idaho³⁰

22 Dec 2005-Nature

TL;DR: The aspergilli comprise a diverse group of filamentous fungi spanning over 200 million years of evolution, and a comparative study with Aspergillus fumigatus and As pergillus oryzae, used in the production of sake, miso and soy sauce, provides new insight into eukaryotic genome evolution and gene regulation.

...read moreread less

Abstract: The aspergilli comprise a diverse group of filamentous fungi spanning over 200 million years of evolution. Here we report the genome sequence of the model organism Aspergillus nidulans, and a comparative study with Aspergillus fumigatus, a serious human pathogen, and Aspergillus oryzae, used in the production of sake, miso and soy sauce. Our analysis of genome structure provided a quantitative evaluation of forces driving long-term eukaryotic genome evolution. It also led to an experimentally validated model of mating-type locus evolution, suggesting the potential for sexual reproduction in A. fumigatus and A. oryzae. Our analysis of sequence conservation revealed over 5,000 non-coding regions actively conserved across all three species. Within these regions, we identified potential functional elements including a previously uncharacterized TPP riboswitch and motifs suggesting regulation in filamentous fungi by Puf family genes. We further obtained comparative and experimental evidence indicating widespread translational regulation by upstream open reading frames. These results enhance our understanding of these widely studied fungi as well as provide new insight into eukaryotic genome evolution and gene regulation.

...read moreread less

1,297 citations

Journal Article•DOI•

Genome sequencing and analysis of Aspergillus oryzae

[...]

Masayuki Machida¹, Kiyoshi Asai¹, Motoaki Sano¹, Toshihiro Tanaka², Toshitaka Kumagai¹, Goro Terai¹, Goro Terai³, Ken Ichi Kusumoto, Toshihide Arima, Osamu Akita, Yutaka Kashiwagi, Keietsu Abe⁴, Katsuya Gomi⁴, Hiroyuki Horiuchi⁵, Katsuhiko Kitamoto⁵, Tetsuo Kobayashi⁶, Michio Takeuchi⁷, David W. Denning⁸, James E. Galagan⁹, William C. Nierman¹⁰, Jiujiang Yu¹¹, David B. Archer¹², Joan W. Bennett¹³, Deepak Bhatnagar¹¹, Thomas E. Cleveland¹¹, Natalie D. Fedorova¹⁴, Osamu Gotoh¹, Hiroshi Horikawa², Akira Hosoyama², Masayuki Ichinomiya⁵, Rie Igarashi², Kazuhiro Iwashita, Praveen R. Juvvadi⁵, Masashi Kato⁶, Yumiko Kato², Taishin Kin¹, Akira Kokubun², Hiroshi Maeda⁴, Noriko Maeyama², Jun-ichi Maruyama⁵, Hideki Nagasaki¹, Tasuku Nakajima⁴, Ken Oda, Kinya Okada¹, Ian T. Paulsen¹⁴, Kazutoshi Sakamoto, Toshihiko Sawano², Mikio Takahashi², Kumiko Takase¹, Yasunobu Terabayashi¹, Jennifer R. Wortman¹⁴, Osamu Yamada, Youhei Yamagata⁴, Hideharu Anazawa, Yoji Hata, Yoshinao Koide, Takashi Komori³, Yasuji Koyama¹⁵, Toshitaka Minetoki, Sivasundaram Suharnan, Akimitsu Tanaka, Katsumi Isono², Satoru Kuhara¹⁶, Naotake Ogasawara¹⁷, Hisashi Kikuchi² - Show less +61 more•Institutions (17)

National Institute of Advanced Industrial Science and Technology¹, National Institute of Technology and Evaluation², Intec, Inc.³, Tohoku University⁴, University of Tokyo⁵, Nagoya University⁶, Tokyo University of Agriculture and Technology⁷, University of Manchester⁸, Broad Institute⁹, George Washington University¹⁰, Agricultural Research Service¹¹, University of Nottingham¹², Tulane University¹³, J. Craig Venter Institute¹⁴, Kikkoman¹⁵, Kyushu University¹⁶, Nara Institute of Science and Technology¹⁷

22 Dec 2005-Nature

TL;DR: Specific expansion of genes for secretory hydrolytic enzymes, amino acid metabolism and amino acid/sugar uptake transporters supports the idea that A. oryzae is an ideal microorganism for fermentation.

...read moreread less

Abstract: The genome of Aspergillus oryzae, a fungus important for the production of traditional fermented foods and beverages in Japan, has been sequenced. The ability to secrete large amounts of proteins and the development of a transformation system have facilitated the use of A. oryzae in modern biotechnology. Although both A. oryzae and Aspergillus flavus belong to the section Flavi of the subgenus Circumdati of Aspergillus, A. oryzae, unlike A. flavus, does not produce aflatoxin, and its long history of use in the food industry has proved its safety. Here we show that the 37-megabase (Mb) genome of A. oryzae contains 12,074 genes and is expanded by 7-9 Mb in comparison with the genomes of Aspergillus nidulans and Aspergillus fumigatus. Comparison of the three aspergilli species revealed the presence of syntenic blocks and A. oryzae-specific blocks (lacking synteny with A. nidulans and A. fumigatus) in a mosaic manner throughout the genome of A. oryzae. The blocks of A. oryzae-specific sequence are enriched for genes involved in metabolism, particularly those for the synthesis of secondary metabolites. Specific expansion of genes for secretory hydrolytic enzymes, amino acid metabolism and amino acid/sugar uptake transporters supports the idea that A. oryzae is an ideal microorganism for fermentation.

...read moreread less

1,149 citations

Journal Article•DOI•

Multiple-laboratory comparison of microarray platforms

[...]

Rafael A. Irizarry¹, Daniel S. Warren¹, Forrest Spencer¹, Irene F. Kim¹, Shyam Biswal¹, Bryan C. Frank², Edward Gabrielson¹, Joe G.N. Garcia¹, Joel Geoghegan³, Gregory G. Germino¹, Constance A. Griffin¹, Sara C. Hilmer⁴, Eric P. Hoffman⁴, Anne E. Jedlicka¹, Ernest S. Kawasaki³, Francisco Martinez-Murillo¹, Laura Morsberger¹, Hannah Lee¹, David Petersen³, John Quackenbush², John Quackenbush⁵, Alan F. Scott¹, Michael A Wilson, Yanqin Yang¹, Shui Qing Ye¹, Wayne Yu¹ - Show less +22 more•Institutions (5)

Johns Hopkins University¹, J. Craig Venter Institute², National Institutes of Health³, George Washington University⁴, Harvard University⁵

21 Apr 2005-Nature Methods

TL;DR: A consortium of ten laboratories from the Washington, DC–Baltimore, USA, area was formed to compare data obtained from three widely used platforms using identical RNA samples to demonstrate that there are relatively large differences in data obtained in labs using the same platform, but that the results from the best-performing labs agree rather well.

...read moreread less

Abstract: Microarray technology is a powerful tool for measuring RNA expression for thousands of genes at once. Various studies have been published comparing competing platforms with mixed results: some find agreement, others do not. As the number of researchers starting to use microarrays and the number of cross-platform meta-analysis studies rapidly increases, appropriate platform assessments become more important. Here we present results from a comparison study that offers important improvements over those previously described in the literature. In particular, we noticed that none of the previously published papers consider differences between labs. For this study, a consortium of ten laboratories from the Washington, DC–Baltimore, USA, area was formed to compare data obtained from three widely used platforms using identical RNA samples. We used appropriate statistical analysis to demonstrate that there are relatively large differences in data obtained in labs using the same platform, but that the results from the best-performing labs agree rather well.

...read moreread less

897 citations

Journal Article•DOI•

Comparative genomics of trypanosomatid parasitic protozoa.

[...]

Najib M. El-Sayed¹, Peter J. Myler², Peter J. Myler³, Gaëlle Blandin⁴, Matthew Berriman⁵, Jonathan Crabtree⁴, Gautam Aggarwal³, Elisabet Caler⁴, Hubert Renauld⁵, Elizabeth A. Worthey³, Christiane Hertz-Fowler⁵, Elodie Ghedin¹, Christopher S. Peacock⁵, Daniella Castanheira Bartholomeu⁴, Brian J. Haas⁴, Anh Nhi Tran⁶, Jennifer R. Wortman⁴, U. Cecilia M. Alsmark⁷, Samuel V. Angiuoli⁴, Atashi Anupama³, Jonathan H. Badger⁴, Frédéric Bringaud⁸, Eithon Cadag³, Jane M. Carlton, Gustavo C. Cerqueira⁹, Todd Creasy⁴, Arthur L. Delcher, Appolinaire Djikeng⁴, T. Martin Embley⁷, Christopher R. Hauser⁴, Alasdair Ivens⁵, Sarah K. Kummerfeld¹⁰, José B. Pereira-Leal¹⁰, Daniel Nilsson⁶, Jeremy Peterson⁴, Steven L. Salzberg, Joshua Shallom⁴, Joana C. Silva⁴, Jaideep P. Sundaram⁴, Scott J. Westenberger, Owen White⁴, Sara E. Melville¹¹, John E. Donelson¹², Björn Andersson⁶, Kenneth Stuart³, Kenneth Stuart², Neil Hall⁵ - Show less +43 more•Institutions (12)

George Washington University¹, University of Washington², Seattle Biomed³, J. Craig Venter Institute⁴, Wellcome Trust Sanger Institute⁵, Karolinska Institutet⁶, Newcastle University⁷, Centre national de la recherche scientifique⁸, Universidade Federal de Minas Gerais⁹, Medical Research Council¹⁰, University of Cambridge¹¹, University of Iowa¹²

15 Jul 2005-Science

TL;DR: No evidence that these species are descended from an ancestor that contained a photosynthetic endosymbiont is revealed, and a conserved core proteome of about 6200 genes in large syntenic polycistronic gene clusters is revealed.

...read moreread less

Abstract: A comparison of gene content and genome architecture of Trypanosoma brucei, Trypanosoma cruzi, and Leishmania major, three related pathogens with different life cycles and disease pathology, revealed a conserved core proteome of about 6200 genes in large syntenic polycistronic gene clusters. Many species-specific genes, especially large surface antigen families, occur at nonsyntenic chromosome-internal and subtelomeric regions. Retroelements, structural RNAs, and gene family expansion are often associated with syntenic discontinuities that-along with gene divergence, acquisition and loss, and rearrangement within the syntenic regions-have shaped the genomes of each parasite. Contrary to recent reports, our analyses reveal no evidence that these species are descended from an ancestor that contained a photosynthetic endosymbiont.

...read moreread less

761 citations

Journal Article•DOI•

The Genome of the Basidiomycetous Yeast and Human Pathogen Cryptococcus Neoformans

[...]

Brendan J. Loftus¹, Eula Fung², Paola Roncaglia³, Don Rowley², Paolo Amedeo¹, Dan Bruno², Jessica Vamathevan¹, Molly Miranda², Iain J. Anderson¹, James A. Fraser⁴, Jonathan E. Allen¹, Ian Bosdet, Michael R. Brent⁵, Readman Chiu, Tamara L. Doering⁵, Maureen J. Donlin⁶, Cletus D'Souza⁷, Deborah S. Fox⁸, Deborah S. Fox⁴, Viktoriya Grinberg¹, Jianmin Fu⁹, Marilyn Fukushima², Brian J. Haas¹, James Huang⁴, Guilhem Janbon¹⁰, Steven J.M. Jones, Hean L. Koo¹, Martin Krzywinski, June Kwon-Chung¹¹, Klaus B. Lengeler⁴, Klaus B. Lengeler¹², Rama Maiti¹, Marco A. Marra, Robert E. Marra¹³, Robert E. Marra⁴, Carrie Mathewson, Thomas G. Mitchell⁴, Mihaela Pertea¹, Florenta R. Riggs¹, Steven L. Salzberg¹, Jacqueline E. Schein, Alla Shvartsbeyn¹, Heesun Shin, Martin Shumway¹, Charles A. Specht¹⁴, Bernard B. Suh¹⁵, Aaron Tenney⁵, T. Utterback¹, Brian L. Wickes, Jennifer R. Wortman¹, Natasja Wye, James W. Kronstad⁷, Jennifer K. Lodge⁶, Joseph Heitman⁴, Ronald W. Davis², Claire M. Fraser¹, Richard W. Hyman² - Show less +53 more•Institutions (15)

J. Craig Venter Institute¹, Stanford University², International School for Advanced Studies³, Duke University⁴, Washington University in St. Louis⁵, Saint Louis University⁶, University of British Columbia⁷, Boston Children's Hospital⁸, University of Texas Health Science Center at San Antonio⁹, Pasteur Institute¹⁰, National Institutes of Health¹¹, University of Düsseldorf¹², Connecticut Agricultural Experiment Station¹³, Boston University¹⁴, University of California, Santa Cruz¹⁵

25 Feb 2005-Science

TL;DR: Comparison of two phenotypically distinct strains reveals variation in gene content in addition to sequence polymorphisms between the genomes, and the genome is rich in transposons, many of which cluster at candidate centromeric regions.

...read moreread less

Abstract: Cryptococcus neoformans is a basidionnycetous yeast ubiquitous in the environment, a model for fungal pathogenesis, and an opportunistic human pathogen of global importance. We have sequenced its similar to20-megabase genome, which contains similar to6500 intron-rich gene structures and encodes a transcriptome abundant in alternatively spliced and antisense messages. The genome is rich in transposons, many of which cluster at candidate centromeric regions. The presence of these transposons may drive karyotype instability and phenotypic variation. C. neoformans encodes unique genes that may contribute to its unusual virulence properties, and comparison of two phenotypically distinct strains reveals variation in gene content in addition to sequence polymorphisms between the genomes.

...read moreread less

Journal Article•DOI•

The Wolbachia genome of Brugia malayi: endosymbiont evolution within a human pathogenic nematode.

[...]

Jeremy M. Foster, Mehul B. Ganatra, Ibrahim H. Kamal¹, Jennifer Ware, Kira S. Makarova², Natalia Ivanova³, Anamitra Bhattacharyya, Vinayak Kapatral, Sanjay Kumar, Janos Posfai, Tamas Vincze, Jessica Ingram, Laurie S. Moran, Alla Lapidus³, Marina V. Omelchenko², Nikos C. Kyrpides³, Elodie Ghedin, Shiliang Wang⁴, Eugene Goltsman³, Victor Joukov, Olga Ostrovskaya⁵, Kiryl Tsukerman, Mikhail Mazur, Donald Comb, Eugene V. Koonin², Barton E. Slatko - Show less +22 more•Institutions (5)

Ain Shams University¹, National Institutes of Health², United States Department of Energy³, J. Craig Venter Institute⁴, Case Western Reserve University⁵

29 Mar 2005-PLOS Biology

TL;DR: Analysis of this first sequenced endosymbiont genome from a filarial nematode provides insight into endosYmbionT evolution and additionally provides new potential targets for elimination of cutaneous and lymphatic human filarial disease.

...read moreread less

Abstract: Complete genome DNA sequence and analysis is presented for Wolbachia, the obligate alpha-proteobacterial endosymbiont required for fertility and survival of the human filarial parasitic nematode Brugia malayi. Although, quantitatively, the genome is even more degraded than those of closely related Rickettsia species, Wolbachia has retained more intact metabolic pathways. The ability to provide riboflavin, flavin adenine dinucleotide, heme, and nucleotides is likely to be Wolbachia's principal contribution to the mutualistic relationship, whereas the host nematode likely supplies amino acids required for Wolbachia growth. Genome comparison of the Wolbachia endosymbiont of B. malayi (wBm) with the Wolbachia endosymbiont of Drosophila melanogaster (wMel) shows that they share similar metabolic trends, although their genomes show a high degree of genome shuffling. In contrast to wMel, wBm contains no prophage and has a reduced level of repeated DNA. Both Wolbachia have lost a considerable number of membrane biogenesis genes that apparently make them unable to synthesize lipid A, the usual component of proteobacterial membranes. However, differences in their peptidoglycan structures may reflect the mutualistic lifestyle of wBm in contrast to the parasitic lifestyle of wMel. The smaller genome size of wBm, relative to wMel, may reflect the loss of genes required for infecting host cells and avoiding host defense systems. Analysis of this first sequenced endosymbiont genome from a filarial nematode provides insight into endosymbiont evolution and additionally provides new potential targets for elimination of cutaneous and lymphatic human filarial disease.

...read moreread less

Journal Article•DOI•

Patellamide A and C biosynthesis by a microcin-like pathway in Prochloron didemni, the cyanobacterial symbiont of Lissoclinum patella.

[...]

Eric W. Schmidt¹, James T. Nelson¹, David A. Rasko², Sebastian Sudek³, Jonathan A. Eisen², Margo G. Haygood³, Jacques Ravel² - Show less +3 more•Institutions (3)

University of Utah¹, J. Craig Venter Institute², University of California, San Diego³

17 May 2005-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: The full sequencing and functional expression of a marine natural-product pathway from an obligate symbiont is presented, and a related cluster was identified in Trichodesmium erythraeum IMS101, an important bloom-forming cyanobacterium.

...read moreread less

Abstract: Prochloron spp. are obligate cyanobacterial symbionts of many didemnid family ascidians. It has been proposed that the cyclic peptides of the patellamide class found in didemnid extracts are synthesized by Prochloron spp., but studies in which host and symbiont cells are separated and chemically analyzed to identify the biosynthetic source have yielded inconclusive results. As part of the Prochloron didemni sequencing project, we identified patellamide biosynthetic genes and confirmed their function by heterologous expression of the whole pathway in Escherichia coli. The primary sequence of patellamides A and C is encoded on a single ORF that resembles a precursor peptide. We propose that this prepatellamide is heterocyclized to form thiazole and oxazoline rings, and the peptide is cleaved to yield the two cyclic patellamides, A and C. This work represents the full sequencing and functional expression of a marine natural-product pathway from an obligate symbiont. In addition, a related cluster was identified in Trichodesmium erythraeum IMS101, an important bloom-forming cyanobacterium.

...read moreread less

Journal Article•DOI•

Standardizing global gene expression analysis between laboratories and across platforms

[...]

Theodore Bammler¹, Richard P. Beyer¹, Sanchita Bhattacharya², Gary A. Boorman³, Abee L. Boyles⁴, Blair U. Bradford⁵, Roger E. Bumgarner¹, Pierre R. Bushel³, Kabir Chaturvedi, Dongseok Choi⁶, Michael L. Cunningham³, Shibing Deng⁵, Holly K. Dressman⁴, Rickie D. Fannin³, Fredrico M. Farin¹, Jonathan H. Freedman⁴, Rebecca C. Fry², Angel Harper, Michael C. Humble³, Patrick Hurban, Terrance J. Kavanagh¹, William K. Kaufmann⁵, Kathleen F. Kerr¹, Li Jing⁷, Jodi Lapidus⁶, Michael R. Lasarev⁶, Jianying Li³, Yi-Ju Li⁴, Edward K. Lobenhofer, Xinfang Lu⁶, Renae L. Malek⁸, Sean Milton², Srinivasa R. Nagalla⁶, Jean P. O'Malley⁶, Valerie S. Palmer⁶, Patrick Pattee⁶, Richard S. Paules³, Charles M. Perou⁵, Ken Phillips, Li-Xuan Qin¹, Yang Qiu, Sean D. Quigley¹, Matthew Rodland⁶, Ivan Rusyn⁵, Leona D. Samson², David A. Schwartz⁴, Yan Shi⁵, Jung Lim Shin⁷, Stella O. Sieber³, Susan H. Slifer⁴, Marcy C. Speer⁴, Peter S. Spencer⁶, Dean I. Sproles⁶, James A. Swenberg⁵, William A. Suk³, Robert C. Sullivan⁷, Ru Tian⁵, Raymond W. Tennant³, Signe A. Todd⁶, Charles J. Tucker³, Bennett Van Houten³, Brenda K. Weis³, Shirley Xuan², Helmut Zarbl⁷ - Show less +60 more•Institutions (8)

University of Washington¹, Massachusetts Institute of Technology², National Institutes of Health³, Duke University⁴, University of North Carolina at Chapel Hill⁵, Oregon Health & Science University⁶, Fred Hutchinson Cancer Research Center⁷, J. Craig Venter Institute⁸

21 Apr 2005-Nature Methods

TL;DR: In this paper, the authors proposed a method for standardizing global gene expression analysis between laboratories and across platforms, which can be found in Section 5.2.1.1].

...read moreread less

Abstract: Addendum: Standardizing global gene expression analysis between laboratories and across platforms

...read moreread less

Journal Article•DOI•

Patterns and Implications of Gene Gain and Loss in the Evolution of Prochlorococcus

[...]

Gregory Carl Kettler¹, Adam C. Martiny¹, Katherine H. Huang¹, Jeremy Zucker², Maureen L. Coleman¹, Sébastien Rodrigue¹, Feng Chen³, Alla Lapidus³, Steven Ferriera⁴, Justin Johnson⁴, Claudia Steglich⁵, George M. Church², Paul G. Richardson³, Sallie W. Chisholm¹ - Show less +10 more•Institutions (5)

Massachusetts Institute of Technology¹, Harvard University², United States Department of Energy³, J. Craig Venter Institute⁴, University of Freiburg⁵

01 Jan 2005-PLOS Genetics

TL;DR: In this article, the authors describe the genomes of eight newly sequenced isolates and combine them with the first four genomes for a comprehensive analysis of the core (shared by all isolates) and flexible genes of the Prochlorococcus group, and the patterns of loss and gain of the flexible genes over the course of evolution.

...read moreread less

Abstract: Prochlorococcus is a marine cyanobacterium that numerically dominates the mid-latitude oceans and is the smallest known oxygenic phototroph. Numerous isolates from diverse areas of the world’s oceans have been studied and shown to be physiologically and genetically distinct. All isolates described thus far can be assigned to either a tightly clustered high-light (HL)-adapted clade, or a more divergent low-light (LL)-adapted group. The 16S rRNA sequences of the entire Prochlorococcus group differ by at most 3%, and the four initially published genomes revealed patterns of genetic differentiation that help explain physiological differences among the isolates. Here we describe the genomes of eight newly sequenced isolates and combine them with the first four genomes for a comprehensive analysis of the core (shared by all isolates) and flexible genes of the Prochlorococcus group, and the patterns of loss and gain of the flexible genes over the course of evolution. There are 1,273 genes that represent the core shared by all 12 genomes. They are apparently sufficient, according to metabolic reconstruction, to encode a functional cell. We describe a phylogeny for all 12 isolates by subjecting their complete proteomes to three different phylogenetic analyses. For each non-core gene, we used a maximum parsimony method to estimate which ancestor likely first acquired or lost each gene. Many of the genetic differences among isolates, especially for genes involved in outer membrane synthesis and nutrient transport, are found within the same clade. Nevertheless, we identified some genes defining HL and LL ecotypes, and clades within these broad ecotypes, helping to demonstrate the basis of HL and LL adaptations in Prochlorococcus. Furthermore, our estimates of gene gain events allow us to identify highly variable genomic islands that are not apparent through simple pairwise comparisons. These results emphasize the functional roles, especially those connected to outer membrane synthesis and transport that dominate the flexible genome and set it apart from the core. Besides identifying islands and demonstrating their role throughout the history of Prochlorococcus, reconstruction of past gene gains and losses shows that much of the variability exists at the ‘‘leaves of the tree,’’ between the most closely related strains. Finally, the identification of core and flexible genes from this 12-genome comparison is largely consistent with the relative frequency of Prochlorococcus genes found in global ocean metagenomic databases, further closing the gap between our understanding of these organisms in the lab and the wild.

...read moreread less

Journal Article•DOI•

Large-scale sequencing of human influenza reveals the dynamic nature of viral genome evolution

[...]

Elodie Ghedin, Naomi Sengamalay¹, Martin Shumway¹, Jennifer Zaborsky¹, Tamara Feldblyum¹, Vik Subbu¹, David J. Spiro¹, Jeff Sitz¹, Hean Koo¹, Pavel Bolotov², Dmitry Dernovoy², Tatiana Tatusova², Yiming Bao², Kirsten St. George³, Jill Taylor³, David J. Lipman², Claire M. Fraser¹, Jeffery K. Taubenberger⁴, Steven L. Salzberg⁵, Steven L. Salzberg⁶ - Show less +16 more•Institutions (6)

J. Craig Venter Institute¹, National Institutes of Health², Wadsworth Center³, Armed Forces Institute of Pathology⁴, Research Medical Center⁵, University of Maryland, College Park⁶

20 Oct 2005-Nature

TL;DR: A new, large-scale sequencing effort to provide a more comprehensive picture of the evolution of influenza viruses and of their pattern of transmission through human and animal populations is reported, encompassing a total of 2,821,103 nucleotides.

...read moreread less

Abstract: Influenza viruses are remarkably adept at surviving in the human population over a long timescale. The human influenza A virus continues to thrive even among populations with widespread access to vaccines, and continues to be a major cause of morbidity and mortality. The virus mutates from year to year, making the existing vaccines ineffective on a regular basis, and requiring that new strains be chosen for a new vaccine. Less-frequent major changes, known as antigenic shift, create new strains against which the human population has little protective immunity, thereby causing worldwide pandemics. The most recent pandemics include the 1918 'Spanish' flu, one of the most deadly outbreaks in recorded history, which killed 30-50 million people worldwide, the 1957 'Asian' flu, and the 1968 'Hong Kong' flu. Motivated by the need for a better understanding of influenza evolution, we have developed flexible protocols that make it possible to apply large-scale sequencing techniques to the highly variable influenza genome. Here we report the results of sequencing 209 complete genomes of the human influenza A virus, encompassing a total of 2,821,103 nucleotides. In addition to increasing markedly the number of publicly available, complete influenza virus genomes, we have discovered several anomalies in these first 209 genomes that demonstrate the dynamic nature of influenza transmission and evolution. This new, large-scale sequencing effort promises to provide a more comprehensive picture of the evolution of influenza viruses and of their pattern of transmission through human and animal populations. All data from this project are being deposited, without delay, in public archives.

...read moreread less

Journal Article•DOI•

Whole-Genome Analysis of Human Influenza A Virus Reveals Multiple Persistent Lineages and Reassortment among Recent H3N2 Viruses

[...]

Edward C. Holmes¹, Elodie Ghedin², Naomi Miller², Jill Taylor³, Yiming Bao⁴, Kirsten St. George³, Bryan T. Grenfell¹, Steven L. Salzberg², Claire M. Fraser², David J. Lipman⁴, Jeffery K. Taubenberger⁵ - Show less +7 more•Institutions (5)

Pennsylvania State University¹, J. Craig Venter Institute², Wadsworth Center³, National Institutes of Health⁴, Armed Forces Institute of Pathology⁵

26 Jul 2005-PLOS Biology

TL;DR: A phylogenetic analysis of 156 complete genomes of human H3N2 influenza A viruses collected between 1999 and 2004 from New York State, United States demonstrated that multiple lineages can co-circulate, persist, and reassort in epidemiologically significant ways, and underscore the importance of genomic analyses for future influenza surveillance.

...read moreread less

Abstract: Understanding the evolution of influenza A viruses in humans is important for surveillance and vaccine strain selection. We performed a phylogenetic analysis of 156 complete genomes of human H3N2 influenza A viruses collected between 1999 and 2004 from New York State, United States, and observed multiple co-circulating clades with different population frequencies. Strikingly, phylogenies inferred for individual gene segments revealed that multiple reassortment events had occurred among these clades, such that one clade of H3N2 viruses present at least since 2000 had provided the hemagglutinin gene for all those H3N2 viruses sampled after the 2002–2003 influenza season. This reassortment event was the likely progenitor of the antigenically variant influenza strains that caused the A/Fujian/411/2002-like epidemic of the 2003–2004 influenza season. However, despite sharing the same hemagglutinin, these phylogenetically distinct lineages of viruses continue to co-circulate in the same population. These data, derived from the first large-scale analysis of H3N2 viruses, convincingly demonstrate that multiple lineages can co-circulate, persist, and reassort in epidemiologically significant ways, and underscore the importance of genomic analyses for future influenza surveillance.

...read moreread less

Journal Article•DOI•

Genome sequence of Theileria parva, a bovine pathogen that transforms lymphocytes.

[...]

Malcolm J. Gardner¹, Richard P. Bishop², Trushar Shah², Etienne P. de Villiers², Jane M. Carlton¹, Neil Hall¹, Qinghu Ren¹, Ian T. Paulsen¹, Arnab Pain³, Matthew Berriman³, Robert John Macleod Wilson⁴, Shigeharu Sato⁴, Stuart A. Ralph⁵, David J. Mann⁶, Zikai Xiong³, Shamira J. Shallom¹, Janice Weidman¹, Lingxia Jiang¹, Jeffery Lynn¹, Bruce Weaver¹, Azadeh Shoaibi¹, Alexander Domingo¹, Delia Wasawo², Jonathan Crabtree¹, Jennifer R. Wortman¹, Brian J. Haas¹, Samuel V. Angiuoli¹, Todd Creasy¹, Charles Lu¹, Charles Lu⁷, Bernard B. Suh¹, Bernard B. Suh⁸, Joana C. Silva¹, Teresa Utterback¹, Tamara Feldblyum¹, Mihaela Pertea¹, Jonathan E. Allen¹, William C. Nierman¹, Evans L. N. Taracha², Steven L. Salzberg¹, Owen White¹, Henry A. Fitzhugh², Subhash Morzaria², Subhash Morzaria⁹, J. Craig Venter, Claire M. Fraser¹, Vishvanath Nene¹ - Show less +43 more•Institutions (9)

J. Craig Venter Institute¹, International Livestock Research Institute², Wellcome Trust Sanger Institute³, Medical Research Council⁴, Pasteur Institute⁵, Imperial College London⁶, Princeton University⁷, University of California, Santa Cruz⁸, Food and Agriculture Organization⁹

01 Jul 2005-Science

TL;DR: The genome sequence of Theileria parva is reported, an apicomplexan pathogen causing economic losses to smallholder farmers in Africa, and its plastid-like genome represents the first example where all apicoplast genes are encoded on one DNA strand.

...read moreread less

Abstract: We report the genome sequence of Theileria parva, an apicomplexan pathogen causing economic losses to smallholder farmers in Africa. The parasite chromosomes exhibit limited conservation of gene synteny with Plasmodium falciparum, and its plastid-like genome represents the first example where all apicoplast genes are encoded on one DNA strand. We tentatively identify proteins that facilitate parasite segregation during host cell cytokinesis and contribute to persistent infection of transformed host cells. Several biosynthetic pathways are incomplete or absent, suggesting substantial metabolic dependence on the host cell. One protein family that may generate parasite antigenic diversity is not telomere-associated.

...read moreread less

Journal Article•DOI•

Sequencing the Genespaces of Medicago truncatula and Lotus japonicus

[...]

Nevin D. Young, Steven B. Cannon¹, Shusei Sato, Dong-Jin Kim², Douglas R. Cook², Christopher D. Town³, Bruce A. Roe⁴, Satoshi Tabata - Show less +4 more•Institutions (4)

University of Minnesota¹, University of California, Davis², J. Craig Venter Institute³, University of Oklahoma⁴

01 Apr 2005-Plant Physiology

TL;DR: Two model legumes, Medicago truncatula and Lotus japonicus, are currently targets of large-scale genome sequencing projects and the prospect of integrating genome information from Mt and Lj is exciting.

...read moreread less

Abstract: Two model legumes, Medicago truncatula ( Mt ) and Lotus japonicus ( Lj ), are currently targets of large-scale genome sequencing projects. As a result, legumes are one of few plant families with extensive genome sequence in multiple species. The prospect of integrating genome information from Mt and

...read moreread less

Journal Article•DOI•

Identification of clustered microRNAs using an ab initio prediction method

[...]

Alain Sewer¹, Nicodeme Paul¹, Pablo Landgraf², Alexei A. Aravin², Sébastien Pfeffer³, Sébastien Pfeffer², Michael J. Brownstein⁴, Thomas Tuschl², Erik van Nimwegen¹, Mihaela Zavolan¹ - Show less +6 more•Institutions (4)

University of Basel¹, Rockefeller University², Centre national de la recherche scientifique³, J. Craig Venter Institute⁴

07 Nov 2005-BMC Bioinformatics

TL;DR: This work describes a computational method for miRNA prediction and the results of its application to the discovery of novel mammalian miRNAs, and shows that although the overall miRNA content in the observed clusters is very similar across the three considered species, the internal organization of the clusters changes in evolution.

...read moreread less

Abstract: MicroRNAs (miRNAs) are endogenous 21 to 23-nucleotide RNA molecules that regulate protein-coding gene expression in plants and animals via the RNA interference pathway. Hundreds of them have been identified in the last five years and very recent works indicate that their total number is still larger. Therefore miRNAs gene discovery remains an important aspect of understanding this new and still widely unknown regulation mechanism. Bioinformatics approaches have proved to be very useful toward this goal by guiding the experimental investigations. In this work we describe our computational method for miRNA prediction and the results of its application to the discovery of novel mammalian miRNAs. We focus on genomic regions around already known miRNAs, in order to exploit the property that miRNAs are occasionally found in clusters. Starting with the known human, mouse and rat miRNAs we analyze 20 kb of flanking genomic regions for the presence of putative precursor miRNAs (pre-miRNAs). Each genome is analyzed separately, allowing us to study the species-specific identity and genome organization of miRNA loci. We only use cross-species comparisons to make conservative estimates of the number of novel miRNAs. Our ab initio method predicts between fifty and hundred novel pre-miRNAs for each of the considered species. Around 30% of these already have experimental support in a large set of cloned mammalian small RNAs. The validation rate among predicted cases that are conserved in at least one other species is higher, about 60%, and many of them have not been detected by prediction methods that used cross-species comparisons. A large fraction of the experimentally confirmed predictions correspond to an imprinted locus residing on chromosome 14 in human, 12 in mouse and 6 in rat. Our computational tool can be accessed on the world-wide-web. Our results show that the assumption that many miRNAs occur in clusters is fruitful for the discovery of novel miRNAs. Additionally we show that although the overall miRNA content in the observed clusters is very similar across the three considered species, the internal organization of the clusters changes in evolution.

...read moreread less

Journal Article•DOI•

Cell-free cloning using φ29 DNA polymerase

[...]

Clyde A. Hutchison¹, Hamilton O. Smith, Cynthia Pfannkoch, J. Craig Venter•Institutions (1)

J. Craig Venter Institute¹

29 Nov 2005-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: Conditions for rolling-circle amplification (RCA) of individual DNA molecules 5–7 kb in size by >109-fold, using φ29 DNA polymerase is described, which allows cell-free cloning of individual synthetic DNA molecules that cannot be cloned in Escherichia coli, and may also speed genome sequencing by eliminating the need for biological cloning.

...read moreread less

Abstract: We describe conditions for rolling-circle amplification (RCA) of individual DNA molecules 5-7 kb in size by >10(9)-fold, using phi29 DNA polymerase. The principal difficulty with amplification of small amounts of template by RCA using phi29 DNA polymerase is "background" DNA synthesis that usually occurs when template is omitted, or at low template concentrations. Reducing the reaction volume while keeping the amount of template fixed increases the template concentration, resulting in a suppression of background synthesis. Cell-free cloning of single circular molecules by using phi29 DNA polymerase was achieved by carrying out the amplification reactions in very small volumes, typically 600 nl. This procedure allows cell-free cloning of individual synthetic DNA molecules that cannot be cloned in Escherichia coli, for example synthetic phage genomes carrying lethal mutations. It also allows cell-free cloning of genomic DNA isolated from bacteria. This DNA can be sequenced directly from the phi29 DNA polymerase reaction without further amplification. In contrast to PCR amplification, RCA using phi29 DNA polymerase does not produce mutant jackpots, and the high processivity of the enzyme eliminates stuttering at homopolymer tracts. Cell-free cloning has many potential applications to both natural and synthetic DNA. These include environmental DNA samples that have proven difficult to clone and synthetic genes encoding toxic products. The method may also speed genome sequencing by eliminating the need for biological cloning.

...read moreread less

Journal Article•DOI•

Composite genome map and recombination parameters derived from three archetypal lineages of Toxoplasma gondii

[...]

Asis Khan¹, Sonya Taylor¹, Chunlei Su¹, Aaron J. Mackey², Jon P. Boyle³, Robert H. Cole¹, Darius Glover¹, Keliang Tang¹, Ian T. Paulsen⁴, Matthew Berriman⁵, John C. Boothroyd³, E. R. Pfefferkorn⁶, Jitender P. Dubey⁷, James W. Ajioka⁸, David S. Roos², John C. Wootton⁹, L. David Sibley - Show less +13 more•Institutions (9)

Washington University in St. Louis¹, University of Pennsylvania², Stanford University³, J. Craig Venter Institute⁴, Wellcome Trust Sanger Institute⁵, Dartmouth College⁶, United States Department of Agriculture⁷, University of Cambridge⁸, National Institutes of Health⁹

01 Jan 2005-Nucleic Acids Research

TL;DR: A high frequency of closely adjacent, apparent double crossover events that may represent gene conversions and large regions of genetic homogeneity among the archetypal clonal lineages are detected, reflecting the relatively few genetic outbreeding events that have occurred since their recent origin are detected.

...read moreread less

Abstract: Toxoplasma gondii is a highly successful protozoan parasite in the phylum Apicomplexa, which contains numerous animal and human pathogens. T.gondii is amenable to cellular, biochemical, molecular and genetic studies, making it a model for the biology of this important group of parasites. To facilitate forward genetic analysis, we have developed a high-resolution genetic linkage map for T.gondii. The genetic map was used to assemble the scaffolds from a 10X shotgun whole genome sequence, thus defining 14 chromosomes with markers spaced at ∼300 kb intervals across the genome. Fourteen chromosomes were identified comprising a total genetic size of ∼592 cM and an average map unit of ∼104 kb/cM. Analysis of the genetic parameters in T.gondii revealed a high frequency of closely adjacent, apparent double crossover events that may represent gene conversions. In addition, we detected large regions of genetic homogeneity among the archetypal clonal lineages, reflecting the relatively few genetic outbreeding events that have occurred since their recent origin. Despite these unusual features, linkage analysis proved to be effective in mapping the loci determining several drug resistances. The resulting genome map provides a framework for analysis of complex traits such as virulence and transmission, and for comparative population genetic studies.

...read moreread less

Journal Article•DOI•

An atlas of human gene expression from massively parallel signature sequencing (MPSS)

[...]

C. Victor Jongeneel¹, Mauro Delorenzi², Christian Iseli², Daixing Zhou, Christian D. Haudenschild, Irina Khrebtukova, Dmitry Kuznetsov², Brian Stevenson², Robert L. Strausberg³, Andrew J. G. Simpson¹, Thomas J. Vasicek - Show less +7 more•Institutions (3)

Ludwig Institute for Cancer Research¹, Swiss Institute of Bioinformatics², J. Craig Venter Institute³

01 Jul 2005-Genome Research

TL;DR: The unbiased sampling of the human transcriptome achieved by MPSS supports the idea that most human genes have been mapped, if not functionally characterized.

...read moreread less

Abstract: We have used massively parallel signature sequencing (MPSS) to sample the transcriptomes of 32 normal human tissues to an unprecedented depth, thus documenting the patterns of expression of almost 20,000 genes with high sensitivity and specificity. The data confirm the widely held belief that differences in gene expression between cell and tissue types are largely determined by transcripts derived from a limited number of tissue-specific genes, rather than by combinations of more promiscuously expressed genes. Expression of a little more than half of all known human genes seems to account for both the common requirements and the specific functions of the tissues sampled. A classification of tissues based on patterns of gene expression largely reproduces classifications based on anatomical and biochemical properties. The unbiased sampling of the human transcriptome achieved by MPSS supports the idea that most human genes have been mapped, if not functionally characterized. This data set should prove useful for the identification of tissue-specific genes, for the study of global changes induced by pathological conditions, and for the definition of a minimal set of genes necessary for basic cell maintenance. The data are available on the Web at http://mpss.licr.org and http://sgb.lynxgen.com.

...read moreread less

Journal Article•DOI•

The sequence of rice chromosomes 11 and 12, rich in disease resistance genes and recent gene duplications.

[...]

Nathalie Choisne¹, Nadia Demange¹, Gisela Orjeda¹, Sylvie Samain¹, Angélique D'Hont², Laurence Cattolico¹, Eric Pelletier¹, Arnaud Couloux¹, Béatrice Segurens¹, Patrick Wincker¹, Claude Scarpelli¹, Jean Weissenbach¹, Marcel Salanoubat¹, Nagendra K. Singh³, Trilochan Mohapatra³, Tilak Raj Sharma³, Kishor Gaikwad³, Alok Singh³, Vivek Dalal³, Subodh K. Srivastava³, Anupam Dixit³, Ajit K. Pal³, Irfan Ahmad Ghazi³, Mahavir Yadav³, Awadhesh Pandit³, Ashutosh Bhargava³, K. Sureshbabu³, Rekha Dixit³, Harvinder Singh³, Suresh C. Swain³, Sumita Pal³, M. Ragiba³, Pradeep K. Singh³, Vibha Singhal³, Sangeeta D. Mendiratta³, Kamlesh Batra³, Saurabh Raghuvanshi⁴, Amitabh Mohanty⁴, Arvind K. Bharti⁴, Anupama Gaur⁴, Vikrant Gupta⁴, Dibyendu Kumar⁴, Ravi Vydianathan⁴, Shuba Vij⁴, Anita Kapur⁴, Parul Khurana⁴, Sulabha Sharma⁴, Paramjit Khurana⁴, Jitendra P. Khurana⁴, Akhilesh K. Tyagi⁴, Qiaoping Yuan⁵, Shu Ouyang⁵, Jia Liu⁵, Wei Zhu⁵, Aihui Wang⁵, Haining Lin⁵, John P. Hamilton⁵, Brian J. Haas⁵, Jennifer R. Wortman⁵, Kristine Jones⁵, Mary Kim⁵, Larry Overton⁵, Tamara Tsitrin⁵, Douglas Fadrosh⁵, Jayati Bera⁵, Bruce Weaver⁵, Shaohua Jin⁵, Shivani Johri⁵, Matt Reardon⁵, Hue Vuong⁵, Luke J. Tallon⁵, Susan Van Aken⁵, Matthew R. Lewis⁵, Teresa Utterback⁵, Tamara Feldblyum⁵, Victoria Zismann⁵, Stacey E. Iobst⁵, Joseph Hsiao⁵, Aymeric R. De Vazeille⁵, Steven L. Salzberg⁵, Owen White⁵, Claire M. Fraser⁵, C. Robin Buell⁵, Yeisoo Yu⁶, Teri Rambo⁶, Jennifer Currie⁶, Kristi Collura⁶, Hyeran Kim⁶, Diana Stum⁶, Wenming Wang⁶, Dave Kudrna⁶, Christopher Mueller⁶, Rod A. Wing⁶, Melissa Kramer⁷, Lori Spiegel⁷, Lidia Nascimento⁷, R. Preston⁷, Theresa Zutavern⁷, Joachim Messing⁸ - Show less +95 more•Institutions (8)

Université Paris-Saclay¹, Centre de coopération internationale en recherche agronomique pour le développement², Indian Council of Agricultural Research³, University of Delhi⁴, J. Craig Venter Institute⁵, University of Arizona⁶, Cold Spring Harbor Laboratory⁷, Rutgers University⁸

27 Sep 2005-BMC Biology

TL;DR: Based on syntenic alignments of these chromosomes, rice chromosome 11 and 12 do not appear to have resulted from a single whole-genome duplication event as previously suggested.

...read moreread less

Abstract: Background: Rice is an important staple food and, with the smallest cereal genome, serves as a reference species for studies on the evolution of cereals and other grasses Therefore, decoding its entire genome will be a prerequisite for applied and basic research on this species and all other cereals Results: We have determined and analyzed the complete sequences of two of its chromosomes, 11 and 12, which total 559 Mb (143% of the entire genome length), based on a set of overlapping clones A total of 5,993 non-transposable element related genes are present on these chromosomes Among them are 289 disease resistance-like and 28 defense-response genes, a higher proportion of these categories than on any other rice chromosome A three-Mb segment on both chromosomes resulted from a duplication 77 million years ago (mya), the most recent large-scale duplication in the rice genome Paralogous gene copies within this segmental duplication can be aligned with genomic assemblies from sorghum and maize Although these gene copies are preserved on both chromosomes, their expression patterns have diverged When the gene order of rice chromosomes 11 and 12 was compared to wheat gene loci, significant synteny between these orthologous regions was detected, illustrating the presence of conserved genes alternating with recently evolved genes Conclusion: Because the resistance and defense response genes, enriched on these chromosomes relative to the whole genome, also occur in clusters, they provide a preferred target for breeding durable disease resistance in rice and the isolation of their allelic variants The recent duplication of a large chromosomal segment coupled with the high density of disease resistance gene clusters makes this the most recently evolved part of the rice genome Based on syntenic alignments of these chromosomes, rice chromosome 11 and 12 do not appear to have resulted from a single whole-genome duplication event as previously suggested (Resume d'auteur)

...read moreread less

Journal Article•DOI•

Sequence survey of receptor tyrosine kinases reveals mutations in glioblastomas

[...]

Vikki Rand¹, Jiaqi Huang¹, Timothy B. Stockwell², Steve Ferriera², Oleksandr V. Buzko³, Samuel Levy², Dana A. Busam², Kelvin Li², Jennifer B. Edwards¹, Charles G. Eberhart¹, Kathleen M. Murphy¹, Alexia Tsiamouri², Karen Beeson², Andrew J. G. Simpson⁴, J. Craig Venter², Gregory J. Riggins¹, Robert L. Strausberg² - Show less +13 more•Institutions (4)

Johns Hopkins University¹, J. Craig Venter Institute², University of California, San Diego³, Ludwig Institute for Cancer Research⁴

04 Oct 2005-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: In this article, the authors report on the sequence analysis of members of the receptor tyrosine kinase (RTK) gene family in the genomes of glioblastoma brain tumors.

...read moreread less

Abstract: It is now clear that tyrosine kinases represent attractive targets for therapeutic intervention in cancer. Recent advances in DNA sequencing technology now provide the opportunity to survey mutational changes in cancer in a high-throughput and comprehensive manner. Here we report on the sequence analysis of members of the receptor tyrosine kinase (RTK) gene family in the genomes of glioblastoma brain tumors. Previous studies have identified a number of molecular alterations in glioblastoma, including amplification of the RTK epidermal growth factor receptor. We have identified mutations in two other RTKs: (i) fibroblast growth receptor 1, including the first mutations in the kinase domain in this gene observed in any cancer, and (ii) a frameshift mutation in the platelet-derived growth factor receptor-α gene. Fibroblast growth receptor 1, platelet-derived growth factor receptor-α, and epidermal growth factor receptor are all potential entry points to the phosphatidylinositol 3-kinase and mitogen-activated protein kinase intracellular signaling pathways already known to be important for neoplasia. Our results demonstrate the utility of applying DNA sequencing technology to systematically assess the coding sequence of genes within cancer genomes.

...read moreread less

Journal Article•DOI•

Genomics: massively parallel sequencing.

[...]

Yu-Hui Rogers¹, J. Craig Venter¹•Institutions (1)

J. Craig Venter Institute¹

15 Sep 2005-Nature

TL;DR: A sequencing system has been developed that can read 25 million bases of genetic code — the entire genome of some fungi — within four hours, and may provide an alternative approach to DNA sequencing.

...read moreread less

Abstract: A sequencing system has been developed that can read 25 million bases of genetic code — the entire genome of some fungi — within four hours. The technique may provide an alternative approach to DNA sequencing. The race is on for a big prize: the job of providing the world's DNA sequencing laboratories with the successor to the ‘Sanger-based’ technology that gave us the first wave of genome sequences. One technology in the frame is that produced by 454 Life Sciences Corporation of Branford, Connecticut. Today's technology reads 67,000 base pairs per hour; this new approach is 100 times faster, reading 6 million base pairs per hour. The improved performance results from using picolitre-sized chemical reactors, enhanced light-emitting sequencing chemistries and complex informatics. Further miniaturization of the system is planned. Such leaps in technology may one day make it possible to analyse an individual's genome before designing therapy: the ultimate in personalized medicine.

...read moreread less

Journal Article•DOI•

DNA replication origins in the Schizosaccharomyces pombe genome

[...]

Jianli Dai¹, Ray-Yuan Chuang¹, Ray-Yuan Chuang², Thomas J. Kelly•Institutions (2)

Johns Hopkins University¹, J. Craig Venter Institute²

11 Jan 2005-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: A stochastic model for initiation of DNA replication in the fission yeast is proposed and it is demonstrated that at least half of intergenes have potential origin activity and that the relative ability of an intergene to function as an origin is governed primarily by AT content and length.

...read moreread less

Abstract: Origins of DNA replication in Schizosaccharomyces pombe lack a specific consensus sequence analogous to the Saccharomyces cerevisiae autonomously replicating sequence (ARS) consensus, raising the question of how they are recognized by the replication machinery. Because all well characterized S. pombe origins are located in intergenic regions, we analyzed the sequence properties and biological activity of such regions. The AT content of intergenes is very high (≈70%), and runs of A's or T's occur with a significantly greater frequency than expected. Additionally, the two DNA strands in intergenes display compositional asymmetry that strongly correlates with the direction of transcription of flanking genes. Importantly, the sequence properties of known S. pombe origins of DNA replication are similar to those of intergenes in general. In functional studies, we assayed the in vivo origin activity of 26 intergenes in a 68-kb region of S. pombe chromosome 2. We also assayed the origin activity of sets of randomly chosen intergenes with the same length or AT content. Our data demonstrate that at least half of intergenes have potential origin activity and that the relative ability of an intergene to function as an origin is governed primarily by AT content and length. We propose a stochastic model for initiation of DNA replication in the fission yeast. In this model, the number of AT tracts in a given sequence is the major determinant of its probability of binding SpORC and serving as a replication origin. A similar model may explain some features of origins of DNA replication in metazoans.

...read moreread less

Journal Article•DOI•

Divergent responses of chondrocytes and endothelial cells to shear stress: Cross-talk among COX-2, the phase 2 response, and apoptosis

[...]

Zachary R. Healy¹, Norman H. Lee², Xiangqun Gao¹, Mary B. Goldring³, Paul Talalay, Thomas W. Kensler¹, Konstantinos Konstantopoulos¹ - Show less +3 more•Institutions (3)

Johns Hopkins University¹, J. Craig Venter Institute², Harvard University³

27 Sep 2005-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: It is demonstrated that shear-induced cyclooxygenase (COX)-2 suppresses phosphatidylinositol 3-kinase (PI3-K) activity, which represses antioxidant response element (ARE)/NF-E2 related factor 2 (Nrf2)-mediated transcriptional response in human chondrocytes, which contributes to their apoptosis.

...read moreread less

Abstract: Fluid shear exerts anti-inflammatory and anti-apoptotic effects on endothelial cells by inducing the coordinated expression of phase 2 detoxifying and antioxidant genes. In contrast, high shear is pro-apoptotic in chondrocytes and promotes matrix degradation and cartilage destruction. We have analyzed the mechanisms regulating shear-mediated chondrocyte apoptosis by cDNA microarray technology and bioinformatics. We demonstrate that shear-induced cyclooxygenase (COX)-2 suppresses phosphatidylinositol 3-kinase (PI3-K) activity, which represses antioxidant response element (ARE)/NF-E2 related factor 2 (Nrf2)-mediated transcriptional response in human chondrocytes. The resultant decrease in antioxidant capacity of sheared chondrocytes contributes to their apoptosis. Phase 2 inducers, and to a lesser extent COX-2-selective inhibitors, negate the shear-mediated suppression of ARE-driven phase 2 activity and apoptosis. The abrogation of shear-induced COX-2 expression by PI3-K activity and/or stimulation of the Nrf2/ARE pathway suggests the existence of PI3-K/Nrf2/ARE negative feedback loops that potentially interfere with c-Jun N-terminal kinase 2 activity upstream of COX-2. Reconstructing the signaling network regulating shear-induced chondrocyte apoptosis may provide insights to optimize conditions for culturing artificial cartilage in bioreactors and for developing therapeutic strategies for arthritic disorders.

...read moreread less

Journal Article•DOI•

Identification of cancer/testis-antigen genes by massively parallel signature sequencing

[...]

Yao-Tseng Chen¹, Matthew J. Scanlan², Charis A. Venditti², Ramon Chua², Grégory Theiler³, Brian Stevenson³, Christian Iseli³, Ali O. Gure², Tom Vasicek⁴, Robert L. Strausberg⁵, C. Victor Jongeneel⁶, Lloyd J. Old², Andrew J. G. Simpson² - Show less +9 more•Institutions (6)

Cornell University¹, Memorial Sloan Kettering Cancer Center², Ludwig Institute for Cancer Research³, Illumina⁴, J. Craig Venter Institute⁵, National Center for Supercomputing Applications⁶

31 May 2005-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: MPSS analysis has resulted in a significant extension of the knowledge of CT antigens, leading to the discovery of a distinctive X-linked CT-antigen gene family.

...read moreread less

Abstract: Massively parallel signature sequencing (MPSS) generates millions of short sequence tags corresponding to transcripts from a single RNA preparation. Most MPSS tags can be unambiguously assigned to genes, thereby generating a comprehensive expression profile of the tissue of origin. From the comparison of MPSS data from 32 normal human tissues, we identified 1,056 genes that are predominantly expressed in the testis. Further evaluation by using MPSS tags from cancer cell lines and EST data from a wide variety of tumors identified 202 of these genes as candidates for encoding cancer/testis (CT) antigens. Of these genes, the expression in normal tissues was assessed by RT-PCR in a subset of 166 intron-containing genes, and those with confirmed testis-predominant expression were further evaluated for their expression in 21 cancer cell lines. Thus, 20 CT or CT-like genes were identified, with several exhibiting expression in five or more of the cancer cell lines examined. One of these genes is a member of a CT gene family that we designated as CT45. The CT45 family comprises six highly similar (>98% cDNA identity) genes that are clustered in tandem within a 125-kb region on Xq26.3. CT45 was found to be frequently expressed in both cancer cell lines and lung cancer specimens. Thus, MPSS analysis has resulted in a significant extension of our knowledge of CT antigens, leading to the discovery of a distinctive X-linked CT-antigen gene family.

...read moreread less

Journal Article•DOI•

The interactive online SKY/M-FISH & CGH database and the Entrez cancer chromosomes search database: linkage of chromosomal aberrations with the genome sequence.

[...]

Turid Knutsen¹, Vasuki Gobu¹, Rodger Knaus¹, Hesed Padilla-Nash¹, Meena Augustus, Robert L. Strausberg², Ilan R. Kirsch¹, Karl Sirotkin¹, Thomas Ried¹ - Show less +5 more•Institutions (2)

National Institutes of Health¹, J. Craig Venter Institute²

01 Sep 2005-Genes, Chromosomes and Cancer

TL;DR: These resources, developed as a part of the Cancer Chromosome Aberration Project (CCAP) initiative, aid the search for new cancer‐associated genes and foster insights into the causes and consequences of genetic alterations in cancer.

...read moreread less

Abstract: To catalog data on chromosomal aberrations in cancer derived from emerging molecular cytogenetic techniques and to integrate these data with genome maps, we have established two resources, the NCI and NCBI SKY/M-FISH & CGH Database and the Cancer Chromosomes database. The goal of the former is to allow investigators to submit and analyze clinical and research cytogenetic data. It contains a karyotype parser tool, which automatically converts the ISCN short-form karyotype into an internal representation displayed in detailed form and as a colored ideogram with band overlay, and also has a tool to compare CGH profiles from multiple cases. The Cancer Chromosomes database integrates the SKY/M-FISH & CGH Database with the Mitelman Database of Chromosome Aberrations in Cancer and the Recurrent Chromosome Aberrations in Cancer database. These three datasets can now be searched seamlessly by use of the Entrez search and retrieval system for chromosome aberrations, clinical data, and reference citations. Common diagnoses, anatomic sites, chromosome breakpoints, junctions, numerical and structural abnormalities, and bands gained and lost among selected cases can be compared by use of the "similarity" report. Because the model used for CGH data is a subset of the karyotype data, it is now possible to examine the similarities between CGH results and karyotypes directly. All chromosomal bands are directly linked to the Entrez Map Viewer database, providing integration of cytogenetic data with the sequence assembly. These resources, developed as a part of the Cancer Chromosome Aberration Project (CCAP) initiative, aid the search for new cancer-associated genes and foster insights into the causes and consequences of genetic alterations in cancer.

...read moreread less