Showing papers by "Daniel H. Haft published in 2002"

PDF

Open Access

Journal Article•DOI•

Genome sequence of the human malaria parasite Plasmodium falciparum

[...]

Malcolm J. Gardner¹, Neil Hall¹, Eula Fung¹, Owen White¹, Matthew Berriman¹, Richard W. Hyman¹, Jane M. Carlton¹, Arnab Pain¹, Karen E. Nelson¹, Sharen Bowman¹, Ian T. Paulsen¹, Keith D. James¹, Jonathan A. Eisen¹, Kim Rutherford¹, Steven L. Salzberg¹, Alister Craig¹, Sue Kyes¹, Man Suen Chan¹, Vishvanath Nene¹, Shamira J. Shallom¹, Bernard B. Suh¹, Jeremy Peterson¹, Samuel V. Angiuoli¹, Mihaela Pertea¹, Jonathan E. Allen¹, Jeremy D. Selengut¹, Daniel H. Haft¹, Michael W. Mather¹, Akhil B. Vaidya¹, David M. A. Martin¹, Alan H. Fairlamb¹, Martin Fraunholz¹, David S. Roos¹, Stuart A. Ralph¹, Geoffrey I. McFadden¹, Leda M. Cummings¹, G. Mani Subramanian¹, Christopher J. Mungall¹, J. Craig Venter¹, Daniel J. Carucci¹, Stephen L. Hoffman¹, Chris I. Newbold¹, Ronald W. Davis¹, Claire M. Fraser¹, Bart Barrell¹ - Show less +41 more•Institutions (1)

J. Craig Venter Institute¹

03 Oct 2002-Nature

TL;DR: The genome sequence of P. falciparum clone 3D7 is reported, which is the most (A + T)-rich genome sequenced to date and is being exploited in the search for new drugs and vaccines to fight malaria.

...read moreread less

Abstract: The parasite Plasmodium falciparum is responsible for hundreds of millions of cases of malaria, and kills more than one million African children annually. Here we report an analysis of the genome sequence of P. falciparum clone 3D7. The 23-megabase nuclear genome consists of 14 chromosomes, encodes about 5,300 genes, and is the most (A + T)-rich genome sequenced to date. Genes involved in antigenic variation are concentrated in the subtelomeric regions of the chromosomes. Compared to the genomes of free-living eukaryotic microbes, the genome of this intracellular parasite encodes fewer enzymes and transporters, but a large proportion of genes are devoted to immune evasion and host-parasite interactions. Many nuclear-encoded proteins are targeted to the apicoplast, an organelle involved in fatty-acid and isoprenoid metabolism. The genome sequence provides the foundation for future studies of this organism, and is being exploited in the search for new drugs and vaccines to fight malaria.

...read moreread less

4,312 citations

Journal Article•DOI•

Genome sequence of the dissimilatory metal ion-reducing bacterium Shewanella oneidensis.

[...]

Johns Hopkins University¹, Jet Propulsion Laboratory², University of Hawaii³, University of Arizona⁴, Carnegie Institution for Science⁵, Celera Corporation⁶, University of Southern California⁷, Washington University in St. Louis⁸

07 Oct 2002-Nature Biotechnology

TL;DR: This genome sequence represents a critical step in the elucidation of the pathways for reduction (and bioremediation) of pollutants such as uranium (U) and chromium (Cr), and offers a starting point for defining this organism's complex electron transport systems and metal ion–reducing capabilities.

...read moreread less

Abstract: Shewanella oneidensis is an important model organism for bioremediation studies because of its diverse respiratory capabilities, conferred in part by multicomponent, branched electron transport systems. Here we report the sequencing of the S. oneidensis genome, which consists of a 4,969,803-base pair circular chromosome with 4,758 predicted protein-encoding open reading frames (CDS) and a 161,613-base pair plasmid with 173 CDSs. We identified the first Shewanella lambda-like phage, providing a potential tool for further genome engineering. Genome analysis revealed 39 c-type cytochromes, including 32 previously unidentified in S. oneidensis, and a novel periplasmic [Fe] hydrogenase, which are integral members of the electron transport system. This genome sequence represents a critical step in the elucidation of the pathways for reduction (and bioremediation) of pollutants such as uranium (U) and chromium (Cr), and offers a starting point for defining this organism's complex electron transport systems and metal ion-reducing capabilities.

...read moreread less

815 citations

Journal Article•DOI•

A bacterial genome in flux: the twelve linear and nine circular extrachromosomal DNAs in an infectious isolate of the Lyme disease spirochete Borrelia burgdorferi.

[...]

Sherwood R. Casjens¹, Nanette Palmer¹, René Van Vugt¹, Wai Mun Huang¹, Brian Stevenson², Patricia A. Rosa³, Raju Lathigra⁴, Granger G. Sutton, Jeremy Peterson, Robert J. Dodson, Daniel H. Haft, Erin Hickey, Michelle L. Gwinn, Owen White, Claire M. Fraser - Show less +11 more•Institutions (4)

University of Utah¹, University of Kentucky², National Institutes of Health³, MedImmune⁴

05 Apr 2002-Molecular Microbiology

TL;DR: It is determined that Borrelia burgdorferi strain B31 MI carries 21 extrachromosomal DNA elements, the largest number known for any bacterium, and the nucleotide sequence of three linear and seven circular plasmids in this infectious isolate is reported.

...read moreread less

Abstract: We have determined that Borrelia burgdorferi strain B31 MI carries 21 extrachromosomal DNA elements, the largest number known for any bacterium. Among these are 12 linear and nine circular plasmids, whose sequences total 610 694 bp. We report here the nucleotide sequence of three linear and seven circular plasmids (comprising 290 546 bp) in this infectious isolate. This completes the genome sequencing project for this organism; its genome size is 1 521 419 bp (plus about 2000 bp of undetermined telomeric sequences). Analysis of the sequence implies that there has been extensive and sometimes rather recent DNA rearrangement among a number of the linear plasmids. Many of these events appear to have been mediated by recombinational processes that formed duplications. These many regions of similarity are reflected in the fact that most plasmid genes are members of one of the genome's 161 paralogous gene families; 107 of these gene families, which vary in size from two to 41 members, contain at least one plasmid gene. These rearrangements appear to have contributed to a surprisingly large number of apparently non-functional pseudogenes, a very unusual feature for a prokaryotic genome. The presence of these damaged genes suggests that some of the plasmids may be in a period of rapid evolution. The sequence predicts 535 plasmid genes ≥300 bp in length that may be intact and 167 apparently mutationally damaged and/or unexpressed genes (pseudogenes). The large majority, over 90%, of genes on these plasmids have no convincing similarity to genes outside Borrelia, suggesting that they perform specialized functions.

...read moreread less

811 citations

Journal Article•DOI•

Whole-Genome Comparison of Mycobacterium tuberculosis Clinical and Laboratory Strains

[...]

Robert D. Fleischmann¹, David Alland¹, Jonathan A. Eisen¹, L. Carpenter¹, Owen White¹, Jeremy Peterson¹, Robert T. DeBoy¹, Robert J. Dodson¹, Michelle L. Gwinn¹, Daniel H. Haft¹, Erin Hickey¹, James F. Kolonay¹, William C. Nelson¹, Lowell Umayam¹, Maria D. Ermolaeva¹, Steven L. Salzberg¹, Arthur L. Delcher¹, T. Utterback¹, Janice Weidman¹, Hoda Khouri¹, John Gill¹, A. Mikula¹, William R. Bishai¹, William R. Jacobs¹, J. C. Venter¹, Claire M. Fraser¹ - Show less +22 more•Institutions (1)

J. Craig Venter Institute¹

01 Oct 2002-Journal of Bacteriology

TL;DR: Results demonstrate that polymorphisms among M. tuberculosis strains are more extensive than initially anticipated, and genetic variation may have an important role in disease pathogenesis and immunity.

...read moreread less

Abstract: Virulence and immunity are poorly understood in Mycobacterium tuberculosis. We sequenced the complete genome of the M. tuberculosis clinical strain CDC1551 and performed a whole-genome comparison with the laboratory strain H37Rv in order to identify polymorphic sequences with potential relevance to disease pathogenesis, immunity, and evolution. We found large-sequence and single-nucleotide polymorphisms in numerous genes. Polymorphic loci included a phospholipase C, a membrane lipoprotein, members of an adenylate cyclase gene family, and members of the PE/PPE gene family, some of which have been implicated in virulence or the host immune response. Several gene families, including the PE/PPE gene family, also had significantly higher synonymous and nonsynonymous substitution frequencies compared to the genome as a whole. We tested a large sample of M. tuberculosis clinical isolates for a subset of the large-sequence and single-nucleotide polymorphisms and found widespread genetic variability at many of these loci. We performed phylogenetic and epidemiological analysis to investigate the evolutionary relationships among isolates and the origins of specific polymorphic loci. A number of these polymorphisms appear to have occurred multiple times as independent events, suggesting that these changes may be under selective pressure. Together, these results demonstrate that polymorphisms among M. tuberculosis strains are more extensive than initially anticipated, and genetic variation may have an important role in disease pathogenesis and immunity.

...read moreread less

732 citations

Journal Article•DOI•

The complete genome sequence of Chlorobium tepidum TLS, a photosynthetic, anaerobic, green-sulfur bacterium.

[...]

Jonathan A. Eisen¹, Karen E. Nelson¹, Ian T. Paulsen¹, John F. Heidelberg¹, Martin Wu¹, Robert J. Dodson¹, Robert T. DeBoy¹, Michelle L. Gwinn¹, William C. Nelson¹, Daniel H. Haft¹, Erin Hickey¹, Jeremy Peterson¹, A. Scott Durkin¹, James L. Kolonay¹, Fan Yang¹, Ingeborg Holt¹, Lowell Umayam¹, Tanya Mason¹, Michael B. Brenner¹, Terrance Shea¹, Debbie S. Parksey¹, William C. Nierman¹, Tamara Feldblyum¹, Cheryl L. Hansen¹, M. Brook Craven¹, Diana Radune¹, Jessica Vamathevan¹, Hoda Khouri¹, Owen White¹, Tanja M. Gruber², K. A. Ketchum³, K. A. Ketchum¹, J. Craig Venter¹, Hervé Tettelin¹, Donald A. Bryant⁴, Claire M. Fraser¹ - Show less +32 more•Institutions (4)

J. Craig Venter Institute¹, University of California, San Francisco², Celera Corporation³, Pennsylvania State University⁴

09 Jul 2002-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: Phylogenomic analysis reveals likely duplications of genes involved in biosynthetic pathways for photosynthesis and the metabolism of sulfur and nitrogen as well as strong similarities between metabolic processes in C. tepidum and many Archaeal species.

...read moreread less

Abstract: The complete genome of the green-sulfur eubacterium Chlorobium tepidum TLS was determined to be a single circular chromosome of 2,154,946 bp. This represents the first genome sequence from the phylum Chlorobia, whose members perform anoxygenic photosynthesis by the reductive tricarboxylic acid cycle. Genome comparisons have identified genes in C. tepidum that are highly conserved among photosynthetic species. Many of these have no assigned function and may play novel roles in photosynthesis or photobiology. Phylogenomic analysis reveals likely duplications of genes involved in biosynthetic pathways for photosynthesis and the metabolism of sulfur and nitrogen as well as strong similarities between metabolic processes in C. tepidum and many Archaeal species.

...read moreread less

362 citations

Journal Article•DOI•

InterPro: An integrated documentation resource for protein families, domains and functional sites

[...]

01 Sep 2002-Briefings in Bioinformatics

TL;DR: InterPro was developed as an integrated documentation resource for protein families, domains and functional sites, to rationalise the complementary efforts of the individual protein signature database projects.

...read moreread less

Abstract: The exponential increase in the submission of nucleotide sequences to the nucleotide sequence database by genome sequencing centres has resulted in a need for rapid, automatic methods for classification of the resulting protein sequences. There are several signature and sequence cluster-based methods for protein classification, each resource having distinct areas of optimum application owing to the differences in the underlying analysis methods. In recognition of this, InterPro was developed as an integrated documentation resource for protein families, domains and functional sites, to rationalise the complementary efforts of the individual protein signature database projects. The member databases - PRINTS, PROSITE, Pfam, ProDom, SMART and TIGRFAMs - form the InterPro core. Related signatures from each member database are unified into single InterPro entries. Each InterPro entry includes a unique accession number, functional descriptions and literature references, and links are made back to the relevant member database(s). Release 4.0 of InterPro (November 2001) contains 4,691 entries, representing 3,532 families, 1,068 domains, 74 repeats and 15 sites of post-translational modification (PTMs) encoded by different regular expressions, profiles, fingerprints and hidden Markov models (HMMs). Each InterPro entry lists all the matches against SWISS-PROT and TrEMBL (2,141,621 InterPro hits from 586,124 SWISS-PROT and TrEMBL protein sequences). The database is freely accessible for text- and sequence-based searches.

...read moreread less

344 citations

Journal Article•DOI•

HMM-based databases in InterPro

[...]

Alex Bateman¹, Daniel H. Haft•Institutions (1)

Wellcome Trust Sanger Institute¹

01 Sep 2002-Briefings in Bioinformatics

TL;DR: This paper reviews the Pfam, TIGRFAMs and SMART databases that use the profile-HMMs provided by the HMMER package to find hidden Markov models used for protein evolution and function detection.

...read moreread less

Abstract: Protein family databases are an important resource for protein annotation and understanding protein evolution and function. In recent years hidden Markov models (HMMs) have become one of the key technologies used for detection of members of these families. This paper reviews the Pfam, TIGRFAMs and SMART databases that use the profile-HMMs provided by the HMMER package.

...read moreread less

44 citations