scispace - formally typeset
Open AccessJournal ArticleDOI

Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: Implications for the microbial “pan-genome”

TLDR
The genomic sequence of six strains representing the five major disease-causing serotypes of Streptococcus agalactiae, the main cause of neonatal infection in humans, was generated and Mathematical extrapolation of the data suggests that the gene reservoir available for inclusion in the S. agalactic pan-genome is vast and that unique genes will continue to be identified even after sequencing hundreds of genomes.
Abstract
The development of efficient and inexpensive genome sequencing methods has revolutionized the study of human bacterial pathogens and improved vaccine design. Unfortunately, the sequence of a single genome does not reflect how genetic variability drives pathogenesis within a bacterial species and also limits genome-wide screens for vaccine candidates or for antimicrobial targets. We have generated the genomic sequence of six strains representing the five major disease-causing serotypes of Streptococcus agalactiae, the main cause of neonatal infection in humans. Analysis of these genomes and those available in databases showed that the S. agalactiae species can be described by a pan-genome consisting of a core genome shared by all isolates, accounting for ≈80% of any single genome, plus a dispensable genome consisting of partially shared and strain-specific genes. Mathematical extrapolation of the data suggests that the gene reservoir available for inclusion in the S. agalactiae pan-genome is vast and that unique genes will continue to be identified even after sequencing hundreds of genomes.

read more

Citations
More filters
Journal ArticleDOI

NCBI prokaryotic genome annotation pipeline

TL;DR: The new NCBI's Prokaryotic Genome Annotation Pipeline (PGAP) relies less on sequence similarity when confident comparative data are available, while it relies more on statistical predictions in the absence of external evidence.
Journal ArticleDOI

progressiveMauve: Multiple Genome Alignment with Gene Gain, Loss and Rearrangement

TL;DR: A new method to align two or more genomes that have undergone rearrangements due to recombination and substantial amounts of segmental gain and loss is described, demonstrating high accuracy in situations where genomes have undergone biologically feasible amounts of genome rearrangement, segmental loss and loss.
Journal ArticleDOI

IMG: the integrated microbial genomes database and comparative analysis system

TL;DR: The Integrated Microbial Genomes system serves as a community resource for comparative analysis of publicly available genomes in a comprehensive integrated context and provides tools and viewers for analyzing and reviewing the annotations of genes and genomes inA comparative context.
Journal ArticleDOI

The microbial pan-genome

TL;DR: A decade after the beginning of the genomic era, the question of how genomics can describe a bacterial species has not been fully addressed and the pan-genome, which is composed of a "core genome" containing genes present in all strains, and a "dispensable genome", might be orders of magnitude larger than any single genome.
References
More filters
Journal ArticleDOI

Identification of common molecular subsequences.

TL;DR: This letter extends the heuristic homology algorithm of Needleman & Wunsch (1970) to find a pair of segments, one from each of two long sequences, such that there is no other Pair of segments with greater similarity (homology).
Journal ArticleDOI

Diversity of the human intestinal microbial flora.

TL;DR: A majority of the bacterial sequences corresponded to uncultivated species and novel microorganisms, and significant intersubject variability and differences between stool and mucosa community composition were discovered.
Journal ArticleDOI

Whole-genome random sequencing and assembly of Haemophilus influenzae Rd.

TL;DR: An approach for genome analysis based on sequencing and assembly of unselected pieces of DNA from the whole chromosome has been applied to obtain the complete nucleotide sequence of the genome from the bacterium Haemophilus influenzae Rd.
Related Papers (5)