scispace - formally typeset
Search or ask a question

Showing papers by "Bruce W. Birren published in 2009"


Journal ArticleDOI
04 Jun 2009-Nature
TL;DR: There are significant expansions of cell wall, secreted and transporter gene families in pathogenic species, suggesting adaptations associated with virulence in Candida albicans species.
Abstract: Candida species are the most common cause of opportunistic fungal infection worldwide. Here we report the genome sequences of six Candida species and compare these and related pathogens and non-pathogens. There are significant expansions of cell wall, secreted and transporter gene families in pathogenic species, suggesting adaptations associated with virulence. Large genomic tracts are homozygous in three diploid species, possibly resulting from recent recombination events. Surprisingly, key components of the mating and meiosis pathways are missing from several species. These include major differences at the mating-type loci (MTL); Lodderomyces elongisporus lacks MTL, and components of the a1/2 cell identity determinant were lost in other species, raising questions about how mating and cell types are controlled. Analysis of the CUG leucine-to-serine genetic-code change reveals that 99% of ancestral CUG codons were erased and new ones arose elsewhere. Lastly, we revise the Candida albicans gene catalogue, identifying many new genes.

956 citations


Journal ArticleDOI
09 Oct 2009-Science
TL;DR: In this article, the authors propose a method to distinguish good from poor data sets by navigating through the databases to find the number and type of reads deposited in sequence trace repositories (and not all genomes have this available), or to identify the number of contigs or genome fragments deposited to the database.
Abstract: For over a decade, genome sequences have adhered to only two standards that are relied on for purposes of sequence analysis by interested third parties (1, 2). However, ongoing developments in revolutionary sequencing technologies have resulted in a redefinition of traditional whole-genome sequencing that requires reevaluation of such standards. With commercially available 454 pyrosequencing (followed by Illumina, SOLiD, and now Helicos), there has been an explosion of genomes sequenced under the moniker “draft”; however, these can be very poor quality genomes (due to inherent errors in the sequencing technologies, and the inability of assembly programs to fully address these errors). Further, one can only infer that such draft genomes may be of poor quality by navigating through the databases to find the number and type of reads deposited in sequence trace repositories (and not all genomes have this available), or to identify the number of contigs or genome fragments deposited to the database. The difficulty in assessing the quality of such deposited genomes has created some havoc for genome analysis pipelines and has contributed to many wasted hours. Exponential leaps in raw sequencing capability and greatly reduced prices have further skewed the time- and cost-ratios of draft data generation versus the painstaking process of improving and finishing a genome. The result is an ever-widening gap between drafted and finished genomes that only promises to continue (see the figure, page 236); hence, there is an urgent need to distinguish good from poor data sets.

370 citations


Journal ArticleDOI
TL;DR: The order and genomic arrangement of the duplicated gene pairs and their common phylogenetic origin provide evidence for an ancestral whole-genome duplication (WGD) event that resulted in the expansion of multiple gene families related to cell growth and signal transduction, as well as secreted aspartic protease and subtilase protein families, which are known fungal virulence factors.
Abstract: Rhizopus oryzae is the primary cause of mucormycosis, an emerging, life-threatening infection characterized by rapid angioinvasive growth with an overall mortality rate that exceeds 50%. As a representative of the paraphyletic basal group of the fungal kingdom called ‘‘zygomycetes,’’ R. oryzae is also used as a model to study fungal evolution. Here we report the genome sequence of R. oryzae strain 99–880, isolated from a fatal case of mucormycosis. The highly repetitive 45.3 Mb genome assembly contains abundant transposable elements (TEs), comprising approximately 20% of the genome. We predicted 13,895 protein-coding genes not overlapping TEs, many of which are paralogous gene pairs. The order and genomic arrangement of the duplicated gene pairs and their common phylogenetic origin provide evidence for an ancestral whole-genome duplication (WGD) event. The WGD resulted in the duplication of nearly all subunits of the protein complexes associated with respiratory electron transport chains, the V-ATPase, and the ubiquitin–proteasome systems. The WGD, together with recent gene duplications, resulted in the expansion of multiple gene families related to cell growth and signal transduction, as well as secreted aspartic protease and subtilase protein families, which are known fungal virulence factors. The duplication of the ergosterol biosynthetic pathway, especially the major azole target, lanosterol 14ademethylase (ERG11), could contribute to the variable responses of R. oryzae to different azole drugs, including voriconazole and posaconazole. Expanded families of cell-wall synthesis enzymes, essential for fungal cell integrity but absent in mammalian hosts, reveal potential targets for novel and R. oryzae-specific diagnostic and therapeutic treatments.

339 citations


Journal ArticleDOI
TL;DR: The results suggest that Coccidioides species are not soil saprophytes, but that they have evolved to remain associated with their dead animal hosts in soil, and that C Occidioide metabolism genes, membrane-related proteins, and putatively antigenic compounds have evolved in response to interaction with an animal host.
Abstract: While most Ascomycetes tend to associate principally with plants, the dimorphic fungi Coccidioides immitis and Coccidioides posadasii are primary pathogens of immunocompetent mammals, including humans. Infection results from environmental exposure to Coccidiodies, which is believed to grow as a soil saprophyte in arid deserts. To investigate hypotheses about the life history and evolution of Coccidioides, the genomes of several Onygenales, including C. immitis and C. posadasii; a close, nonpathogenic relative, Uncinocarpus reesii; and a more diverged pathogenic fungus, Histoplasma capsulatum, were sequenced and compared with those of 13 more distantly related Ascomycetes. This analysis identified increases and decreases in gene family size associated with a host/substrate shift from plants to animals in the Onygenales. In addition, comparison among Onygenales genomes revealed evolutionary changes in Coccidioides that may underlie its infectious phenotype, the identification of which may facilitate improved treatment and prevention of coccidioidomycosis. Overall, the results suggest that Coccidioides species are not soil saprophytes, but that they have evolved to remain associated with their dead animal hosts in soil, and that Coccidioides metabolism genes, membrane-related proteins, and putatively antigenic compounds have evolved in response to interaction with an animal host.

305 citations


Journal ArticleDOI
02 Sep 2009-PLOS ONE
TL;DR: A pipeline that enables single-cell WGA on hundreds of cells at a time while virtually eliminating non-target DNA from the reactions is described and a post-amplification normalization procedure that mitigates extreme variations in sequencing coverage associated with multiple displacement amplification is developed.
Abstract: Background: Single-cell genome sequencing has the potential to allow the in-depth exploration of the vast genetic diversity found in uncultured microbes. We used the marine cyanobacterium Prochlorococcus as a model system for addressing important challenges facing high-throughput whole genome amplification (WGA) and complete genome sequencing of individual cells. Methodology/Principal Findings: We describe a pipeline that enables single-cell WGA on hundreds of cells at a time while virtually eliminating non-target DNA from the reactions. We further developed a post-amplification normalization procedure that mitigates extreme variations in sequencing coverage associated with multiple displacement amplification (MDA), and demonstrated that the procedure increased sequencing efficiency and facilitated genome assembly. We report genome recovery as high as 99.6% with reference-guided assembly, and 95% with de novo assembly starting from a single cell. We also analyzed the impact of chimera formation during MDA on de novo assembly, and discuss strategies to minimize the presence of incorrectly joined regions in contigs. Conclusions/Significance: The methods describe in this paper will be useful for sequencing genomes of individual cells from a variety of samples.

279 citations


Journal ArticleDOI
TL;DR: The findings suggest that in addition to the duplication of the Francisella Pathogenicity Island, and acquisition of individual loci, adaptation by gene loss in the more recently emerged tularensis, holarctica, and mediasiatica subspecies occurred and was distinct from evolutionary events that differentiated these subspecies, and the novicida subspecies from a common ancestor.
Abstract: Tularemia is a geographically widespread, severely debilitating, and occasionally lethal disease in humans. It is caused by infection by a gram-negative bacterium, Francisella tularensis. In order to better understand its potency as an etiological agent as well as its potential as a biological weapon, we have completed draft assemblies and report the first complete genomic characterization of five strains belonging to the following different Francisella subspecies (subsp.): the F. tularensis subsp. tularensis FSC033, F. tularensis subsp. holarctica FSC257 and FSC022, and F. tularensis subsp. novicida GA99-3548 and GA99-3549 strains. Here, we report the sequencing of these strains and comparative genomic analysis with recently available public Francisella sequences, including the rare F. tularensis subsp. mediasiatica FSC147 strain isolate from the Central Asian Region. We report evidence for the occurrence of large-scale rearrangement events in strains of the holarctica subspecies, supporting previous proposals that further phylogenetic subdivisions of the Type B clade are likely. We also find a significant enrichment of disrupted or absent ORFs proximal to predicted breakpoints in the FSC022 strain, including a genetic component of the Type I restriction-modification defense system. Many of the pseudogenes identified are also disrupted in the closely related rarely human pathogenic F. tularensis subsp. mediasiatica FSC147 strain, including modulator of drug activity B (mdaB) (FTT0961), which encodes a known NADPH quinone reductase involved in oxidative stress resistance. We have also identified genes exhibiting sequence similarity to effectors of the Type III (T3SS) and components of the Type IV secretion systems (T4SS). One of the genes, msrA2 (FTT1797c), is disrupted in F. tularensis subsp. mediasiatica and has recently been shown to mediate bacterial pathogen survival in host organisms. Our findings suggest that in addition to the duplication of the Francisella Pathogenicity Island, and acquisition of individual loci, adaptation by gene loss in the more recently emerged tularensis, holarctica, and mediasiatica subspecies occurred and was distinct from evolutionary events that differentiated these subspecies, and the novicida subspecies, from a common ancestor. Our findings are applicable to future studies focused on variations in Francisella subspecies pathogenesis, and of broader interest to studies of genomic pathoadaptation in bacteria.

132 citations


Journal ArticleDOI
TL;DR: VAAL detected ∼98% of differences (including large insertion-deletions) between pairs of strains from three species while calling no false positives, identifying an antibiotic's site of action by identifying sequence differences between drug-sensitive strains and drug-resistant derivatives.
Abstract: This variant ascertainment algorithm, or VAAL, uses short sequence reads of haploid bacterial genomes to first locally assemble the reads and then compare these assemblies to the reference genome. This allows VAAL to detect all types of variants ranging from single-nucleotide polymorphisms to large insertions or deletions. Our variant ascertainment algorithm, VAAL, uses massively parallel DNA sequence data to identify differences between bacterial genomes with high sensitivity and specificity. VAAL detected ∼98% of differences (including large insertion-deletions) between pairs of strains from three species while calling no false positives. VAAL also pinpointed a single mutation between Vibrio cholerae genomes, identifying an antibiotic's site of action by identifying sequence differences between drug-sensitive strains and drug-resistant derivatives.

71 citations


Journal ArticleDOI
01 Feb 2009-Genetics
TL;DR: A set of single nucleotide polymorphisms (SNPs) between the reference Neurospora crassa strain Oak Ridge and the Mauriceville strain, of sufficient density to allow fine mapping of most loci, are discovered and validated.
Abstract: We report the discovery and validation of a set of single nucleotide polymorphisms (SNPs) between the reference Neurospora crassa strain Oak Ridge and the Mauriceville strain (FGSC 2555), of sufficient density to allow fine mapping of most loci. Sequencing of Mauriceville cDNAs and alignment to the completed genomic sequence of the Oak Ridge strain identified 19,087 putative SNPs. Of these, a subset was validated by cleaved amplified polymorphic sequence (CAPS), a simple and robust PCR-based assay that reliably distinguishes between SNP alleles. Experimental confirmation resulted in the development of 250 CAPS markers distributed evenly over the genome. To demonstrate the applicability of this map, we used bulked segregant analysis followed by interval mapping to locate the csp-1 mutation to a narrow region on LGI. Subsequently, we refined mapping resolution to 74 kbp by developing additional markers, resequenced the candidate gene, NCU02713.3, in the mutant background, and phenocopied the mutation by gene replacement in the WT strain. Together, these techniques demonstrate a generally applicable and straightforward approach for the isolation of novel genes from existing mutants. Data on both putative and validated SNPs are deposited in a customized public database at the Broad Institute, which encourages augmentation by community users.

59 citations


Journal ArticleDOI
TL;DR: Ultra-deep sequencing of full-length HIV-1 genomes identifies rapid viral evolution during acute infection.
Abstract: Open Access Poster presentation P09-20 LB. Ultra-deep sequencing of full-length HIV-1 genomes identifies rapid viral evolution during acute infection MR Henn1, C Boutwell3, N Lennon1, K Power3, C Malboeuf1, P Charlebois1, A Gladden3, J Levin1, M Casali1, L Philips3, A Berlin1, A Berical3, R Erlich1, S Anderson1, H Streeck3, M Kemper3, E Ryan1, Y Wang3, L Green1, K Axten3, Z Brumme3, C Brumme3, C Russ1, E Rosenberg3, H Jessen2, M Altfeld3, C Nusbaum1, B Walker3, B Birren1 and TM Allen*3

7 citations