scispace - formally typeset
Open AccessJournal ArticleDOI

PhyloSift: phylogenetic analysis of genomes and metagenomes.

Reads0
Chats0
TLDR
This work presents an approach to leverage phylogenetic analysis of metagenomic sequence data to conduct phylogeny-driven Bayesian hypothesis tests for the presence of an organism in a sample and applies new tools to analyze the phylogenetic diversity of microbial communities.
Abstract
Like all organisms on the planet, environmental microbes are subject to the forces of molecular evolution. Metagenomic sequencing provides a means to access the DNA sequence of uncultured microbes. By combining DNA sequencing of microbial communities with evolutionary modeling and phylogenetic analysis we might obtain new insights into microbiology and also provide a basis for practical tools such as forensic pathogen detection. In this work we present an approach to leverage phylogenetic analysis of metagenomic sequence data to conduct several types of analysis. First, we present a method to conduct phylogeny-driven Bayesian hypothesis tests for the presence of an organism in a sample. Second, we present a means to compare community structure across a collection of many samples and develop direct associations between the abundance of certain organisms and sample metadata. Third, we apply new tools to analyze the phylogenetic diversity of microbial communities and again demonstrate how this can be associated to sample metadata. These analyses are implemented in an open source software pipeline called PhyloSift. As a pipeline, PhyloSift incorporates several other programs including LAST, HMMER, and pplacer to automate phylogenetic analysis of protein coding and RNA sequences in metagenomic datasets generated by modern sequencing platforms (e.g., Illumina, 454).

read more

Citations
More filters
Journal ArticleDOI

CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes

TL;DR: An objective measure of genome quality is proposed that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities and is shown to provide accurate estimates of genome completeness and contamination and to outperform existing approaches.
Journal ArticleDOI

Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea

Robert M. Bowers, +56 more
- 01 Jul 2018 - 
TL;DR: Two standards developed by the Genomic Standards Consortium (GSC) for reporting bacterial and archaeal genome sequences are presented, including the Minimum Information about a Single Amplified Genome (MISAG) and the Minimum information about a Metagenome-Assembled Genomes (MIMAG), including estimates of genome completeness and contamination.
Journal ArticleDOI

UBCG: Up-to-date bacterial core gene set and pipeline for phylogenomic tree reconstruction

TL;DR: This work presents the up-to-date bacterial core gene set, named UBCG, and software suites to accommodate necessary steps to generate and evaluate phylogenetic trees, successfully used to infer phylogenomic relationship of Escherichia and related taxa.
Journal ArticleDOI

Characterization of the Gut Microbiome Using 16S or Shotgun Metagenomics.

TL;DR: The two main approaches for analyzing the microbiome, 16S ribosomal RNA gene amplicons and shotgun metagenomics, are illustrated with analyses of libraries designed to highlight their strengths and weaknesses and several methods for taxonomic classification of bacterial sequences are discussed.
Journal ArticleDOI

An introduction to the analysis of shotgun metagenomic data.

TL;DR: This review describes the analytical strategies and specific tools that can be applied to metagenomic data and the considerations and caveats associated with their use and documents how metagenomes can be analyzed to quantify community structure and diversity.
References
More filters
Journal ArticleDOI

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
Journal ArticleDOI

Ultrafast and memory-efficient alignment of short DNA sequences to the human genome

TL;DR: Bowtie extends previous Burrows-Wheeler techniques with a novel quality-aware backtracking algorithm that permits mismatches and can be used simultaneously to achieve even greater alignment speeds.
Journal ArticleDOI

Naïve Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy

TL;DR: The RDP Classifier can rapidly and accurately classify bacterial 16S rRNA sequences into the new higher-order taxonomy proposed in Bergey's Taxonomic Outline of the Prokaryotes, and the majority of the classification errors appear to be due to anomalies in the current taxonomies.
Related Papers (5)