scispace - formally typeset
Open AccessJournal ArticleDOI

pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree

TLDR
Pplacer as discussed by the authors is a software package for phylogenetic placement and subsequent visualization, which can place twenty thousand short reads on a reference tree of one thousand taxa per hour per processor, and is easy to run in parallel.
Abstract
Likelihood-based phylogenetic inference is generally considered to be the most reliable classification method for unknown sequences. However, traditional likelihood-based phylogenetic methods cannot be applied to large volumes of short reads from next-generation sequencing due to computational complexity issues and lack of phylogenetic signal. "Phylogenetic placement," where a reference tree is fixed and the unknown query sequences are placed onto the tree via a reference alignment, is a way to bring the inferential power offered by likelihood-based approaches to large data sets. This paper introduces pplacer, a software package for phylogenetic placement and subsequent visualization. The algorithm can place twenty thousand short reads on a reference tree of one thousand taxa per hour per processor, has essentially linear time and memory complexity in the number of reference taxa, and is easy to run in parallel. Pplacer features calculation of the posterior probability of a placement on an edge, which is a statistically rigorous way of quantifying uncertainty on an edge-by-edge basis. It also can inform the user of the positional uncertainty for query sequences by calculating expected distance between placement locations, which is crucial in the estimation of uncertainty with a well-sampled reference tree. The software provides visualizations using branch thickness and color to represent number of placements and their uncertainty. A simulation study using reads generated from 631 COG alignments shows a high level of accuracy for phylogenetic placement over a wide range of alignment diversity, and the power of edge uncertainty estimates to measure placement confidence. Pplacer enables efficient phylogenetic placement and subsequent visualization, making likelihood-based phylogenetics methodology practical for large collections of reads; it is freely available as source code, binaries, and a web service.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability

TL;DR: This version of MAFFT has several new features, including options for adding unaligned sequences into an existing alignment, adjustment of direction in nucleotide alignment, constrained alignment and parallel processing, which were implemented after the previous major update.
Journal ArticleDOI

CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes

TL;DR: An objective measure of genome quality is proposed that can be used to select genomes suitable for specific gene- and genome-centric analyses of microbial communities and is shown to provide accurate estimates of genome completeness and contamination and to outperform existing approaches.
Journal ArticleDOI

Interactive Tree Of Life (iTOL) v4: recent updates and new developments.

TL;DR: The current version of iTOL v4 introduces four new dataset types, together with numerous new features, and is the first tool which supports direct visualization of Qiime 2 trees and associated annotations.
Journal ArticleDOI

Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees

TL;DR: ITOL 3 is the first tool which supports direct visualization of the recently proposed phylogenetic placements format, and its account system has been redesigned to simplify the management of trees in user-defined workspaces and projects.
Journal ArticleDOI

Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation.

TL;DR: The Interactive Tree Of Life (ITOL) as mentioned in this paper is an online tool for the display, manipulation and annotation of phylogenetic and other trees, which allows users to draw shapes, labels and other features directly onto the trees.
References
More filters
Journal ArticleDOI

Basic Local Alignment Search Tool

TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.
Journal ArticleDOI

MRBAYES: Bayesian inference of phylogenetic trees

TL;DR: The program MRBAYES performs Bayesian inference of phylogeny using a variant of Markov chain Monte Carlo, and an executable is available at http://brahms.rochester.edu/software.html.
Journal Article

PHYLIP-Phylogeny inference package (Version 3.2)

J. Felsenstein
- 01 Jan 1989 - 
Journal ArticleDOI

A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood.

TL;DR: This work has used extensive and realistic computer simulations to show that the topological accuracy of this new method is at least as high as that of the existing maximum-likelihood programs and much higher than the performance of distance-based and parsimony approaches.
Journal ArticleDOI

RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models

TL;DR: UNLABELLED RAxML-VI-HPC (randomized axelerated maximum likelihood for high performance computing) is a sequential and parallel program for inference of large phylogenies with maximum likelihood (ML) that has been used to compute ML trees on two of the largest alignments to date.
Related Papers (5)