DADA2: High-resolution sample inference from Illumina amplicon data

doi:10.1038/NMETH.3869

Open AccessJournal ArticleDOI

DADA2: High-resolution sample inference from Illumina amplicon data

Benjamin J. Callahan, +5 more

- 01 Jul 2016 -

Nature Methods

- Vol. 13, Iss: 7, pp 581-583

TLDR

The open-source software package DADA2 for modeling and correcting Illumina-sequenced amplicon errors is presented, revealing a diversity of previously undetected Lactobacillus crispatus variants.

Abstract:

We present the open-source software package DADA2 for modeling and correcting Illumina-sequenced amplicon errors (https://github.com/benjjneb/dada2). DADA2 infers sample sequences exactly and resolves differences of as little as 1 nucleotide. In several mock communities, DADA2 identified more real variants and output fewer spurious sequences than other methods. We applied DADA2 to vaginal samples from a cohort of pregnant women, revealing a diversity of previously undetected Lactobacillus crispatus variants.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2

Evan Bolyen, +123 more

- 01 Aug 2019 -

Nature Biotechnology

TL;DR: QIIME 2 development was primarily funded by NSF Awards 1565100 to J.G.C. and R.K.P. and partial support was also provided by the following: grants NIH U54CA143925 and U54MD012388.

...read moreread less

“Bioinformatics” 특집을 내면서

장병탁, +2 more

TL;DR: Assessment of medical technology in the context of commercialization with Bioentrepreneur course, which addresses many issues unique to biomedical products.

...read moreread less

Journal ArticleDOI

Optimizing taxonomic classification of marker-gene amplicon sequences with QIIME 2’s q2-feature-classifier plugin

Nicholas A. Bokulich, +7 more

- 17 May 2018 -

Microbiome

TL;DR: The results illustrate the importance of parameter tuning for optimizing classifier performance, and the recommendations regarding parameter choices for these classifiers under a range of standard operating conditions are made.

...read moreread less

Journal ArticleDOI

Exact sequence variants should replace operational taxonomic units in marker-gene data analysis.

Benjamin J. Callahan, +2 more

- 21 Jul 2017 -

The ISME Journal

TL;DR: It is argued that the improvements in reusability, reproducibility and comprehensiveness are sufficiently great that ASVs should replace OTUs as the standard unit of marker-gene analysis and reporting.

...read moreread less

Journal ArticleDOI

Simple statistical identification and removal of contaminant sequences in marker-gene and metagenomics data.

Nicole M Davis, +6 more

- 17 Dec 2018 -

Microbiome

TL;DR: The application of decontam to two recently published datasets corroborated and extended their conclusions that little evidence existed for an indigenous placenta microbiome and that some low-frequency taxa seemingly associated with preterm birth were contaminants.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Removing noise from pyrosequenced amplicons.

Christopher Quince, +3 more

- 28 Jan 2011 -

BMC Bioinformatics

TL;DR: AmpliconNoise is a development of the PyroNoise algorithm that is capable of separately removing 454 sequencing errors and PCR single base errors and a novel chimera removal program, Perseus, that exploits the sequence abundances associated with pyrosequencing data.

...read moreread less

Journal ArticleDOI

Error filtering, pair assembly and error correction for next-generation sequencing reads

Robert C. Edgar, +1 more

- 01 Nov 2015 -

Bioinformatics

TL;DR: This work demonstrates large reductions in error frequencies, especially for high-error-rate reads, by three independent means: filtering reads according to their expected number of errors, assembling overlapping read pairs and by exploiting unique sequence abundances to perform error correction.

...read moreread less

Journal ArticleDOI

Rapidly denoising pyrosequencing amplicon reads by exploiting rank-abundance distributions.

Jens Reeder, +1 more

- 01 Sep 2010 -

Nature Methods

TL;DR: A fast method for denoising pyrosequencing for community 16S rRNA analysis is developed and a 2–4 fold reduction in the number of observed OTUs is observed comparing denoised with non-denoised data.

...read moreread less

Journal ArticleDOI

Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform

Melanie Schirmer, +5 more

- 31 Mar 2015 -

Nucleic Acids Research

TL;DR: A large study on the error patterns for the MiSeq based on 16S rRNA amplicon sequencing data is conducted and it is shown that the library preparation method and the choice of primers are the most significant sources of bias and cause distinct error patterns.

...read moreread less

Journal ArticleDOI

Minimum entropy decomposition: unsupervised oligotyping for sensitive partitioning of high-throughput marker gene sequences.

A. Murat Eren, +5 more

- 17 Mar 2015 -

The ISME Journal

TL;DR: Minimum Entropy Decomposition (MED) provides a computationally efficient means to partition marker gene datasets into ‘MED nodes’, which represent homogeneous operational taxonomic units and enables sensitive discrimination of closely related organisms in marker gene amplicon datasets without relying on extensive computational heuristics and user supervision.

...read moreread less

Related Papers (5)

phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data.

Paul J. McMurdie, +1 more

- 22 Apr 2013 -

PLOS ONE

The SILVA ribosomal RNA gene database project: improved data processing and web-based tools

Christian Quast, +7 more

- 28 Nov 2012 -

Nucleic Acids Research

Cutadapt removes adapter sequences from high-throughput sequencing reads

Marcel Martin

- 02 May 2011 -

EMBnet.journal

Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2

Michael I. Love, +3 more

- 05 Dec 2014 -

Genome Biology

DADA2: High-resolution sample inference from Illumina amplicon data

Citations

Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2

“Bioinformatics” 특집을 내면서

Optimizing taxonomic classification of marker-gene amplicon sequences with QIIME 2’s q2-feature-classifier plugin

Exact sequence variants should replace operational taxonomic units in marker-gene data analysis.

Simple statistical identification and removal of contaminant sequences in marker-gene and metagenomics data.

References

Removing noise from pyrosequenced amplicons.

Error filtering, pair assembly and error correction for next-generation sequencing reads

Rapidly denoising pyrosequencing amplicon reads by exploiting rank-abundance distributions.

Insight into biases and sequencing errors for amplicon sequencing with the Illumina MiSeq platform

Minimum entropy decomposition: unsupervised oligotyping for sensitive partitioning of high-throughput marker gene sequences.

Related Papers (5)

phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data.

The SILVA ribosomal RNA gene database project: improved data processing and web-based tools

QIIME allows analysis of high-throughput community sequencing data.

Cutadapt removes adapter sequences from high-throughput sequencing reads

Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2

Trending Questions (2)