scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Every base matters: assessing small subunit rRNA primers for marine microbiomes with mock communities, time series and global field samples

01 May 2016-Environmental Microbiology (Environ Microbiol)-Vol. 18, Iss: 5, pp 1403-1414
TL;DR: It is shown that beyond in silico predictions, testing with mock communities and field samples is important in primer selection, and a single mismatch can strongly bias amplification, but even perfectly matched primers can exhibit preferential amplification.
Abstract: Summary Microbial community analysis via high-throughput sequencing of amplified 16S rRNA genes is an essential microbiology tool. We found the popular primer pair 515F (515F-C) and 806R greatly underestimated (e.g. SAR11) or overestimated (e.g. Gammaproteobacteria) common marine taxa. We evaluated marine samples and mock communities (containing 11 or 27 marine 16S clones), showing alternative primers 515F-Y (5′-GTGYCAGCMGCCGCGGTAA) and 926R (5′-CCGYCAATTYMTTTRAGTTT) yield more accurate estimates of mock community abundances, produce longer amplicons that can differentiate taxa unresolvable with 515F-C/806R, and amplify eukaryotic 18S rRNA. Mock communities amplified with 515F-Y/926R yielded closer observed community composition versus expected (r2 = 0.95) compared with 515F-Y/806R (r2 ∼ 0.5). Unexpectedly, biases with 515F-Y/806R against SAR11 in field samples (∼4–10-fold) were stronger than in mock communities (∼2-fold). Correcting a mismatch to Thaumarchaea in the 515F-C increased their apparent abundance in field samples, but not as much as using 926R rather than 806R. With plankton samples rich in eukaryotic DNA (> 1 μm size fraction), 18S sequences averaged ∼17% of all sequences. A single mismatch can strongly bias amplification, but even perfectly matched primers can exhibit preferential amplification. We show that beyond in silico predictions, testing with mock communities and field samples is important in primer selection.
Citations
More filters
Journal ArticleDOI
01 Nov 2017-Nature
TL;DR: A meta-analysis of microbial community samples collected by hundreds of researchers for the Earth Microbiome Project is presented, creating both a reference database giving global context to DNA sequence data and a framework for incorporating data from future studies, fostering increasingly complete characterization of Earth’s microbial diversity.
Abstract: Our growing awareness of the microbial world’s importance and diversity contrasts starkly with our limited understanding of its fundamental structure. Despite recent advances in DNA sequencing, a lack of standardized protocols and common analytical frameworks impedes comparisons among studies, hindering the development of global inferences about microbial life on Earth. Here we present a meta-analysis of microbial community samples collected by hundreds of researchers for the Earth Microbiome Project. Coordinated protocols and new analytical methods, particularly the use of exact sequences instead of clustered operational taxonomic units, enable bacterial and archaeal ribosomal RNA gene sequences to be followed across multiple studies and allow us to explore patterns of diversity at an unprecedented scale. The result is both a reference database giving global context to DNA sequence data and a framework for incorporating data from future studies, fostering increasingly complete characterization of Earth’s microbial diversity.

1,676 citations

Journal ArticleDOI
25 Feb 2016
TL;DR: Modification of modified 16S rRNA gene and internal transcribed spacer (ITS) primers for archaea/bacteria and fungi with nonaquatic samples demonstrated that two recently modified primer pairs that target taxonomically discriminatory regions of bacterial and fungal genomic DNA do not introduce new biases when used on a variety of sample types.
Abstract: Designing primers for PCR-based taxonomic surveys that amplify a broad range of phylotypes in varied community samples is a difficult challenge, and the comparability of data sets amplified with varied primers requires attention. Here, we examined the performance of modified 16S rRNA gene and internal transcribed spacer (ITS) primers for archaea/bacteria and fungi, respectively, with nonaquatic samples. We moved primer bar codes to the 5' end, allowing for a range of different 3' primer pairings, such as the 515f/926r primer pair, which amplifies variable regions 4 and 5 of the 16S rRNA gene. We additionally demonstrated that modifications to the 515f/806r (variable region 4) 16S primer pair, which improves detection of Thaumarchaeota and clade SAR11 in marine samples, do not degrade performance on taxa already amplified effectively by the original primer set. Alterations to the fungal ITS primers did result in differential but overall improved performance compared to the original primers. In both cases, the improved primers should be widely adopted for amplicon studies. IMPORTANCE We continue to uncover a wealth of information connecting microbes in important ways to human and environmental ecology. As our scientific knowledge and technical abilities improve, the tools used for microbiome surveys can be modified to improve the accuracy of our techniques, ensuring that we can continue to identify groundbreaking connections between microbes and the ecosystems they populate, from ice caps to the human body. It is important to confirm that modifications to these tools do not cause new, detrimental biases that would inhibit the field rather than continue to move it forward. We therefore demonstrated that two recently modified primer pairs that target taxonomically discriminatory regions of bacterial and fungal genomic DNA do not introduce new biases when used on a variety of sample types, from soil to human skin. This confirms the utility of these primers for maintaining currently recommended microbiome research techniques as the state of the art.

1,222 citations


Cites background from "Every base matters: assessing small..."

  • ...(4) 806r Modified GGACTACNVGGGTWTCTAAT Apprill et al....

    [...]

  • ...Additionally, further analyses, such as with mock communities (4), could evaluate the extent to which differences in these outliers stem from systematic biases of particular taxa....

    [...]

  • ...(4) ITS1f CTTGGTCATTTAGAGGAAGTAA Gardes and Bruns (6) ITS2 GCTGCGTTCTTCATCGATGC White et al....

    [...]

  • ...(4) demonstrated improved phylogenetic resolution using the 515f-926r construct; additionally, the percentage of eukaryotic taxa amplified by this construct indicated that in marine or other non-host-associated environments, this construct may be a particularly attractive choice for producing amplicons from all three domains....

    [...]

  • ...Specifically, additional degeneracy was added to the 515f primer to reduce bias against Crenarchaeota/Thaumarchaeota (4) and to the 806r primer to minimize the bias against the SAR11 clade (5)....

    [...]

Journal ArticleDOI
TL;DR: In this short review, key information from the literature pertaining to each processing step is described, and consequently, general recommendations for future 16S rRNA gene metabarcoding experiments are made.
Abstract: The development and continuous improvement of high-throughput sequencing platforms have stimulated interest in the study of complex microbial communities Currently, the most popular sequencing approach to study microbial community composition and dynamics is targeted 16S rRNA gene metabarcoding To prepare samples for sequencing, there are a variety of processing steps, each with the potential to introduce bias at the data analysis stage In this short review, key information from the literature pertaining to each processing step is described, and consequently, general recommendations for future 16S rRNA gene metabarcoding experiments are made

382 citations


Cites background from "Every base matters: assessing small..."

  • ...the sampling and library preparation processes to be identified (42, 47, 59, 60)....

    [...]

Journal ArticleDOI
TL;DR: Suggestions for optimizing experimental design and avoiding known pitfalls are presented, organized in the typical order in which studies are carried out, to help researchers in this exciting field advance their studies efficiently while avoiding errors.
Abstract: Research on the human microbiome has yielded numerous insights into health and disease, but also has resulted in a wealth of experimental artifacts. Here, we present suggestions for optimizing experimental design and avoiding known pitfalls, organized in the typical order in which studies are carried out. We first review best practices in experimental design and introduce common confounders such as age, diet, antibiotic use, pet ownership, longitudinal instability, and microbial sharing during cohousing in animal studies. Typically, samples will need to be stored, so we provide data on best practices for several sample types. We then discuss design and analysis of positive and negative controls, which should always be run with experimental samples. We introduce a convenient set of non-biological DNA sequences that can be useful as positive controls for high-volume analysis. Careful analysis of negative and positive controls is particularly important in studies of samples with low microbial biomass, where contamination can comprise most or all of a sample. Lastly, we summarize approaches to enhancing experimental robustness by careful control of multiple comparisons and to comparing discovery and validation cohorts. We hope the experimental tactics summarized here will help researchers in this exciting field advance their studies efficiently while avoiding errors.

374 citations


Cites background from "Every base matters: assessing small..."

  • ...Many studies have used positive controls comprised of mixtures of cultured organisms (“mock communities”) [23, 96, 111] or known mixtures of free DNA (“mock DNA” samples) [88, 112, 113], both of which make useful controls....

    [...]

Journal ArticleDOI
TL;DR: The potential of eDNA to inform on the breadth of biodiversity present in a tropical marine environment is investigated, and the sensitivity and low cost of e DNA metabarcoding are advocated, urging this approach to be rapidly integrated into biomonitoring programs.
Abstract: Effective marine management requires comprehensive data on the status of marine biodiversity. However, efficient methods that can document biodiversity in our oceans are currently lacking. Environmental DNA (eDNA) sourced from seawater offers a new avenue for investigating the biota in marine ecosystems. Here, we investigated the potential of eDNA to inform on the breadth of biodiversity present in a tropical marine environment. Directly sequencing eDNA from seawater using a shotgun approach resulted in only 0.34% of 22.3 million reads assigning to eukaryotes, highlighting the inefficiency of this method for assessing eukaryotic diversity. In contrast, using ‘tree of life’ (ToL) metabarcoding and 20-fold fewer sequencing reads, we could detect 287 families across the major divisions of eukaryotes. Our data also show that the best performing ‘universal’ PCR assay recovered only 44% of the eukaryotes identified across all assays, highlighting the need for multiple metabarcoding assays to catalogue biodiversity. Lastly, focusing on the fish genus Lethrinus, we recovered intra- and inter-specific haplotypes from seawater samples, illustrating that eDNA can be used to explore diversity beyond taxon identifications. Given the sensitivity and low cost of eDNA metabarcoding we advocate this approach be rapidly integrated into biomonitoring programs.

315 citations

References
More filters
Journal ArticleDOI
TL;DR: The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved and modifications are incorporated into a new program, CLUSTAL W, which is freely available.
Abstract: The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved for the alignment of divergent protein sequences. Firstly, individual weights are assigned to each sequence in a partial alignment in order to down-weight near-duplicate sequences and up-weight the most divergent ones. Secondly, amino acid substitution matrices are varied at different alignment stages according to the divergence of the sequences to be aligned. Thirdly, residue-specific gap penalties and locally reduced gap penalties in hydrophilic regions encourage new gaps in potential loop regions rather than regular secondary structure. Fourthly, positions in early alignments where gaps have been opened receive locally reduced gap penalties to encourage the opening up of new gaps at these positions. These modifications are incorporated into a new program, CLUSTAL W which is freely available.

63,427 citations

Journal ArticleDOI
TL;DR: An advanced version of the Molecular Evolutionary Genetics Analysis software, which currently contains facilities for building sequence alignments, inferring phylogenetic histories, and conducting molecular evolutionary analysis, is released, which enables the inference of timetrees, as it implements the RelTime method for estimating divergence times for all branching points in a phylogeny.
Abstract: We announce the release of an advanced version of the Molecular Evolutionary Genetics Analysis (MEGA) software, which currently contains facilities for building sequence alignments, inferring phylogenetic histories, and conducting molecular evolutionary analysis. In version 6.0, MEGA now enables the inference of timetrees, as it implements the RelTime method for estimating divergence times for all branching points in a phylogeny. A new Timetree Wizard in MEGA6 facilitates this timetree inference by providing a graphical user interface (GUI) to specify the phylogeny and calibration constraints step-by-step. This version also contains enhanced algorithms to search for the optimal trees under evolutionary criteria and implements a more advanced memory management that can double the size of sequence data sets to which MEGA can be applied. Both GUI and command-line versions of MEGA6 can be downloaded from www.megasoftware.net free of charge.

37,956 citations

01 Jan 2013
TL;DR: The Molecular Evolutionary Genetics Analysis (MEGA) software as discussed by the authors provides facilities for building sequence alignments, inferring phylogenetic histories, and conducting molecular evolutionary analysis, including the inference of timetrees.
Abstract: We announce the release of an advanced version of the Molecular Evolutionary Genetics Analysis (MEGA) software, which currently contains facilities for building sequence alignments, inferring phylogenetic histories, and conducting molecular evolutionary analysis. In version 6.0, MEGA now enables the inference of timetrees, as it implements the RelTime method for estimating divergence times for all branching points in a phylogeny. A new Timetree Wizard in MEGA6 facilitates this timetree inference by providing a graphical user interface (GUI) to specify the phylogeny and calibration constraints step-by-step. This version also contains enhanced algorithms to search for the optimal trees under evolutionary criteria and implements a more advanced memory management that can double the size of sequence data sets to which MEGA can be applied. Both GUI and command-line versions of MEGA6 can be downloaded from www. megasoftware.net free of charge.

30,478 citations

Journal ArticleDOI
TL;DR: An overview of the analysis pipeline and links to raw data and processed output from the runs with and without denoising are provided.
Abstract: Supplementary Figure 1 Overview of the analysis pipeline. Supplementary Table 1 Details of conventionally raised and conventionalized mouse samples. Supplementary Discussion Expanded discussion of QIIME analyses presented in the main text; Sequencing of 16S rRNA gene amplicons; QIIME analysis notes; Expanded Figure 1 legend; Links to raw data and processed output from the runs with and without denoising.

28,911 citations

Journal ArticleDOI
TL;DR: The extensively curated SILVA taxonomy and the new non-redundant SILVA datasets provide an ideal reference for high-throughput classification of data from next-generation sequencing approaches.
Abstract: SILVA (from Latin silva, forest, http://www.arb-silva.de) is a comprehensive web resource for up to date, quality-controlled databases of aligned ribosomal RNA (rRNA) gene sequences from the Bacteria, Archaea and Eukaryota domains and supplementary online services. The referred database release 111 (July 2012) contains 3 194 778 small subunit and 288 717 large subunit rRNA gene sequences. Since the initial description of the project, substantial new features have been introduced, including advanced quality control procedures, an improved rRNA gene aligner, online tools for probe and primer evaluation and optimized browsing, searching and downloading on the website. Furthermore, the extensively curated SILVA taxonomy and the new non-redundant SILVA datasets provide an ideal reference for high-throughput classification of data from next-generation sequencing approaches.

18,256 citations

Related Papers (5)