scispace - formally typeset
Open AccessJournal ArticleDOI

Ecological Patterns of nifH Genes in Four Terrestrial Climatic Zones Explored with Targeted Metagenomics Using FrameBot, a New Informatics Tool

TLDR
To accurately detect and correct frameshifts caused by indel sequencing errors, FrameBot was developed, a tool for frameshift correction and nearest-neighbor classification, and its accuracy was compared to that of two other rapid frameshIFT correction tools.
Abstract
Biological nitrogen fixation is an important component of sustainable soil fertility and a key component of the nitrogen cycle. We used targeted metagenomics to study the nitrogen fixation-capable terrestrial bacterial community by targeting the gene for nitrogenase reductase ( nifH ). We obtained 1.1 million nifH 454 amplicon sequences from 222 soil samples collected from 4 National Ecological Observatory Network (NEON) sites in Alaska, Hawaii, Utah, and Florida. To accurately detect and correct frameshifts caused by indel sequencing errors, we developed FrameBot, a tool for frameshift correction and nearest-neighbor classification, and compared its accuracy to that of two other rapid frameshift correction tools. We found FrameBot was, in general, more accurate as long as a reference protein sequence with 80% or greater identity to a query was available, as was the case for virtually all nifH reads for the 4 NEON sites. Frameshifts were present in 12.7% of the reads. Those nifH sequences related to the Proteobacteria phylum were most abundant, followed by those for Cyanobacteria in the Alaska and Utah sites. Predominant genera with nifH sequences similar to reads included Azospirillum , Bradyrhizobium , and Rhizobium , the latter two without obvious plant hosts at the sites. Surprisingly, 80% of the sequences had greater than 95% amino acid identity to known nifH gene sequences. These samples were grouped by site and correlated with soil environmental factors, especially drainage, light intensity, mean annual temperature, and mean annual precipitation. FrameBot was tested successfully on three ecofunctional genes but should be applicable to any. IMPORTANCE High-throughput phylogenetic analysis of microbial communities using rRNA-targeted sequencing is now commonplace; however, such data often allow little inference with respect to either the presence or the diversity of genes involved in most important ecological processes. To study the gene pool for these processes, it is more straightforward to assess the genes directly responsible for the ecological function (ecofunctional genes). However, analyzing these genes involves technical challenges beyond those seen for rRNA. In particular, frameshift errors cause garbled downstream protein translations. Our FrameBot tool described here both corrects frameshift errors in query reads and determines their closest matching protein sequences in a set of reference sequences. We validated this new tool with sequences from defined communities and demonstrated the tool’s utility on nifH gene fragments sequenced from soils in well-characterized and major terrestrial ecosystem types.

read more

Citations
More filters
Journal ArticleDOI

Optimizing taxonomic classification of marker-gene amplicon sequences with QIIME 2’s q2-feature-classifier plugin

TL;DR: The results illustrate the importance of parameter tuning for optimizing classifier performance, and the recommendations regarding parameter choices for these classifiers under a range of standard operating conditions are made.
Journal ArticleDOI

FunGene: the functional gene pipeline and repository

TL;DR: The Functional Gene Pipeline and Repository offers databases of many common ecofunctional genes and proteins, as well as integrated tools that allow researchers to browse these collections and choose subsets for further analysis, build phylogenetic trees, test primers and probes for coverage, and download aligned sequences.
Journal ArticleDOI

Microbial functional genes involved in nitrogen fixation, nitrification and denitrification in forest ecosystems

TL;DR: In this article, the abundance and community structure of functional genes involved in the biogeochemical cycling of N in forest soils offers an approach to directly link microbial groups to soil characteristics and ecosystem processes.
Journal ArticleDOI

Colonic Butyrate-Producing Communities in Humans: an Overview Using Omics Data.

TL;DR: It is demonstrated that butyrate producers establish themselves within the first year of life and display high abundances in adults regardless of origin and results from longitudinal analyses propose that diversity supports functional stability during ordinary life disturbances and during interventions such as antibiotic treatment.
Journal ArticleDOI

Uncovering the trimethylamine-producing bacteria of the human gut microbiota

TL;DR: A diagnostic framework was developed that enabled the quantification and comprehensive characterization of the TMA-producing potential in human fecal samples and provides crucial information for the development of specific treatment strategies to restrain TMA producers and limit their proliferation.
References
More filters
Journal ArticleDOI

Gene-targeted-metagenomics reveals extensive diversity of aromatic dioxygenase genes in the environment.

TL;DR: Gene-targeted metagenomics and pyrosequencing to aromatic dioxygenase genes to obtain greater sequence depth than possible by other methods and provides deeper insights into genes potentially important in environmental processes to better understand their ecology, functional differences and evolutionary origins.
Journal ArticleDOI

Moss-cyanobacteria associations as biogenic sources of nitrogen in boreal forest ecosystems

TL;DR: Reports on the drivers of N2 fixation in feather moss-cyanobacteria associations are synthesized to gain a deeper understanding of their role for ecosystem-N-cycling and the fate of the fixed N2 in mosses remains unresolved.
Journal ArticleDOI

Three distinct clades of cultured heterocystous cyanobacteria constitute the dominant N2-fixing members of biological soil crusts of the Colorado Plateau, USA.

TL;DR: Of the nifH sequence-types detected in soil crusts of the Colorado Plateau, 89% were most closely related to n ifH signature sequences from cyanobacteria of the order Nostocales, and two clades are well-represented by phylogenetically and morphologically coherent strains, corresponding to the descriptions of Nostoc commune and Scytonema hyalinum.
Journal ArticleDOI

Photosynthetic bradyrhizobia from Aeschynomene spp. are specific to stem-nodulated species and form a separate 16S ribosomal DNA restriction fragment length polymorphism group.

TL;DR: It is concluded that photosynthetic Aeschynomene nodule isolates share the ability to nodulate particular stem-nodulated species and form a separate subbranch on the Bradyrhizobium rRNA lineage, distinct from B. japonicum and B. elkanii.
Journal ArticleDOI

HMM-FRAME: accurate protein domain classification for metagenomic sequences containing frameshift errors

TL;DR: HMM-FRAME provides a complementary protein domain classification tool to conventional profile HMM-based methods for data sets containing frameshifts and achieved high error detection sensitivity and specificity in a data set with annotated errors.
Related Papers (5)