scispace - formally typeset
Open AccessJournal ArticleDOI

Ecological Patterns of nifH Genes in Four Terrestrial Climatic Zones Explored with Targeted Metagenomics Using FrameBot, a New Informatics Tool

TLDR
To accurately detect and correct frameshifts caused by indel sequencing errors, FrameBot was developed, a tool for frameshift correction and nearest-neighbor classification, and its accuracy was compared to that of two other rapid frameshIFT correction tools.
Abstract
Biological nitrogen fixation is an important component of sustainable soil fertility and a key component of the nitrogen cycle. We used targeted metagenomics to study the nitrogen fixation-capable terrestrial bacterial community by targeting the gene for nitrogenase reductase ( nifH ). We obtained 1.1 million nifH 454 amplicon sequences from 222 soil samples collected from 4 National Ecological Observatory Network (NEON) sites in Alaska, Hawaii, Utah, and Florida. To accurately detect and correct frameshifts caused by indel sequencing errors, we developed FrameBot, a tool for frameshift correction and nearest-neighbor classification, and compared its accuracy to that of two other rapid frameshift correction tools. We found FrameBot was, in general, more accurate as long as a reference protein sequence with 80% or greater identity to a query was available, as was the case for virtually all nifH reads for the 4 NEON sites. Frameshifts were present in 12.7% of the reads. Those nifH sequences related to the Proteobacteria phylum were most abundant, followed by those for Cyanobacteria in the Alaska and Utah sites. Predominant genera with nifH sequences similar to reads included Azospirillum , Bradyrhizobium , and Rhizobium , the latter two without obvious plant hosts at the sites. Surprisingly, 80% of the sequences had greater than 95% amino acid identity to known nifH gene sequences. These samples were grouped by site and correlated with soil environmental factors, especially drainage, light intensity, mean annual temperature, and mean annual precipitation. FrameBot was tested successfully on three ecofunctional genes but should be applicable to any. IMPORTANCE High-throughput phylogenetic analysis of microbial communities using rRNA-targeted sequencing is now commonplace; however, such data often allow little inference with respect to either the presence or the diversity of genes involved in most important ecological processes. To study the gene pool for these processes, it is more straightforward to assess the genes directly responsible for the ecological function (ecofunctional genes). However, analyzing these genes involves technical challenges beyond those seen for rRNA. In particular, frameshift errors cause garbled downstream protein translations. Our FrameBot tool described here both corrects frameshift errors in query reads and determines their closest matching protein sequences in a set of reference sequences. We validated this new tool with sequences from defined communities and demonstrated the tool’s utility on nifH gene fragments sequenced from soils in well-characterized and major terrestrial ecosystem types.

read more

Citations
More filters
Journal ArticleDOI

Hypoxia and Inactivity Related Physiological Changes (Constipation, Inflammation) Are Not Reflected at the Level of Gut Metabolites and Butyrate Producing Microbial Community: The PlanHab Study

TL;DR: The observed progressive increase in two immunological intestinal markers suggested that the transition from healthy physiological state toward the developed symptoms of low magnitude obesity-related syndromes was primarily driven by the onset of inactivity (lack of exercise in NBR) and significantly alleviated by exercise, despite hypoxia (HAmb).
Journal ArticleDOI

Diazotrophic microbial community and abundance in acidic subtropical natural and re-vegetated forest soils revealed by high-throughput sequencing of nifH gene

TL;DR: In this article, the nifH gene was chosen to detect the diazotrophic microorganisms with high-throughput sequencing from five acidic forest soils, including three natural forests and two re-vegetated forests.
Journal ArticleDOI

Identification of Syntrophobacteraceae as major acetate-degrading sulfate reducing bacteria in Italian paddy soil.

TL;DR: Syntrophobacteraceae affiliated species were identified as the major acetotrophic sulfate reducers (SRB) in Italian paddy soil, and the identification of these SRB as dominant acetate degraders well explained the scenarios of competition between SRB and acetoclastic methanogens as observed in rice paddy soils.
Journal ArticleDOI

Reduction in Methane Emissions From Acidified Dairy Slurry Is Related to Inhibition of Methanosarcina Species.

TL;DR: The results suggest that reduction of CH4 missions achieved by acidification was due to an inhibition of the growth and activity of Methanosarcina species.
Journal ArticleDOI

Distribution of bacterial polycyclic aromatic hydrocarbon (PAH) ring-hydroxylating dioxygenases genes in oilfield soils and mangrove sediments explored by gene-targeted metagenomics

TL;DR: In this study, gene-targeted metagenomics was first used to investigate the diversity of PAH-degrading bacterial communities in oilfield soils and mangrove sediments, and showed that higher diversity ofPAH- Degrading bacteria in the studied samples was revealed by gene- targeted meetagenomics than traditional clone library analysis.
References
More filters
Journal ArticleDOI

Basic Local Alignment Search Tool

TL;DR: A new approach to rapid sequence comparison, basic local alignment search tool (BLAST), directly approximates alignments that optimize a measure of local similarity, the maximal segment pair (MSP) score.
Journal ArticleDOI

The Pfam protein families database

TL;DR: The definition and use of family-specific, manually curated gathering thresholds are explained and some of the features of domains of unknown function (also known as DUFs) are discussed, which constitute a rapidly growing class of families within Pfam.
Journal ArticleDOI

A general method applicable to the search for similarities in the amino acid sequence of two proteins

TL;DR: A computer adaptable method for finding similarities in the amino acid sequences of two proteins has been developed and it is possible to determine whether significant homology exists between the proteins to trace their possible evolutionary development.
Journal ArticleDOI

Pfam: the protein families database.

TL;DR: Pfam as discussed by the authors is a widely used database of protein families, containing 14 831 manually curated entries in the current version, version 27.0, and has been updated several times since 2012.
Journal ArticleDOI

Species assemblages and indicator species:the need for a flexible asymmetrical approach

TL;DR: A new and simple method to find indicator species and species assemblages characterizing groups of sites, and a new way to present species-site tables, accounting for the hierarchical relationships among species, is proposed.
Related Papers (5)