scispace - formally typeset
Open AccessJournal ArticleDOI

Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life

TLDR
The recovery of 7,903 bacterial and archaeal metagenome-assembled genomes increases the phylogenetic diversity represented by public genome repositories and provides the first representatives from 20 candidate phyla.
Abstract
Challenges in cultivating microorganisms have limited the phylogenetic diversity of currently available microbial genomes. This is being addressed by advances in sequencing throughput and computational techniques that allow for the cultivation-independent recovery of genomes from metagenomes. Here, we report the reconstruction of 7,903 bacterial and archaeal genomes from >1,500 public metagenomes. All genomes are estimated to be ≥50% complete and nearly half are ≥90% complete with ≤5% contamination. These genomes increase the phylogenetic diversity of bacterial and archaeal genome trees by >30% and provide the first representatives of 17 bacterial and three archaeal candidate phyla. We also recovered 245 genomes from the Patescibacteria superphylum (also known as the Candidate Phyla Radiation) and find that the relative diversity of this group varies substantially with different protein marker sets. The scale and quality of this data set demonstrate that recovering genomes from metagenomes provides an expedient path forward to exploring microbial dark matter.

read more

Citations
More filters
Journal ArticleDOI

Tenuifilum thalassicum gen. nov., sp. nov., a novel moderate thermophilic anaerobic bacterium from a Kunashir Island shallow hot spring representing a new family Tenuifilaceae fam. nov. in the class Bacteroidia.

TL;DR: A novel anaerobic moderately thermophilic bacterium, designated strain 38H-strT, was isolated from a 12m deep hot spring of the Kunashir Island shore and represents the first genus and species Tenuifilum thalassicum gen. nov.
Journal ArticleDOI

Genome-resolved metagenome and metatranscriptome analyses of thermophilic composting reveal key bacterial players and their metabolic interactions.

TL;DR: In this article, 60 mostly high-quality metagenome-assembled genomes (MAGs) recovered from time-series samples of two thermophilic composting cells, of which 47 are potentially new bacterial species; 24 of those did not have any hits in two public MAG datasets at the 95% average nucleotide identity level.
Journal ArticleDOI

Genomic Analysis of the Yet-Uncultured Binatota Reveals Broad Methylotrophic, Alkane-Degradation, and Pigment Production Capacities.

TL;DR: A global analysis of genomes belonging to Binatota (UBP10), a globally distributed, yet-uncharacterized bacterial phylum, highlights the consistent encountering of incomplete biosynthetic pathways in microbial genomes, a phenomenon necessitating careful assessment when assigning putative functions based on a set-threshold of pathway completion.
Journal ArticleDOI

DiTing: A pipeline to infer and compare biogeochemical pathways from metagenomic and metatranscriptomic data

TL;DR: DiTing as discussed by the authors is a tool wrapper to infer and compare biogeochemical pathways among a set of given metagenomic or metatranscriptomic reads in one step, based on the Kyoto Encyclopedia of Genes and Genomes (KEGG) and a manually created DMSP cycling gene database.
Journal ArticleDOI

Evolutionary Ecology of Natural Comammox Nitrospira Populations

TL;DR: The microbial interactions and evolutionary processes affecting comammox Nitrospira, a recently discovered bacterial type capable of performing the whole nitrification process, are identified and it is revealed that evolutionary processes are more affected by habitat type than by species identity.
References
More filters
Journal ArticleDOI

Fast and accurate short read alignment with Burrows–Wheeler transform

TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.
Journal ArticleDOI

BLAST+: architecture and applications.

TL;DR: The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences.
Journal ArticleDOI

tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

TL;DR: A program is described, tRNAscan-SE, which identifies 99-100% of transfer RNA genes in DNA sequence while giving less than one false positive per 15 gigabases.
Journal ArticleDOI

Database resources of the National Center for Biotechnology Information

TL;DR: In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI’s website.
Related Papers (5)