scispace - formally typeset
Open AccessJournal ArticleDOI

Biosynthetic potential of the global ocean microbiome

TLDR
This paper investigated the diversity and novelty of biosynthetic gene clusters in the ocean by integrating around 10,000 microbial genomes from cultivated and single cells with more than 25,000 newly reconstructed draft genomes from more than 1,000 seawater samples.
Abstract
Abstract Natural microbial communities are phylogenetically and metabolically diverse. In addition to underexplored organismal groups 1 , this diversity encompasses a rich discovery potential for ecologically and biotechnologically relevant enzymes and biochemical compounds 2,3 . However, studying this diversity to identify genomic pathways for the synthesis of such compounds 4 and assigning them to their respective hosts remains challenging. The biosynthetic potential of microorganisms in the open ocean remains largely uncharted owing to limitations in the analysis of genome-resolved data at the global scale. Here we investigated the diversity and novelty of biosynthetic gene clusters in the ocean by integrating around 10,000 microbial genomes from cultivated and single cells with more than 25,000 newly reconstructed draft genomes from more than 1,000 seawater samples. These efforts revealed approximately 40,000 putative mostly new biosynthetic gene clusters, several of which were found in previously unsuspected phylogenetic groups. Among these groups, we identified a lineage rich in biosynthetic gene clusters (‘ Candidatus Eudoremicrobiaceae’) that belongs to an uncultivated bacterial phylum and includes some of the most biosynthetically diverse microorganisms in this environment. From these, we characterized the phospeptin and pythonamide pathways, revealing cases of unusual bioactive compound structure and enzymology, respectively. Together, this research demonstrates how microbiomics-driven strategies can enable the investigation of previously undescribed enzymes and natural products in underexplored microbial groups and environments.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

MIBiG 3.0: a community-driven effort to annotate experimentally validated biosynthetic gene clusters

TL;DR: The Minimum Information about a Biosynthetic Gene Clustering (MIBiG) as discussed by the authors is a standardised data format that describes the minimally required information to uniquely characterise a BGC.
Journal ArticleDOI

Mirusviruses link herpesviruses to giant viruses

TL;DR: This paper carried out a phylogeny-guided genome-resolved metagenomic survey of the sunlit oceans and discovered plankton-infecting relatives of herpesviruses that form a putative new phylum dubbed Mirusviricota.
Peer ReviewDOI

Plankton-infecting relatives of herpesviruses clarify the evolutionary trajectory of giant viruses

TL;DR: A phylogeny-guided genome-resolved metagenomic survey of the sunlit oceans exposed a major, previously undescribed clade of large eukaryotic DNA viruses which exposed a putative new phylum dubbed ‘Mirusviricota’, which provides missing links in the evolution of both animal herpesviruses from tailed prokaryotic viruses and giant varidnaviruses from smaller relatives.
Journal ArticleDOI

MAGNETO: An Automated Workflow for Genome-Resolved Metagenomics

TL;DR: MAGNETO is presented, an automated workflow dedicated to MAG reconstruction, which includes a fully-automated coassembly step informed by optimal clustering of metagenomic distances, and implements complementary genome binning strategies, for improving MAG recovery.
Journal ArticleDOI

proGenomes3: approaching one million accurately and consistently annotated high-quality prokaryotic genomes

TL;DR: Progenomes3 as mentioned in this paper is a database of 907 388 high-quality genomes containing 4 billion genes that passed stringent criteria and have been consistently annotated using multiple functional and taxonomic databases including mobile genetic elements and biosynthetic gene clusters.
References
More filters
Journal ArticleDOI

Fast and accurate short read alignment with Burrows–Wheeler transform

TL;DR: Burrows-Wheeler Alignment tool (BWA) is implemented, a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps.
Journal ArticleDOI

MUSCLE: multiple sequence alignment with high accuracy and high throughput

TL;DR: MUSCLE is a new computer program for creating multiple alignments of protein sequences that includes fast distance estimation using kmer counting, progressive alignment using a new profile function the authors call the log-expectation score, and refinement using tree-dependent restricted partitioning.
Journal ArticleDOI

MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability

TL;DR: This version of MAFFT has several new features, including options for adding unaligned sequences into an existing alignment, adjustment of direction in nucleotide alignment, constrained alignment and parallel processing, which were implemented after the previous major update.
Journal ArticleDOI

KEGG: Kyoto Encyclopedia of Genes and Genomes

TL;DR: The Kyoto Encyclopedia of Genes and Genomes (KEGG) as discussed by the authors is a knowledge base for systematic analysis of gene functions in terms of the networks of genes and molecules.
Journal ArticleDOI

featureCounts: an efficient general-purpose program for assigning sequence reads to genomic features

TL;DR: FeatureCounts as discussed by the authors is a read summarization program suitable for counting reads generated from either RNA or genomic DNA sequencing experiments, which implements highly efficient chromosome hashing and feature blocking techniques.
Related Papers (5)