antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences

doi:10.1093/NAR/GKR466

Open AccessJournal ArticleDOI

antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences

Marnix H. Medema, +9 more

- 01 Jul 2011 -

Nucleic Acids Research

- Vol. 39, Iss: 2, pp 339-346

TLDR

This work presents the first comprehensive pipeline capable of identifying biosynthetic loci covering the whole range of known secondary metabolite compound classes, and integrates or cross-links all previously available secondary-metabolite specific gene analysis methods in one interactive view.

Abstract:

Bacterial and fungal secondary metabolism is a rich source of novel bioactive compounds with potential pharmaceutical applications as antibiotics, anti-tumor drugs or cholesterol-lowering drugs To find new drug candidates, microbiologists are increasingly relying on sequencing genomes of a wide variety of microbes However, rapidly and reliably pinpointing all the potential gene clusters for secondary metabolites in dozens of newly sequenced genomes has been extremely challenging, due to their biochemical heterogeneity, the presence of unknown enzymes and the dispersed nature of the necessary specialized bioinformatics tools and resources Here, we present antiSMASH (antibiotics & Secondary Metabolite Analysis Shell), the first comprehensive pipeline capable of identifying biosynthetic loci covering the whole range of known secondary metabolite compound classes (polyketides, non-ribosomal peptides, terpenes, aminoglycosides, aminocoumarins, indolocarbazoles, lantibiotics, bacteriocins, nucleosides, beta-lactams, butyrolactones, siderophores, melanins and others) It aligns the identified regions at the gene cluster level to their nearest relatives from a database containing all other known gene clusters, and integrates or cross-links all previously available secondary-metabolite specific gene analysis methods in one interactive view antiSMASH is available at http://antismashsecondarymetabolitesorg

Citations

PDF

Open Access

More filters

Journal ArticleDOI

antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline

Kai Blin, +8 more

- 02 Jul 2019 -

Nucleic Acids Research

TL;DR: AntiSMASH 5 adds detection rules for clusters encoding the biosynthesis of acyl-amino acids, β-lactones, fungal RiPPs, RaS-Ri PPs, polybrominated diphenyl ethers, C-nucleosides, PPY-like ketones and lipolanthines and provides more detailed predictions for type II polyketide synthase-encoding gene clusters.

...read moreread less

Journal ArticleDOI

antiSMASH 3.0—a comprehensive resource for the genome mining of biosynthetic gene clusters

Tilmann Weber, +14 more

- 01 Jul 2015 -

Nucleic Acids Research

TL;DR: AntiSMASH as mentioned in this paper is a web server and stand-alone tool for the automatic genomic identification and analysis of biosynthetic gene clusters, available at http://antismash.org.

...read moreread less

Journal ArticleDOI

antiSMASH 4.0-improvements in chemistry prediction and gene cluster boundary identification.

Kai Blin, +17 more

- 03 Jul 2017 -

Nucleic Acids Research

TL;DR: The thoroughly updated antiSMASH version 4 is presented, which adds several novel features, including prediction of gene cluster boundaries using the ClusterFinder method or the newly integrated CASSIS algorithm, improved substrate specificity prediction for non-ribosomal peptide synthetase adenylation domains based on the new SANDPUMA algorithm, and several usability features have been updated and improved.

...read moreread less

Journal ArticleDOI

antiSMASH 6.0: improving cluster detection and comparison capabilities.

Kai Blin, +7 more

- 05 Dec 2021 -

Nucleic Acids Research

TL;DR: antiSMASH as mentioned in this paper is the most widely used tool for detecting and characterising biosynthetic gene clusters (BGCs) in bacteria and fungi, and it is updated version 6 of antiSMASH.

...read moreread less

Journal ArticleDOI

Best practices for analysing microbiomes.

Rob Knight, +25 more

- 01 Jul 2018 -

Nature Reviews Microbiology

TL;DR: This Review focuses on recent findings that suggest that operational taxonomic unit-based analyses should be replaced with new methods that are based on exact sequence variants, methods for integrating metagenomic and metabolomic data, and issues surrounding compositional data analysis.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

MUSCLE: multiple sequence alignment with high accuracy and high throughput

Robert C. Edgar

- 01 Mar 2004 -

Nucleic Acids Research

TL;DR: MUSCLE is a new computer program for creating multiple alignments of protein sequences that includes fast distance estimation using kmer counting, progressive alignment using a new profile function the authors call the log-expectation score, and refinement using tree-dependent restricted partitioning.

...read moreread less

Journal ArticleDOI

The Pfam protein families database

Marco Punta, +15 more

- 01 Jan 2000 -

Nucleic Acids Research

TL;DR: The definition and use of family-specific, manually curated gathering thresholds are explained and some of the features of domains of unknown function (also known as DUFs) are discussed, which constitute a rapidly growing class of families within Pfam.

...read moreread less

Journal ArticleDOI

BLAST+: architecture and applications.

Christiam Camacho, +6 more

- 15 Dec 2009 -

BMC Bioinformatics

TL;DR: The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences.

...read moreread less

Journal ArticleDOI

FastTree 2--approximately maximum-likelihood trees for large alignments.

Morgan N. Price, +3 more

- 10 Mar 2010 -

PLOS ONE

TL;DR: Improvements to FastTree are described that improve its accuracy without sacrificing scalability, and FastTree 2 allows the inference of maximum-likelihood phylogenies for huge alignments.

...read moreread less

Journal ArticleDOI

Pfam: the protein families database.

Robert D. Finn, +12 more

- 01 Jan 2014 -

Nucleic Acids Research

TL;DR: Pfam as discussed by the authors is a widely used database of protein families, containing 14 831 manually curated entries in the current version, version 27.0, and has been updated several times since 2012.

...read moreread less

Collapse

Related Papers (5)

Basic Local Alignment Search Tool

Stephen F. Altschul, +4 more

- 01 Oct 1990 -

Journal of Molecular Biology

Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2)

Stephen D. Bentley, +43 more

- 09 May 2002 -

Nature

antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences

Citations

antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline

antiSMASH 3.0—a comprehensive resource for the genome mining of biosynthetic gene clusters

antiSMASH 4.0-improvements in chemistry prediction and gene cluster boundary identification.

antiSMASH 6.0: improving cluster detection and comparison capabilities.

Best practices for analysing microbiomes.

References

MUSCLE: multiple sequence alignment with high accuracy and high throughput

The Pfam protein families database

BLAST+: architecture and applications.

FastTree 2--approximately maximum-likelihood trees for large alignments.

Pfam: the protein families database.

Related Papers (5)

antiSMASH 3.0—a comprehensive resource for the genome mining of biosynthetic gene clusters

Basic Local Alignment Search Tool

Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2)

The RAST Server: Rapid Annotations using Subsystems Technology

SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing