antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences
Marnix H. Medema,Kai Blin,Peter Cimermancic,Victor de Jager,Victor de Jager,Piotr Zakrzewski,Michael A. Fischbach,Tilmann Weber,Eriko Takano,Rainer Breitling +9 more
TLDR
This work presents the first comprehensive pipeline capable of identifying biosynthetic loci covering the whole range of known secondary metabolite compound classes, and integrates or cross-links all previously available secondary-metabolite specific gene analysis methods in one interactive view.Abstract:
Bacterial and fungal secondary metabolism is a rich source of novel bioactive compounds with potential pharmaceutical applications as antibiotics, anti-tumor drugs or cholesterol-lowering drugs To find new drug candidates, microbiologists are increasingly relying on sequencing genomes of a wide variety of microbes However, rapidly and reliably pinpointing all the potential gene clusters for secondary metabolites in dozens of newly sequenced genomes has been extremely challenging, due to their biochemical heterogeneity, the presence of unknown enzymes and the dispersed nature of the necessary specialized bioinformatics tools and resources Here, we present antiSMASH (antibiotics & Secondary Metabolite Analysis Shell), the first comprehensive pipeline capable of identifying biosynthetic loci covering the whole range of known secondary metabolite compound classes (polyketides, non-ribosomal peptides, terpenes, aminoglycosides, aminocoumarins, indolocarbazoles, lantibiotics, bacteriocins, nucleosides, beta-lactams, butyrolactones, siderophores, melanins and others) It aligns the identified regions at the gene cluster level to their nearest relatives from a database containing all other known gene clusters, and integrates or cross-links all previously available secondary-metabolite specific gene analysis methods in one interactive view antiSMASH is available at http://antismashsecondarymetabolitesorgread more
Citations
More filters
Journal ArticleDOI
antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline
Kai Blin,Simon Shaw,Katharina Steinke,Rasmus Villebro,Nadine Ziemert,Sang Yup Lee,Sang Yup Lee,Marnix H. Medema,Tilmann Weber +8 more
TL;DR: AntiSMASH 5 adds detection rules for clusters encoding the biosynthesis of acyl-amino acids, β-lactones, fungal RiPPs, RaS-Ri PPs, polybrominated diphenyl ethers, C-nucleosides, PPY-like ketones and lipolanthines and provides more detailed predictions for type II polyketide synthase-encoding gene clusters.
Journal ArticleDOI
antiSMASH 3.0—a comprehensive resource for the genome mining of biosynthetic gene clusters
Tilmann Weber,Kai Blin,Srikanth Duddela,Daniel Krug,Hyun Uk Kim,Robert E. Bruccoleri,Sang Yup Lee,Sang Yup Lee,Michael A. Fischbach,Rolf Müller,Wolfgang Wohlleben,Rainer Breitling,Eriko Takano,Marnix H. Medema,Marnix H. Medema +14 more
TL;DR: AntiSMASH as mentioned in this paper is a web server and stand-alone tool for the automatic genomic identification and analysis of biosynthetic gene clusters, available at http://antismash.org.
Journal ArticleDOI
antiSMASH 4.0-improvements in chemistry prediction and gene cluster boundary identification.
Kai Blin,Thomas Wolf,Marc G. Chevrette,Xiaowen Lu,Christopher J. Schwalen,Satria A. Kautsar,Hernando G. Suarez Duran,Emmanuel L. C. de los Santos,Hyun Uk Kim,Mariana Nave,Jeroen S. Dickschat,Douglas A. Mitchell,Ekaterina Shelest,Rainer Breitling,Eriko Takano,Sang Yup Lee,Tilmann Weber,Marnix H. Medema +17 more
TL;DR: The thoroughly updated antiSMASH version 4 is presented, which adds several novel features, including prediction of gene cluster boundaries using the ClusterFinder method or the newly integrated CASSIS algorithm, improved substrate specificity prediction for non-ribosomal peptide synthetase adenylation domains based on the new SANDPUMA algorithm, and several usability features have been updated and improved.
Journal ArticleDOI
antiSMASH 6.0: improving cluster detection and comparison capabilities.
Kai Blin,Simon Shaw,Alexander M. Kloosterman,Zach Charlop-Powers,Gilles P. van Wezel,Marnix H. Medema,Marnix H. Medema,Tilmann Weber +7 more
TL;DR: antiSMASH as mentioned in this paper is the most widely used tool for detecting and characterising biosynthetic gene clusters (BGCs) in bacteria and fungi, and it is updated version 6 of antiSMASH.
Journal ArticleDOI
Best practices for analysing microbiomes.
Rob Knight,Alison Vrbanac,Bryn C. Taylor,Alexander A. Aksenov,Chris Callewaert,Chris Callewaert,Justine W. Debelius,Antonio Gonzalez,Tomasz Kosciolek,Laura-Isobel McCall,Daniel McDonald,Alexey V. Melnik,James T. Morton,Jose Navas,Robert A. Quinn,Jon G. Sanders,Austin D. Swafford,Luke R. Thompson,Luke R. Thompson,Anupriya Tripathi,Zhenjiang Zech Xu,Jesse R. Zaneveld,Qiyun Zhu,J. Gregory Caporaso,Pieter C. Dorrestein,Pieter C. Dorrestein +25 more
TL;DR: This Review focuses on recent findings that suggest that operational taxonomic unit-based analyses should be replaced with new methods that are based on exact sequence variants, methods for integrating metagenomic and metabolomic data, and issues surrounding compositional data analysis.
References
More filters
Journal ArticleDOI
MUSCLE: multiple sequence alignment with high accuracy and high throughput
TL;DR: MUSCLE is a new computer program for creating multiple alignments of protein sequences that includes fast distance estimation using kmer counting, progressive alignment using a new profile function the authors call the log-expectation score, and refinement using tree-dependent restricted partitioning.
Journal ArticleDOI
The Pfam protein families database
Marco Punta,Penny Coggill,Ruth Y. Eberhardt,Jaina Mistry,John Tate,Chris Boursnell,Ningze Pang,Kristoffer Forslund,Goran Ceric,Jody Clements,Andreas Heger,Liisa Holm,Erik L. L. Sonnhammer,Sean R. Eddy,Alex Bateman,Robert D. Finn +15 more
TL;DR: The definition and use of family-specific, manually curated gathering thresholds are explained and some of the features of domains of unknown function (also known as DUFs) are discussed, which constitute a rapidly growing class of families within Pfam.
Journal ArticleDOI
BLAST+: architecture and applications.
Christiam Camacho,George Coulouris,Vahram Avagyan,Ning Ma,Jason S. Papadopoulos,Kevin Bealer,Thomas L. Madden +6 more
TL;DR: The new BLAST command-line applications, compared to the current BLAST tools, demonstrate substantial speed improvements for long queries as well as chromosome length database sequences.
Journal ArticleDOI
FastTree 2--approximately maximum-likelihood trees for large alignments.
TL;DR: Improvements to FastTree are described that improve its accuracy without sacrificing scalability, and FastTree 2 allows the inference of maximum-likelihood phylogenies for huge alignments.
Journal ArticleDOI
Pfam: the protein families database.
Robert D. Finn,Alex Bateman,Jody Clements,Penelope Coggill,Ruth Y. Eberhardt,Sean R. Eddy,Andreas Heger,Kirstie Hetherington,Liisa Holm,Jaina Mistry,Erik L. L. Sonnhammer,John Tate,Marco Punta +12 more
TL;DR: Pfam as discussed by the authors is a widely used database of protein families, containing 14 831 manually curated entries in the current version, version 27.0, and has been updated several times since 2012.
Related Papers (5)
Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2)
Stephen D. Bentley,Keith F. Chater,Ana Cerdeño-Tárraga,Gregory L. Challis,Gregory L. Challis,Nicholas R. Thomson,Keith D. James,David Harris,Michael A. Quail,H. M. Kieser,D. Harper,Alex Bateman,Steve D.M. Brown,Govind Chandra,Carton W. Chen,Mark O. Collins,Ann Cronin,Andrew G. Fraser,Arlette Goble,J. Hidalgo,T. Hornsby,S. Howarth,Chih-Hung Huang,Tobias Kieser,L. Larke,Lee Murphy,Karen Oliver,Susan O'Neil,Ester Rabbinowitsch,Marie-Adèle Rajandream,Kim Rutherford,Simon Rutter,Kathy Seeger,David L. Saunders,Sarah Sharp,R. Squares,S. Squares,K. Taylor,T. Warren,Andreas Wietzorrek,John Woodward,Bart Barrell,Julian Parkhill,David A. Hopwood +43 more
The RAST Server: Rapid Annotations using Subsystems Technology
Ramy K. Aziz,Ramy K. Aziz,Daniela Bartels,Aaron A. Best,Matthew DeJongh,Terrence Disz,Terrence Disz,Robert Edwards,Kevin Formsma,Svetlana Gerdes,Elizabeth M. Glass,Michael Kubal,Folker Meyer,Folker Meyer,Gary J. Olsen,Gary J. Olsen,Robert Olson,Robert Olson,Andrei L. Osterman,Ross Overbeek,Leslie Klis McNeil,Daniel Paarmann,Tobias Paczian,Bruce Parrello,Gordon D. Pusch,Claudia I. Reich,Rick Stevens,Rick Stevens,Olga Vassieva,Veronika Vonstein,Andreas Wilke,Olga Zagnitko +31 more