MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform
Reads0
Chats0
TLDR
A simplified scoring system is proposed that performs well for reducing CPU time and increasing the accuracy of alignments even for sequences having large insertions or extensions as well as distantly related sequences of similar length.Abstract:
A multiple sequence alignment program, MAFFT, has been developed. The CPU time is drastically reduced as compared with existing methods. MAFFT includes two novel techniques. (i) Homologous regions are rapidly identified by the fast Fourier transform (FFT), in which an amino acid sequence is converted to a sequence composed of volume and polarity values of each amino acid residue. (ii) We propose a simplified scoring system that performs well for reducing CPU time and increasing the accuracy of alignments even for sequences having large insertions or extensions as well as distantly related sequences of similar length. Two different heuristics, the progressive method (FFT-NS-2) and the iterative refinement method (FFT-NS-i), are implemented in MAFFT. The performances of FFT-NS-2 and FFT-NS-i were compared with other methods by computer simulations and benchmark tests; the CPU time of FFT-NS-2 is drastically reduced as compared with CLUSTALW with comparable accuracy. FFT-NS-i is over 100 times faster than T-COFFEE, when the number of input sequences exceeds 60, without sacrificing the accuracy.read more
Citations
More filters
Journal ArticleDOI
Craniofacial divergence and ongoing adaptation via the hedgehog pathway.
TL;DR: Alleles of the hedgehog pathway receptor Patched1 (Ptch1) gene are responsible for adaptive variation in the shape of the lower jaw both within and among genera of Lake Malawi cichlid fish.
Journal ArticleDOI
Winding up the molecular clock in the genus carabus (coleoptera: Carabidae): assessment of methodological decisions on rate and node age estimation
TL;DR: The combination of several genes is proposed as the best strategy to minimise both the idiosyncratic behaviors of individual markers and the effect of analytical aspects in rate and age estimations as well as other methodological issues potentially affecting rate estimation.
Journal ArticleDOI
Botryosphaeriaceae occurring on native Syzygium cordatum in South Africa and their potential threat to Eucalyptus
TL;DR: Results of this study illustrate that species of the Botryosphaeriaceae, including N. mangiferae, were more pathogenic on the Eucalyptus clone than on S. cordatum, while B. dothidea and L. gonubiensis were the least pathogenic.
Journal ArticleDOI
101 Dothideomycetes genomes: A test case for predicting lifestyles and emergence of pathogens.
Sajeet Haridas,R. Albert,R. Albert,M. Binder,J. Bloem,Kurt LaButti,Asaf Salamov,Bill Andreopoulos,Scott E. Baker,Kerrie Barry,Gerald F. Bills,B. H. Bluhm,Charles H. Cannon,Raúl Castanera,Raúl Castanera,David E. Culley,Christopher Daum,David Ezra,J.B. González,Bernard Henrissat,Bernard Henrissat,Bernard Henrissat,Alan Kuo,C. Liang,Anna Lipzen,François Lutzoni,Jon K. Magnuson,Stephen J. Mondo,Stephen J. Mondo,Matt Nolan,Robin A. Ohm,Robin A. Ohm,Jasmyn Pangilinan,Hee-Jin Park,Lucía Ramírez,Manuel Alfaro,Hui Sun,Andrew Tritt,Yuko Yoshinaga,L.-H. Zwiers,B.G. Turgeon,Stephen B. Goodwin,Joseph W. Spatafora,Pedro W. Crous,Igor V. Grigoriev,Igor V. Grigoriev +45 more
TL;DR: This study presents the first large-scale, whole-genome comparison of 101 Dothideomycetes introducing 55 newly sequenced species and classified fungi into lifestyle classes with >95 % accuracy and identified a small number of gene families that positively correlated with these distinctions.
Journal ArticleDOI
Developing an in silico minimum inhibitory concentration panel test for Klebsiella pneumoniae.
Marcus Nguyen,Marcus Nguyen,Marcus Nguyen,Thomas Brettin,Thomas Brettin,S. Wesley Long,S. Wesley Long,James M. Musser,James M. Musser,Randall J. Olsen,Randall J. Olsen,Robert Olson,Robert Olson,Maulik Shukla,Maulik Shukla,Rick Stevens,Rick Stevens,Fangfang Xia,Fangfang Xia,Hyunseung Yoo,Hyunseung Yoo,James J. Davis,James J. Davis +22 more
TL;DR: This study shows that machine learning can be used to build a complete in silico MIC prediction panel for K. pneumoniae and provides a framework for building MIC prediction models for other pathogenic bacteria.
References
More filters
Journal ArticleDOI
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
Stephen F. Altschul,Thomas L. Madden,Alejandro A. Schäffer,Jinghui Zhang,Zheng Zhang,Webb Miller,David J. Lipman +6 more
TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
Journal ArticleDOI
Clustal w: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice
TL;DR: The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved and modifications are incorporated into a new program, CLUSTAL W, which is freely available.
Journal ArticleDOI
A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences.
TL;DR: Some examples were worked out using reported globin sequences to show that synonymous substitutions occur at much higher rates than amino acid-altering substitutions in evolution.
Book
Numerical Recipes in C: The Art of Scientific Computing
TL;DR: Numerical Recipes: The Art of Scientific Computing as discussed by the authors is a complete text and reference book on scientific computing with over 100 new routines (now well over 300 in all), plus upgraded versions of many of the original routines, with many new topics presented at the same accessible level.
Journal ArticleDOI
Improved tools for biological sequence comparison.
TL;DR: Three computer programs for comparisons of protein and DNA sequences can be used to search sequence data bases, evaluate similarity scores, and identify periodic structures based on local sequence similarity.