MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform
Reads0
Chats0
TLDR
A simplified scoring system is proposed that performs well for reducing CPU time and increasing the accuracy of alignments even for sequences having large insertions or extensions as well as distantly related sequences of similar length.Abstract:
A multiple sequence alignment program, MAFFT, has been developed. The CPU time is drastically reduced as compared with existing methods. MAFFT includes two novel techniques. (i) Homologous regions are rapidly identified by the fast Fourier transform (FFT), in which an amino acid sequence is converted to a sequence composed of volume and polarity values of each amino acid residue. (ii) We propose a simplified scoring system that performs well for reducing CPU time and increasing the accuracy of alignments even for sequences having large insertions or extensions as well as distantly related sequences of similar length. Two different heuristics, the progressive method (FFT-NS-2) and the iterative refinement method (FFT-NS-i), are implemented in MAFFT. The performances of FFT-NS-2 and FFT-NS-i were compared with other methods by computer simulations and benchmark tests; the CPU time of FFT-NS-2 is drastically reduced as compared with CLUSTALW with comparable accuracy. FFT-NS-i is over 100 times faster than T-COFFEE, when the number of input sequences exceeds 60, without sacrificing the accuracy.read more
Citations
More filters
Journal ArticleDOI
Development of an Inactivated Vaccine Candidate, BBIBP-CorV, with Potent Protection against SARS-CoV-2.
Hui Wang,Yuntao Zhang,Baoying Huang,Wei Deng,Yaru Quan,Wenling Wang,Wenbo Xu,Yuxiu Zhao,Na Li,Jin Zhang,Hongyang Liang,Linlin Bao,Yanfeng Xu,Ling Ding,Weimin Zhou,Hong Gao,Jiangning Liu,Peihua Niu,Li Zhao,Wei Zhen,Hui Fu,Yu Shouzhi,Zhang Zhengli,Guangxue Xu,Changgui Li,Zhiyong Lou,Miao Xu,Chuan Qin,Guizhen Wu,George F. Gao,Wenjie Tan,Xiaoming Yang +31 more
TL;DR: Two-dose immunizations using 2 μg/dose of BBIBP-CorV provided highly efficient protection against SARS-CoV-2 intratracheal challenge in rhesus macaques, without detectable antibody-dependent enhancement of infection.
Journal ArticleDOI
Extensive sampling of basidiomycete genomes demonstrates inadequacy of the white-rot/brown-rot paradigm for wood decay fungi
Robert Riley,Asaf Salamov,Daren W. Brown,László Nagy,Dimitrios Floudas,Benjamin W. Held,Anthony Levasseur,Vincent Lombard,Emmanuelle Morin,Robert Otillar,Erika Lindquist,Hui Sun,Kurt LaButti,Jeremy Schmutz,Dina Jabbour,Hong Luo,Scott E. Baker,Antonio G. Pisabarro,Jonathan D. Walton,Robert A. Blanchette,Bernard Henrissat,Francis Martin,Daniel Cullen,David S. Hibbett,Igor V. Grigoriev +24 more
TL;DR: The results indicate that the prevailing paradigm of white rot vs. brown rot does not capture the diversity of fungal wood decay mechanisms, and suggest a continuum rather than a dichotomy between the white-rot and brown-rot modes of wood decay.
Journal ArticleDOI
The ASTRAL Compendium in 2004.
John-Marc Chandonia,Gary C. Hon,Nigel S. Walker,Loredana Lo Conte,Patrice Koehl,Michael Levitt,Steven E. Brenner,Steven E. Brenner +7 more
TL;DR: The ASTRAL Compendium provides several databases and tools to aid in the analysis of protein structures, particularly through the use of their sequences, and all SCOP domains are now made available as PDB-style coordinate files as well as sequences.
Journal ArticleDOI
Case Study: Prolonged Infectious SARS-CoV-2 Shedding from an Asymptomatic Immunocompromised Individual with Cancer.
Victoria A. Avanzato,Victoria A. Avanzato,M. Jeremiah Matson,M. Jeremiah Matson,Stephanie N. Seifert,Rhys Pryce,Brandi N. Williamson,Sarah L. Anzick,Kent D. Barbian,Seth D. Judson,Elizabeth R. Fischer,Craig Martens,Thomas A. Bowden,Emmie de Wit,Francis X. Riedo,Vincent J. Munster +15 more
TL;DR: The data indicate that certain immunocompromised patients may shed infectious virus for longer durations than previously recognized, and detection of subgenomic RNA is recommended in persistently SARS-CoV-2 positive individuals as a proxy for shedding of infectious virus.
Journal ArticleDOI
M-Coffee : Combining multiple sequence alignment methods with T-Coffee
TL;DR: M-Coffee is a meta-method for assembling multiple sequence alignments (MSA) by combining the output of several individual methods into one single MSA that is robust to variations in the choice of constituent methods and reasonably tolerant to duplicate MSAs.
References
More filters
Journal ArticleDOI
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
Stephen F. Altschul,Thomas L. Madden,Alejandro A. Schäffer,Jinghui Zhang,Zheng Zhang,Webb Miller,David J. Lipman +6 more
TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
Journal ArticleDOI
Clustal w: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice
TL;DR: The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved and modifications are incorporated into a new program, CLUSTAL W, which is freely available.
Journal ArticleDOI
A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences.
TL;DR: Some examples were worked out using reported globin sequences to show that synonymous substitutions occur at much higher rates than amino acid-altering substitutions in evolution.
Book
Numerical Recipes in C: The Art of Scientific Computing
TL;DR: Numerical Recipes: The Art of Scientific Computing as discussed by the authors is a complete text and reference book on scientific computing with over 100 new routines (now well over 300 in all), plus upgraded versions of many of the original routines, with many new topics presented at the same accessible level.
Journal ArticleDOI
Improved tools for biological sequence comparison.
TL;DR: Three computer programs for comparisons of protein and DNA sequences can be used to search sequence data bases, evaluate similarity scores, and identify periodic structures based on local sequence similarity.