MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform
Reads0
Chats0
TLDR
A simplified scoring system is proposed that performs well for reducing CPU time and increasing the accuracy of alignments even for sequences having large insertions or extensions as well as distantly related sequences of similar length.Abstract:
A multiple sequence alignment program, MAFFT, has been developed. The CPU time is drastically reduced as compared with existing methods. MAFFT includes two novel techniques. (i) Homologous regions are rapidly identified by the fast Fourier transform (FFT), in which an amino acid sequence is converted to a sequence composed of volume and polarity values of each amino acid residue. (ii) We propose a simplified scoring system that performs well for reducing CPU time and increasing the accuracy of alignments even for sequences having large insertions or extensions as well as distantly related sequences of similar length. Two different heuristics, the progressive method (FFT-NS-2) and the iterative refinement method (FFT-NS-i), are implemented in MAFFT. The performances of FFT-NS-2 and FFT-NS-i were compared with other methods by computer simulations and benchmark tests; the CPU time of FFT-NS-2 is drastically reduced as compared with CLUSTALW with comparable accuracy. FFT-NS-i is over 100 times faster than T-COFFEE, when the number of input sequences exceeds 60, without sacrificing the accuracy.read more
Citations
More filters
Journal ArticleDOI
Genomic monitoring of SARS-CoV-2 uncovers an Nsp1 deletion variant that modulates type I interferon response.
Jing-wen Lin,Chao Tang,Han-cheng Wei,Baowen Du,Chuan. Chen,Minjin Wang,Yongzhao Zhou,Ming-xia Yu,Lu Yang Cheng,Lu Yang Cheng,Suvi Kuivanen,Natacha S. Ogando,Lev Levanov,Yuancun Zhao,Chang-ling Li,Ran Zhou,Zhidan Li,Yiming Zhang,Ke Sun,Chengdi Wang,Li Chen,Xia Xiao,Xiuran Zheng,Sha-sha Chen,Zhen Zhen Zhou,Ruirui Yang,Dan Ning Zhang,Mengying Xu,Junwei Song,Danrui Wang,Yupeng Li,ShiKun Lei,Wanqin Zeng,Qingxin Yang,Ping He,Yaoyao Zhang,Lifang Zhou,Ling Ya Cao,Feng Luo,Huayi Liu,Huayi Liu,Liping Wang,Liping Wang,Fei Ye,Ming Zhang,Mengjiao Li,Wei Fan,Xinqiong Li,Kaiju Li,Bowen Ke,Jiannan Xu,Huiping Yang,Shusen He,Ming Pan,Yichen Yan,Yi Zha,Lingyu Jiang,Changxiu Yu,Yingfen Liu,Zhiyong Xu,Qingfeng Li,Yongmei Jiang,Jiufeng Sun,Wei Hong,Hongping Wei,Guangwen Lu,Olli Vapalahti,Yunzi Luo,Yunzi Luo,Yuquan Wei,Thomas R. Connor,Wenjie Tan,Eric J. Snijder,Teemu Smura,Weimin Li,Jia Geng,Binwu Ying,Lu Chen +77 more
TL;DR: In this paper, the SARS-CoV-2 virus, the causative agent of COVID-19, is undergoing constant mutation and the authors utilized an integrative approach combining epidemiology, virus genome sequencing, clinical phenotyping, and experimental validation to locate mutations of clinical importance.
Journal ArticleDOI
A structural model for microtubule minus-end recognition and protection by CAMSAP proteins
Joseph Atherton,Kai Jiang,Marcel M Stangier,Yanzhang Luo,Shasha Hua,Klaartje Houben,Jolien J. E. van Hooff,Jolien J. E. van Hooff,Agnel Praveen Joseph,Guido Scarabelli,Barry J. Grant,Anthony J. Roberts,Maya Topf,Michel O. Steinmetz,Michel O. Steinmetz,Marc Baldus,Carolyn A. Moores,Anna Akhmanova +17 more
TL;DR: This work finds that the CAMSAP C-terminal CKK domain is widely present among eukaryotes and autonomously recognizes microtubule minus ends and proposes that minus- end-specific features of the interprotofilament interface at this site serve as the basis for CKK's minus-end preference.
Journal ArticleDOI
MUSiCC: a marker genes based framework for metagenomic normalization and accurate profiling of gene abundances in the microbiome
TL;DR: This work introduces an alternative normalization paradigm, MUSiCC, which combines universal single-copy genes with machine learning methods to correct biases and to obtain an accurate and biologically meaningful measure of gene abundances.
Journal ArticleDOI
HmtDB, a Human Mitochondrial Genomic Resource Based on Variability Studies Supporting Population Genetics and Biomedical Research
Marcella Attimonelli,Matteo Accetturo,Monica Santamaria,Daniela Lascaro,Gaetano Scioscia,Graziano Pappadà,Luigi Russo,Luigi Zanchetta,Mila Tommaseo-Ponzetta +8 more
TL;DR: The HmtDB project will contribute towards completing and/or refining haplogroup classification and revealing the real pathogenic potential of mitochondrial mutations, on the basis of variability estimation.
Posted ContentDOI
Patterns of within-host genetic diversity in SARS-CoV-2
Gerry Tonkin-Hill,Inigo Martincorena,Roberto Amato,Andrew R. J. Lawson,Moritz Gerstung,Ian Johnston,David K. Jackson,Naomi R Park,Stefanie V Lensing,Michael A. Quail,Sónia Gonçalves,Cristina V. Ariani,Michael Spencer Chapman,William L Hamilton,Luke W. Meredith,Grant Hall,Aminu S Jahun,Yasmin Chaudhry,Myra Hosmillo,Malte L Pinckert,Iliana Georgana,Anna Yakovleva,Laura G Caller,Sarah L Caddy,Theresa Feltwell,Fahad A Khokhar,Charlotte J. Houldcroft,Martin D. Curran,Surendra Parmar,Alex Alderton,Rachel Nelson,Ewan Harrison,John Sillitoe,Stephen D. Bentley,Jeffrey C. Barrett,M. Estée Török,Ian Goodfellow,Cordelia Langford,Dominic P. Kwiatkowski,Dominic P. Kwiatkowski +39 more
TL;DR: In this paper, the authors describe the patterns of within-host diversity in 1,181 SARS-CoV-2 samples sequenced to high depth in duplicate and identify multiple putative examples of co-infection.
References
More filters
Journal ArticleDOI
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
Stephen F. Altschul,Thomas L. Madden,Alejandro A. Schäffer,Jinghui Zhang,Zheng Zhang,Webb Miller,David J. Lipman +6 more
TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
Journal ArticleDOI
Clustal w: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice
TL;DR: The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved and modifications are incorporated into a new program, CLUSTAL W, which is freely available.
Journal ArticleDOI
A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences.
TL;DR: Some examples were worked out using reported globin sequences to show that synonymous substitutions occur at much higher rates than amino acid-altering substitutions in evolution.
Book
Numerical Recipes in C: The Art of Scientific Computing
TL;DR: Numerical Recipes: The Art of Scientific Computing as discussed by the authors is a complete text and reference book on scientific computing with over 100 new routines (now well over 300 in all), plus upgraded versions of many of the original routines, with many new topics presented at the same accessible level.
Journal ArticleDOI
Improved tools for biological sequence comparison.
TL;DR: Three computer programs for comparisons of protein and DNA sequences can be used to search sequence data bases, evaluate similarity scores, and identify periodic structures based on local sequence similarity.