MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform
Reads0
Chats0
TLDR
A simplified scoring system is proposed that performs well for reducing CPU time and increasing the accuracy of alignments even for sequences having large insertions or extensions as well as distantly related sequences of similar length.Abstract:
A multiple sequence alignment program, MAFFT, has been developed. The CPU time is drastically reduced as compared with existing methods. MAFFT includes two novel techniques. (i) Homologous regions are rapidly identified by the fast Fourier transform (FFT), in which an amino acid sequence is converted to a sequence composed of volume and polarity values of each amino acid residue. (ii) We propose a simplified scoring system that performs well for reducing CPU time and increasing the accuracy of alignments even for sequences having large insertions or extensions as well as distantly related sequences of similar length. Two different heuristics, the progressive method (FFT-NS-2) and the iterative refinement method (FFT-NS-i), are implemented in MAFFT. The performances of FFT-NS-2 and FFT-NS-i were compared with other methods by computer simulations and benchmark tests; the CPU time of FFT-NS-2 is drastically reduced as compared with CLUSTALW with comparable accuracy. FFT-NS-i is over 100 times faster than T-COFFEE, when the number of input sequences exceeds 60, without sacrificing the accuracy.read more
Citations
More filters
Patent
Antibodies and other molecules that bind b7-h1 and pd-1
TL;DR: In this paper, the use of antibody fragments and their antigen-binding fragments in the diagnosis and treatment of cancer and other diseases is discussed. But the present invention relates to antibodies and their antigathering fragments and to other molecules that are capable of immunospecifically binding to B7-H1 or PD-1.
Journal ArticleDOI
Fern genomes elucidate land plant evolution and cyanobacterial symbioses
Fay-Wei Li,Fay-Wei Li,Paul Brouwer,Lorenzo Carretero-Paulet,Shifeng Cheng,Jan de Vries,Pierre-Marc Delaux,Ariana N. Eily,Nils Koppers,Li-Yaung Kuo,Zheng Li,Mathew Simenc,Ian Small,Eric K. Wafula,Stephany Angarita,Michael S. Barker,Andrea Bräutigam,Claude W. dePamphilis,Sven B. Gould,Prashant S. Hosmani,Yao Moan Huang,Bruno Huettel,Yoichiro Kato,Xin Liu,Steven Maere,Rose McDowell,Lukas A. Mueller,Klaas G.J. Nierop,Stefan A. Rensing,Tanner A. Robison,Carl J. Rothfels,Erin M. Sigel,Yue Song,Prakash R. Timilsena,Yves Van de Peer,Yves Van de Peer,Hongli Wang,Per K.I. Wilhelmsson,Paul G. Wolf,Xun Xu,Joshua P. Der,Henriette Schluepmann,Gane Ka-Shu Wong,Kathleen M. Pryer +43 more
TL;DR: The genomes of two fern species, Azolla filiculoides and Salvinia cucullata, are reported and insights into fern-specific whole-genome duplications, f Fern-specific insect-resistant gene evolution and fern–cyanobacterial symbiosis are provided.
Journal ArticleDOI
Phylogenomics and a posteriori data partitioning resolve the Cretaceous angiosperm radiation Malpighiales
Zhenxiang Xi,Brad R. Ruhfel,Brad R. Ruhfel,Hanno Schaefer,Hanno Schaefer,André M. Amorim,M. Sugumaran,Kenneth J. Wurdack,Peter K. Endress,Merran L. Matthews,Peter F. Stevens,Sarah Mathews,Charles C. Davis +12 more
TL;DR: It is found that commonly used a priori approaches for partitioning concatenated data in maximum likelihood analyses, by gene or by codon position, performed poorly relative to the use of partitions identified a posteriori using a Bayesian mixture model.
Journal ArticleDOI
Metagenomic study of the oral microbiota by Illumina high-throughput sequencing
Vladimir Lazarevic,Katrine Whiteson,Susan M. Huse,David Hernandez,Laurent Farinelli,Magne Osteras,Jacques Schrenzel,Patrice Francois +7 more
TL;DR: The V5 hypervariable region of the 16S ribosomal RNA (rRNA) gene is identified as a short region providing reliable identification of bacterial sequences available in public databases such as the Human Oral Microbiome Database, and several taxa not yet discovered in these types of samples are identified.
Journal ArticleDOI
Coast-to-Coast Spread of SARS-CoV-2 during the Early Epidemic in the United States.
Joseph R. Fauver,Mary E. Petrone,Emma B. Hodcroft,Emma B. Hodcroft,Kayoko Shioda,Hanna Y. Ehrlich,Alexander Watts,Chantal B.F. Vogels,Anderson F. Brito,Tara Alpert,Anthony Muyombwe,Jafar Razeq,Randy Downing,Nagarjuna R. Cheemarla,Anne L. Wyllie,Chaney C. Kalinich,Isabel M. Ott,Joshua Quick,Nicholas J. Loman,Karla M. Neugebauer,Alexander L. Greninger,Alexander L. Greninger,Keith R. Jerome,Keith R. Jerome,Pavitra Roychoudhury,Pavitra Roychoudhury,Hong Xie,Lasata Shrestha,Meei Li Huang,Meei Li Huang,Virginia E. Pitzer,Akiko Iwasaki,Akiko Iwasaki,Saad B. Omer,Kamran Khan,Kamran Khan,Isaac I. Bogoch,Richard A. Martinello,Ellen F. Foxman,Marie L. Landry,Richard A. Neher,Richard A. Neher,Albert I. Ko,Nathan D. Grubaugh +43 more
TL;DR: It is shown that early SARS-CoV-2 transmission in Connecticut was likely driven by domestic introductions, and the risk of domestic importation to Connecticut exceeded that of international importation by mid-March regardless of the estimated effects of federal travel restrictions.
References
More filters
Journal ArticleDOI
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
Stephen F. Altschul,Thomas L. Madden,Alejandro A. Schäffer,Jinghui Zhang,Zheng Zhang,Webb Miller,David J. Lipman +6 more
TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
Journal ArticleDOI
Clustal w: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice
TL;DR: The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved and modifications are incorporated into a new program, CLUSTAL W, which is freely available.
Journal ArticleDOI
A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences.
TL;DR: Some examples were worked out using reported globin sequences to show that synonymous substitutions occur at much higher rates than amino acid-altering substitutions in evolution.
Book
Numerical Recipes in C: The Art of Scientific Computing
TL;DR: Numerical Recipes: The Art of Scientific Computing as discussed by the authors is a complete text and reference book on scientific computing with over 100 new routines (now well over 300 in all), plus upgraded versions of many of the original routines, with many new topics presented at the same accessible level.
Journal ArticleDOI
Improved tools for biological sequence comparison.
TL;DR: Three computer programs for comparisons of protein and DNA sequences can be used to search sequence data bases, evaluate similarity scores, and identify periodic structures based on local sequence similarity.