scispace - formally typeset
Open AccessJournal ArticleDOI

MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform

Reads0
Chats0
TLDR
A simplified scoring system is proposed that performs well for reducing CPU time and increasing the accuracy of alignments even for sequences having large insertions or extensions as well as distantly related sequences of similar length.
Abstract
A multiple sequence alignment program, MAFFT, has been developed. The CPU time is drastically reduced as compared with existing methods. MAFFT includes two novel techniques. (i) Homologous regions are rapidly identified by the fast Fourier transform (FFT), in which an amino acid sequence is converted to a sequence composed of volume and polarity values of each amino acid residue. (ii) We propose a simplified scoring system that performs well for reducing CPU time and increasing the accuracy of alignments even for sequences having large insertions or extensions as well as distantly related sequences of similar length. Two different heuristics, the progressive method (FFT-NS-2) and the iterative refinement method (FFT-NS-i), are implemented in MAFFT. The performances of FFT-NS-2 and FFT-NS-i were compared with other methods by computer simulations and benchmark tests; the CPU time of FFT-NS-2 is drastically reduced as compared with CLUSTALW with comparable accuracy. FFT-NS-i is over 100 times faster than T-COFFEE, when the number of input sequences exceeds 60, without sacrificing the accuracy.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

'Glomus intraradices DAOM197198', a model fungus in arbuscular mycorrhiza research, is not Glomus intraradices.

TL;DR: It is concluded that the AM fungi with the identifiers DAOM197198 and BEG195 are not G. intraradices, but fall in a clade that contains the recently described species G. irregulare.
Journal ArticleDOI

Arabidopsis WUSCHEL Is a Bifunctional Transcription Factor That Acts as a Repressor in Stem Cell Regulation and as an Activator in Floral Patterning

TL;DR: It is demonstrated here that the Arabidopsis thaliana protein WUSCHEL (WUS), which regulates the maintenance of stem cell populations in shoot meristems, is a bifunctional transcription factor that acts mainly as a repressor but becomes an activator when involved in the regulation of the AGAMOUS (AG) gene.
Journal ArticleDOI

How to describe a cryptic species? Practical challenges of molecular taxonomy

TL;DR: Three previously valid Pontohedyle species are characterized based on four genetic markers and nine cryptic new species are formally described applying molecular taxonomy, based on diagnostic nucleotides in DNA sequences of the four markers.
Journal ArticleDOI

Application of the MAFFT sequence alignment program to large data—reexamination of the usefulness of chained guide trees

TL;DR: This work used HomFam, ContTest and OXFam to evaluate several methods enabled in MAFFT and found that methods 3 and 4 increased the benchmark scores more consistently than method 2 for the three datasets, suggesting that they are safer to use.
Journal ArticleDOI

DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignment

TL;DR: DIALIGN-TX is presented, a substantial improvement of DIAL IGN-T that combines the previous greedy algorithm with a progressive alignment approach and produces significantly better alignments, especially on globally related sequences, without increasing the CPU time and memory consumption exceedingly.
References
More filters
Journal ArticleDOI

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

TL;DR: A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original.
Journal ArticleDOI

Clustal w: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice

TL;DR: The sensitivity of the commonly used progressive multiple sequence alignment method has been greatly improved and modifications are incorporated into a new program, CLUSTAL W, which is freely available.
Journal ArticleDOI

A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences.

TL;DR: Some examples were worked out using reported globin sequences to show that synonymous substitutions occur at much higher rates than amino acid-altering substitutions in evolution.
Book

Numerical Recipes in C: The Art of Scientific Computing

TL;DR: Numerical Recipes: The Art of Scientific Computing as discussed by the authors is a complete text and reference book on scientific computing with over 100 new routines (now well over 300 in all), plus upgraded versions of many of the original routines, with many new topics presented at the same accessible level.
Journal ArticleDOI

Improved tools for biological sequence comparison.

TL;DR: Three computer programs for comparisons of protein and DNA sequences can be used to search sequence data bases, evaluate similarity scores, and identify periodic structures based on local sequence similarity.
Related Papers (5)