scispace - formally typeset
Open AccessJournal ArticleDOI

Improved tools for biological sequence comparison.

Reads0
Chats0
TLDR
Three computer programs for comparisons of protein and DNA sequences can be used to search sequence data bases, evaluate similarity scores, and identify periodic structures based on local sequence similarity.
Abstract
We have developed three computer programs for comparisons of protein and DNA sequences. They can be used to search sequence data bases, evaluate similarity scores, and identify periodic structures based on local sequence similarity. The FASTA program is a more sensitive derivative of the FASTP program, which can be used to search protein or DNA sequence data bases and can compare a protein sequence to a DNA sequence data base by translating the DNA data base as it is searched. FASTA includes an additional step in the calculation of the initial pairwise similarity score that allows multiple regions of similarity to be joined to increase the score of related sequences. The RDF2 program can be used to evaluate the significance of similarity scores using a shuffling method that preserves local sequence composition. The LFASTA program can display all the regions of local similarity between two sequences with scores greater than a threshold, using the same scoring parameters and a similar alignment algorithm; these local similarities can be displayed as a "graphic matrix" plot or as individual alignments. In addition, these programs have been generalized to allow comparison of DNA or protein sequences based on a variety of alternative scoring matrices.

read more

Citations
More filters
Journal ArticleDOI

The Enolase Superfamily: A General Strategy for Enzyme-Catalyzed Abstraction of the α-Protons of Carboxylic Acids†

TL;DR: A superfamily of enzymes related by their ability to catalyze the abstraction of the alpha-proton of a carboxylic acid to form an enolic intermediate is discovered, and the established and deduced structure-function relationships in the superfamily allow the prediction that other apparent members of the family for which no catalytic functions have yet been assigned will also perform chemistry involving abstraction of their alpha-protons.
Journal ArticleDOI

A large domain common to sperm receptors (Zp2 and Zp3) and TGF-β type III receptor

TL;DR: The results lead to the hypothesis that this type of domain has a common tertiary structure and that there is a functional similarity in the recognition mechanism of the sperm receptor system and the TGF‐β receptor complex.
Journal ArticleDOI

Characterization of the cobalamin (vitamin B12) biosynthetic genes of Salmonella typhimurium.

TL;DR: Differences in the DNA sequence for the promoter region and the proximal 17.1 kb of the cob operon suggest that the cobalamin biosynthetic pathways differ between the two organisms.
Journal ArticleDOI

A contig assembly program based on sensitive detection of fragment overlaps.

TL;DR: An effective computer program for assembling DNA fragments, the contig assembly program (CAP), has been developed and the performance tests of the program on fragment data from genomic sequencing projects produced satisfactory results.
Journal ArticleDOI

ACLAME: A CLAssification of Mobile genetic Elements, update 2010

TL;DR: The ACLAME database is dedicated to the collection, analysis and classification of sequenced mobile genetic elements (MGEs, in particular phages and plasmids), and numerous tools for in-depth analysis of the results have been added.
References
More filters
Journal ArticleDOI

A general method applicable to the search for similarities in the amino acid sequence of two proteins

TL;DR: A computer adaptable method for finding similarities in the amino acid sequences of two proteins has been developed and it is possible to determine whether significant homology exists between the proteins to trace their possible evolutionary development.
Journal ArticleDOI

Identification of common molecular subsequences.

TL;DR: This letter extends the heuristic homology algorithm of Needleman & Wunsch (1970) to find a pair of segments, one from each of two long sequences, such that there is no other Pair of segments with greater similarity (homology).
Journal ArticleDOI

Rapid and sensitive protein similarity searches

TL;DR: An algorithm was developed which facilitates the search for similarities between newly determined amino acid sequences and sequences already available in databases and increases sensitivity by giving high scores to those amino acid replacements which occur frequently in evolution.
Journal ArticleDOI

Rapid similarity searches of nucleic acid and protein data banks.

TL;DR: An algorithm for the global comparison of sequences based on matching k-tuples of sequence elements for a fixed k results in substantial reduction in the time required to search a data bank when compared with prior techniques of similarity analysis, with minimal loss in sensitivity.
Related Papers (5)