scispace - formally typeset
Open AccessJournal ArticleDOI

Improved tools for biological sequence comparison.

Reads0
Chats0
TLDR
Three computer programs for comparisons of protein and DNA sequences can be used to search sequence data bases, evaluate similarity scores, and identify periodic structures based on local sequence similarity.
Abstract
We have developed three computer programs for comparisons of protein and DNA sequences. They can be used to search sequence data bases, evaluate similarity scores, and identify periodic structures based on local sequence similarity. The FASTA program is a more sensitive derivative of the FASTP program, which can be used to search protein or DNA sequence data bases and can compare a protein sequence to a DNA sequence data base by translating the DNA data base as it is searched. FASTA includes an additional step in the calculation of the initial pairwise similarity score that allows multiple regions of similarity to be joined to increase the score of related sequences. The RDF2 program can be used to evaluate the significance of similarity scores using a shuffling method that preserves local sequence composition. The LFASTA program can display all the regions of local similarity between two sequences with scores greater than a threshold, using the same scoring parameters and a similar alignment algorithm; these local similarities can be displayed as a "graphic matrix" plot or as individual alignments. In addition, these programs have been generalized to allow comparison of DNA or protein sequences based on a variety of alternative scoring matrices.

read more

Citations
More filters
Journal ArticleDOI

Classification of bacteria isolated from a medieval wall painting

TL;DR: Six bacterial strains were isolated from a damaged medieval wall painting by a polyphasic approach, including analysis of respiratory isoprenoid quinones, polar lipids, fatty acids, polyamines, cell wall diamino acids and sugars from whole cell hydrolysates, and partial 16S rDNA sequence analysis.
Journal ArticleDOI

Modeling water molecules in protein-ligand docking using GOLD.

TL;DR: A novel approach to score water mediation and displacement in the protein-ligand docking program GOLD, where a constant penalty, sigma(p), representing the loss of rigid-body entropy, is added for water molecules that are switched on, hence rewarding water displacement.
Journal ArticleDOI

Characterization of a candidate bcl-1 gene.

TL;DR: Analysis of expression of bcl-1 in an extensive panel of human cell lines showed it to be widely expressed except in lymphoid or myeloid lineages, which may provide a molecular basis for distinct modes of cell cycle control in different mammalian tissues.
Journal ArticleDOI

Identification of the gene causing mucolipidosis type IV

TL;DR: The identification of a new gene in this human chromosomal region in which MLIV-specific mutations were identified is reported here, and positional cloning was an alternative to identify the MLIV gene.
Journal ArticleDOI

Reducing storage requirements for biological sequence comparison

TL;DR: A simple and elegant method in which only a small fraction of seeds, called 'minimizers', needs to be stored, which can speed up string-matching computations by a large factor while missing only aSmall fraction of the matches found using all seeds.
References
More filters
Journal ArticleDOI

A general method applicable to the search for similarities in the amino acid sequence of two proteins

TL;DR: A computer adaptable method for finding similarities in the amino acid sequences of two proteins has been developed and it is possible to determine whether significant homology exists between the proteins to trace their possible evolutionary development.
Journal ArticleDOI

Identification of common molecular subsequences.

TL;DR: This letter extends the heuristic homology algorithm of Needleman & Wunsch (1970) to find a pair of segments, one from each of two long sequences, such that there is no other Pair of segments with greater similarity (homology).
Journal ArticleDOI

Rapid and sensitive protein similarity searches

TL;DR: An algorithm was developed which facilitates the search for similarities between newly determined amino acid sequences and sequences already available in databases and increases sensitivity by giving high scores to those amino acid replacements which occur frequently in evolution.
Journal ArticleDOI

Rapid similarity searches of nucleic acid and protein data banks.

TL;DR: An algorithm for the global comparison of sequences based on matching k-tuples of sequence elements for a fixed k results in substantial reduction in the time required to search a data bank when compared with prior techniques of similarity analysis, with minimal loss in sensitivity.
Related Papers (5)