Improved tools for biological sequence comparison.
Reads0
Chats0
TLDR
Three computer programs for comparisons of protein and DNA sequences can be used to search sequence data bases, evaluate similarity scores, and identify periodic structures based on local sequence similarity.Abstract:
We have developed three computer programs for comparisons of protein and DNA sequences. They can be used to search sequence data bases, evaluate similarity scores, and identify periodic structures based on local sequence similarity. The FASTA program is a more sensitive derivative of the FASTP program, which can be used to search protein or DNA sequence data bases and can compare a protein sequence to a DNA sequence data base by translating the DNA data base as it is searched. FASTA includes an additional step in the calculation of the initial pairwise similarity score that allows multiple regions of similarity to be joined to increase the score of related sequences. The RDF2 program can be used to evaluate the significance of similarity scores using a shuffling method that preserves local sequence composition. The LFASTA program can display all the regions of local similarity between two sequences with scores greater than a threshold, using the same scoring parameters and a similar alignment algorithm; these local similarities can be displayed as a "graphic matrix" plot or as individual alignments. In addition, these programs have been generalized to allow comparison of DNA or protein sequences based on a variety of alternative scoring matrices.read more
Citations
More filters
Journal ArticleDOI
Classification of bacteria isolated from a medieval wall painting
Petra Altenburgera,Petra Altenburgera,Peter Kämpferb,Peter Kämpferb,Athanasios Makristathisc,Athanasios Makristathisc,Werner Lubitza,Werner Lubitza,Hans-Jürgen Bussea,Hans-Jürgen Bussea +9 more
TL;DR: Six bacterial strains were isolated from a damaged medieval wall painting by a polyphasic approach, including analysis of respiratory isoprenoid quinones, polar lipids, fatty acids, polyamines, cell wall diamino acids and sugars from whole cell hydrolysates, and partial 16S rDNA sequence analysis.
Journal ArticleDOI
Modeling water molecules in protein-ligand docking using GOLD.
Marcel L. Verdonk,Gianni Chessari,Jason C. Cole,Michael J. Hartshorn,Christopher W. Murray,J. Willem M. Nissink,Richard David Taylor,Robin Taylor +7 more
TL;DR: A novel approach to score water mediation and displacement in the protein-ligand docking program GOLD, where a constant penalty, sigma(p), representing the loss of rigid-body entropy, is added for water molecules that are switched on, hence rewarding water displacement.
Journal ArticleDOI
Characterization of a candidate bcl-1 gene.
TL;DR: Analysis of expression of bcl-1 in an extensive panel of human cell lines showed it to be widely expressed except in lymphoid or myeloid lineages, which may provide a molecular basis for distinct modes of cell cycle control in different mammalian tissues.
Journal ArticleDOI
Identification of the gene causing mucolipidosis type IV
Ruth Bargal,Nili Avidan,Edna Ben-Asher,Zvia Olender,Marcia Zeigler,Ayala Frumkin,Annick Raas-Rothschild,Gustavo Glusman,Doron Lancet,Gideon Bach +9 more
TL;DR: The identification of a new gene in this human chromosomal region in which MLIV-specific mutations were identified is reported here, and positional cloning was an alternative to identify the MLIV gene.
Journal ArticleDOI
Reducing storage requirements for biological sequence comparison
TL;DR: A simple and elegant method in which only a small fraction of seeds, called 'minimizers', needs to be stored, which can speed up string-matching computations by a large factor while missing only aSmall fraction of the matches found using all seeds.
References
More filters
Journal ArticleDOI
A general method applicable to the search for similarities in the amino acid sequence of two proteins
TL;DR: A computer adaptable method for finding similarities in the amino acid sequences of two proteins has been developed and it is possible to determine whether significant homology exists between the proteins to trace their possible evolutionary development.
Journal ArticleDOI
Identification of common molecular subsequences.
TL;DR: This letter extends the heuristic homology algorithm of Needleman & Wunsch (1970) to find a pair of segments, one from each of two long sequences, such that there is no other Pair of segments with greater similarity (homology).
Journal ArticleDOI
Rapid and sensitive protein similarity searches
TL;DR: An algorithm was developed which facilitates the search for similarities between newly determined amino acid sequences and sequences already available in databases and increases sensitivity by giving high scores to those amino acid replacements which occur frequently in evolution.
Journal ArticleDOI
Rapid similarity searches of nucleic acid and protein data banks.
W. J. Wilbur,David J. Lipman +1 more
TL;DR: An algorithm for the global comparison of sequences based on matching k-tuples of sequence elements for a fixed k results in substantial reduction in the time required to search a data bank when compared with prior techniques of similarity analysis, with minimal loss in sensitivity.