Multiple sequence alignment with hierarchical clustering
Reads0
Chats0
TLDR
An algorithm is presented for the multiple alignment of sequences, either proteins or nucleic acids, that is both accurate and easy to use on microcomputers, based on the conventional dynamic-programming method of pairwise alignment.Abstract:
An algorithm is presented for the multiple alignment of sequences, either proteins or nucleic acids, that is both accurate and easy to use on microcomputers. The approach is based on the conventional dynamic-programming method of pairwise alignment. Initially, a hierarchical clustering of the sequences is performed using the matrix of the pairwise alignment scores. The closest sequences are aligned creating groups of aligned sequences. Then close groups are aligned until all sequences are aligned in one group. The pairwise alignments included in the multiple alignment form a new matrix that is used to produce a hierarchical clustering. If it is different from the first one, iteration of the process can be performed. The method is illustrated by an example: a global alignment of 39 sequences of cytochrome c.read more
Citations
More filters
Journal ArticleDOI
Amino acid substitution matrices from protein blocks
TL;DR: This work has derived substitution matrices from about 2000 blocks of aligned sequence segments characterizing more than 500 groups of related proteins, leading to marked improvements in alignments and in searches using queries from each of the groups.
Journal ArticleDOI
Deciphering key features in protein structures with the new ENDscript server
Xavier Robert,Patrice Gouet +1 more
TL;DR: This major upgrade has been fully re-engineered to enhance speed, accuracy and usability with interactive 3D visualization of ENDscript 2 and ESPript 3 to handle a large number of data with reduced computation time.
Journal ArticleDOI
Comparative Protein Structure Modeling Using MODELLER
Narayanan Eswar,Ben Webb,Marc A. Marti-Renom,Mallur S. Madhusudhan,David Eramian,Min-Yi Shen,Ursula Pieper,Andrej Sali +7 more
TL;DR: This unit describes how to calculate comparative models using the program MODELLER and discusses all four steps of comparative modeling, frequently observed errors, and some applications.
Journal ArticleDOI
A classification of glycosyl hydrolases based on amino acid sequence similarities.
TL;DR: With the steady increase in sequence and structural data, it is suggested that the enzyme classification system should perhaps be revised.
Journal ArticleDOI
Comparative protein structure modeling of genes and genomes
Marc A. Marti-Renom,Ashley C. Stuart,Andras Fiser,Roberto Sanchez,Francisco Melo,Andrej Sali +5 more
TL;DR: There is a need to develop an automated, rapid, robust, sensitive, and accurate comparative modeling pipeline applicable to whole genomes and to encourage new kinds of applications for the many resulting models, based on their large number and completeness at the level of the family, organism, or functional network.