scispace - formally typeset
Open AccessJournal ArticleDOI

Multiple sequence alignment with hierarchical clustering

Florence Corpet
- 25 Nov 1988 - 
- Vol. 16, Iss: 22, pp 10881-10890
Reads0
Chats0
TLDR
An algorithm is presented for the multiple alignment of sequences, either proteins or nucleic acids, that is both accurate and easy to use on microcomputers, based on the conventional dynamic-programming method of pairwise alignment.
Abstract
An algorithm is presented for the multiple alignment of sequences, either proteins or nucleic acids, that is both accurate and easy to use on microcomputers. The approach is based on the conventional dynamic-programming method of pairwise alignment. Initially, a hierarchical clustering of the sequences is performed using the matrix of the pairwise alignment scores. The closest sequences are aligned creating groups of aligned sequences. Then close groups are aligned until all sequences are aligned in one group. The pairwise alignments included in the multiple alignment form a new matrix that is used to produce a hierarchical clustering. If it is different from the first one, iteration of the process can be performed. The method is illustrated by an example: a global alignment of 39 sequences of cytochrome c.

read more

Citations
More filters
Journal ArticleDOI

Amino acid substitution matrices from protein blocks

TL;DR: This work has derived substitution matrices from about 2000 blocks of aligned sequence segments characterizing more than 500 groups of related proteins, leading to marked improvements in alignments and in searches using queries from each of the groups.
Journal ArticleDOI

Deciphering key features in protein structures with the new ENDscript server

TL;DR: This major upgrade has been fully re-engineered to enhance speed, accuracy and usability with interactive 3D visualization of ENDscript 2 and ESPript 3 to handle a large number of data with reduced computation time.
Journal ArticleDOI

Comparative Protein Structure Modeling Using MODELLER

TL;DR: This unit describes how to calculate comparative models using the program MODELLER and discusses all four steps of comparative modeling, frequently observed errors, and some applications.
Journal ArticleDOI

A classification of glycosyl hydrolases based on amino acid sequence similarities.

TL;DR: With the steady increase in sequence and structural data, it is suggested that the enzyme classification system should perhaps be revised.
Journal ArticleDOI

Comparative protein structure modeling of genes and genomes

TL;DR: There is a need to develop an automated, rapid, robust, sensitive, and accurate comparative modeling pipeline applicable to whole genomes and to encourage new kinds of applications for the many resulting models, based on their large number and completeness at the level of the family, organism, or functional network.
Related Papers (5)