Multiple sequence alignment with hierarchical clustering

doi:10.1093/NAR/16.22.10881

Open AccessJournal ArticleDOI

Multiple sequence alignment with hierarchical clustering

Florence Corpet

- 25 Nov 1988 -

Nucleic Acids Research

- Vol. 16, Iss: 22, pp 10881-10890

Chats0

TLDR

An algorithm is presented for the multiple alignment of sequences, either proteins or nucleic acids, that is both accurate and easy to use on microcomputers, based on the conventional dynamic-programming method of pairwise alignment.

Abstract:

An algorithm is presented for the multiple alignment of sequences, either proteins or nucleic acids, that is both accurate and easy to use on microcomputers. The approach is based on the conventional dynamic-programming method of pairwise alignment. Initially, a hierarchical clustering of the sequences is performed using the matrix of the pairwise alignment scores. The closest sequences are aligned creating groups of aligned sequences. Then close groups are aligned until all sequences are aligned in one group. The pairwise alignments included in the multiple alignment form a new matrix that is used to produce a hierarchical clustering. If it is different from the first one, iteration of the process can be performed. The method is illustrated by an example: a global alignment of 39 sequences of cytochrome c.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Amino acid substitution matrices from protein blocks

Steven Henikoff, +1 more

- 15 Nov 1992 -

Proceedings of the National Academy of S...

TL;DR: This work has derived substitution matrices from about 2000 blocks of aligned sequence segments characterizing more than 500 groups of related proteins, leading to marked improvements in alignments and in searches using queries from each of the groups.

...read moreread less

Journal ArticleDOI

Deciphering key features in protein structures with the new ENDscript server

Xavier Robert, +1 more

- 01 Jul 2014 -

Nucleic Acids Research

TL;DR: This major upgrade has been fully re-engineered to enhance speed, accuracy and usability with interactive 3D visualization of ENDscript 2 and ESPript 3 to handle a large number of data with reduced computation time.

...read moreread less

Journal ArticleDOI

Comparative Protein Structure Modeling Using MODELLER

Narayanan Eswar, +7 more

- 01 Nov 2007 -

Current protocols in protein science

TL;DR: This unit describes how to calculate comparative models using the program MODELLER and discusses all four steps of comparative modeling, frequently observed errors, and some applications.

...read moreread less

Journal ArticleDOI

A classification of glycosyl hydrolases based on amino acid sequence similarities.

Bernard Henrissat

- 01 Dec 1991 -

Biochemical Journal

TL;DR: With the steady increase in sequence and structural data, it is suggested that the enzyme classification system should perhaps be revised.

...read moreread less

Journal ArticleDOI

Comparative protein structure modeling of genes and genomes

Marc A. Marti-Renom, +5 more

- 01 Jan 2000 -

Annual Review of Biophysics and Biomolec...

TL;DR: There is a need to develop an automated, rapid, robust, sensitive, and accurate comparative modeling pipeline applicable to whole genomes and to encourage new kinds of applications for the many resulting models, based on their large number and completeness at the level of the family, organism, or functional network.

...read moreread less