Logo of narLink to Publisher's site
Nucleic Acids Res. Nov 25, 1988; 16(22): 10881–10890.
PMCID: PMC338945

Multiple sequence alignment with hierarchical clustering.

Abstract

An algorithm is presented for the multiple alignment of sequences, either proteins or nucleic acids, that is both accurate and easy to use on microcomputers. The approach is based on the conventional dynamic-programming method of pairwise alignment. Initially, a hierarchical clustering of the sequences is performed using the matrix of the pairwise alignment scores. The closest sequences are aligned creating groups of aligned sequences. Then close groups are aligned until all sequences are aligned in one group. The pairwise alignments included in the multiple alignment form a new matrix that is used to produce a hierarchical clustering. If it is different from the first one, iteration of the process can be performed. The method is illustrated by an example: a global alignment of 39 sequences of cytochrome c.

Full text

Full text is available as a scanned copy of the original print version. Get a printable copy (PDF file) of the complete article (849K), or click on a page image below to browse page by page. Links to PubMed are also available for Selected References.

Selected References

These references are in PubMed. This may not be the complete list of references from this article.
  • Needleman SB, Wunsch CD. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol. 1970 Mar;48(3):443–453. [PubMed]
  • Murata M, Richardson JS, Sussman JL. Simultaneous comparison of three protein sequences. Proc Natl Acad Sci U S A. 1985 May;82(10):3073–3077. [PMC free article] [PubMed]
  • Bains W. MULTAN: a program to align multiple DNA sequences. Nucleic Acids Res. 1986 Jan 10;14(1):159–177. [PMC free article] [PubMed]
  • Sankoff D, Cedergren RJ, Lapalme G. Frequency of insertion-deletion, transversion, and transition in the evolution of 5S ribosomal RNA. J Mol Evol. 1976 Mar 29;7(2):133–149. [PubMed]
  • Sobel E, Martinez HM. A multiple sequence alignment program. Nucleic Acids Res. 1986 Jan 10;14(1):363–374. [PMC free article] [PubMed]
  • Martinez HM. A flexible multiple sequence alignment program. Nucleic Acids Res. 1988 Mar 11;16(5):1683–1691. [PMC free article] [PubMed]
  • Santibánez M, Rohde K. A multiple alignment program for protein sequences. Comput Appl Biosci. 1987 Jun;3(2):111–114. [PubMed]
  • Bacon DJ, Anderson WF. Multiple sequence alignment. J Mol Biol. 1986 Sep 20;191(2):153–161. [PubMed]
  • Taylor WR. Multiple sequence alignment by a pairwise algorithm. Comput Appl Biosci. 1987 Jun;3(2):81–87. [PubMed]
  • Barton GJ, Sternberg MJ. A strategy for the rapid multiple alignment of protein sequences. Confidence levels from tertiary structure comparisons. J Mol Biol. 1987 Nov 20;198(2):327–337. [PubMed]
  • Gribskov M, McLachlan AD, Eisenberg D. Profile analysis: detection of distantly related proteins. Proc Natl Acad Sci U S A. 1987 Jul;84(13):4355–4358. [PMC free article] [PubMed]
  • Gribskov M, Homyak M, Edenfield J, Eisenberg D. Profile scanning for three-dimensional structural patterns in protein sequences. Comput Appl Biosci. 1988 Mar;4(1):61–66. [PubMed]
  • Feng DF, Doolittle RF. Progressive sequence alignment as a prerequisite to correct phylogenetic trees. J Mol Evol. 1987;25(4):351–360. [PubMed]
  • Barton GJ, Sternberg MJ. Evaluation and improvements in the automatic alignment of protein sequences. Protein Eng. 1987 Feb-Mar;1(2):89–94. [PubMed]
  • Lipman DJ, Pearson WR. Rapid and sensitive protein similarity searches. Science. 1985 Mar 22;227(4693):1435–1441. [PubMed]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

Formats:

Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...

Links

  • PubMed
    PubMed
    PubMed citations for these articles

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...