MUSTANG: a multiple structural alignment algorithm

Proteins. 2006 Aug 15;64(3):559-74. doi: 10.1002/prot.20921.

Abstract

Multiple structural alignment is a fundamental problem in structural genomics. In this article, we define a reliable and robust algorithm, MUSTANG (MUltiple STructural AligNment AlGorithm), for the alignment of multiple protein structures. Given a set of protein structures, the program constructs a multiple alignment using the spatial information of the C(alpha) atoms in the set. Broadly based on the progressive pairwise heuristic, this algorithm gains accuracy through novel and effective refinement phases. MUSTANG reports the multiple sequence alignment and the corresponding superposition of structures. Alignments generated by MUSTANG are compared with several handcurated alignments in the literature as well as with the benchmark alignments of 1033 alignment families from the HOMSTRAD database. The performance of MUSTANG was compared with DALI at a pairwise level, and with other multiple structural alignment tools such as POSA, CE-MC, MALECON, and MultiProt. MUSTANG performs comparably to popular pairwise and multiple structural alignment tools for closely related proteins, and performs more reliably than other multiple structural alignment methods on hard data sets containing distantly related proteins or proteins that show conformational changes.

Publication types

  • Comparative Study

MeSH terms

  • Algorithms*
  • Amino Acid Sequence
  • Computational Biology
  • Databases, Protein
  • Globins / chemistry
  • Models, Molecular
  • Molecular Sequence Data
  • Protein Structure, Secondary
  • Protein Structure, Tertiary
  • Reproducibility of Results
  • Sequence Alignment / methods*
  • Sequence Homology, Amino Acid
  • Serine Endopeptidases / chemistry
  • Software
  • Structural Homology, Protein

Substances

  • Globins
  • Serine Endopeptidases