Efficient constrained multiple sequence alignment with performance guarantee

J Bioinform Comput Biol. 2005 Feb;3(1):1-18. doi: 10.1142/s0219720005000977.

Abstract

The constrained multiple sequence alignment problem is to align a set of sequences of maximum length n subject to a given constrained sequence, which arises from some knowledge of the structure of the sequences. This paper presents new algorithms for this problem, which are more efficient in terms of time and space (memory) than the previous algorithms, and with a worst-case guarantee on the quality of the alignment. Saving the space requirement by a quadratic factor is particularly significant as the previous O(n4)-space algorithm has limited application due to its huge memory requirement. Experiments on real data sets confirm that our new algorithms show improvements in both alignment quality and resource requirements.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms*
  • Conserved Sequence
  • Ribonucleases / analysis*
  • Ribonucleases / chemistry*
  • Sequence Alignment / methods*
  • Sequence Analysis / methods*
  • Sequence Homology

Substances

  • Ribonucleases