Structured motifs search

J Comput Biol. 2005 Oct;12(8):1065-82. doi: 10.1089/cmb.2005.12.1065.

Abstract

In this paper, we describe an algorithm for the localization of structured models, i.e. sequences of (simple) motifs and distance constraints. It basically combines standard pattern matching procedures with a constraint satisfaction solver, and it has the ability, not present in similar tools, to search for partial matches. A significant feature of our approach, especially in terms of efficiency for the application context, is that the (potentially) exponentially many solutions to the considered problem are represented in compact form as a graph. Moreover, the time and space necessary to build the graph are linear in the number of occurrences of the component patterns.

MeSH terms

  • Algorithms*
  • Base Sequence*
  • Cluster Analysis*
  • Computational Biology
  • DNA Transposable Elements / genetics
  • Genome
  • Information Storage and Retrieval
  • Models, Genetic
  • Numerical Analysis, Computer-Assisted
  • Pattern Recognition, Automated*
  • Sequence Alignment
  • Sequence Analysis, DNA*
  • Software

Substances

  • DNA Transposable Elements