SCARNA: fast and accurate structural alignment of RNA sequences by matching fixed-length stem fragments

Bioinformatics. 2006 Jul 15;22(14):1723-9. doi: 10.1093/bioinformatics/btl177. Epub 2006 May 11.

Abstract

Motivation: The functions of non-coding RNAs are strongly related to their secondary structures, but it is known that a secondary structure prediction of a single sequence is not reliable. Therefore, we have to collect similar RNA sequences with a common secondary structure for the analyses of a new non-coding RNA without knowing the exact secondary structure itself. Therefore, the sequence comparison in searching similar RNAs should consider not only their sequence similarities but also their potential secondary structures. Sankoff's algorithm predicts the common secondary structures of the sequences, but it is computationally too expensive to apply to large-scale analyses. Because we often want to compare a large number of cDNA sequences or to search similar RNAs in the whole genome sequences, much faster algorithms are required.

Results: We propose a new method of comparing RNA sequences based on the structural alignments of the fixed-length fragments of the stem candidates. The implemented software, SCARNA (Stem Candidate Aligner for RNAs), is fast enough to apply to the long sequences in the large-scale analyses. The accuracy of the alignments is better or comparable with the much slower existing algorithms.

Availability: The web server of SCARNA with graphical structural alignment viewer is available at http://www.scarna.org/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Artificial Intelligence
  • Base Sequence
  • Molecular Sequence Data
  • Pattern Recognition, Automated / methods*
  • RNA / genetics*
  • Sequence Alignment / methods*
  • Sequence Analysis, RNA / methods*
  • Sequence Homology, Nucleic Acid
  • Software*

Substances

  • RNA