Format

Send to

Choose Destination
See comment in PubMed Commons below
Bioinformatics. 2007 Aug 1;23(15):1883-91. Epub 2007 May 30.

RNA Sampler: a new sampling based algorithm for common RNA secondary structure prediction and structural alignment.

Author information

1
Department of Genetics, Washington University, School of Medicine, St. Louis, MO 63110, USA. xingxu@ural.wustl.edu

Abstract

MOTIVATION:

Non-coding RNA genes and RNA structural regulatory motifs play important roles in gene regulation and other cellular functions. They are often characterized by specific secondary structures that are critical to their functions and are often conserved in phylogenetically or functionally related sequences. Predicting common RNA secondary structures in multiple unaligned sequences remains a challenge in bioinformatics research.

METHODS AND RESULTS:

We present a new sampling based algorithm to predict common RNA secondary structures in multiple unaligned sequences. Our algorithm finds the common structure between two sequences by probabilistically sampling aligned stems based on stem conservation calculated from intrasequence base pairing probabilities and intersequence base alignment probabilities. It iteratively updates these probabilities based on sampled structures and subsequently recalculates stem conservation using the updated probabilities. The iterative process terminates upon convergence of the sampled structures. We extend the algorithm to multiple sequences by a consistency-based method, which iteratively incorporates and reinforces consistent structure information from pairwise comparisons into consensus structures. The algorithm has no limitation on predicting pseudoknots. In extensive testing on real sequence data, our algorithm outperformed other leading RNA structure prediction methods in both sensitivity and specificity with a reasonably fast speed. It also generated better structural alignments than other programs in sequences of a wide range of identities, which more accurately represent the RNA secondary structure conservations.

AVAILABILITY:

The algorithm is implemented in a C program, RNA Sampler, which is available at http://ural.wustl.edu/software.html

PMID:
17537756
DOI:
10.1093/bioinformatics/btm272
[Indexed for MEDLINE]
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for Silverchair Information Systems
    Loading ...
    Support Center