Format

Send to

Choose Destination

An Accurate Scalable Template-based Alignment Algorithm.

Author information

1
Institute for Cellular and Molecular Biology The University of Texas at Austin Austin, TX, USA dpgardn@utexas.edu.
2
Texas Advanced Computing Center The University of Texas at Austin Austin, TX, USA xwj@tacc.utexas.edu.
3
Department of Computer Sciences The University of Texas at Austin Austin, TX, USA miranker@cs.utexas.edu.
4
Microsoft Corporation Redmond, WA, USA stuarto@microsoft.com.
5
Center for Computational Biology and Bioinformatics The University of Texas at Austin Austin, TX, USA.

Abstract

The rapid determination of nucleic acid sequences is increasing the number of sequences that are available. Inherent in a template or seed alignment is the culmination of structural and functional constraints that are selecting those mutations that are viable during the evolution of the RNA. While we might not understand these structural and functional, template-based alignment programs utilize the patterns of sequence conservation to encapsulate the characteristics of viable RNA sequences that are aligned properly. We have developed a program that utilizes the different dimensions of information in rCAD, a large RNA informatics resource, to establish a profile for each position in an alignment. The most significant include sequence identity and column composition in different phylogenetic taxa. We have compared our methods with a maximum of eight alternative alignment methods on different sets of 16S and 23S rRNA sequences with sequence percent identities ranging from 50% to 100%. The results showed that CRWAlign outperformed the other alignment methods in both speed and accuracy. A web-based alignment server is available at http://www.rna.ccbb.utexas.edu/SAE/2F/CRWAlign.

KEYWORDS:

RNA sequence alignment; comparative analysis; phylogenetic-based alignment; template-based alignment

Supplemental Content

Full text links

Icon for PubMed Central
Loading ...
Support Center