Format

Send to:

Choose Destination
See comment in PubMed Commons below
J Comput Biol. 2009 Jan;16(1):1-18. doi: 10.1089/cmb.2008.0137.

Exact calculation of distributions on integers, with application to sequence alignment.

Author information

  • 1Center for Bioinformatics, Wadsworth Center, New York State Department of Health, Albany, New York, USA. leen@cs.rpi.edu

Abstract

Computational biology is replete with high-dimensional discrete prediction and inference problems. Dynamic programming recursions can be applied to several of the most important of these, including sequence alignment, RNA secondary-structure prediction, phylogenetic inference, and motif finding. In these problems, attention is frequently focused on some scalar quantity of interest, a score, such as an alignment score or the free energy of an RNA secondary structure. In many cases, score is naturally defined on integers, such as a count of the number of pairing differences between two sequence alignments, or else an integer score has been adopted for computational reasons, such as in the test of significance of motif scores. The probability distribution of the score under an appropriate probabilistic model is of interest, such as in tests of significance of motif scores, or in calculation of Bayesian confidence limits around an alignment. Here we present three algorithms for calculating the exact distribution of a score of this type; then, in the context of pairwise local sequence alignments, we apply the approach so as to find the alignment score distribution and Bayesian confidence limits.

PMID:
19119992
[PubMed - indexed for MEDLINE]
PMCID:
PMC2858568
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for Mary Ann Liebert, Inc. Icon for PubMed Central
    Loading ...
    Write to the Help Desk