Format

Send to

Choose Destination
See comment in PubMed Commons below
Proc Natl Acad Sci U S A. 1994 May 24;91(11):4625-8.

Rapid and accurate estimates of statistical significance for sequence data base searches.

Author information

1
Department of Mathematics, University of Southern California, Los Angeles 90089-1113.

Abstract

A central question in sequence comparison is the statistical significance of an observed similarity. For local alignment containing gaps to optimize sequence similarity this problem has so far not been solved mathematically. Using as a basis the Chen-Stein theory of Poisson approximation, we present a practical method to approximate the probability that a local alignment score is a result of chance alone. For a set of similarity scores and gap penalties only one simulation of random alignments needs to be calculated to derive the key information allowing us to estimate the significance of any alignment calculated under this setting. We present applications to data base searching and the analysis of pairwise and self-comparisons of proteins.

PMID:
8197109
PMCID:
PMC43840
[Indexed for MEDLINE]
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments

    Supplemental Content

    Full text links

    Icon for HighWire Icon for PubMed Central
    Loading ...
    Support Center