Format

Send to

Choose Destination
See comment in PubMed Commons below
Protein Sci. 1995 Aug;4(8):1587-95.

Finding flexible patterns in unaligned protein sequences.

Author information

1
Department of Informatics, University of Bergen, HIB, Norway.

Abstract

We present a new method for the identification of conserved patterns in a set of unaligned related protein sequences. It is able to discover patterns of a quite general form, allowing for both ambiguous positions and for variable length wildcard regions. It allows the user to define a class of patterns (e.g., the degree of ambiguity allowed and the length and number of gaps), and the method is then guaranteed to find the conserved patterns in this class scoring highest according to a significance measure defined. Identified patterns may be refined using one of two new algorithms. We present a new (nonstatistical) significance measure for flexible patterns. The method is shown to recover known motifs for PROSITE families and is also applied to some recently described families from the literature.

PMID:
8520485
PMCID:
PMC2143188
DOI:
10.1002/pro.5560040817
[Indexed for MEDLINE]
Free PMC Article
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for Wiley Icon for PubMed Central
    Loading ...
    Support Center