Display Settings:

Format

Send to:

Choose Destination
    Protein Sci. 1997 Mar;6(3):698-705.

    Embedding strategies for effective use of information from multiple sequence alignments.

    Source

    Howard Hughes Medical Institute, Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98104, USA. henikoff@howard.fhcrc.org

    Abstract

    We describe a new strategy for utilizing multiple sequence alignment information to detect distant relationships in searches of sequence databases. A single sequence representing a protein family is enriched by replacing conserved regions with position-specific scoring matrices (PSSMs) or consensus residues derived from multiple alignments of family members. In comprehensive tests of these and other family representations, PSSM-embedded queries produced the best results overall when used with a special version of the Smith-Waterman searching algorithm. Moreover, embedding consensus residues instead of PSSMs improved performance with readily available single sequence query searching programs, such as BLAST and FASTA. Embedding PSSMs or consensus residues into a representative sequence improves searching performance by extracting multiple alignment information from motif regions while retaining single sequence information where alignment is uncertain.

    PMID:
    9070452
    [PubMed - indexed for MEDLINE]
    PMCID:
    PMC2143675
    Free PMC Article

      Supplemental Content

      Icon for John Wiley & Sons, Inc. Icon for PubMed Central

      Save items

      loading

      Recent activity

      Your browsing activity is empty.

      Activity recording is turned off.

      Turn recording back on

      See more...
      Write to the Help Desk