Display Settings:

Format

Send to:

Choose Destination
See comment in PubMed Commons below
IEEE/ACM Trans Comput Biol Bioinform. 2011 Mar-Apr;8(2):441-51. doi: 10.1109/TCBB.2009.82.

Discriminative motif finding for predicting protein subcellular localization.

Author information

  • 1Carnegie Mellon University, Pittsburgh, Pittsburgh, PA 15213, USA. thlin@cs.cmu.edu

Abstract

Many methods have been described to predict the subcellular location of proteins from sequence information. However, most of these methods either rely on global sequence properties or use a set of known protein targeting motifs to predict protein localization. Here, we develop and test a novel method that identifies potential targeting motifs using a discriminative approach based on hidden Markov models (discriminative HMMs). These models search for motifs that are present in a compartment but absent in other, nearby, compartments by utilizing an hierarchical structure that mimics the protein sorting mechanism. We show that both discriminative motif finding and the hierarchical structure improve localization prediction on a benchmark data set of yeast proteins. The motifs identified can be mapped to known targeting motifs and they are more conserved than the average protein sequence. Using our motif-based predictions, we can identify potential annotation errors in public databases for the location of some of the proteins. A software implementation and the data set described in this paper are available from http://murphylab.web.cmu.edu/software/2009_TCBB_motif/.

PMID:
21233524
[PubMed - indexed for MEDLINE]
PMCID:
PMC3050600
Free PMC Article

Images from this publication.See all images (6)Free text

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for IEEE Computer Society Icon for PubMed Central
    Loading ...
    Write to the Help Desk