Format

Send to

Choose Destination
J Comput Biol. 2009 May;16(5):639-57. doi: 10.1089/cmb.2008.0176.

Conditional graphical models for protein structural motif recognition.

Author information

1
IBM T.J. Watson Research Center, Yorktown Heights, New York 10598, USA. liuya@us.ibm.com

Abstract

Determining protein structures is crucial to understanding the mechanisms of infection and designing drugs. However, the elucidation of protein folds by crystallographic experiments can be a bottleneck in the development process. In this article, we present a probabilistic graphical model framework, conditional graphical models, for predicting protein structural motifs. It represents the structure characteristics of a structural motif using a graph, where the nodes denote the secondary structure elements, and the edges indicate the side-chain interactions between the components either within one protein chain or between chains. Then the model defines the optimal segmentation of a protein sequence against the graph by maximizing its "conditional" probability so that it can take advantages of the discriminative training approach. Efficient approximate inference algorithms using reversible jump Markov Chain Monte Carlo (MCMC) algorithm are developed to handle the resulting complex graphical models. We test our algorithm on four important structural motifs, and our method outperforms other state-of-art algorithms for motif recognition. We also hypothesize potential membership proteins of target folds from Swiss-Prot, which further supports the evolutionary hypothesis about viral folds.

PMID:
19432536
DOI:
10.1089/cmb.2008.0176
[Indexed for MEDLINE]

Supplemental Content

Loading ...
Support Center