Format

Send to

Choose Destination
Bioinformatics. 2012 Sep 15;28(18):i416-i422. doi: 10.1093/bioinformatics/bts416.

Multiple instance learning of Calmodulin binding sites.

Author information

1
Department of Computer Science, Colorado State University, Fort Collins, CO 80523-1873, USA.

Abstract

MOTIVATION:

Calmodulin (CaM) is a ubiquitously conserved protein that acts as a calcium sensor, and interacts with a large number of proteins. Detection of CaM binding proteins and their interaction sites experimentally requires a significant effort, so accurate methods for their prediction are important.

RESULTS:

We present a novel algorithm (MI-1 SVM) for binding site prediction and evaluate its performance on a set of CaM-binding proteins extracted from the Calmodulin Target Database. Our approach directly models the problem of binding site prediction as a large-margin classification problem, and is able to take into account uncertainty in binding site location. We show that the proposed algorithm performs better than the standard SVM formulation, and illustrate its ability to recover known CaM binding motifs. A highly accurate cascaded classification approach using the proposed binding site prediction method to predict CaM binding proteins in Arabidopsis thaliana is also presented.

AVAILABILITY:

Matlab code for training MI-1 SVM and the cascaded classification approach is available on request.

CONTACT:

fayyazafsar@gmail.com or asa@cs.colostate.edu.

PMID:
22962461
PMCID:
PMC3436843
DOI:
10.1093/bioinformatics/bts416
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Silverchair Information Systems Icon for PubMed Central
Loading ...
Support Center