Format

Send to

Choose Destination
PLoS Comput Biol. 2008 Aug 8;4(8):e1000146. doi: 10.1371/journal.pcbi.1000146.

Classifying RNA-binding proteins based on electrostatic properties.

Author information

1
Faculty of Biology, Technion-Israel Institute of Technology, Haifa, Israel.

Erratum in

  • PLoS Comput Biol. 2008 Aug;4(8). doi: 10.1371/annotation/be4eb7dd-4092-49c1-a619-e8eeba40ed67.

Abstract

Protein structure can provide new insight into the biological function of a protein and can enable the design of better experiments to learn its biological roles. Moreover, deciphering the interactions of a protein with other molecules can contribute to the understanding of the protein's function within cellular processes. In this study, we apply a machine learning approach for classifying RNA-binding proteins based on their three-dimensional structures. The method is based on characterizing unique properties of electrostatic patches on the protein surface. Using an ensemble of general protein features and specific properties extracted from the electrostatic patches, we have trained a support vector machine (SVM) to distinguish RNA-binding proteins from other positively charged proteins that do not bind nucleic acids. Specifically, the method was applied on proteins possessing the RNA recognition motif (RRM) and successfully classified RNA-binding proteins from RRM domains involved in protein-protein interactions. Overall the method achieves 88% accuracy in classifying RNA-binding proteins, yet it cannot distinguish RNA from DNA binding proteins. Nevertheless, by applying a multiclass SVM approach we were able to classify the RNA-binding proteins based on their RNA targets, specifically, whether they bind a ribosomal RNA (rRNA), a transfer RNA (tRNA), or messenger RNA (mRNA). Finally, we present here an innovative approach that does not rely on sequence or structural homology and could be applied to identify novel RNA-binding proteins with unique folds and/or binding motifs.

PMID:
18716674
PMCID:
PMC2518515
DOI:
10.1371/journal.pcbi.1000146
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Public Library of Science Icon for PubMed Central
Loading ...
Support Center