Annotating the protein-RNA interaction sites in proteins using evolutionary information and protein backbone structure

J Theor Biol. 2012 Nov 7:312:55-64. doi: 10.1016/j.jtbi.2012.07.020. Epub 2012 Jul 31.

Abstract

RNA-protein interactions play important roles in various biological processes. The precise detection of RNA-protein interaction sites is very important for understanding essential biological processes and annotating the function of the proteins. In this study, based on various features from amino acid sequence and structure, including evolutionary information, solvent accessible surface area and torsion angles (φ, ψ) in the backbone structure of the polypeptide chain, a computational method for predicting RNA-binding sites in proteins is proposed. When the method is applied to predict RNA-binding sites in three datasets: RBP86 containing 86 protein chains, RBP107 containing 107 proteins chains and RBP109 containing 109 proteins chains, better sensitivities and specificities are obtained compared to previously published methods in five-fold cross-validation tests. In order to make further examination for the efficiency of our method, the RBP107 dataset is used as training set, RBP86 and RBP109 datasets are used as the independent test sets. In addition, as examples of our prediction, RNA-binding sites in a few proteins are presented. The annotated results are consistent with the PDB annotation. These results show that our method is useful for annotating RNA binding sites of novel proteins.

Keywords: PSSM profile; RNA-binding sites; Solvent accessible surface area; Support vector machine; Torsion angles (φ, ψ).

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Evolution, Molecular*
  • Molecular Sequence Annotation / methods*
  • RNA / chemistry*
  • RNA / genetics*
  • RNA-Binding Proteins / chemistry*
  • RNA-Binding Proteins / genetics*

Substances

  • RNA-Binding Proteins
  • RNA