Format

Send to

Choose Destination
Bioinformatics. 2016 Oct 1;32(19):2936-46. doi: 10.1093/bioinformatics/btw361. Epub 2016 Jun 17.

STRUM: structure-based prediction of protein stability changes upon single-point mutation.

Author information

1
School of Computer Science and Technology, Soochow University, Suzhou, Jiangsu 215006, China Department of Computational Medicine and Bioinformatics, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA.
2
School of Computer Science and Technology, Soochow University, Suzhou, Jiangsu 215006, China Collaborative Innovation Center of Novel Software Technology and Industrialization, Nanjing, Jiangsu, China.
3
Department of Computational Medicine and Bioinformatics, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA Department of Biological Chemistry, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA.

Abstract

MOTIVATION:

Mutations in human genome are mainly through single nucleotide polymorphism, some of which can affect stability and function of proteins, causing human diseases. Several methods have been proposed to predict the effect of mutations on protein stability; but most require features from experimental structure. Given the fast progress in protein structure prediction, this work explores the possibility to improve the mutation-induced stability change prediction using low-resolution structure modeling.

RESULTS:

We developed a new method (STRUM) for predicting stability change caused by single-point mutations. Starting from wild-type sequences, 3D models are constructed by the iterative threading assembly refinement (I-TASSER) simulations, where physics- and knowledge-based energy functions are derived on the I-TASSER models and used to train STRUM models through gradient boosting regression. STRUM was assessed by 5-fold cross validation on 3421 experimentally determined mutations from 150 proteins. The Pearson correlation coefficient (PCC) between predicted and measured changes of Gibbs free-energy gap, ΔΔG, upon mutation reaches 0.79 with a root-mean-square error 1.2 kcal/mol in the mutation-based cross-validations. The PCC reduces if separating training and test mutations from non-homologous proteins, which reflects inherent correlations in the current mutation sample. Nevertheless, the results significantly outperform other state-of-the-art methods, including those built on experimental protein structures. Detailed analyses show that the most sensitive features in STRUM are the physics-based energy terms on I-TASSER models and the conservation scores from multiple-threading template alignments. However, the ΔΔG prediction accuracy has only a marginal dependence on the accuracy of protein structure models as long as the global fold is correct. These data demonstrate the feasibility to use low-resolution structure modeling for high-accuracy stability change prediction upon point mutations.

AVAILABILITY AND IMPLEMENTATION:

http://zhanglab.ccmb.med.umich.edu/STRUM/ CONTACT: qiang@suda.edu.cn and zhng@umich.edu

SUPPLEMENTARY INFORMATION:

Supplementary data are available at Bioinformatics online.

PMID:
27318206
PMCID:
PMC5039926
DOI:
10.1093/bioinformatics/btw361
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Silverchair Information Systems Icon for PubMed Central
Loading ...
Support Center