Format

Send to

Choose Destination
J Theor Biol. 2018 Aug 7;450:86-103. doi: 10.1016/j.jtbi.2018.04.026. Epub 2018 Apr 18.

Predicting protein submitochondrial locations by incorporating the pseudo-position specific scoring matrix into the general Chou's pseudo-amino acid composition.

Author information

1
College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao 266061, China; Artificial Intelligence and Biomedical Big Data Research Center, Qingdao University of Science and Technology, Qingdao 266061, China.
2
College of Information Science and Technology, Qingdao University of Science and Technology, Qingdao 266061, China.
3
Shandong Province Key Laboratory of Wisdom Mining Information Technology, Shandong University of Science and Technology, Qingdao 266590, China. Electronic address: yjpeng@sdust.edu.cn.
4
College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao 266061, China; Artificial Intelligence and Biomedical Big Data Research Center, Qingdao University of Science and Technology, Qingdao 266061, China. Electronic address: yubin@qust.edu.cn.

Abstract

Mitochondrion is important organelle of most eukaryotes and play an important role in participating in various life activities of cells. However, some functions of mitochondria can only be achieved in specific submitochondrial location, the study of submitochondrial locations will help to further understand the biological function of protein, which is a hotspot in proteomics research. In this paper, we propose a new method for protein submitochondrial locations prediction. Firstly, the features of protein sequence are extracted by combining Chou's pseudo-amino acid composition (PseAAC) and pseudo-position specific scoring matrix (PsePSSM). Then the extracted feature information is denoised by two-dimensional (2-D) wavelet denoising. Finally, the optimal feature vectors are input to the SVM classifier to predict the protein submitochondrial locations. We obtained the ideal prediction results by jackknife test and compared with other prediction methods. The results indicate that the proposed method is significantly better than the existing research results, which can provide a new method to predict protein locations in other organelles. The source code and all datasets are available at https://github.com/QUST-BSBRC/PseAAC-PsePSSM-WD/ for academic use.

KEYWORDS:

Machine learning; Pseudo-amino acid composition; Pseudo-position specific scoring matrix; Submitochondrial locations; Support vector machine; Two-dimensional wavelet denoising

PMID:
29678694
DOI:
10.1016/j.jtbi.2018.04.026

Supplemental Content

Full text links

Icon for Elsevier Science
Loading ...
Support Center