Format

Send to

Choose Destination
Hum Mutat. 2019 Sep;40(9):1235-1242. doi: 10.1002/humu.23785. Epub 2019 Jun 18.

Predicting the change of exon splicing caused by genetic variant using support vector regression.

Author information

1
School of Data and Computer Science, Sun Yat-sen University, Guangzhou, China.
2
Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou, China.
3
Key Laboratory of Machine Intelligence and Advanced Computing (Sun Yat-sen University), Ministry of Education, Guangzhou, China.

Abstract

Alternative splicing can be disrupted by genetic variants that are related to diseases like cancers. Discovering the influence of genetic variations on the alternative splicing will improve the understanding of the pathogenesis of variants. Here, we developed a new approach, PredPSI-SVR to predict the impact of variants on exon skipping events by using the support vector regression. From the sequence of a particular exon and its flanking regions, 42 comprehensive features related to splicing events were extracted. By using a greedy feature selection algorithm, we found eight features contributing most to the prediction. The trained model achieved a Pearson correlation coefficient (PCC) of 0.570 in the 10-fold cross-validation based on the training data set provided by the "vex-seq" challenge of the 5th Critical Assessment of Genome Interpretation. In the blind test also held by the challenge, our prediction ranked the 2nd with a PCC of 0.566 that demonstrates the robustness of our method. A further test indicated that the PredPSI-SVR is helpful in prioritizing deleterious synonymous mutations. The method is available on https://github.com/chenkenbio/PredPSI-SVR.

KEYWORDS:

Vex-seq; alternative splicing; splice site motif; support vector machine; synonymous mutation

PMID:
31070294
PMCID:
PMC6744342
[Available on 2020-09-01]
DOI:
10.1002/humu.23785

Supplemental Content

Full text links

Icon for Wiley
Loading ...
Support Center