Format

Send to

Choose Destination
Bioinformatics. 2016 Oct 1;32(19):2947-55. doi: 10.1093/bioinformatics/btw362. Epub 2016 Jun 17.

The structural effects of mutations can aid in differential phenotype prediction of beta-myosin heavy chain (Myosin-7) missense variants.

Author information

1
Institute of Structural and Molecular Biology, Division of Biosciences, University College London, London WC1E 6BT, UK.
2
Institute of Cardiovascular Science, UCL, London, UK.
3
Complejo Hospitalario Universitario de A Coruña, Insituto de Investigación Biomédica, Coruña, Spain.

Abstract

MOTIVATION:

High-throughput sequencing platforms are increasingly used to screen patients with genetic disease for pathogenic mutations, but prediction of the effects of mutations remains challenging. Previously we developed SAAPdap (Single Amino Acid Polymorphism Data Analysis Pipeline) and SAAPpred (Single Amino Acid Polymorphism Predictor) that use a combination of rule-based structural measures to predict whether a missense genetic variant is pathogenic. Here we investigate whether the same methodology can be used to develop a differential phenotype predictor, which, once a mutation has been predicted as pathogenic, is able to distinguish between phenotypes-in this case the two major clinical phenotypes (hypertrophic cardiomyopathy, HCM and dilated cardiomyopathy, DCM) associated with mutations in the beta-myosin heavy chain (MYH7) gene product (Myosin-7).

RESULTS:

A random forest predictor trained on rule-based structural analyses together with structural clustering data gave a Matthews' correlation coefficient (MCC) of 0.53 (accuracy, 75%). A post hoc removal of machine learning models that performed particularly badly, increased the performance (MCC = 0.61, Acc = 79%). This proof of concept suggests that methods used for pathogenicity prediction can be extended for use in differential phenotype prediction.

AVAILABILITY AND IMPLEMENTATION:

Analyses were implemented in Perl and C and used the Java-based Weka machine learning environment. Please contact the authors for availability.

CONTACTS:

andrew@bioinf.org.uk or andrew.martin@ucl.ac.uk

SUPPLEMENTARY INFORMATION:

Supplementary data are available at Bioinformatics online.

PMID:
27318203
DOI:
10.1093/bioinformatics/btw362
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Silverchair Information Systems
Loading ...
Support Center