Format

Send to

Choose Destination
Mol Biol Evol. 2013 Jun;30(6):1252-7. doi: 10.1093/molbev/mst037. Epub 2013 Mar 5.

Evolutionary balancing is critical for correctly forecasting disease-associated amino acid variants.

Author information

1
Center for Evolutionary Medicine and Informatics, Biodesign Institute, Arizona State University, USA.

Abstract

Computational predictions have become indispensable for evaluating the disease-related impact of nonsynonymous single-nucleotide variants discovered in exome sequencing. Many such methods have their roots in molecular evolution, as they use information derived from multiple sequence alignments. We show that the performance of current methods (e.g., PolyPhen-2 and SIFT) is improved significantly by optimizing their statistical models on evolutionarily balanced training data, where equal numbers of positive and negative controls within each evolutionary conservation class are used. Evolutionary balancing significantly reduces the false-positive rates for variants observed at highly conserved sites and false-negative rates for variants observed at fast evolving sites. Use of these improved methods enables more accurate forecasting when concordant diagnosis from multiple methods is regarded as a more reliable indicator of the prediction. Applied to a large exome variation data set, we find that the current methods produce concordant predictions for less than half of the population variants. These advances are implemented in a web resource for use in practical applications (www.mypeg.info, last accessed March 13, 2013).

KEYWORDS:

computational prediction; evolutionary medicine; nonsynonymous single nucleotide variant

PMID:
23462317
PMCID:
PMC3649672
DOI:
10.1093/molbev/mst037
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Silverchair Information Systems Icon for PubMed Central
Loading ...
Support Center