Motivation: It is hoped that advances in our knowledge in disease genomics will contribute to personalized medicine such as individualized preventive strategies or early diagnoses of diseases. With the growth of genome-wide association studies (GWAS) in the past decade, how far have we reached this goal? In this study we explored the predictive ability of polygenic risk scores (PRSs) derived from GWAS for a range of complex disease and traits.
Results: We first proposed a new approach to evaluate predictive performances of PRS at arbitrary P -value thresholds. The method was based on corrected estimates of effect sizes, accounting for possible false positives and selection bias. This approach requires no distributional assumptions and only requires summary statistics as input. The validity of the approach was verified in simulations. We explored the predictive power of PRS for ten complex traits, including type 2 diabetes (DM), coronary artery disease (CAD), triglycerides, high- and low-density lipoprotein, total cholesterol, schizophrenia (SCZ), bipolar disorder (BD), major depressive disorder and anxiety disorders. We found that the predictive ability of PRS for CAD and DM were modest (best AUC = 0.608 and 0.607) while for lipid traits the prediction R-squared ranged from 16.1 to 29.8%. For psychiatric disorders, the predictive power for SCZ was estimated to be the highest (best AUC 0.820), followed by BD. Predictive performance of other psychiatric disorders ranged from 0.543 to 0.585. Psychiatric traits tend to have more gradual rise in AUC when significance thresholds increase and achieve the best predictive power at higher P -values than cardiometabolic traits.
Contact: hcso@cuhk.edu.hk ; pcsham@hku.hk.
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com