Exploring the predictive power of polygenic scores derived from genome-wide association studies: a study of 10 complex traits

Bioinformatics. 2017 Mar 15;33(6):886-892. doi: 10.1093/bioinformatics/btw745.

Abstract

Motivation: It is hoped that advances in our knowledge in disease genomics will contribute to personalized medicine such as individualized preventive strategies or early diagnoses of diseases. With the growth of genome-wide association studies (GWAS) in the past decade, how far have we reached this goal? In this study we explored the predictive ability of polygenic risk scores (PRSs) derived from GWAS for a range of complex disease and traits.

Results: We first proposed a new approach to evaluate predictive performances of PRS at arbitrary P -value thresholds. The method was based on corrected estimates of effect sizes, accounting for possible false positives and selection bias. This approach requires no distributional assumptions and only requires summary statistics as input. The validity of the approach was verified in simulations. We explored the predictive power of PRS for ten complex traits, including type 2 diabetes (DM), coronary artery disease (CAD), triglycerides, high- and low-density lipoprotein, total cholesterol, schizophrenia (SCZ), bipolar disorder (BD), major depressive disorder and anxiety disorders. We found that the predictive ability of PRS for CAD and DM were modest (best AUC = 0.608 and 0.607) while for lipid traits the prediction R-squared ranged from 16.1 to 29.8%. For psychiatric disorders, the predictive power for SCZ was estimated to be the highest (best AUC 0.820), followed by BD. Predictive performance of other psychiatric disorders ranged from 0.543 to 0.585. Psychiatric traits tend to have more gradual rise in AUC when significance thresholds increase and achieve the best predictive power at higher P -values than cardiometabolic traits.

Contact: hcso@cuhk.edu.hk ; pcsham@hku.hk.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bipolar Disorder / diagnosis
  • Bipolar Disorder / genetics
  • Coronary Artery Disease / diagnosis
  • Coronary Artery Disease / genetics
  • Depressive Disorder, Major / diagnosis
  • Depressive Disorder, Major / genetics
  • Diabetes Mellitus, Type 2 / diagnosis
  • Diabetes Mellitus, Type 2 / genetics
  • Genetic Predisposition to Disease*
  • Genome-Wide Association Study / methods*
  • Genomics / methods*
  • Humans
  • Multifactorial Inheritance*
  • Polymorphism, Single Nucleotide
  • Risk
  • Schizophrenia / diagnosis
  • Schizophrenia / genetics