Comparison between linear and non-parametric regression models for genome-enabled prediction in wheat

G3 (Bethesda). 2012 Dec;2(12):1595-605. doi: 10.1534/g3.112.003665. Epub 2012 Dec 1.

Abstract

In genome-enabled prediction, parametric, semi-parametric, and non-parametric regression models have been used. This study assessed the predictive ability of linear and non-linear models using dense molecular markers. The linear models were linear on marker effects and included the Bayesian LASSO, Bayesian ridge regression, Bayes A, and Bayes B. The non-linear models (this refers to non-linearity on markers) were reproducing kernel Hilbert space (RKHS) regression, Bayesian regularized neural networks (BRNN), and radial basis function neural networks (RBFNN). These statistical models were compared using 306 elite wheat lines from CIMMYT genotyped with 1717 diversity array technology (DArT) markers and two traits, days to heading (DTH) and grain yield (GY), measured in each of 12 environments. It was found that the three non-linear models had better overall prediction accuracy than the linear regression specification. Results showed a consistent superiority of RKHS and RBFNN over the Bayesian LASSO, Bayesian ridge regression, Bayes A, and Bayes B models.

Keywords: GenPred; Shared data resources.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem
  • Genome, Plant*
  • Genotype
  • Linear Models*
  • Neural Networks, Computer
  • Nonlinear Dynamics*
  • Phenotype
  • Triticum / genetics*