Format

Send to

Choose Destination
Genet Sel Evol. 2015 May 6;47:38. doi: 10.1186/s12711-015-0116-6.

Optimization of genomic selection training populations with a genetic algorithm.

Author information

1
Department of Plant Breeding & Genetics, Cornell University, Ithaca, NY, USA. da346@cornell.edu.
2
Department of Plant Breeding & Genetics, Cornell University, Ithaca, NY, USA. ji66@cornell.edu.
3
Robert W. Holley Center for Agriculture and Health, USDA-ARS, Ithaca, NY, USA. jeanluc.jannink@ars.usda.gov.

Abstract

In this article, we imagine a breeding scenario with a population of individuals that have been genotyped but not phenotyped. We derived a computationally efficient statistic that uses this genetic information to measure the reliability of genomic estimated breeding values (GEBV) for a given set of individuals (test set) based on a training set of individuals. We used this reliability measure with a genetic algorithm scheme to find an optimized training set from a larger set of candidate individuals. This subset was phenotyped to create the training set that was used in a genomic selection model to estimate GEBV in the test set. Our results show that, compared to a random sample of the same size, the use of a set of individuals selected by our method improved accuracies. We implemented the proposed training selection methodology on four sets of data on Arabidopsis, wheat, rice and maize. This dynamic model building process that takes genotypes of the individuals in the test sample into account while selecting the training individuals improves the performance of genomic selection models.

PMID:
25943105
PMCID:
PMC4422310
DOI:
10.1186/s12711-015-0116-6
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for BioMed Central Icon for PubMed Central
Loading ...
Support Center