Display Settings:

Format

Send to:

Choose Destination
See comment in PubMed Commons below
Genetics. 2009 Dec;183(4):1545-53. doi: 10.1534/genetics.109.104935. Epub 2009 Oct 12.

Reliability of genomic predictions across multiple populations.

Author information

  • 1Biosciences Research Division, Department of Primary Industries Victoria, University of Melbourne, Bundoora 3083, Australia. sander.de.roos@crv4all.com

Abstract

Genomic prediction of future phenotypes or genetic merit using dense SNP genotypes can be used for prediction of disease risk, forensics, and genomic selection of livestock and domesticated plant species. The reliability of genomic predictions is their squared correlation with the true genetic merit and indicates the proportion of the genetic variance that is explained. As reliability relies heavily on the number of phenotypes, combining data sets from multiple populations may be attractive as a way to increase reliabilities, particularly when phenotypes are scarce. However, this strategy may also decrease reliabilities if the marker effects are very different between the populations. The effect of combining multiple populations on the reliability of genomic predictions was assessed for two simulated cattle populations, A and B, that had diverged for T = 6, 30, or 300 generations. The training set comprised phenotypes of 1000 individuals from population A and 0, 300, 600, or 1000 individuals from population B, while marker density and trait heritability were varied. Adding individuals from population B to the training set increased the reliability in population A by up to 0.12 when the marker density was high and T = 6, whereas it decreased the reliability in population A by up to 0.07 when the marker density was low and T = 300. Without individuals from population B in the training set, the reliability in population B was up to 0.77 lower than in population A, especially for large T. Adding individuals from population B to the training set increased the reliability in population B to close to the same level as in population A when the marker density was sufficiently high for the marker-QTL linkage disequilibrium to persist across populations. Our results suggest that the most accurate genomic predictions are achieved when phenotypes from all populations are combined in one training set, while for more diverged populations a higher marker density is required.

PMID:
19822733
[PubMed - indexed for MEDLINE]
PMCID:
PMC2787438
Free PMC Article

Images from this publication.See all images (8)Free text

F igure  1.—
F igure  2.—
F igure  3.—
F igure  4.—
F igure  5.—
F igure  6.—
F igure  7.—
F igure  8.—
PubMed Commons home

PubMed Commons

0 comments
How to join PubMed Commons

    Supplemental Content

    Full text links

    Icon for HighWire Icon for PubMed Central
    Loading ...
    Write to the Help Desk