Format

Send to

Choose Destination
Bioessays. 2013 Sep;35(9):780-6. doi: 10.1002/bies.201300014. Epub 2013 Jul 9.

SNP ascertainment bias in population genetic analyses: why it is important, and how to correct it.

Author information

1
Departments of Biology and Genetics, University of Pennsylvania, Philadelphia, PA, USA. lachance.joseph@gmail.com

Abstract

Whole genome sequencing and SNP genotyping arrays can paint strikingly different pictures of demographic history and natural selection. This is because genotyping arrays contain biased sets of pre-ascertained SNPs. In this short review, we use comparisons between high-coverage whole genome sequences of African hunter-gatherers and data from genotyping arrays to highlight how SNP ascertainment bias distorts population genetic inferences. Sample sizes and the populations in which SNPs are discovered affect the characteristics of observed variants. We find that SNPs on genotyping arrays tend to be older and present in multiple populations. In addition, genotyping arrays cause allele frequency distributions to be shifted towards intermediate frequency alleles, and estimates of linkage disequilibrium are modified. Since population genetic analyses depend on allele frequencies, it is imperative that researchers are aware of the effects of SNP ascertainment bias. With this in mind, we describe multiple ways to correct for SNP ascertainment bias.

KEYWORDS:

African hunter-gatherers; SNP ascertainment bias; human genetics; population genetics; whole genome sequencing

PMID:
23836388
PMCID:
PMC3849385
DOI:
10.1002/bies.201300014
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Wiley Icon for PubMed Central
Loading ...
Support Center