Obtaining unbiased estimates of tagging SNP performance

Ann Hum Genet. 2006 Mar;70(Pt 2):254-61. doi: 10.1111/j.1529-8817.2005.00212.x.

Abstract

The use of tagging SNPs (tSNPs) as a cost-effective means of capturing genetic diversity is widespread. However, the quality of the tSNPs selected is dependent on the initial sample in which they are characterized. If the initial marker set is too sparse the tSNPs chosen will capture less information than a naïve analysis suggests. A simple method has been proposed that should provide a better estimate of the performance of tSNPs. It is shown here that this approach is both unbiased and accurate, even for small numbers of typed markers. The effect of unknown phase is also investigated and it is shown that, excepting very small samples, this has little effect on the accuracy of the method.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Humans
  • Polymorphism, Single Nucleotide*