Five common SNP-based NR2E1 haplotypes account for the majority of chromosomes examined for global diversity. (a) The SNP-based haplotypes for both chromosomes of every individual are illustrated. Each row represents one chromosome. Each column represents one variable site, the number of which is indicated above each column (see Fig. 1b). Black boxes indicate the major allele; white boxes represent the minor allele. Coriell Cell Repositories ID codes are indicated. ‘SNP’ refers to single nucleotide polymorphism (i.e. single nucleotide substitutions with minor allele frequencies ≥1%). ‘Prob’ is the probability of haplotype assignment, where 1.00 = 100% probable (i.e. individual is either homozygous at all sites or a heterozyote for only one site). MB, Mbuti; BK, Biaka; AA, African-American; CH, Cheyenne; MA, Mayan; QU, Quechua; KA, Karitiana; IP, Indo-Pakistani; CN, Chinese; JA, Japanese; IT, Italian; RU, Russian; NE, Northern European; IC, Icelandic; AJ, Ashkenazi Jewish; DA, Druze Arab; PA, Pacific Islanders; ME, Melanesian. (b) Estimated population haplotype frequencies of the 13 most frequent SNP-based NR2E1 haplotypes. ‘−’ indicates that the haplotype is absent from the population. ‘1’ and ‘0’ represent present and absence of TC indel, respectively. (c) The frequency (y-axis) of the CA-repeat allele (x-axis) with the five most common NR2E1 haplotypes (z-axis) is plotted for the global diversity population.