Logo of ajhgLink to Publisher's site
Am J Hum Genet. 2007 Dec; 81(6): 1119–1132.
Published online 2007 Oct 15. doi:  10.1086/522235
PMCID: PMC2276347

A Genomewide Association Study of Skin Pigmentation in a South Asian Population


We have conducted a multistage genomewide association study, using 1,620,742 single-nucleotide polymorphisms to systematically investigate the genetic factors influencing intrinsic skin pigmentation in a population of South Asian descent. Polymorphisms in three genes—SLC24A5, TYR, and SLC45A2—yielded highly significant replicated associations with skin-reflectance measurements, an indirect measure of melanin content in the skin. The associations detected in these three genes, in an additive manner, collectively account for a large fraction of the natural variation of skin pigmentation in a South Asian population. Our study is the first to interrogate polymorphisms across the genome, to find genetic determinants of the natural variation of skin pigmentation within a human population.

Humans possess an impressive range of skin pigmentation, both within and between populations. This diversity is highly correlated with geographical location, indicating that environmental factors as well as genetics strongly influence skin color. The predominant environmental variable affecting skin pigmentation is sunlight, and it is certain that skin pigments play an important role in both protecting DNA from the effects of UV irradiation14 and influencing the availability of UV radiation for the synthesis of necessary compounds, such as vitamin D.57 Epidemiological studies in humans show that skin pigmentation is a polygenic quantitative trait with high heritability,810 and, with the direct correlation between skin pigmentation and incident UV exposure, it has long been postulated that it is a trait under intense selective pressure.1114 Because of its striking diversity and potential evolutionary importance as a highly selected trait, human skin pigmentation is of great scientific interest. Over the past 100 years, pigmentary mutants in model organisms and human pigmentation disorders have been the main source for the discovery of genes involved in skin color. More than 100 pigmentation genes have been identified in mouse alone, most with identified human orthologs, and at least 18 genes are currently listed in OMIM as being involved in human albinism.15,16 However, until very recently, few direct studies to identify the genes responsible for the normal range of human skin pigmentation have been reported13,1726; in addition, all these studies have investigated the role of only known pigmentation-disorder genes in normal skin-color variation. Therefore, many questions about the loci responsible for the natural diversity of human skin pigmentation have remained unanswered: how many genes are involved, are the different gene effects additive, are the genes involved in skin pigmentation the same across different ethnic populations, and are there still-undiscovered pigmentation genes?

With the availability of the entire human genomic sequence in 2001,27,28 the identification of millions of SNPs across the genome,2931 and the development of high-throughput genotyping technologies, the tools were available for investigation of the genetic components controlling human skin pigmentation with use of a high-density genomewide association study. In the present study, we applied a three-tiered methodology of quantitative pooled genotyping followed by individual genotyping of associated SNPs in original and replicate population sets, as has been successfully used elsewhere.32

In all, 1,620,742 SNPs across the genome were measured for allele-frequency differences between DNA pools made from the 20% tails of the skin-pigmentation distribution, as objectively measured by reflectance spectroscopy in a South Asian population. The top 30,000 SNPs (~2%) with the largest allele-frequency differences between the pools were individually genotyped in the original population sample of 737 individuals, followed by individual genotyping of associated SNPs in an independent replicate sample of 231 individuals of South Asian ancestry. Despite confounding of phenotype-genotype associations by population stratification, we found three genes with replicated genomewide significance that together account for a large fraction of the skin-pigmentation variation in the South Asian population.

Material and Methods

Sample Selection

Approval for the study was obtained from ethics review boards in the United Kingdom. After a complete description of the study was given to the subjects, written informed consent was obtained. Recruitment of volunteers of South Asian descent was conducted at >50 different sites across the United Kingdom. For each qualifying subject, either both parents or all four grandparents were born in India, Pakistan, Bangladesh, or Sri Lanka. Reported ancestry was used to group subjects by region into eastern (mainly Bangladesh), northwestern (mainly Gujarat and Punjab), southern (mainly Sri Lanka), or mixed origin. Reported ancestry was defined for each individual from the grandparental information if known and from the parental information otherwise. If the country and state of birth of all four grandparents were the same, then the reported ancestry of the volunteer was defined similarly; this was also the case if there was missing information from one grandparent but the origin of the other three grandparents was the same. Furthermore, if information from both maternal and/or paternal grandparents was missing, then maternal and/or paternal information was used in its place. Subjects with disagreements about reported ancestry were categorized as “mixed.” Both males and females aged [gt-or-equal, slanted]18 years were included. Volunteers were excluded if they reported consumption of oral dietary foods or supplements or the use of topical skin ointments on the measurement areas that are designed to change skin color. Other exclusion criteria included pigmentation disorders or any current skin disease of any type anywhere on the body.

Phenotype and DNA Collection

Skin pigmentation was measured with a Minolta chromameter with use of the Commission Internationale de l’Eclairage L*a*b* color system. The L* value, which measures skin reflectance, or lightness, ranges from 0 to 100, where 0 is the darkest and 100 is the lightest skin pigmentation. Skin reflectance was measured on three relatively hairless sites per arm—the sun-exposed lower dorsal forearm, the sun-protected inner volar forearm, and the sun-protected inner arm above the elbow—giving six measurements per volunteer. The highest of the six L* values was defined as maxL*. Skin chromameter measurements for 98 randomly selected individuals of South Asian ancestry living in the United Kingdom were used to estimate the distribution of maxL* within the South Asian population (fig. 1). Individuals were selected for the first phase of our association study by targeted recruitment in the upper and lower quintiles of maxL* values determined as described above. Targeted recruitment of individuals was conducted by prescreening volunteers for skin color by visual assessment. More than 2,800 volunteers were enrolled in the study and had their skin reflectance measured, with blood samples taken from >1,100 individuals for the isolation of DNA. Genomic DNA was extracted from whole blood with use of the Qiagen DNA isolation kit, in accordance with the manufacturer’s instructions. The sample recruited for the pooled genotyping phase of the study (cohort 1) consists of 395 individuals with maxL* values [less-than-or-eq, slant]56 and 383 individuals with maxL* values [gt-or-equal, slanted]63, who were classified as “L” for having low reflectance and as “H” for having high reflectance, respectively. For the replication population (cohort 2), 119 L and 116 H individuals were recruited.

Figure  1.
Histogram of maximum L* values (maxL*) in volunteers of South Asian ancestry. The maxL* values were obtained from 98 randomly selected individuals of South Asian ancestry living in the United Kingdom. The maxL* values for the 20th and 80th percentiles ...

Population-Structure Analysis

The genetic ancestry of the individuals in cohort 1 was examined by individually genotyping them on a set of 312 SNPs, referred to as “genomic control” (GC) SNPs, which are spaced approximately uniformly across the human autosomes.33 Population structure within the study population was evaluated using the structure program34 with models having 1–4 ancestral clusters. For each model, subsets of the L and H individuals were selected so as to construct pools matched on mean cluster membership, as described elsewhere.32 Simple χ2 tests for association with skin reflectance were performed for these GC SNPs. Any residual inflation in test statistics was assessed in comparison with the experimental error of allele-frequency determination in pooled genotyping.32

For the association analysis of individual genotypes, both population cohorts used in this study were assessed jointly by use of principal-components analysis (PCA)35 of standardized genotypes on the GC SNPs. The presence of population structure increases the fraction of genotypic variance that is accounted for by the first few principal components. The genotypes and phenotypes are subsequently adjusted for these to project out the contributions of ancestry to the genotypes. To assess the significance of the eigenvalues corresponding to the first several principal components, the PCA was repeated for 100 permutations of the genotype data in which SNP genotypes were scrambled, so as to eliminate all SNP-SNP correlations.

Pooled Genotyping on Oligonucleotide Arrays

From cohort 1, 286 L individuals and 285 H individuals were used to construct two DNA pools, L and H, respectively, for the estimation of 1,620,742 SNP allele-frequency differences between the low- and high-reflectance groups, with the use of methods described elsewhere.32,36 For pooled genotyping, each DNA pool was amplified in quadruplicate with the use of 266,851 long-range PCRs; each PCR replicate was then pooled, labeled, hybridized to 228 high-density arrays, stained, and detected as described elsewhere.36

Determination of Pooled Allele-Frequency Estimates

Estimates of the pooled allele frequency, equation M1, were computed from fluorescence intensities on the arrays, as described elsewhere.32 For each SNP, we obtained eight independent measurements of equation M2, four for each DNA pool. The allele-frequency difference between the L and H pools, Δequation M3, was determined from the average for each pool. We also used an independently derived genomewide haplotype map35 to obtain a better estimate of the allele-frequency difference for some SNPs, using a method described elsewhere.32 In brief, within each haplotype block across the genome, linear regression was used to solve for frequency differences of the underlying common haplotype patterns from the measured estimates of Δequation M4. A good quality of fit (P<.05 for the F test) was considered to be indicative of conformance between the genetic structure of the population in our study and the independently derived haplotype map. For such haplotype-conforming SNPs, the regression provided improved estimates of allele-frequency differences, called fitted Δequation M5, by effectively averaging over redundant information within each haplotype block. These linear regressions within haplotype blocks also allowed for the elimination of some redundancy in the SNP set selected for subsequent individual genotyping.

The selection of SNPs for further assessment with individual genotyping used a ranking by the magnitude of the estimated allele-frequency difference |Δequation M6|. For haplotype-conforming SNPs, a lower threshold was employed on the fitted |Δequation M7|, on the basis of the fact that estimates of allele-frequency differences are better for these SNPs. This approach to selecting SNPs from pooled genotype data for subsequent individual genotyping has been explained in detail elsewhere.32

Individual Genotyping on Oligonucleotide Arrays

High-density oligonucleotide arrays for individual genotyping were designed as described elsewhere,30 with each SNP interrogated by 40 distinct 25-mer probes. DNA samples were amplified by short-range multiplex PCR and were labeled, hybridized to the arrays, stained, and detected as described elsewhere.33 The individual genotypes were determined by clustering measurements from multiple scans in the two-dimensional space defined by reference and alternate perfect-match trimmed mean intensities, as described elsewhere.37,38 Quality-control filters were applied to both the DNA samples and the SNPs. DNA samples with call rates <0.75 or that showed evidence of misidentification were excluded from analysis. SNPs genotyped in <0.8 of the DNA samples were excluded from further analysis. Deviations from Hardy-Weinberg equilibrium (HWE) are sometimes indicative of problems with genotype clustering; however, such deviations could also legitimately arise because of population structure or association with the trait of interest. Therefore, SNPs that did not follow HWE at a level of P<.001 were discarded in most cases, with exceptions for SNPs that were also statistically significant at a level of P[less-than-or-eq, slant].001 on a simple trend test for association with skin reflectance corrected with GC.39 For these potentially associated SNPs, the clustering was visually inspected, and SNPs were excluded from further analysis if problems were detected.

Single-SNP Association Tests

Association tests were performed with logistic regression, with covariates to account for sex and population structure, with the use of likelihood-ratio tests to assess the significance of association with SNP genotypes. Population structure was represented by the first several principal components of the genotype matrix for the set of individually genotyped GC SNPs (see the “Population-Structure Analysis” section). Sex is a known confounder with skin pigmentation24 that was not explicitly matched for in selecting study participants; thus, it was also included as a covariate in logistic regression. The model for our primary single-SNP association test can be written as

equation image

where P(H) is the likelihood of membership in the H group, P(L) is the likelihood of membership in the L group, γ is a factor denoting sex, π1–π5 are the first five principal components determined as described in the “Population-Structure Correction in the Association Analysis” section, and g is the SNP genotype of the individual, coded as g=0,1,2 on the basis of the count of an arbitrarily designated allele. This linear coding of genotypes corresponds to a multiplicative model of risk. A likelihood-ratio test was used to test for association of skin reflectance with genotype, comparing the model above with a null model without the genotype term,

equation image

We tested for residual systematic inflation in the test statistics by applying this likelihood-ratio test to all samples taken together (cohorts 1 and 2) for the GC SNPs that were polymorphic in the study population and that passed our quality filters for individual genotyping. Other features of interest in the genotype data (dominance, epistasis, and independence of associations) were modeled by adding corresponding terms to the logistic-regression models.40

Independence of Associations

Cohort 2 samples were genotyped for 408 SNPs in the region on chromosome 15 between positions 45,524,265 and 48,470,992 (National Center for Biotechnology Information [NCBI] build 36), where many SNPs showed significant associations with the phenotype in cohort 1. To assess the independence of these SNP associations, an association analysis was performed conditioning on the genotype of the strongest associated SNP in this region, rs1426654. The conditional association for each other SNP in this region was assessed with a likelihood-ratio test that compared the logistic-regression model,

equation image

with the null model,

equation image

where g represents the genotypes of the SNP being assessed and gtop represents the genotypes of rs1426654. An analogous analysis with the roles of g and gtop reversed was also performed.

Dominance and Interactions

For the three primary SNPs that were found to be strongly associated with skin reflectance, dominance effects and epistasis were investigated by adding appropriate terms to the logistic-regression model. To test for dominance effects, genotypes of these SNPs were represented by two variables, gadd and gdom, and the genotypes AA, Aa, and aa (where A and a represent the two SNP alleles, arbitrarily designated) were coded as gadd=(-1,0,1) and gdom=(-0.5,0.5,-0.5).40 Thus, gdom represents deviations from a multiplicative risk model. A logistic-regression model was built on the cohort 2 samples, and a likelihood-ratio test was used to evaluate the significance of the association with gdom.

equation image

was compared with

equation image

To test for interactions between pairs of SNPs, we constructed indicator variables of possible combinations of genotypes at the two loci. Denoting the alleles at SNP 1 by a and A and those at SNP 2 by b and B, the indicator variables are IaAbB, IaABB, IAAbB, and IAABB, where, for example, IaAbB=1 for an individual whose genotype is aA on SNP 1 and bB on SNP 2. These terms were added to a base model that has additive terms gadd,1, gadd,2, and gadd,3 for the three SNPs. The interaction between each pair of SNPs was then assessed using a likelihood-ratio test, comparing the model

equation image


equation image


Determination of the Phenotypic Groups

The population of South Asia, which, in this study, includes India, Pakistan, Bangladesh, and Sri Lanka, has a wide natural variation in skin pigmentation and consists of the most genetically diverse population outside of Africa,41 which makes it well suited for the investigation of the complex genetic trait of human skin pigmentation. Human skin color is largely determined by two biological components, melanin and hemoglobin, and one major environmental component, exposure to UV light.42 Melanin is the major determining factor in intrinsic skin pigmentation43; therefore, we were most interested in identifying genes that affect melanin. The melanin contribution to skin pigmentation can be quantitatively measured and distinguished from redness caused by hemoglobin and/or inflammation by reflectance spectrophotometry with use of the L* value from the L*a*b* color system.44 Therefore, the highest L* value (maxL*) from hairless, sun-protected sites on the arm was used to define intrinsic skin reflectance in this study, thereby controlling for confounding effects due to sun exposure and hair. To empirically determine the top 20% and bottom 20% of the skin-reflectance distribution in the South Asian population, 98 randomly selected individuals of South Asian ancestry living in the United Kingdom were recruited and had skin chromameter measurements taken. From the plot of the distribution of the maxL* values in this set (fig. 1), individuals with maxL* values [less-than-or-eq, slant]56 were classified as belonging to the low-reflectance quintile (L) with the darkest skin pigmentation, and those with maxL* values [gt-or-equal, slanted]63 were classified as belonging to the high-reflectance (H) quintile with the lightest skin pigmentation of the distribution. Targeted recruitment of individuals in these tails resulted in the collection of DNA from 395 and 383 subjects with low and high skin reflectance, respectively, for the pooled genotyping phase of the study (cohort 1).

Population Structure and Construction of Balanced Pools

The genetic diversity within South Asian populations potentially introduces significant confounding in an association study because of population structure.45 To control for spurious associations due to systematic differences in ancestry between the L and H groups, we composed the two DNA pools to be used in the pooled genotyping with individuals who were matched for ancestry. The genetic ancestry of the 395 L and 383 H individuals was investigated by individually genotyping them for a set of 312 unlinked GC SNPs. Of the 312 GC SNPs, 294 yielded high-quality genotype data and were analyzed for population structure by use of the structure program,34 with the use of models with 1–4 clusters. The results yielded good evidence of more than one distinct genetic population cluster in our set of 778 individuals but yielded little support for more than two population clusters. The inferred cluster-membership values for a two-cluster model correlate with reported geographical ancestry. Individuals of southern (mainly Sri Lankan) and eastern (mainly Bangladeshi) origin have a similar distribution of admixture, which is distinct from individuals of northwestern (mainly Punjabi) origin (fig. 2).

Figure  2.
Histogram of admixture proportions in cohort 1 for the geographical ancestry groups. Admixture proportions were determined for 778 individuals in cohort 1 with use of the structure program for a two-cluster model, with genotypes from 294 GC SNPs. Reported ...

Using the same genotype data set of 294 GC SNPs and a method described elsewhere,32,33 we found significant population stratification between the tails of the skin-reflectance distribution. This result was not unexpected, since there was a clear bias in the recruitment of individuals for cohort 1 from the three geographical regions with respect to the reflectance tails (table 1). Under the null (one-cluster) model of no population structure, for which the L and H groups are assumed to be well matched, we observed a large excess of small P values in χ2 tests for association with skin color (table 2), and a global test for population stratification based on the sum of χ2 statistics46 was highly statistically significant (P<10-40). Using the inferred ancestry values obtained from the structure program with the two-cluster model, we composed the two phenotypic pools, using a subset of individuals such that the average proportions of genetic ancestry were similar between the groups and as many individuals as possible were retained. This matching required us to exclude ~25% of the original individuals but reduced the population stratification considerably, as can be measured by a reduction in the sum of χ2 statistics versus the unmatched groups (table 2). Although the residual stratification in the matched groups is statistically significant (P=3.9×10-5), the effect of a sum statistic of 400 is comparable to additional experimental error of ~1% in the pooled allele-frequency measurements. This error size is smaller than the actual level of experimental error. On the basis of these results, we constructed the DNA pools from 286 L and 285 H individuals from cohort 1. Recruitment of individuals for the replication population, cohort 2, was designed to reduce the population stratification bias with respect to skin reflectance, with the use of reported geographical ancestry as a marker for the genetic ancestry profiles we detected in cohort 1 (fig. 2).

Table 1.
Cohort Distribution for Regional Ancestry and Skin Reflectance[Note]
Table 2.
Measures of Population Stratification for Skin-Reflectance Pools of Different Compositions

Pooled Genotyping Results

We attempted to estimate the allele-frequency difference, Δequation M8, between the L and H pools for 1,620,742 SNPs across the human genome. Individual measurements of equation M9 were excluded when we could identify specific experimental errors—saturated signal intensities, inconsistent hybridization patterns, or low signal-to-background ratios. We excluded SNPs for which more than one equation M10 measurement per pool failed these quality-control criteria. SNPs for which the SE of Δequation M11 was >0.035 were also excluded, so that the set of SNPs having the largest absolute Δequation M12 were not dominated by a subset of measurements with very high experimental variance. “Fitted Δequation M13” values, where information from Perlegen’s haplotype map is used to improve pooled allele-frequency estimates, were generated for a fraction of the SNPs, called “haplotype-conforming” SNPs. These fitted Δequation M14 values provide more-accurate estimates than do the measured Δequation M15 values of individual SNPs, by exploiting correlations among SNPs within haplotype blocks. After all the data-quality filters were applied, we had high-quality Δequation M16 or fitted Δequation M17 estimates for 1,502,205 SNPs.

Individual Genotyping for Cohort 1

SNPs were ranked on the basis of absolute Δequation M18 or fitted Δequation M19, from largest to smallest, and 30,000 were chosen for individual genotyping on the basis of the capacity of a high-density oligonucleotide array. Selected SNPs from the pooled genotyping results had a fitted equation M20 or a equation M21. A lower threshold was used for haplotype-conforming SNPs, because their consistency with the haplotype map provides additional evidence of allele-frequency differences at those positions. An additional set of 66 candidate SNPs was selected to provide denser coverage of variants in genes known to be involved in the pigmentation process (table 3), and 312 GC SNPs used to test for population structure were also included in the array. Short-range PCR assays were successfully designed for 30,045 SNPs, 98.9% of those selected for all categories, and were genotyped with the 778 individuals of cohort 1. After application of quality filters to the genotypes and the DNA samples, a total of 25,928 SNPs for 737 individuals, 363 L and 374 H, were used for the association analysis, including 292 polymorphic GC SNPs. These included individuals who had been excluded from the balanced pools constructed for cohort 1; for, unlike pooled genotyping, with individual genotypes we can explicitly model population structure in the association analysis, and, therefore, the inclusion of all individuals maximizes our power to detect associations.

Table 3.
Candidate SNPs

Population-Structure Correction in the Association Analysis

Since we observed significant population stratification between the tails of the skin-reflectance distribution in this South Asian population, we assessed and adjusted for population structure in the association analysis, using the genotypes of our GC panel of 312 SNPs. Population structure was modeled jointly for both population cohorts used in this study, with use of a PCA of the standardized genotypes on the 292 polymorphic GC SNPs that passed our quality filters. The significance of the eigenvalues corresponding to the first several principal components was assessed by repeating the PCA on 100 scrambled realizations of the SNP genotype data. The largest three eigenvalues for the true data set were larger than the largest eigenvalue in any of the permuted data sets (P<.01); the largest five eigenvalues in the true data set were observed in fewer than half the permuted data sets. In addition, four of the top five principal components show significant association with reported ancestry regions in an analysis of variance (PC1, P<1.0×10-16; PC2, P=.19; PC3, P=2.9×10-3; PC4, P=2.1×10-3; PC5, P=5.9×10-8), indicating that these axes of variation likely reflect true features of population structure. We also investigated the relationship between skin reflectance (classified as “L” or “H”) and the top 10 principal components, using logistic regression with sex as a covariate. The first and fifth components showed significant associations (P=4.6×10-22 and P=1.7×10-3, respectively). On the basis of these findings, we adjusted all association analyses for the first five principal components in addition to sex. After this adjustment, we tested for evidence of remaining population structure bias in the association analysis, using the 292 polymorphic GC SNPs that passed quality filters. One SNP (rs8041414) was located in a region on chromosome 15 showing the strongest association with skin reflectance. The results for the remaining 291 SNPs did not reveal a significant inflation in single-SNP test statistics for association with skin reflectance (mean deviance 1.0232; P=.37). Thus, no further corrections for population structure were deemed necessary in association tests for other SNPs in cohort 1 or cohort 2.

Association Results for Cohort 1

Single-SNP tests for association with skin reflectance were performed with logistic regression, including sex and five principal components of the GC SNP genotypes as covariates. Likelihood-ratio tests were used to assess the significance of associations with SNP genotypes. In total, we tested 1,502,205 SNPs for association with skin pigmentation. Although there is likely to be some linkage disequilibrium among the SNPs within this population, it could not be evaluated from our pooled genotyping allele-frequency estimates. Therefore, we employed a conservative Bonferroni threshold for significance of association, α=0.05/1,502,205=3.3×10-8, and found 42 SNPs associated at this level of significance across the genome (table 4). Surprisingly, 39 of the 42 associated SNPs are all located within a single 2.4-Mb chromosomal region of 15q21.1-21.2, between chromosomal positions 45,774,265 and 48,220,992. The strongest SNP association for the entire genome is in this region, with an allele-frequency difference of 45% between the H and L reflectance groups, yielding a P value of 1.0×10−50. This SNP, rs1834640, is located 21 kb from the closest gene, SLC24A5. Of the three SNPs with genomewide significance located outside the chromosomal 15q region, two are located within the TYR gene on chromosome 11; one is in an intron (rs12295166), and the other is a nonsynonymous polymorphism (rs1042602 [p.S192Y]) and is the only SNP in this set of 42 associations that was selected for individual genotyping as a candidate SNP (table 4). The last associated SNP, rs16891982, is also a nonsynonymous polymorphism (p.L374F) in the gene SLC45A2 on chromosome 5.

Table 4.
Associated SNPs Showing Genomewide Significance in Cohort 1

Replication of Associations in Cohort 2

All of the 42 genomewide-significant SNPs in cohort 1 were individually genotyped with an independent replicate population of 235 individuals (cohort 2). After quality filters were applied, genotypes of the 42 SNPs in 115 H and 116 L individuals were used in the association analysis, where single-SNP likelihood-ratio tests were performed in the same manner as were those for cohort 1. A majority of the associated SNPs, 32 (76%) of 42, yielded P values with a significance threshold of <.05 (table 5). These included the nonsynonymous polymorphisms in TYR and SLC45A2, as well as 30 SNPs in the 2.4-Mb region on chromosome 15. As was seen in cohort 1, the most significant association among these SNPs in cohort 2 was for rs1834640 on 15q21.1, with P=3.28×10-15 and a 36% allele-frequency difference between the high- and low-reflectance groups. Treating the two cohorts as a single population of 968 individuals in the association analysis, instead of independently, yielded different statistical conclusions for only four SNPs, the intronic TYR SNP rs12295166 and three chromosome 15 SNPs, which show genomewide significance in the joint analysis but failed to reach statistical significance by the previous analysis (table 5).

Table 5.
Results for Associated SNPs in Cohort 2

Independent Associations on Chromosome 15 in the Vicinity of SLC24A5

To determine whether the large number of SNP associations observed within the 15q21.1-21.2 region were due to the presence of multiple independently associated SNPs or to one association having a large effect, we individually genotyped a dense set of SNPs within this region for cohort 2. The range for SNP selection was extended by 0.25 Mb in both directions from the boundaries established by significant cohort 1 associations in the region—that is, from positions 45,524,265 to 48,470,992 on chromosome 15. A total of 408 SNPs were successfully assayed, and association test results for this set of SNPs revealed several other significant associations across the region (table 6 and fig. 3). The most significant association in this region was observed for rs1426654, a nonsynonymous polymorphism (p.A111T) in SLC24A5, with P=1.06×10-18 and a 39% allele-frequency difference between the high- and low-reflectance groups. This SNP is in strong linkage disequilibrium with the most significant SNP from the genomewide scan, rs1834640 (R2=0.93), indicating that the two probably represent a single associated locus.

Figure  3.
Association signals, linkage disequilibrium, and genes in the region 45,524,265–48,470,992 on chromosome 15q21. Negative log10 association P values (found by logistic regression and likelihood-ratio tests) are displayed for cohorts 1 and 2 and ...
Table 6.
Association Test Results for a 2.4-Mb Region on Chromosome 15 for Cohort 2

Furthermore, by performing an association analysis conditional on the genotypes of rs1426654 in cohort 2, we observed that none of the other SNPs rise above the Bonferroni threshold for significance of conditional association in the region (α=0.05/407=1.23×10-4) (fig. 3 and table 6). When the conditional analysis was performed in reverse, by evaluation of the association of rs1426654 in cohort 2 conditional on each of the other 407 SNPs in this region in turn, the rs1426654 association was found to be significant at the genomewide level (P<3.3×10-8) for all SNPs but rs1834640, with which it is in the strongest linkage disequilibrium (P=.003 for association of skin reflectance with rs1426654 conditional on rs1834640; P=.73 for association with rs1834640 conditional on rs1426654). Thus, the data are consistent with the existence of a single strong primary association at rs1426654 in the SLC24A5 gene, and the multiplicity of apparent single-SNP associations in this region can be explained by the pattern of linkage disequilibrium with this SNP (fig. 3). Because of the lack of genomewide individual genotyping data for a South Asian population, we are unable to compare the extent of linkage disequilibrium in this region with that in the rest of the genome. Of the 407 SNPs genotyped in this region, only rs1834640 is in high linkage disequilibrium with rs1426654 (R2>0.9), and 3 more SNPs are in moderate linkage disequilibrium (R2>0.5) (fig. 3). Although we cannot rule out other associations of small effect within this chromosomal region, the evidence points strongly to a single association of large effect size, located at or in strong disequilibrium with the nonsynonymous SNP rs1426654 within SLC24A5.

Dominance and Interactions among Three Nonsynonymous Polymorphisms

The single-SNP association analyses with cohorts 1 and 2 identified three nonsynonymous polymorphisms in three genes—rs1426654 (p.A111T) in SLC24A5, rs16891982 (p.L374F) in SLC45A2, and rs1042602 (p.S192Y) in TYR—that show genomewide significance for association with skin pigmentation, on the basis of a multiplicative model of risk. Under a simple additive allele-risk model, the squared correlation (R2) between the dichotomously defined skin-reflectance trait and the genotypes is directly related to the fraction of the variance of skin reflectance accounted for by each SNP. The primary associated SNP, rs1426654 in SLC24A5, has the largest effect on skin reflectance of the three SNPs, with R2=0.33, compared with R2=0.036 for rs16891982 in SLC45A2 and R2=0.025 for rs1042602 in TYR.

We examined whether there is evidence within the cohort 2 data of deviations from a multiplicative model of risk, such as a dominance effect or interactions among the loci. We have very good power to detect dominant or recessive inheritance for rs1426654, which has a high minor-allele frequency (49% in the L group and 10% in the H group) and a very large effect on skin reflectance. However, our power to detect deviations from additive inheritance for the other two SNP associations is much lower because of the smaller size of their effect. Likelihood-ratio tests were used to assess the significance of an added dominance term, but none of the tests showed significant deviations from multiplicative risk (P=.58 for rs1426654, P=.20 for rs16891982, and P=.91 for rs1042602), which provides no evidence to support dominant or recessive inheritance for any of the three SNPs.

We next explored whether the three SNPs collectively explain more of the trait than does the combination of their single-SNP associations. Interactions were modeled within the framework of logistic regressions by adding indicator variables corresponding to specific genotype combinations for pairs of SNPs. We tested each of the three pairs separately and found no evidence for pairwise interactions (rs1426654-rs16891982, P=.77; rs1426654-rs1042602, P=.38; rs16891982-rs1042602, P=.12). Therefore, our data are most consistent with a simple model for skin pigmentation comprising additive contributions from alleles at the three associated nonsynonymous polymorphisms. Because our study design used the tails of the skin-reflectance distribution, we cannot directly determine the overall fraction of the trait variance in the population that is accounted for by the associated SNPs. Nevertheless, it is evident that these three markers collectively account for a large fraction of the wide variability of skin pigmentation within South Asians.


This is the first genomewide association study, to our knowledge, to investigate the genetic determinants of normal skin pigmentation within a human population. With the use of objective quantitative measurements of melanin content, our study clearly identifies three loci—TYR, SLC45A2, and SLC24A5—that contribute to the natural variation in skin pigmentation within a South Asian population. Within each of these genes, we found polymorphisms that met genomewide significance on association tests, with the associations replicated in a second South Asian cohort. The contributions of these polymorphisms to skin pigmentation were found to be independent and additive across genes, with no evidence of dominant or recessive effects at any of these loci. Collectively, these three genes account for a large fraction of the wide, naturally occurring variation in skin pigmentation in South Asians.

Strikingly, our primary associated SNP in SLC24A5 may explain >30% of the variance of the dichotomously defined skin reflectance in our study population. As a result of the interplay between the strength of this single-SNP association and linkage disequilibrium with SNPs in its vicinity, several other significant associations were observed in a 2.4-Mb region on chromosome 15q21. Although we cannot completely exclude the possibility that multiple real independent associations exist in this chromosomal 15 region, our conditional analysis shows unambiguously that the association signals on all SNPs in the region are far less significant than those for the primary associated SNP in SLC24A5.

This is also one of the first genomewide association studies to focus on a South Asian population. This population is known to be highly genetically diverse, and, because ancestry is known to be highly confounded with skin pigmentation, the treatment of population structure was crucial to our association analysis. Our analysis of population structure was based on the individual genotypes of 968 individuals with 292 unlinked SNPs. Notwithstanding the limitations imposed by the size of this data set on the detection of structure,48 the results for these 292 SNPs indicated that we were able to adequately correct for stratification. It is encouraging that, despite the use of a study population that is genetically diverse and a trait that is strongly confounded with ancestry, we were able to detect three true SNP associations.

At the time of this study, mutations in two of these genes, TYR and SLC45A2, were known to be involved in different forms of oculocutaneous albinism (OCA1 [MIM 203100] and OCA4 [MIM 606574]),49,50 yet their contribution to the normal range of skin pigmentation between and within populations was just beginning to be studied. The remaining locus, SLC24A5, which displays the largest genetic effect on pigmentation status in the South Asian population studied here, was an unknown pigmentation gene during our study. The SLC24A5 gene has since been shown to play an important role in pigmentation, because a mutation in this gene was identified as the genetic basis of the hypopigmentation phenotype in golden zebrafish.22 During the past few years, several candidate-gene admixture studies on a cohort of African Americans and African Caribbeans has shown that the same three nonsynonymous polymorphisms that we found in our South Asian cohort—rs1042602 in TYR, rs16891982 in SLC45A2, and rs1426654 in SLC24A5—are significantly associated with melanin content measured by reflectometry.13,21,22,24 When taken together with the results of our study, the evidence is compelling that these three loci are a large component of the genetic influence of constitutive pigmentation in these diverse populations.

The importance of these three particular SNPs—rs1042602 in TYR, rs16891982 in SLC45A2, and rs1426654 in SLC24A5—to skin-pigmentation variation within other populations is likely to be highly variable across the globe. All three of the SNPs are essentially monomorphic in African and East Asian populations and thus are unlikely to contribute significantly to skin-pigmentation variation within those groups.13,25,29,30 For populations of European ancestry, the contributions of these three SNPs to intrapopulation skin-pigmentation variation are quite distinct from each other. The SLC24A5 SNP rs1426654 is nearly fixed for one allele in all European groups.11,13,22,29,30,51 In comparison, the TYR SNP rs1042602 is highly polymorphic in all European populations, but, despite its variability, previous studies found that this SNP was not significantly associated with skin-reflectance measurements in a white population.13,21,24,29 In contrast to the TYR SNP, the SLC45A2 SNP rs16891982 displays a gradient of increasing minor-allele frequency in European populations from north to south13,26,52 and has been significantly associated with subjectively measured skin color in a population of European ancestry.19 It is important to distinguish the involvement of these three particular SNPs in skin-pigmentation variation within populations from the potential contribution of the three loci (TYR, SLC24A4, and SLC45A2). For example, two functional promoter variants and another nonsynonymous polymorphism in SLC45A2, rs26722 (p.E272K), have also been significantly associated with subjectively measured skin color in a population of European ancestry.20 Whereas the promoter variants in SLC45A2 were not included in our study, rs26722 was assayed in the present study but showed weaker evidence of association with skin-reflectance measurements in our South Asian population (joint cohort association P=.02), as compared with the significantly associated rs16891982 SNP.

Other genes have been implicated either directly or indirectly with pigmentation-related traits in previous work. Several reported associations are based on the examination of markers in pigmentation candidate genes for allele-frequency differences between populations with known skin-pigmentation differences1113,51 or are associations with a different qualitative measure of pigmentation, such as the Fitzpatrick skin type and subjective classification of skin color by visual assessment,5355 all of which make direct comparisons with our results difficult. In contrast, other associations reported for skin pigmentation used quantitative measures of reflectance similar to those in our study17,18,21,23,24; therefore, our results can be compared directly with those studies. A modest association has been found between skin pigmentation from reflectance measurements for SNP rs1800404 within the gene OCA221 in an African American population (P=.01). Although rs1800404 was not assayed in our study, the closest OCA2 SNP that was included in our study, rs1037208, located 4,414 bp from rs1800404, yielded the most significant joint cohort association P value (P=5.6×10-5) of several OCA2 SNPs that we assayed with individual genotyping. On the basis of International HapMap data for the CEU sample (Utah residents with ancestry from northern and western Europe),29 these two SNPs are in modest linkage disequilibrium with one another (R2=0.35). Although we cannot establish beyond a doubt that these two SNPs correspond to a single locus in our population, it is quite plausible that they are highly correlated in South Asians; therefore, our results may support the prior association of OCA2 and skin pigmentation. Another study found an association with skin reflectance in a Tibetan population under an epistatic model (P=.01) for a pair of SNPs in OCA2 and MC1R17rs12910433 and a nonsynonymous polymorphism, rs2228479 (p.V92M), respectively. Both these SNPs were individually genotyped in cohort 1 in our study, and, whereas the OCA2 SNP rs12910433 showed no evidence of association (P=.3), the MC1R SNP rs2228479 had the lowest P value (P=6.8×10-4) of several MC1R SNPs that were individually genotyped. There is no evidence of interactions between these two SNPs in our study population (P=.91). Lastly, one SNP in the 3′ UTR of the ASIP gene, rs6058017, has been associated with darker skin color in female African Americans18 (P<.001). This SNP was included in our pooled study but showed no allele-frequency difference between the low- and high-reflectance groups (equation M22), and attempts to individually genotype this SNP in cohort 1 were unsuccessful. Since the previous association showed significant association with only female, but not male, skin pigmentation, our pooled genotyping strategy that used mixed sexes would not have the power to find such an association.

It is important to note that the application of a very strict Bonferroni correction of 1,502,205 independent tests, combined with the pooled genotyping design, greatly reduces the power of this study to detect real SNP associations with small effects on skin pigmentation; thus, it is possible that loci other than TYR, SLC45A2, and SLC24A5 may affect skin-pigmentation variation within the South Asian population. Furthermore, as was mentioned earlier in this discussion, several studies comparing allele frequencies of candidate skin-pigmentation SNPs across multiple global populations present evidence supporting an independent genetic mechanism for the lighter skin pigmentation of East Asians that is distinct from genetic factors influencing lighter pigmentation in Europeans.12,13,25 Therefore, it is likely that loci other than the three identified in this study influence skin pigmentation in other global populations.


We thank Cathryn Lewis, C. V. Natraj, Geoffrey Probert, Pushker Sona, Ian Scott, Michael Barratt, Simon Alaluf, and Nicholas Holmberg, for useful discussions. We thank all Perlegen Sciences employees who provided technical assistance and scientific discussion for this project and manuscript. We also thank the volunteers who donated their blood and time, without which this study would not have been possible.

Web Resources

The URLs for data presented herein are as follows:

Online Mendelian Inheritance in Man (OMIM), http://www.ncbi.nlm.nih.gov/Omim/ (for OCA1 and OCA4)


1. Armstrong BK, Kricker A, English DR (1997) Sun exposure and skin cancer. Australas J Dermatol 38:S1–S6 [PubMed] [Cross Ref]10.1111/j.1440-0960.1997.tb01115.x
2. Palmer JS, Duffy DL, Box NF, Aitken JF, O’Gorman LE, Green AC, Hayward NK, Martin NG, Sturm RA (2000) Melanocortin-1 receptor polymorphisms and risk of melanoma: is the association explained solely by pigmentation phenotype? Am J Hum Genet 66:176–186 [PMC free article] [PubMed]
3. Rees JL (2004) The genetics of sun sensitivity in humans. Am J Hum Genet 75:739–751 [PMC free article] [PubMed]
4. Robbins AH (1991) Biological perspectives on human pigmentation. Cambridge University Press, Cambridge, United Kingdom
5. Branda RF, Eaton JW (1978) Skin color and nutrient photolysis: an evolutionary hypothesis. Science 201:625–626 [PubMed] [Cross Ref]10.1126/science.675247
6. Chaplin G (2004) Geographic distribution of environmental factors influencing human skin coloration. Am J Phys Anthropol 125:292–302 [PubMed] [Cross Ref]10.1002/ajpa.10263
7. Murray FG (1934) Pigmentation, sunlight, and nutritional disease. Am Anthropol 36:438–44510.1525/aa.1934.36.3.02a00100 [Cross Ref]
8. Clark P, Stark AE, Walsh RJ, Jardine R, Martin NG (1981) A twin study of skin reflectance. Ann Hum Biol 8:529–541 [PubMed] [Cross Ref]10.1080/03014468100005371
9. Frisancho AR, Wainwright R, Way A (1981) Heritability and components of phenotypic expression in skin reflectance of Mestizos from the Peruvian lowlands. Am J Phys Anthropol 55:203–208 [PubMed] [Cross Ref]10.1002/ajpa.1330550207
10. Harrison GA, Owen JJ (1964) Studies on the inheritance of human skin colour. Ann Hum Genet 28:27–37 [PubMed] [Cross Ref]10.1111/j.1469-1809.1964.tb00457.x
11. Izagirre N, Garcia I, Junquera C, de la Rua C, Alonso S (2006) A scan for signatures of positive selection in candidate loci for skin pigmentation in humans. Mol Biol Evol 23:1697–1706 [PubMed] [Cross Ref]10.1093/molbev/msl030
12. Myles S, Somel M, Tang K, Kelso J, Stoneking M (2007) Identifying genes underlying skin pigmentation differences among human populations. Hum Genet 120:613–621 [PubMed] [Cross Ref]10.1007/s00439-006-0256-4
13. Norton HL, Kittles RA, Parra E, McKeigue P, Mao X, Cheng K, Canfield VA, Bradley DG, McEvoy B, Shriver MD (2006) Genetic evidence for the convergent evolution of light skin in Europeans and East Asians. Mol Biol Evol 24:710–722 [PubMed] [Cross Ref]10.1093/molbev/msl203
14. Relethford JH (1997) Hemispheric difference in human skin color. Am J Phys Anthropol 104:449–457 [PubMed] [Cross Ref]10.1002/(SICI)1096-8644(199712)104:4<449::AID-AJPA2>3.0.CO;2-N
15. Bennett DC, Lamoreux ML (2003) The color loci of mice—a genetic century. Pigment Cell Res 16:333–344 [PubMed] [Cross Ref]10.1034/j.1600-0749.2003.00067.x
16. Sturm RA, Teasdale RD, Box NF (2001) Human pigmentation genes: identification, structure and consequences of polymorphic variation. Gene 277:49–62 [PubMed] [Cross Ref]10.1016/S0378-1119(01)00694-1
17. Akey JM, Wang H, Xiong M, Wu H, Liu W, Shriver MD, Jin L (2001) Interaction between the melanocortin-1 receptor and P genes contributes to inter-individual variation in skin pigmentation phenotypes in a Tibetan population. Hum Genet 108:516–520 [PubMed] [Cross Ref]10.1007/s004390100524
18. Bonilla C, Boxill LA, Donald SA, Williams T, Sylvester N, Parra EJ, Dios S, Norton HL, Shriver MD, Kittles RA (2005) The 8818G allele of the agouti signaling protein (ASIP) gene is ancestral and is associated with darker skin color in African Americans. Hum Genet 116:402–406 [PubMed] [Cross Ref]10.1007/s00439-004-1251-2
19. Graf J, Hodgson R, van Daal A (2005) Single nucleotide polymorphisms in the MATP gene are associated with normal human pigmentation variation. Hum Mutat 25:278–284 [PubMed] [Cross Ref]10.1002/humu.20143
20. Graf J, Voisey J, Hughes I, van Daal A (2007) Promoter polymorphisms in the MATP (SLC45A2) gene are associated with normal human skin color variation. Hum Mutat 28:710–717 [PubMed] [Cross Ref]10.1002/humu.20504
21. Hoggart CJ, Parra EJ, Shriver MD, Bonilla C, Kittles RA, Clayton DG, McKeigue PM (2003) Control of confounding of genetic associations in stratified populations. Am J Hum Genet 72:1492–1504 [PMC free article] [PubMed]
22. Lamason RL, Mohideen MA, Mest JR, Wong AC, Norton HL, Aros MC, Jurynec MJ, Mao X, Humphreville VR, Humbert JE, et al (2005) SLC24A5, a putative cation exchanger, affects pigmentation in zebrafish and humans. Science 310:1782–1786 [PubMed] [Cross Ref]10.1126/science.1116238
23. Naysmith L, Waterston K, Ha T, Flanagan N, Bisset Y, Ray A, Wakamatsu K, Ito S, Rees JL (2004) Quantitative measures of the effect of the melanocortin 1 receptor on human pigmentary status. J Invest Dermatol 122:423–428 [PubMed] [Cross Ref]10.1046/j.0022-202X.2004.22221.x
24. Shriver MD, Parra EJ, Dios S, Bonilla C, Norton H, Jovel C, Pfaff C, Jones C, Massac A, Cameron N, et al (2003) Skin pigmentation, biogeographical ancestry and admixture mapping. Hum Genet 112:387–399 [PubMed]
25. Soejima M, Koda Y (2007) Population differences of two coding SNPs in pigmentation-related genes SLC24A5 and SLC45A2. Int J Legal Med 121:36–39 [PubMed] [Cross Ref]10.1007/s00414-006-0112-z
26. Yuasa I, Umetsu K, Harihara S, Kido A, Miyoshi A, Saitou N, Dashnyam B, Jin F, Lucotte G, Chattopadhyay PK, et al (2006) Distribution of the F374 allele of the SLC45A2 (MATP) gene and founder-haplotype analysis. Ann Hum Genet 70:802–811 [PubMed] [Cross Ref]10.1111/j.1469-1809.2006.00261.x
27. Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, et al (2001) Initial sequencing and analysis of the human genome. Nature 409:860–921 [PubMed] [Cross Ref]10.1038/35057062
28. Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG, Smith HO, Yandell M, Evans CA, Holt RA, et al (2001) The sequence of the human genome. Science 291:1304–1351 [PubMed] [Cross Ref]10.1126/science.1058040
29. International HapMap Consortium (2005) A haplotype map of the human genome. Nature 437:1299–1320 [PMC free article] [PubMed] [Cross Ref]10.1038/nature04226
30. Hinds DA, Stuve LL, Nilsen GB, Halperin E, Eskin E, Ballinger DG, Frazer KA, Cox DR (2005) Whole-genome patterns of common DNA variation in three human populations. Science 307:1072–1079 [PubMed] [Cross Ref]10.1126/science.1105436
31. Sachidanandam R, Weissman D, Schmidt SC, Kakol JM, Stein LD, Marth G, Sherry S, Mullikin JC, Mortimore BJ, Willey DL, et al (2001) A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms. Nature 409:928–933 [PubMed] [Cross Ref]10.1038/35057149
32. Hinds DA, Seymour AB, Durham K, Banerjee P, Ballinger DG, Milos PM, Cox DR, Thompson JF, Frazer KA (2004) Application of pooled genotyping to scan candidate regions for association with HDL cholesterol levels. Hum Genomics 1:421–434 [PMC free article] [PubMed]
33. Hinds DA, Stokowski RP, Patil N, Konvicka K, Kershenobich D, Cox DR, Ballinger DG (2004) Matching strategies for genetic association studies in structured populations. Am J Hum Genet 74:317–325 [PMC free article] [PubMed]
34. Pritchard JK, Stephens M, Donnelly P (2000) Inference of population structure using multilocus genotype data. Genetics 155:945–959 [PMC free article] [PubMed]
35. Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D (2006) Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 38:904–909 [PubMed] [Cross Ref]10.1038/ng1847
36. Patil N, Berno AJ, Hinds DA, Barrett WA, Doshi JM, Hacker CR, Kautzer CR, Lee DH, Marjoribanks C, McDonough DP, et al (2001) Blocks of limited haplotype diversity revealed by high-resolution scanning of human chromosome 21. Science 294:1719–1723 [PubMed] [Cross Ref]10.1126/science.1065573
37. Bierut LJ, Madden PA, Breslau N, Johnson EO, Hatsukami D, Pomerleau OF, Swan GE, Rutter J, Bertelsen S, Fox L, et al (2007) Novel genes identified in a high-density genome wide association study for nicotine dependence. Hum Mol Genet 16:24–35 [PMC free article] [PubMed] [Cross Ref]10.1093/hmg/ddl441
38. Saccone SF, Hinrichs AL, Saccone NL, Chase GA, Konvicka K, Madden PA, Breslau N, Johnson EO, Hatsukami D, Pomerleau O, et al (2007) Cholinergic nicotinic receptor genes implicated in a nicotine dependence association study targeting 348 candidate genes with 3713 SNPs. Hum Mol Genet 16:36–49 [PMC free article] [PubMed] [Cross Ref]10.1093/hmg/ddl438
39. Devlin B, Roeder K (1999) Genomic control for association studies. Biometrics 55:997–1004 [PubMed] [Cross Ref]10.1111/j.0006-341X.1999.00997.x
40. Cordell HJ, Clayton DG (2002) A unified stepwise regression procedure for evaluating the relative effects of polymorphisms within a gene using case/control or family data: application to HLA in type 1 diabetes. Am J Hum Genet 70:124–141 [PMC free article] [PubMed]
41. Majumder PP (1998) People of India: biological diversity and affinities. Evol Anthropol 6:100–11010.1002/(SICI)1520-6505(1998)6:3<100::AID-EVAN4>3.0.CO;2-I [Cross Ref]
42. Barsh GS (2003) What controls variation in human skin color? PLoS Biol 1:E27 [PMC free article] [PubMed] [Cross Ref]10.1371/journal.pbio.0000027
43. Rees JL (2003) Genetics of hair and skin color. Annu Rev Genet 37:67–90 [PubMed] [Cross Ref]10.1146/annurev.genet.37.110801.143233
44. Alaluf S, Atkins D, Barrett K, Blount M, Carter N, Heath A (2002) The impact of epidermal melanin on objective measurements of human skin colour. Pigment Cell Res 15:119–126 [PubMed] [Cross Ref]10.1034/j.1600-0749.2002.1o072.x
45. Sengupta S, Farheen S, Mukherjee N, Dey B, Mukhopadhyay B, Sil SK, Prabhakaran N, Ramesh A, Edwin D, Usha Rani MV, et al (2004) DNA sequence variation and haplotype structure of the ICAM1 and TNF genes in 12 ethnic groups of India reveal patterns of importance in designing association studies. Ann Hum Genet 68:574–587 [PubMed] [Cross Ref]10.1046/j.1529-8817.2003.00126.x
46. Pritchard JK, Rosenberg NA (1999) Use of unlinked genetic markers to detect population stratification in association studies. Am J Hum Genet 65:220–228 [PMC free article] [PubMed]
47. Stein LD, Mungall C, Shu S, Caudy M, Mangone M, Day A, Nickerson E, Stajich JE, Harris TW, Arva A, et al (2002) The Generic Genome Browser: a building block for a model organism system database. Genome Res 12:1599–1610 [PMC free article] [PubMed] [Cross Ref]10.1101/gr.403602
48. Patterson N, Price AL, Reich D (2006) Population structure and eigenanalysis. PLoS Genet 2:e190 [PMC free article] [PubMed] [Cross Ref]10.1371/journal.pgen.0020190
49. Giebel LB, Strunk KM, King RA, Hanifin JM, Spritz RA (1990) A frequent tyrosinase gene mutation in classic, tyrosinase-negative (type IA) oculocutaneous albinism. Proc Natl Acad Sci USA 87:3255–3258 [PMC free article] [PubMed] [Cross Ref]10.1073/pnas.87.9.3255
50. Newton JM, Cohen-Barak O, Hagiwara N, Gardner JM, Davisson MT, King RA, Brilliant MH (2001) Mutations in the human orthologue of the mouse underwhite gene (uw) underlie a new form of oculocutaneous albinism, OCA4. Am J Hum Genet 69:981–988 [PMC free article] [PubMed]
51. Lao O, de Gruijter JM, van Duijn K, Navarro A, Kayser M (2007) Signatures of positive selection in genes associated with human skin pigmentation as revealed from analyses of single nucleotide polymorphisms. Ann Hum Genet 71:354–369 [PubMed] [Cross Ref]10.1111/j.1469-1809.2006.00341.x
52. Nakayama K, Fukamachi S, Kimura H, Koda Y, Soemantri A, Ishida T (2002) Distinctive distribution of AIM1 polymorphism among major human populations with different skin color. J Hum Genet 47:92–94 [PubMed] [Cross Ref]10.1007/s100380200007
53. Duffy DL, Montgomery GW, Chen W, Zhao ZZ, Le L, James MR, Hayward NK, Martin NG, Sturm RA (2007) A three–single-nucleotide polymorphism haplotype in intron 1 of OCA2 explains most human eye-color variation. Am J Hum Genet 80:241–252 [PMC free article] [PubMed]
54. Flanagan N, Healy E, Ray A, Philips S, Todd C, Jackson IJ, Birch-Machin MA, Rees JL (2000) Pleiotropic effects of the melanocortin 1 receptor (MC1R) gene on human pigmentation. Hum Mol Genet 9:2531–2537 [PubMed] [Cross Ref]10.1093/hmg/9.17.2531
55. Valverde P, Healy E, Jackson I, Rees JL, Thody AJ (1995) Variants of the melanocyte-stimulating hormone receptor gene are associated with red hair and fair skin in humans. Nat Genet 11:328–330 [PubMed] [Cross Ref]10.1038/ng1195-328

Articles from American Journal of Human Genetics are provided here courtesy of American Society of Human Genetics
PubReader format: click here to try


Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...