Principal component analyses of substructure in a diverse set of subjects of European descent. Graphic representation of the first two PCs based on analysis with >250,000 SNPs are shown. Color code shows subgroup of subjects for each population group. The subjects included Adygei (ADY, 12 subjects), Ashkenazi Jewish American (AJA, 40 subjects), Basque (BAS, 12 subjects), Bedouin (BDN, 23 subjects), CEPH European American (CEU, 48), Druze (20 subjects), Eastern European American (EEUR, 11 subjects), German American (GERM, 17 subjects), Greek American (GRK, 7), Hungarian American (HUN, 4), IRISH (84 subjects), Italian American (ITN, 20 subjects), northern Italian (ITN_N, 13 subjects), Dutch American (NETH, 3 subjects), Orcadian (ORC, 14 subjects), Palestinian (PAL, 22 subjects), Russian (RUS, 13 subjects), Sardinian (SARD, 28 subjects), Scandinavian American (SCAN, 6 subjects ), Spanish (SPAIN, 12 subjects), Swedish (SWED, 591 subjects), Tuscan (TUSC, 8 subjects), and United Kingdom American (UK, 5 subjects). Each of the specific country or ethnic color-coded origins had consistent four grandparent origin information. The total number of individuals in this analysis was 4,446. (A) European Americans (EURA) without four grandparental information are shown (contains both NYCP and CHOP). (B) and (C) illustrate the distribution of the EURA from NYCP (1,873 subjects) and CHOP (1,488 subjects), respectively.

