![]() | ![]() |
Formats:
|
||||||||||||||||||||||
Copyright © 2005 by The American Society of Human Genetics. All rights reserved. An Algorithm to Construct Genetically Similar Subsets of Families with the Use of Self-Reported Ethnicity Information 1Department of Biostatistics, University of Michigan, Ann Arbor Address for correspondence and reprints: Dr. Andrew Skol, Department of Biostatistics, University of Michigan, 1420 Washington Heights, Ann Arbor, MI 48109-2029. E-mail: askol/at/umich.edu *VA Cooperative Study 366 Investigators are listed in the acknowledgments. Received March 31, 2005; Accepted June 15, 2005. This article has been cited by other articles in PMC.Abstract We present a simple algorithm that uses self-reported ethnicity information, pedigree structure, and affection status to group families into genetically more homogeneous subsets. This algorithm should prove useful to researchers who wish to perform genetic analyses on more-homogeneous subsets when they suspect that ignoring heterogeneity could lead to false-positive results or loss of power. We applied our algorithm to the self-reported ethnicity information of 159 families from the Veterans Affairs Cooperative Study of schizophrenia. We compared these estimates of population membership with those obtained using the program structure in an analysis of 378 microsatellite markers. We found excellent concordance between family classifications determined using self-reported ethnicity information and our algorithm and those determined using genetic marker data and structure; 158 of the 159 families had concordant classifications. In addition, the degree of admixture estimated using our algorithm and self-reported ethnicity information correlated well with that predicted using the genotype information. Introduction Many genetic studies of psychiatric illness and other complex disorders are performed with samples from ethnically stratified populations. Linkage and case-control association analyses are sensitive to population heterogeneity of disease etiology and marker allele frequencies (Curtis and Sham 1996; Deng 2001). Using an averaged allele-frequency distribution can result in an overestimate or underestimate of the degree of sharing between affected relatives when parental genotypes are not available (Curtis and Sham 1996). Given possible heterogeneity between populations, it may be advisable to perform analyses separately for the families from each population, in addition to performing a joint analysis in which all families are analyzed together. In genetic studies of psychiatric disease, a common instrument used to determine a subject’s psychiatric diagnosis is the Diagnostic Interview for Genetic Studies (DIGS) (Nurnberger et al. 1994). The interview includes a question regarding the ethnicity of the subject’s parents (appendix A). Generally, studies that have used the DIGS or a similar instrument and have performed analyses separately by ethnic or racial group have not disclosed how they defined or determined group membership of families, but they likely used imprecise definitions and ad hoc methods to assign them (Cloninger et al. 1998; Funke et al. 2004). Here, we propose an algorithm that uses the information from the DIGS, family structure, affection status, and a simple decision rule to determine group membership for each family for the purposes of genetic analysis. The effectiveness of using self-reported ethnicity data to generate genetically homogeneous samples is as dependent on the subjects’ interpretation of the interviewer’s ethnicity question as it is on how genetically heterogeneous the ethnicities are. Sankar and Cho (2002) offer a useful discussion on the need for researchers, when they are performing studies that incorporate ethnicity as an explanatory variable, to understand the factors that influence which ethnic groups a subject identifies with. To assess the performance of our algorithm, we applied it to data from a recently completed genetic linkage study of schizophrenia (Tsuang et al. 2000; Faraone et al., in press). We compared the estimates of ancestral background obtained by using our algorithm and the DIGS data with those obtained by using the program structure and the analysis of data on 378 microsatellite markers (Pritchard et al. 2000; Falush et al. 2003), which estimates population membership with the use of genotype data. When we used a simple rule that classified a family as European American (EA) or African American (AA) when the probability of the family belonging to that population was estimated to be >50%, 158 of the 159 families were classified concordantly by our algorithm and by structure (κ=0.99). In addition, families that were estimated to be admixed by our algorithm were also shown to have substantial admixture by structure analysis. Methods The purpose of our algorithm is to divide samples into genetically more homogenous groups of families. This goal was motivated by a desire to perform linkage and family-based association analyses of our Veterans Affairs (VA) schizophrenia data. When parental genotypes are unavailable, evidence for linkage is sensitive to misspecification of parental allele frequencies (Curtis and Sham 1996; Deng 2001), hence the importance of separating families into homogeneous subsets. Populations that historically have been separated by large distances or geographical boundaries are often genetically heterogeneous, and the longer these populations have been isolated and the more varied their environments, the more heterogeneous they are likely to be (Zhivotovsky et al. 2003). The ethnic or racial groups that individuals are operationally assigned to in epidemiological contexts are usually correlated with the continental region of a person’s ancestors in the immediate pre-Columbian era, and so the percentages of ancestry tracing to different continental regions are reasonable variables to use to try to construct more-homogenous subsets of families. Our algorithm summarizes self-reported ethnicity information from family members into a family-specific ancestral-group classification in four steps. First, we convert each pedigree member’s response to the DIGS ethnicity question into maternal and paternal ancestry classification vectors. Second, we estimate the ancestry classification vectors of the pedigree’s founders from the information of their descendants. Third, we calculate the family ancestry classification vector as a weighted average of the founders’ vectors, where a founder’s weight is the sum of the kinship coefficients (Wright 1984) between the founder and affected individuals. Fourth, we assign a summary family ancestry based on values in the family’s classification vector. We present our algorithm with the assumption that the ethnicity information is coming from the DIGS, since this is a likely source of diagnostic and ethnicity information for psychiatric genetic studies. It should be straightforward to adjust our algorithm to use information from other sources of self-reported ethnicity. Step 1: Convert DIGS Ethnicity Responses into Vectors of Ancestry Proportions The DIGS question regarding ethnic makeup is given in appendix A. The question asks about the ethnicity of the subject’s parents. We store the subject’s responses to the question in two vectors, one for each parent. The elements of these vectors contain, in principle, the probability that a randomly chosen gene from the parent traces to a particular continental population. Since our VA sample is almost exclusively of European or African origin, we categorized the ethnicities listed in appendix A into AA (15), EA (1–7), and other (OT) (8–14 and 16). Each respondent is assigned a maternal and paternal ancestry vector, each composed of the proportion of responses corresponding to AA, EA, and OT. Each pedigree member that did not complete the DIGS is assigned a single ancestry vector initialized to [0, 0, 0]. Unlike those who complete the DIGS, these nonrespondents do not require maternal and paternal ancestry vectors; rather, they require only a single vector, since information inferred about their ancestral makeup is an amalgam of their parents’ information and cannot be decomposed into parent-specific information. Figure 1
Step 2: Infer the Founders' Ancestry Vectors In this step, we estimate the ancestry vectors of the founders from the information provided by their descendants. We repeat the following five actions until the founders are reached. First, we identify all individuals with parents but no offspring in the current version of the pedigree. The second, third, fourth, and fifth actions are performed only on these individuals. Second, the ancestry vectors are normalized, if necessary, by dividing each vector element by the sum of the elements; this will not be required for individuals with no offspring in the original pedigree or those who completed the DIGS. Third, each ancestry vector is multiplied by a weight. For our sample, affected individuals, because of possible impaired cognition, communication, and social interaction, are considered to have less reliable responses and are given smaller weights than unaffected individuals. We assign weight 1 to the ancestry vectors of unaffected offspring and weight 0.001 to those of affected offspring. This scheme allows use of the affected offspring responses when no unaffected offspring information is available and virtually ignores this information when responses from unaffected offspring are available. Fourth, for each individual who completed the DIGS, the maternal (or paternal) ancestry vector is added to the mother’s (or father’s) vector, provided that the mother (or father) did not complete the DIGS. For example, if a mother and her son both completed the DIGS, then the information the son reports about his mother’s ethnicity is ignored, and only the information the mother provided is used. For individuals who do not complete the DIGS, their ancestry vector is added to their mother’s (or father’s) vector, provided that she (or he) did not complete the DIGS. Note that a respondent, who has both maternal and paternal ancestry vectors, provides parent-specific ethnicity information, whereas a nonrespondent, who has only a single race vector, provides, at best, an average of his or her parents’ ethnicity information, for both parents. Fifth, the individuals processed from the current pedigree are removed, and we return to the first action. When the founders are reached, their ancestry vectors are normalized. If a founder completed the DIGS, the maternal and paternal ancestry vectors are averaged. We demonstrate this procedure for the pedigree in figure 1 In the example, affected individuals’ information was downweighted. Had we chosen to weight each individual’s information equally, the information provided by individual 4 would have been weighted the same as that from her brother, individual 5. This would have given classification vectors [.75, 0, .25] and [1, 0, 0] for founders 1 and 2, respectively, rather than [1, 0, 0] for each. Step 3: Calculate the Family’s Ancestry Classification Vector We calculate the family’s classification vector as a weighted average of the founders’ ancestry vectors. For weights, we sum the kinship coefficients between the founder and all affected family members. The kinship coefficient between relatives x and y ( xy) is the probability that an allele selected at random from an arbitrary locus in x and an allele selected at random from the same locus in y are identical by descent. For a family with F founders and A affected pedigree members, we write the family’s race classification vector as
ij and ri can be found in figure 1Step 4: Assign Family Ancestry We assign a family to an ancestry if the family’s vector element for that ancestry exceeds a cutoff value. For our VA study, we assigned an ancestry of AA (or EA) if the AA (or EA) element of the family ancestry vector was >0.50. The ancestry vector of the family in figure 1 Application We applied our algorithm to the data of 159 families from our VA Cooperative Studies schizophrenia-study sample. The families had an average of 2.7 affected individuals (range 2–6) and 4.6 genotyped individuals (range 2–14) per family. The markers used were primarily from the ABI Prism Linkage Mapping set, version 2.5, HD5 (Applied Biosystems; see Web Resources). Comparison with structure The program structure (Pritchard et al. 2000; Falush et al. 2003) implements a Bayesian approach to estimate the proportion of each individual’s genome derived from each of K populations, where K is user specified. Also, structure estimates the joint distribution of the marker allele frequencies of the K populations, the admixture of each individual, and the population origin of each allele by use of Markov chain–Monte Carlo simulation. Details of the algorithm can be found in Falush et al. (2003). We ran structure on our autosomal genome scan data of 378 microsatellite markers, using K=2 and K=3. We ran the analyses once using all 732 genotyped individuals from the 159 families and again using only the 187 genotyped founders. The use of only founders is consistent with structure’s assumption that sample individuals are unrelated. The use of all individuals takes advantage of all available information and so is better able to overcome structure’s prior distribution that individuals are an equal admixture of the K populations, but it overestimates the actual information present. When we used all genotyped individuals in the structure analysis, we estimated family race vectors by averaging the estimated proportion of population membership for the affected members. When we used only founders, we estimated family ancestry vectors using the estimates from structure and our algorithm as described above. Results Table 1 displays the frequency with which the most common ethnicity responses were given. Figure 2
Figure 3
We next compared the family race vector estimates from our algorithm with the use of the self-reported data to those estimated by structure with the use of genotype information (table 4 and fig. 4
Discussion We presented a simple algorithm that transforms self-reported population affiliation, pedigree structure, affection status, and perceived validity of the self-reported information into a family ancestry classification. Our algorithm should prove useful to researchers who wish to perform linkage analyses or other family-based analyses when population heterogeneity is a potential factor that could lead to excess false positives or loss of power. Although our algorithm was designed for ethnicity data reported from the DIGS, in which subjects report their parents’ ethnicities, it can easily be adapted to other information formats, such as those in which individuals report their own ethnicities. If such data are collected, our algorithm could be implemented by assuming that the individual responded that both parents’ ethnicities are the same as his or her own. If a large amount of genotype information is already available from the families, our algorithm can be used in concert with structure as we demonstrated above. If no or little genotype information is available and a subset of families with ethnicity information is available to choose from (e.g., National Institute of Mental Health [NIMH] Schizophrenia Genetics Initiative; see Web Resources), our algorithm could be applied to select a collection of families that have a desired distribution of ethnic backgrounds. We found almost perfect concordance between population assignments determined with the use of our algorithm and those determined with structure; 158 of 159 families were classified concordantly (κ=0.99). We also discovered that, for predominantly AA families, the amount of admixture predicted from the self-reported information was consistent with that found by structure (Spearman correlation coefficient = 0.85). In addition, structure's estimate that AA individuals have, on average, 18% EA admixture is relatively consistent with the findings of Parra et al. (1998) and Shriver et al. (2003), who found that an AA sample from the Washington, D.C., area had genetic contribution estimates of 79%, 18%, and 3% from AA, EA, and Native American populations, respectively. In contrast, for predominantly EA families, we found little correlation (r=0.06). Figure 4 Our algorithm likely performed as well as it did in part because the major groups into which we classified ethnicities—AA and EA—are quite genetically disparate. We also expect our algorithm might work well with Hispanic American data. Distinguishing more subtle ethnic differences may prove challenging and will likely depend on how recently the groups of interest became isolated or admixed. For example, it is possible that individuals from some Hispanic population can accurately report the amount of Native American and European admixture of their ancestors (Bonilla et al. 2004). In this instance, our algorithm should perform well. However, it is less clear how our algorithm would perform if we were trying to discern the subtle regional ethnic subdivisions of the Icelandic population (Helgason et al. 2005). Subjects from this population are less likely to be aware of these more subtle divisions and so would likely provide less precise responses to ethnicity questions that try to capture the ethnic differences that exist between regions. Additionally, there is likely to be much less genetic variation between subjects from the different regions, leading to groups that, even if respondents report accurately, would be difficult to distinguish, even with methods such as those employed by structure (Rosenberg et al. 2002). There are a number of ways in which our algorithm could be extended or modified. We could weight offspring information on the basis of the number of generations since the information was initially reported, where information from more recent generations would be weighted more heavily than information originating from more distant generations. This would penalize information coming from distant relatives—a reasonable action, given that half the information about an individual is lost in each generation. Recall that, in our algorithm, information about a father (or mother) provided by his (or her) offspring would be ignored if the father (or mother) provided information about his (or her) own parents. One could argue that the offspring’s information should be used. This could be accommodated by adding weighted race vectors of the offspring to the parent’s race vector. However, we feel that ignoring the offspring’s information is logical, since, in general, self-reported information has better reliability than “secondhand” information. Further, as noted above, parents report information about both of their parents, whereas offspring report about the aggregate of their parents. One scenario in which this approach would be prudent is when an affected parent and at least one of his or her unaffected children complete the DIGS. This would then allow the information from the unaffected child to supersede, or at least contribute, to the possibly less reliable information from the parent. We assigned families to AA and EA populations on the basis of the simple rule that the AA or EA element of the family ancestry vector must exceed 0.5. If researchers wish to be more discriminating, they could increase the threshold for being assigned to an ancestral group to a value >0.5. The decision rule could also employ a measure of the variance of the family ancestry vector using a cross-validation–type method. We chose to calculate a weighted average of the founders’ classification vectors to determine a family-specific ancestral group. An alternative to identifying an ancestry for each family is to determine the most likely population to which each founder belongs. This would allow us to use population allele-frequency estimates that are appropriate for each founder and, in principle, would allow an even more accurate calculation of the pedigree likelihood. However, no widely used linkage analysis package allows the use of different allele-frequency estimates for a pedigree’s founders. Our algorithm could also be implemented to estimate family-averaged phenotypes. This could be useful if an investigator wishes to separate families on the basis of some phenotype before performing another type of analysis. Ordered subset analysis (OSA) (Hauser et al. 2004) is one such analysis. OSA sorts families by a suspected confounding variable, and a dichotomy in the evidence for linkage between those with high values of the variable and those with low values is tested. In summary, we have developed a simple, practical, and flexible algorithm that allows researchers to assign family-specific ancestries when only self-reported ethnicity information is available, such as when the DIGS is used. In addition to including the relationships among family members, the algorithm can incorporate the confidence the investigator has in the accuracy of the collected information. Further, this algorithm is not restricted to ethnicity data—it can also be applied to any quantitative or categorical traits. Acknowledgments Veterans Affairs Cooperative Study 366 investigators are C. Baldwin, S. Bingham, J. Collins, S. V. Faraone, S. L. Haverstock, T. Keith, F. Mena, A. S. Menon, S. Prabhudesai, F. Sautter, G. D. Schellenberg, D. W. Tsuang, M. T. Tsuang, D. Weiss, and K. A. Young. We thank Jodi vanden Eng, for her valuable discussions and input at the early stages of this project, and Jeff Long, for his careful reading of our manuscript and valuable comments, which resulted in a more thoughtful handling of ethnicity and race issues. We gratefully acknowledge support from National Institutes of Health grant HG00376 (to M.B.), the Department of Veterans Affairs Cooperative Studies Program, and a Veterans Affairs Merit Review. Appendix A: DIGS Ethnicity Question Question: “What is the ethnic background of your biological parents?” Interviewer instruction: “Code up to four ethnicities on maternal and paternal sides, if possible.” Note that interviewers were directed to ask the question as written and to provide explanations and prompting if necessary. For example, if the respondent replies “white” or “black,” then the interviewer might say, “Let’s start with your mother; what is her ethnic background? What is the ethnic background of her parents? Do you know the ethnic background of their parents?” These questions would then be repeated for the paternal side. Multiple entries of the same ethnicity value within the mother’s (or father’s) response column implies that both of the mother’s (or father's) parents identified themselves, at least partially, with this ethnicity. Response was recorded with the following ethnicities: 01 = Anglo-Saxon 02 = Northern European (e.g., Norwegian) 03 = West European (e.g., French, German) 04 = East European, Slavic 05 = Russian 06 = Mediterranean 07 = Ashkenazi Jew 08 = Sephardic Jew 09 = Hispanic (not Puerto Rican) 10 = Puerto Rican Hispanic 11 = Mexican Hispanic 12 = Asian 13 = Arab 14 = Native American/Alaskan Native 15 = African American 16 = Other, Specify:_______UU = Unknown MOTHER FATHERa)_______ e)_______b)_______ f)_______c)_______ g)_______d)_______ h)_______Web Resources The URLs for data presented herein are as follows: Applied Biosystems, http://home.appliedbiosystems.com/ (for ABI Prism Linkage Mapping set, version 2.5, HD5). NIMH Schizophrenia Genetics Initiative, http://zork.wustl.edu/nimh/sz.html. References Bonilla C, Parra EJ, Pfaff CL, Dios S, Marshall JA, Hamman RF, Ferrell RE, Hoggart CL, McKeigue PM, Shriver MD (2004) Admixture in the Hispanics of the San Luis Valley, Colorado, and its implications for complex trait gene mapping. Ann Hum Genet 68:139–153 [PubMed] doi: 10.1046/j.1529-8817.2003.00084.x. Cloninger RC, Kaufmann CA, Faraone SV, Malaspina D, Svrakic DM, Harkavy-Friedman J, Suarez BK, Matise TC, Shore D, Lee H, Hampe CL, Wynne D, Drain C, Markel PD, Zambuto CT, Schmitt K, Tsuang MT (1998) Genome-wide search for schizophrenia susceptibility loci: the NIMH genetics initiative and millennium consortium. Am J Med Genet 81:275–281 [PubMed] doi: 10.1002/(SICI)1096-8628(19980710)81:4<275::AID-AJMG1>3.0.CO;2-T. Curtis D, Sham PC (1996) Population stratifications can cause false positive linkage results if founders are untyped. Ann Hum Genet 60:261–263 [PubMed] Deng HW (2001) Population admixture may appear to mask, change or reverse genetic effects of genes underlying complex traits. Genetics 159:1319–1323 [PubMed] Falush D, Stephens M, Pritchard JK (2003) Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics 164:1567–1587 [PubMed] Faraone SV, Skol AD, Tsuang DW, Young KA, Haverstock SL, Prabhudesai S, Mena F, Menon AS, Leong L, Sautter F, Baldwin C, Bingham S, Weiss D, Collins J, Keith T, vanden Eng JL, Boehnke M, Tsuang MT, Schellenberg GD. Genome scan of schizophrenia families in a large veterans affairs cooperative study sample: evidence for linkage to 18p11.32 and for racial heterogeneity on chromosomes 6 and 14. Am J Hum Genet (in press). Funke B, Finn CT, Plocik AM, Lake S, DeRosse P, Kane JM, Kucherlapati R, Malhotra AK (2004) Association of the DTNBP1 locus with schizophrenia in a U.S. population. Am J Hum Genet 75:891–898 [PubMed] Hauser ER, Watanabe RM, Duren WL, Bass MP, Langefeld CD, Boehnke M (2004) Ordered subset analysis in genetic linkage mapping of complex traits. Genet Epidemiol 27:53–63 [PubMed] doi: 10.1002/gepi.20000. Helgason A, Yngvadottir B, Hrafnkelsson B, Gulcher J, Stefansson K (2005) An Icelandic example of the impact of population structure on association studies. Nat Genet 37:90–95 [PubMed] Nurnberger JI, Blehar MC, Kaufmann CA, Yorkcooler C, Simpson SG, Harkavy-Friedman J, Severe JB, Malaspina D, Reich T, Miller M, Bowman ES, Depaulo JR, Cloninger CR, Robinson G, Modlin S, Gershon ES, Maxwell E, Guroff JJ, Kirch D, Wynne D, Berg K, Tsuang MT, Faraone SV, Pepple JR, Ritz AL (1994) Diagnostic interview for genetic studies—rationale, unique features, and training. Arch Gen Psychiatry 51:849–859 [PubMed] Parra EJ, Marcini A, Akey J, Martinson J, Batzer MA, Cooper R, Forrester T, Allison DB, Deka R, Ferrell RE, Shriver MD (1998) Estimating African American admixture proportions by use of population-specific alleles. Am J Hum Genet 63:1839–1851 [PubMed] Pritchard JK, Stephens M, Donnelly PJ (2000) Inference of population structure using multilocus genotype data. Genetics 155:945–959 [PubMed] Risch N, Burchard E, Ziv E, Hua T (2002) Categorization of humans in biological research: genes, race and disease. Genome Biol 3:2007.1–2007.12 [PubMed] Rosenberg NA, Pritchard JK, Weber JL, Cann HM, Kidd KK, Zhivotovsky LA, Feldman MW (2002) Genetic structure of human populations. Science 298:2381–2385 [PubMed] doi: 10.1126/science.1078311. Sankar P, Cho MK (2002) Toward a new vocabulary of human genetic variation. Science 298:1337–1338 [PubMed] doi: 10.1126/science.1074447. Shriver MD, Parra EJ, Dios S, Bonilla C, Norton H, Jovel C, Pfaff C, Jones C, Massac A, Cameron N, Baron A, Jackson T, Argyropoulos G, Jin L, Hoggart CJ, McKeigue PM, Kittles RA (2003) Skin pigmentation, biogeographical ancestry and admixture mapping. Hum Genet 112:387–399 [PubMed] Tsuang MT, Faraone SV, Bingham S, Young K, Prabhudesai S, Haverstock SL, Mena F, Menon AS, Pepple J, Johnson J, Baldwin C, Weiss D, Collins J (2000) Department of Veterans Affairs Cooperative Studies Program genetic linkage study of schizophrenia: ascertainment methods and sample description. Am J Med Genet 96:342–347 [PubMed] doi: 10.1002/1096-8628(20000612)96:3<342::AID-AJMG21>3.0.CO;2-J. Wright S (1984) Evolution and the genetics of populations, volume 2: theory of gene frequencies. University of Chicago Press, Chicago. Zhivotovsky LA, Rosenberg NA, Feldman MW (2003) Features of evolution and expansion of modern humans, inferred from genomewide microsatellite markers. Am J Hum Genet 72:1171–1186 [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||
Ann Hum Genet. 1996 May; 60(Pt 3):261-3.
[Ann Hum Genet. 1996]Genetics. 2001 Nov; 159(3):1319-23.
[Genetics. 2001]Arch Gen Psychiatry. 1994 Nov; 51(11):849-59; discussion 863-4.
[Arch Gen Psychiatry. 1994]Am J Med Genet. 1998 Jul 10; 81(4):275-81.
[Am J Med Genet. 1998]Am J Hum Genet. 2004 Nov; 75(5):891-8.
[Am J Hum Genet. 2004]Science. 2002 Nov 15; 298(5597):1337-8.
[Science. 2002]Am J Med Genet. 2000 Jun 12; 96(3):342-7.
[Am J Med Genet. 2000]Genetics. 2000 Jun; 155(2):945-59.
[Genetics. 2000]Genetics. 2003 Aug; 164(4):1567-87.
[Genetics. 2003]Ann Hum Genet. 1996 May; 60(Pt 3):261-3.
[Ann Hum Genet. 1996]Genetics. 2001 Nov; 159(3):1319-23.
[Genetics. 2001]Am J Hum Genet. 2003 May; 72(5):1171-86.
[Am J Hum Genet. 2003]Genetics. 2000 Jun; 155(2):945-59.
[Genetics. 2000]Genetics. 2003 Aug; 164(4):1567-87.
[Genetics. 2003]Am J Hum Genet. 1998 Dec; 63(6):1839-51.
[Am J Hum Genet. 1998]Hum Genet. 2003 Apr; 112(4):387-99.
[Hum Genet. 2003]Genome Biol. 2002 Jul 1; 3(7):comment2007.
[Genome Biol. 2002]Ann Hum Genet. 2004 Mar; 68(Pt 2):139-53.
[Ann Hum Genet. 2004]Nat Genet. 2005 Jan; 37(1):90-5.
[Nat Genet. 2005]Science. 2002 Dec 20; 298(5602):2381-5.
[Science. 2002]Genet Epidemiol. 2004 Jul; 27(1):53-63.
[Genet Epidemiol. 2004]