![]() | ![]() |
Formats:
|
||||||||||||||||||||||||||
Copyright © 2008 by the Genetics Society of America Testing for Archaic Hominin Admixture on the X Chromosome: Model Likelihoods for the Modern Human RRM2P4 Region From Summaries of Genealogical Topology Under the Structured Coalescent *ARL Division of Biotechnology, †Department of Ecology and Evolutionary Biology and ‡Department of Anthropology, University of Arizona, Tucson, Arizona 85721, §Department of Animal and Human Biology, University of Rome “La Sapienza,” and Istituto Italiano di Antropologia, 00185 Rome, Italy and **Department of Anthropology, University of Michigan, Ann Arbor, Michigan 48109 1Corresponding author: ARL Division of Biotechnology, Life Sciences South, University of Arizona, Tucson, AZ 85721. E-mail: mfh/at/u.arizona.edu Communicating editor: N. Takahata Received August 13, 2007; Accepted November 11, 2007. This article has been cited by other articles in PMC.Abstract A 2.4-kb stretch within the RRM2P4 region of the X chromosome, previously sequenced in a sample of 41 globally distributed humans, displayed both an ancient time to the most recent common ancestor (e.g., a TMRCA of ~2 million years) and a basal clade composed entirely of Asian sequences. This pattern was interpreted to reflect a history of introgressive hybridization from archaic hominins (most likely Asian Homo erectus) into the anatomically modern human genome. Here, we address this hypothesis by resequencing the 2.4-kb RRM2P4 region in 131 African and 122 non-African individuals and by extending the length of sequence in a window of 16.5 kb encompassing the RRM2P4 pseudogene in a subset of 90 individuals. We find that both the ancient TMRCA and the skew in non-African representation in one of the basal clades are essentially limited to the central 2.4-kb region. We define a new summary statistic called the minimum clade proportion (pmc), which quantifies the proportion of individuals from a specified geographic region in each of the two basal clades of a binary gene tree, and then employ coalescent simulations to assess the likelihood of the observed central RRM2P4 genealogy under two alternative views of human evolutionary history: recent African replacement (RAR) and archaic admixture (AA). A molecular-clock-based TMRCA estimate of 2.33 million years is a statistical outlier under the RAR model; however, the large variance associated with this estimate makes it difficult to distinguish the predictions of the human origins models tested here. The pmc summary statistic, which has improved power with larger samples of chromosomes, yields values that are significantly unlikely under the RAR model and fit expectations better under a range of archaic admixture scenarios. FOSSIL, archaeological, and genetic data all lend support to the hypothesis that Homo sapiens originated in Africa (McBrearty and Brooks 2000; McDougall et al. 2005; Garrigan and Hammer 2006). With the acceptance of the role of Africa in our species' origin, there is now increasing interest in the question of how the ancestral population that gave rise to anatomically modern humans (AMH) was structured. Did AMH emerge from a single, isolated African deme or from a subdivided ancestral population with gene flow among subpopulations? A related question is whether the expanding AMH population completely replaced or interbred with then contemporaneous archaic populations such as Neanderthals and H. erectus (Eswaran 2002; Templeton 2002; Garrigan et al. 2005a,b; Plagnol and Wall 2006; Wall and Hammer 2006). Early studies of nonrecombining regions such as mtDNA and the Y chromosome were consistent with the hypothesis of a single origin followed by complete replacement, sometimes referred to as the recent African replacement (RAR) model. While many of the more recently published DNA sequencing studies of X-linked and autosomal loci are also concordant with this RAR model, a growing number are not (Evans et al. 2006; Garrigan and Hammer 2006). Garrigan et al. (2005b) published one of the first studies to posit recent admixture between AMH and an archaic human population. A resequencing study of 2.4 kb of the ribonucleotide reductase M2 pseudogene 4 (RRM2P4) in a sample of 41 globally diverse humans identified an unusual pattern of nucleotide polymorphism compared with most of the human genome. The reconstructed gene tree revealed two clades of allelic sequences that were estimated to have diverged ~2 million years ago (MYA). One clade with very little sequence variation was specific to Asians, while the other more diverse clade of RRM2P4 sequences resembled a pattern typical of human variation and was globally distributed. By genotyping a diagnostic SNP in a large sample of humans, the divergent “Asian clade” was shown to be frequent in Southeast Asians and nearly absent in sub-Saharan Africans. The greater genealogical depth in Asia led to the hypothesis that RRM2P4 is a genomic remnant of introgressive hybridization from an Asian archaic population (H. erectus) into AMH groups expanding from Africa. However, despite the increasing availability of genomic data, the uniqueness of such deep times to the most recent common ancestor (TMRCA) and of gene trees with non-African basal clades remains unclear. Importantly, we still do not understand the likelihood of such genealogies under either the RAR model or models involving admixture with archaic humans. Garrigan et al.'s original RRM2P4 study was limited by a paucity of sequence data and primarily qualitative analyses. Here, we extend earlier research by sequencing the 2.4-kb RRM2P4 region in 131 African and 122 non-African individuals. We also determine the extent of sequence that exhibits the unusual pattern of polymorphism by selectively resequencing three DNA fragments totaling 5.6 kb within a 16.5-kb window flanking RRM2P4. We infer the likelihood of the observed RRM2P4 genealogy using a suite of summary statistics and Monte Carlo coalescent simulations under the RAR model and a range of archaic admixture (AA) models (Nordborg 2000; Plagnol and Wall 2006). The RAR models are parameterized by approximate Bayesian computation (ABC) conditioned on resequence data from an additional data set of 19 neutral, unlinked X-chromosomal loci. This resequence data set represents 12 Mb from 19 regions of the X chromosome, which are unlinked from genes (and each other) by medium to high rates of recombination (≥1.0 cM/Mb). These loci, which represent selectively neutral X-chromosomal diversity, will be published elsewhere. However, using these data, we show here that simple RAR models often produce TMRCA values that are similar to that of RRM2P4, while genealogies with skewing of basal clade membership toward non-Africans remain statistical outliers. SUBJECTS AND METHODS Regions sequenced: Resequence data for the RRM2P4 locus were generated within a “trio” design (Figure 1
Sampling: The RRM2P4 central region was sequenced in 131 African and 122 non-African individuals (panel A). These samples were chosen without prior information about RRM2P4 lineage status. DNA samples representing Mandenka from Senegal (n = 16), Biaka Pygmies from the Central African Republic (n = 16), Khoisan from Namibia (n = 9), French Basque (n =16), Han Chinese (n =16), and Nasioi from Bougainville (n = 16) were purchased from the Centre d'Etude du Polymorphisme Humain (Cann et al. 2002). Samples of Baka Pygmies from Cameroon (n = 23) were provided by Giovanni Destro-Bisol and the Dogon from Mali (n = 32) were provided by Beverly Strassmann. Samples from the Dinka of southern Sudan (n = 21) were collected in Tucson, Arizona, with informed consent. Samples from three Siberian populations, the Selkups (n = 32), Forest Nentsi (n = 28), and Tundra Nentsi (n = 3), were described previously (Karafet et al. 2002). Non-population-based samples (n = 25) from the Y chromosome consortium cell lines (Y Chromosome Consortium 2002) were also included in this panel. Resequence data for the full trio were generated in a second panel of 42 African and 48 non-African individuals (panel B) from three African groups (Khoisan, Mandenka, and Biaka) and three non-African groups (French Basque, Han, and Nasioi) (samples as described above). All sampling protocols were approved by the Human Subjects Committee at the University of Arizona and by the institutions of all collaborators who provided DNA samples. Recombination analysis and genetree dating: Rates of linkage disequilibrium across the sequence were determined using LDhat (McVean and Spencer 2006). The RRM2P4 central region shows only limited signs of recombination, and a most parsimonious tree was reconstructed by breaking low-frequency reticulations. The TMRCA of the tree and the age of its polymorphisms were estimated with Genetree (Griffiths 2007). Genetree employs a full maximum-likelihood method that is based on the standard coalescent (Kingman 1982) and assumes an infinite-sites mutational model. Likelihood surfaces for the population mutation rate, θ, and the population growth rate, β, were generated under a panmictic single-deme model and an island model of population structure. TMRCA values were inferred from maximum-likelihood parameterizations under both models. Demographic models: Summaries of the RRM2P4 central region were compared with values obtained from simulations under a structured coalescent (Nordborg 1997; Hudson 2002) to determine the likelihood of the observed genealogy. We employed a framework for human demography similar to that developed by Plagnol and Wall (2006), but modified (see below) to yield the number of segregating sites, S, consistent with an independent data set of 19 X chromosome loci sequenced in the same individuals (panel B). We present results from two demographic models: a two-deme RAR model and an AA scenario similar to Nordborg's (2000) isolation and admixture model (Figure 2
The RAR model depicts two panmictic, exponentially growing Wright–Fisher demes representing African and non-African populations. Growth begins in the African deme 80,000 years ago from a single ancestral population (Ne = 104) and continues until it reaches its current effective size. A small group (Ne = 500) splits from the African deme 60,000 years ago to form the non-African deme. This subgroup experiences a bottleneck for 5000 years before expanding exponentially to its current effective size. Modern effective sizes and the intercontinental migration rate were parameterized by approximate Bayesian computation (see below). The AA model incorporates instantaneous admixture from an ancestral hominin source into the RAR model described above. An ancestral hominin population with constant effective population size of 103 splits from the ancestors of modern humans at 2 MYA. We considered a range of admixture rates (0–5%) from ancestral hominins to modern humans in Asia, coupled with a series of admixture times (10–55 KY before present). We could not infer an optimized AA model because the paucity of candidate loci is not sufficient both for model training and for subsequent statistical testing. Model fitting via approximate Bayesian computation: We parameterized the RAR model to reflect what is known about deep human demography and, consequently, to produce simulated data sets that mimic real genomic data sets. Parameter inference rapidly becomes computationally intractable at high-dimensional state spaces, such as those associated with complex demographic models. Therefore, we fixed some demographic parameters that have been inferred elsewhere, e.g., the time of onset of population growth and non-African bottleneck size (Plagnol and Wall 2006, Figure 2 Essentially, we generated 104 coalescent simulations for each of 105 different sets of demographic parameters, Θ = {N0, m}, that were drawn randomly from two uniform distributions, Test statistics: We calculated the approximate likelihoods of two summary statistics: the TMRCA and the minimum proportion of Africans in one of the two basal clades. Values were inferred from 105 replicates under all demographic models. We note that these tests are conservative, because both summaries are determined directly from coalescent genealogies. Resolution of the underlying genealogy is constrained by S for real data sets (cf. Nordborg 2000). The distribution of TMRCA values was extracted from the output of Hudson's (2002) ms using custom software (code available on request). Here, we also define a new summary statistic, the minimum clade proportion (pmc), which characterizes the minimum quantity of individuals from a specified geographic region or ethnic group (such as Africans) in each of the two basal clades of a binary gene tree (Figure 3
RESULTS Patterns of DNA sequence variation within and around RRM2P4: We sequenced 2.4 kb of the central RRM2P4 segment, which encompasses the processed pseudogene (Figure 1 0.001; Table 1). Also, the central subregion still shows minimal evidence of recombination despite a sixfold increase in the size of the data set (supplemental Figure 1 at http://www.genetics.org/supplemental/). As before, this low level of recombination permitted partial reconstruction of a single nonreticulating gene tree for the central region (Figure 4
We used both molecular-clock and coalescent approaches to estimate the TMCRA of the RRM2P4 central subregion gene tree. Outgroup comparisons reveal an average of 22 nucleotide substitutions between all human and chimpanzee central RRM2P4 region sequences. Given an average of 8.53 nucleotide differences observed between the two human RRM2P4 lineages (i.e., the average number of mutations between sequences across the base of the human gene tree), we estimate that the two deepest human clades diverged ~2.33 MYA (assuming a 6-MYA human–chimpanzee divergence time). We also inferred the TMRCA of the RRM2P4 central subregion using a full maximum-likelihood method under both a panmictic and an island model. These models yielded TMRCA values of 1.24 and 2.88 MYA, which bracket the molecular-clock date (data not shown). To determine the length of sequence within and around the RRM2P4 locus that shows the unusual genealogical features, we sequenced two additional fragments of 1725 and 1601 bp that flank the central region (Figure 1 RAR model parameters: The RAR model was parameterized by ABC. Because the model cannot be parameterized on the test locus (here, RRM2P4), we conditioned the model on a separate training data set of 19 independent X chromosome noncoding regions. These resequence data are too extensive for detailed description here and are the subject of a separate publication. However, basic summaries of these 19 loci relevant to the current analysis are presented in supplemental Table 3 at http://www.genetics.org/supplemental/. The two-deme RAR model was parameterized by generating 105 random N0 and m values (see full description in subjects and methods) and accepting the 0.01% that best matched the number of segregating sites observed in 19 independent X chromosome noncoding loci (supplemental Figure 3 at http://www.genetics.org/supplemental/). The optimal parameters were inferred as a modern effective size, N0, of 12,300 (range 12,000–12,500) and an intercontinental migration rate per generation, m, of 3.62 × 10−9 (range 6.75 × 10−10–8.24 × 10−9). Using parameter values at the extremes of these ranges had little effect on the following statistical analyses (our unpublished data). Importantly, the parameterized RAR model produces simulated data sets that yield summaries (such as growth rates and effective sizes) that are consistent with other demographic inferences (e.g., from Genetree). Although there is strictly no way to assess the true history of our samples, this best-fit RAR model is a reasonable first approximation for human demographic history, as reconstructed from the X chromosome. Assessing the uniqueness of the RRM2P4 genealogy: The uniqueness of the RRM2P4 central region was estimated using a simulation-based summary-likelihood approach under the RAR and AA models (Figure 2
The geographical distribution of lineages on the central RRM2P4 genealogy is also skewed: one of the two basal clades (Figure 4
DISCUSSION Garrigan et al. (2005b) described a 2.4-kb region on the X chromosome with unusual genealogical structure in a sample of 41 humans: a deep TMRCA and a basal clade composed entirely of Asian (n = 3) sequences. This differs from most genealogies observed to date, where African individuals dominate at least one of the two basal clades (Labuda et al. 2000; Takahata et al. 2001; Satta and Takahata 2004). Garrigan et al. (2005b) also genotyped a single diagnostic SNP to test for the presence of clade A in a larger number of samples (n = 570 from 17 globally distributed populations). They discovered a decreasing frequency gradient centered on southern China (where the clade A is present >50%) and extremely low frequencies of the “Asian” divergent lineage in Europe, the Middle East, and Africa. (See Figure 1 We generated resequencing data 5′ and 3′ of the central 2.4-kb region in a panel of 90 individuals to see whether the pattern originally described by Garrigan et al. (2005b) extended farther along the X chromosome. We found that a strong recombination hotspot almost completely decouples the central and 3′ regions, and despite linkage disequilibrium with the 5′ end of the sequence, the pattern of higher Asian diversity for the central region was not found for the other two regions. While it is possible that this reduced diversity may simply reflect the stochastic nature of the mutational process given the relatively short length of sequence examined (~2 kb), we suggest that the lineage history of the central region has been at least partly decoupled from those of the 5′ and 3′ regions through recombination. BLAST results show that the RRM2P4 processed pseudogene has sequence conservation to the rhesus macaque (Macaca mulatta). Therefore, the high diversity of the central region does not result from increased mutation at the time of pseudogene insertion because human variation traces back only to the Homo lineage (i.e., the TMRCA of human haplotypes is much more recent than the insertion time). Central region diversity is also unlikely to reflect paralogous gene conversion because the RRM2P4 sequence has no close matches to other regions in the human genome. Finally, increased central region diversity is unlikely to reflect linkage to a gene under long-term balancing selection. The nearest 5′ gene is SPANX-N2, which encodes a protein of uncharacterized function. SPANX-N2 is 580 kb, or ~1.8 cM, distant from the RRM2P4 pseudogene, and the intervening region contains at least three hotspots of ~16 cM/Mb (each similar to the hotspot 3′ of the RRM2P4 locus). While we cannot exclude the possibility that an unknown 5′ functional variant may be linked to the RRM2P4 locus, the disparity in diversity between the RRM2P4 5′ and central regions makes linkage with a third selected locus located farther 5′ unlikely. In sum, it appears that polymorphisms that clearly define the unusual RRM2P4 genealogy are found only within the stretch of noncoding DNA associated with the pseudogene (Figure 1 Our sixfold larger database of central region sequences does not result in a substantial change in the topology of the gene tree described by Garrigan et al. (2005b) or in the partitioning of its two deepest branches. However, we did observe several new lineages at low frequency and discovered some novel evidence of recombination. Of the 21 individuals identified with the less frequent basal lineage, only one was from Africa. This Dogon individual from Mali was the same African individual identified as carrying a clade A lineage on the basis of the SNP-based genotyping assay in Garrigan et al. (2005b). Our extended sequence database identified only a single new polymorphism in clade A and 11 new polymorphisms in the more diverse globally distributed clade B (compare Figure 1 Our simulations show that the 2.33-million-year molecular-clock-based TMRCA estimate is a statistical outlier under the RAR model (Figure 5, a and b We also considered the observation of Garrigan et al. (2005b) that Asian samples are overrepresented in one of the two basal clades of the RRM2P4 tree. This pattern continued to hold even after increasing the size of our DNA sequence data set. To assess how unusual this aspect of the RRM2P4 genealogy is under alternative models of human evolutionary history, we defined a new summary statistic, pmc, which quantifies the skew in the proportion of individuals from two populations among the two basal clades of a gene tree. This summary statistic is applicable to RRM2P4 because the central region is essentially tree-like. We observed pmc = 0.048 for the central region of RRM2P4, which is significantly unlikely under the RAR model (P = 0.031, Figure 5, c and d While there are limitations with both TMRCA and pmc for distinguishing predictions of the RAR and AA models, they do represent independent summaries of the data and, thus, complement one another. As already mentioned, the power of these two test statistics depends on different aspects of sampling. Variance in the estimate of the TMRCA is improved by longer sequences of the region with tree-like ancestry (in the case of RRM2P4 this is limited by the small central region and flanking recombination), but only slightly by increasing the sample size. On the other hand, estimates of pmc can be improved by increasing the sample size, because the variance of pmc decreases approximately as the inverse of the sample size (analyses not shown). If the pmc is genuinely an outlier under the RAR model, increasing the number of individuals sampled increases the power to reject RAR. Indeed, when we use the SNP data of Garrigan et al. (2005b), which included 177 Africans and 393 non-Africans, we reject the RAR model with greater confidence (pmc = 0.0189, P = 0.014). Further evidence in support of an archaic admixture model awaits analysis of additional loci exhibiting genealogical properties similar to the central RRM2P4 region. Several candidates have already been identified (Harding et al. 1997; Ziętkiewicz et al. 2003; Stefansson et al. 2005; Shimada et al. 2007), but most lack rigorous statistical analyses under a range of demographic models, including ancient admixture alternatives. The frequency at which we expect to find introgressed regions depends largely on the amount of admixture between the two archaic populations (Wall 2000). Moreover, unless admixture was recent and involved highly divergent populations, the power to detect archaic admixture is low (Nordborg 2000). In the case of RRM2P4, divergence may have started at the time of separation of H. ergaster/H. erectus populations in Africa ~2 MYA (Anton and Swisher 2004). Yet, the length of the divergent sequence is short, possibly as a result of the nearby recombination hotspot or because admixture occurred in the more distant past and recombination has subsequently broken down the admixed chromosome. In any case, identifying longer sequences with greater divergence would allow for more sophisticated tests of archaic admixture. For example, Wall (2000) suggested a number of summary statistics that are based on both the level of divergence between two clades and the amount of recombination between them. Recent population structure is another factor that may affect the probability of sampling a locus with a genealogy showing signs of archaic admixture. As pointed out by Nordborg (2000), population structure may actually increase the power to detect archaic admixture if we sample sufficiently among demes, because we would expect the introgressed alleles to still be present in the area of the world where admixture took place. In the case of RRM2P4, individuals carrying the less frequent divergent lineage are concentrated in East Asia, suggesting that admixture may have occurred at the Asian end of the global distribution of human populations. On the other hand, it is important to point out that current population structure is unlikely to reflect ancient patterns directly. Following a demic expansion, what was once subdivision between two African populations may now appear as structure between African and non-African populations. For loci with more ancient TMRCA, there is an increase in power to detect archaic admixture even if it occurred at more ancient times (Nordborg 2000). This means that for RRM2P4, which has an ancient TMRCA, we cannot be confident about where archaic admixture may have occurred geographically. In this regard, it is interesting to note that a growing number of loci have been discovered with two deeply divergent lineages where both the major and the minor types are present only in African populations (Barreiro et al. 2005; Garrigan et al. 2005a; Hayakawa et al. 2006). This supports models in which anatomically modern humans descend from a structured ancestral African population (Garrigan and Hammer 2006). We find some support that elevated admixture among highly divergent African subpopulations just prior to the recent African expansion could explain the pattern of polymorphism at RRM2P4 (supplemental Figure 4 at http://www.genetics.org/supplemental/), but note that this model has little power to explain why RRM2P4 clade A lineages are geographically restricted to East Asia today. For now, this locus represents a genealogical history that is most consistent with recent admixture from an archaic hominin population in Asia. Acknowledgments We thank Zahra Mobasher (University of Arizona) for excellent technical assistance and David Morales (University of Arizona) for helpful discussion. This research forms part of the HOMINID project, a genomic resequence study funded by National Science Foundation grant BCS-0423670. APPENDIX Here, we define a new summary statistic, the minimum clade proportion (pmc). In its simplest form, this statistic characterizes the proportion of individuals from a specified group (e.g., Africans) in each of the two basal clades of a binary gene tree. The minimum clade proportion can thus be defined as
The variance of pmc decreases approximately as the inverse of the sample size; i.e., increasing the number of sampled individuals increases the power to reject the null model (our unpublished simulation results). Furthermore, the power of this test depends on sampling sizes as follows. Consider that we have N1 individuals from the specified group, N1 + N2 total individuals, and an observed pmc = λ. Also, given an arbitrarily chosen clade, p1 is that clade's frequency in the specific group, whereas p2 is the frequency of that clade in the remaining individuals. For a given hypothesis H
We emphasize that the likelihood of an observed pmc depends strongly on the demographic model underlying the null hypothesis. In practice, the probability that pmc ≤ λ must be determined by coalescent simulation. Finally, we note that the pmc statistic can be generalized to any subset of chromosome copies, K, and any number of clades, C (indexed by i), from a data set, N. Necessary conditions include
References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||||
J Hum Evol. 2000 Nov; 39(5):453-563.
[J Hum Evol. 2000]Nature. 2005 Feb 17; 433(7027):733-6.
[Nature. 2005]Nat Rev Genet. 2006 Sep; 7(9):669-80.
[Nat Rev Genet. 2006]Nature. 2002 Mar 7; 416(6876):45-51.
[Nature. 2002]Genetics. 2005 Aug; 170(4):1849-56.
[Genetics. 2005]Mol Biol Evol. 2005 Feb; 22(2):189-92.
[Mol Biol Evol. 2005]PLoS Genet. 2006 Jul; 2(7):e105.
[PLoS Genet. 2006]Mol Biol Evol. 2005 Feb; 22(2):189-92.
[Mol Biol Evol. 2005]Science. 2002 Apr 12; 296(5566):261-2.
[Science. 2002]Hum Biol. 2002 Dec; 74(6):761-89.
[Hum Biol. 2002]Genome Res. 2002 Feb; 12(2):339-48.
[Genome Res. 2002]Curr Opin Genet Dev. 2006 Dec; 16(6):624-9.
[Curr Opin Genet Dev. 2006]Genetics. 1997 Aug; 146(4):1501-14.
[Genetics. 1997]Bioinformatics. 2002 Feb; 18(2):337-8.
[Bioinformatics. 2002]PLoS Genet. 2006 Jul; 2(7):e105.
[PLoS Genet. 2006]Am J Phys Anthropol. 2005 Oct; 128(2):415-23.
[Am J Phys Anthropol. 2005]PLoS Genet. 2006 Jul; 2(7):e105.
[PLoS Genet. 2006]Genetics. 2002 Dec; 162(4):2025-35.
[Genetics. 2002]Bioinformatics. 2002 Feb; 18(2):337-8.
[Bioinformatics. 2002]Mol Biol Evol. 2005 Feb; 22(2):189-92.
[Mol Biol Evol. 2005]Mol Biol Evol. 2005 Feb; 22(2):189-92.
[Mol Biol Evol. 2005]Mol Biol Evol. 2005 Feb; 22(2):189-92.
[Mol Biol Evol. 2005]Genetics. 2000 Oct; 156(2):799-808.
[Genetics. 2000]Mol Biol Evol. 2001 Feb; 18(2):172-83.
[Mol Biol Evol. 2001]Mol Ecol. 2004 Apr; 13(4):877-86.
[Mol Ecol. 2004]Mol Biol Evol. 2005 Feb; 22(2):189-92.
[Mol Biol Evol. 2005]Mol Biol Evol. 2005 Feb; 22(2):189-92.
[Mol Biol Evol. 2005]Genetics. 2002 May; 161(1):447-59.
[Genetics. 2002]J Genet. 2003 Apr-Aug; 82(1-2):7-12.
[J Genet. 2003]Mol Biol Evol. 2005 Feb; 22(2):189-92.
[Mol Biol Evol. 2005]Genetics. 1993 Feb; 133(2):375-87.
[Genetics. 1993]Mol Biol Evol. 2005 Feb; 22(2):189-92.
[Mol Biol Evol. 2005]Mol Biol Evol. 2005 Feb; 22(2):189-92.
[Mol Biol Evol. 2005]Am J Hum Genet. 1997 Apr; 60(4):772-89.
[Am J Hum Genet. 1997]Am J Hum Genet. 2003 Nov; 73(5):994-1015.
[Am J Hum Genet. 2003]Nat Genet. 2005 Feb; 37(2):129-37.
[Nat Genet. 2005]Mol Biol Evol. 2007 Mar; 24(3):687-98.
[Mol Biol Evol. 2007]Genetics. 2000 Mar; 154(3):1271-9.
[Genetics. 2000]Am J Hum Genet. 2005 Nov; 77(5):869-86.
[Am J Hum Genet. 2005]Genetics. 2005 Aug; 170(4):1849-56.
[Genetics. 2005]Genetics. 2006 Feb; 172(2):1139-46.
[Genetics. 2006]Nat Rev Genet. 2006 Sep; 7(9):669-80.
[Nat Rev Genet. 2006]