![]() | ![]() |
Formats:
|
||||||||||||||
Copyright © 1999, The National Academy of Sciences Evolution Inaugural Article Constructing primate phylogenies from ancient retrovirus sequences Department of Molecular Microbiology, Tufts University School of Medicine, Boston, MA 02111 †Present address: Division of Microbiology, New England Regional Primate Research Center, Harvard Medical School, Southborough, MA 01772-9102. ‡To whom reprint requests should be addressed at: Tufts University School of Medicine, Department of Molecular Biology and Microbiology, 136 Harrison Ave., Boston, MA 02111. E-mail: JCoffin_PAR/at/OPAL.Tufts.edu. This contribution is part of the special series of Inaugural Articles by members of the National Academy of Sciences elected on April 27, 1999.Contributed by John M. Coffin Accepted July 6, 1999. This article has been cited by other articles in PMC.Abstract The genomes of modern humans are riddled with thousands of endogenous retroviruses (HERVs), the proviral remnants of ancient viral infections of the primate lineage. Most HERVs are nonfunctional, selectively neutral loci. This fact, coupled with their sheer abundance in primate genomes, makes HERVs ideal for exploitation as phylogenetic markers. Endogenous retroviruses (ERVs) provide phylogenetic information in two ways: (i) by comparison of integration site polymorphism and (ii) by orthologous comparison of evolving, proviral, nucleotide sequence. In this study, trees are constructed with the noncoding long terminal repeats (LTRs) of several ERV loci. Because the two LTRs of an ERV are identical at the time of integration but evolve independently, each ERV locus can provide two estimates of species phylogeny based on molecular evolution of the same ancestral sequence. Moreover, tree topology is highly sensitive to conversion events, allowing for easy detection of sequences involved in recombination as well as correction for such events. Although other animal species are rich in ERV sequences, the specific use of HERVs in this study allows comparison of trees to a well established phylogenetic standard, that of the Old World primates. HERVs, and by extension the ERVs of other species, constitute a unique and plentiful resource for studying the evolutionary history of the Retroviridae and their animal hosts. Retroviruses are unique among RNA viruses in their ability to integrate DNA copies of their genomes into the genome of the infected cell. On occasion, integration takes place in a germ-line cell, giving rise to an endogenous retrovirus (ERV), which can be inherited by the offspring of the infected host, and may eventually become fixed in the gene pool of the host population (1). The genomes of vertebrate species contain dozens to thousands of ERV sequences (2), some of which were acquired in evolutionarily recent times, whereas others derive from “ancient” times, as indicated by their identical site of integration in more than one species (1, 3, 4). Typically, ancient proviruses have sustained numerous point mutations, deletions, and insertions, rendering them incapable of expressing virus. No biologically active viruses have been associated with the ancient proviruses. Despite their abundance in vertebrate genomes, and some other especially useful features described below, ERVs have rarely been exploited as phylogenetic markers (5–10). In a few instances integration site polymorphisms have served as a source of phylogenetic signal (6), or as markers for linkage analysis (11), but the usefulness of orthologous ERV nucleotide sequences has never been fully explored. Here we report the application of ancient human endogenous retrovirus (HERV) sequences to phylogenetic analysis on a time scale spanning recent primate evolution. HERVs can be organized into at least a dozen distinct groups, which vary in size from one to thousands of members (1, 12). Cross-hybridization and PCR studies consistently reveal that most HERV families are also found in other primates, including apes and Old World monkeys (OWMs) (12–19). Many HERVs, including the ones used in this study, are the result of integration events that took place between 5 and 50 million years ago, as indicated by the distribution of specific proviruses at the same integration sites (or “loci”) among related species. The evolution of primates has been the subject of intense study for well over a century, providing a well established phylogenetic consensus with which to compare and evaluate the performance of ERVs as phylogenetic markers. METHODS Preparation of Genomic DNA Samples. Human DNA samples came from the National Institute of General Medical Sciences Human Genetic Mutant Cell Repository. Great-ape genomic DNA was extracted from whole blood (Yerkes Regional Primate Research Center) or immortalized cell lines, including WES (chimpanzee), ROK (gorilla), PUTI (orangutan), MLA144 (gibbon), and COS-7 (African green monkey) cells (American Type Culture Collection; Manassas, VA). Additional blood samples were purchased from the New England Regional Primate Research Center (Southborough, MA) and included rhesus macaque, cynomolgus macaque, baboon, marmoset, and spider monkey samples. Genomic DNA was extracted from cells by digestion with SDS/Pronase and extraction with phenol or from whole blood by using the QIAamp Blood Kit (Qiagen, Santa Clarita, CA.). PCR and Sequencing. All PCR mixtures contained 200–400 ng of genomic DNA, 1.5–2.0 mM MgCl2, 200 μM each dNTP, 0.2 μM each primer, and 4 units of Taq DNA polymerase (Perkin–Elmer). HERV-K(HML6.17) (GenBank accession nos. U60268 and U60269). Primers for amplifying the 5′ long terminal repeat (LTR) from human, chimpanzee, bonobo, gorilla, and gibbon were HML1 (5′-TTGCCTTCTCAAGACAATATGGGC-3′) and HML2 (5′-AGGCGCTGACCTCATGTGCGC-3′). The 5′ LTR from orangutan, African green monkey, baboon, cynomolgus macaque, and rhesus macaque was amplified with HML7 (5′-GGGCAAAGTGAGTACATTTAGTGG-3′) and HML8 (5′-GTTCAGACTGTCGCCTGTTTGAG-3′). The 3′ LTR from human, chimpanzee, bonobo, gorilla, orangutan, and African green monkey was amplified with HML5 (5′-GTCTCTGTCTTCTGTTTATAGTCTG-3′) and HML6 (5′-GTTTTGTGCCTTTACAACATAGGTAC-3′); from baboon, cynomolgus macaque, and rhesus macaque, with HML5 and HML4 (5′-ATAGGTACTAACTTAACTTCAACAAC-3′). Screening for cellular target sequence in marmoset and squirrel monkey used primers HML1 or HML7 (forward) and HML4 or HML6 (reverse). RTVL-Ia (accession nos. M92067 and M34038). The 5′ LTR was amplified from chimpanzee, bonobo, gorilla, orangutan, and gibbon with primers RTVLIa-1 (5′-CCTAAAGGTGAATTATCACAAAATACT-3′) and RTVLIa-5 (5′-GGACAATGTTTTCCCTGTAT-3′). The 5′ LTR of African green monkey was amplified with RTVLIa-8 (5′-TACTAGAAATAACCACATAAGTGTTT-3′) and RTVLIa-5. The 3′ LTR was amplified from chimpanzee, bonobo, gorilla, and orangutan by using RTVLIa-3 (5′-AAGACCCAAGTAGAATAACAGAGCC-3′) and RTVLIa-7 (5′-CCTCTACTTCTTGAAATTTTCC-3′); the 3′ LTR was amplified from gibbon and African green monkey by using RTVLIa-3 and RTVLIa-2 (5′-TTTCCATAATCAAAGATTCCTTAAAT-3′). Published RTVL-H sequences were used to query databases for RTVL-H loci with flanking sequences suitable for designing PCR primers. RTVL-Ha is an RTVL-H element spanning nucleotides 14602–20350 of cosmid clone N28H9 (accession no. Z71183). 5′ LTRs were amplified from human, chimpanzee, bonobo, and gorilla with RTVLH-1 (5′-CTGACCGATGCTGACAATGGC-3′) and RTVLH-2 (5′-CTCACGGAGCAAAGAACAGGAGG-3′). The 3′ LTRs of the same species were amplified with primers RTVLH-3 (5′-GAAACATCGCCCATTCTCTCTCC-3′) and RTVLH-4 (5′-GATCAATGGCAGGTTTTCAACCTC-3′). The uninterrupted cellular sequence was amplified from orangutan and African green monkey by using primers RTVLH-1 and RTVLH-4. RTVL-Hb is between bases 75–5817 of human bacterial artificial chromosome (BAC) clone RG341D10 (accession no. AC002530). The 5′ LTRs were amplified by using RTVLHb-1 (5′-GCTCTAAGTTAATTTATAGGTCAC-3′) and RTVLHb-2 (5′-CGATCCGAGTCACGGCACC-3′); the 3′ LTRs were amplified with RTVLHb-3 (5′-CTAATCCCGCTTGAAGCAGCC-3′) and RTVLHb-4 (5′-AACAGCCCTGCATGAAGTCAC-3′). HERV-K18 (M12853 and M12852). Primers were derived from the HERV-K18 published flanking sequence (HERVK-1 and HERVK-4) conserved HERV-K family gag and env sequences (HERVK-2 and HERVK-3) (internal sequences of HERV-K18 have not been published). The 5′ LTR was amplified from human, chimpanzee, bonobo, and gorilla with primers HERVK-1 (5′-TGGCATGTACAACATAAGCGGAATC-3′) and HERVK-2 (5′-CTCCACGTTGGGCACCAGATG-3′). The 3′ LTR was amplified from the same species by using primers HERVK-3 (5′-TGTCTGTTGTTAGTCTGCAGGTGTACC-3′) and HERVK-4 (5′-CTGTGATTACCGCCTTACAGGATTTC-3′). PCR products were gel purified and sequenced directly on both strands by using the ABI Prism Dye-Terminator Cycle Sequencing Ready Reaction Kit and analyzed with an ABI 373 Stretch automated sequencer (Perkin–Elmer). Each of the proviruses used in this study maps to a different human chromosome, according to information available in the sequence databases, or by PCR screening of monochromosomal human × rodent somatic cell hybrids (data not shown). RESULTS AND DISCUSSION Building Phylogenetic Trees from ERV LTR Sequences Endogenous retrovirus loci provide no less than three sources of phylogenetic signal, which can be used in complementary fashion to obtain much more information than simple distance estimates of homologous sequences. First, the distribution of provirus-containing loci among taxa dates the insertion. Given the size of vertebrate genomes (>1 × 109 bp) and the random nature of retroviral integration (22, 23), multiple integrations (and subsequent fixation) of ERV loci at precisely the same location are highly unlikely (24). Therefore, an ERV locus shared by two or more species is descended from a single integration event and is proof that the species share a common ancestor into whose germ line the original integration took place (14). Furthermore, integrated proviruses are extremely stable: there is no mechanism for removing proviruses precisely from the genome, without leaving behind a solo LTR or deleting chromosomal DNA. The distribution of an ERV among related species also reflects the age of the provirus: older loci are found among widely divergent species, whereas younger proviruses are limited to more closely related species. In theory, the species distribution of a set of known integration sites can be used to construct phylogenetic trees in a manner similar to restriction fragment length polymorphism (RFLP) analysis. Second, as with other sequence-based phylogenetic analyses, mutations in a provirus that have accumulated since the divergence of the species provide an estimate of the genetic distance between the species. Because, for any given provirus, it is highly unlikely that there will be selection for or against any specific sequence, it is safe to assume that the rate of accumulation of mutations approximates the rate of their occurrence, with appropriate corrections for reversion. Analysis of closely related proviruses integrated at different sites should also reveal regional differences in mutation rates. Third, sequence divergence between the LTRs at the ends of a given provirus provides an important and unique source of phylogenetic information. The LTRs are created during reverse transcription to regenerate cis-acting elements required for integration and transcription. Because of the mechanism of reverse transcription, the two LTRs must be identical at the time of integration, even if they differed in the precursor provirus (Fig. (Fig.11
Assuming that the LTRs of an ERV are evolving independently, at approximately the same rate, and in the absence of rearrangement events, a phylogenetic tree containing 5′ and 3′ LTRs derived from the same ERV locus is predicted to have a topology similar to that depicted in Fig. Fig.11 Species Distribution of HERV Loci Fig. Fig.11 Evolution of HERV LTR Sequences For each HERV locus, the amplified LTRs from each species were directly sequenced, and the aligned sequences were used to generate phylogenetic trees (Fig. (Fig.2).2
Fig. Fig.22
Alternatively, the topology of the tree in Fig. Fig.22 (i) The proviruses are derived from two independent integration events (xenology). This possibility would require two nearly identical viruses (differing by no more than 11 substitutions within the LTRs) to integrate into precisely the same nucleotide position in two different lineages—a highly unlikely possibility. A similarly unlikely variation on this possibility is independent integrations into very similar cellular target sequences. (ii) In one of the two lineages, HERV-K18 was largely replaced by recombination with a separate (but nearly identical) provirus. Such recombination would have been restricted to sequences within the provirus, as the flanking cellular sequences are identical in both lineages. It should be noted that there are hundreds of HERV-K LTR sequences within the primate genome (31–34). The HERV-K(C4) LTR sequences (Fig. (Fig.22 The RTVL-Ia tree (Fig. (Fig.22 The trees in Fig. Fig.22 The tree in Fig. Fig.22 Nucleotide Substitution Patterns Some authors have suggested that methyl-CG deamination has evolved as a specific defense against colonization of the genome by ERVs (38). Existence of such a mechanism should be manifest as a bias toward C G → T A transitions within CG dinucleotides. Tracing the pattern and direction of shared derived substitutions on each of the HERV trees revealed such a bias. Table 1 shows the distribution of C G → T A changes among the C/G-containing dinucleotides in the ancestral LTR sequence. In every case, the number of C G → T A substitutions per CG dinucleotide is 5- to 10-fold higher than for any of the six other dinucleotide contexts. Indeed, over 40% of the total C → T and G → A transitions are attributable to CG changes, despite the fact that CG is much less frequent than any other dinucleotide. This imbalance is consistent with methyl-CG deamination as a mechanism for generating C G → T A transitions (39); however, the issue of whether a mechanism for promoting CG deamination has evolved specifically as a defense against ERVs requires a careful comparison of the substitution patterns in ERV sequences with those of other nuclear DNA markers.
Estimating the Time of Integration The genetic distance between the 5′ and 3′ LTRs of an ERV reflects mutations accumulated since the time of integration and should therefore be proportional to the age of the provirus. HERV-KC4, HERV-KHML6.17, and RTVL-Ia are found in both OWMs and hominoids, which are estimated to have last shared a common ancestor over 31 million years ago. By contrast, HERV-K18, RTVL-Ha, and RTVL-Hb are found only in humans, chimpanzees, and gorillas, which are thought to have diverged around 5 million years ago (40–42). To estimate the age of each provirus the human/chimpanzee distances from each tree were used to calibrate the rate of molecular evolution at each locus (Table 2). The most recent common ancestor of humans and chimpanzees lived approximately 4.5 million years ago (40–42), so dividing the distance between the human and chimpanzee sequences (substitutions per site) by this number gives rates ranging from 2.3 to 5.0 × 10−9 substitutions per site per year. These numbers are similar to the estimated rates of evolution for pseudogenes and noncoding regions of mammalian genes (43–45). Applying each rate to the divergence between the 5′ and 3′ LTRs of the same locus gives integration times consistent with estimates based on species distribution (Table 2).
A number of authors have pointed out that molecular clock calibrations are subject to a wide margin of error, and are usually based on imprecise estimates of divergence dates (46–48). The calculations in Table 2 are therefore only rough estimates of absolute time, but they are nonetheless useful for comparing the relative ages and rates of evolution of different HERV loci. Conclusions The study reported here is, to our knowledge, the first to take advantage of special properties of retroelements to provide insight into evolutionary mechanisms. The HERVs analyzed above include six unlinked loci, representing five unrelated HERV sequence families. Except where noted, these sequences gave trees that were consistent with the well established phylogeny of the old world primates, including OWMs, apes, and humans. Within this time scale genetic distances were less than 10% for all orthologous comparisons, and correction for multiple substitutions did not significantly alter branch lengths or tree topologies (data not shown). As with other nuclear DNA sequences, analyses of older phylogenetic relationships by using ERVs are likely to require such corrections. One surprising result is the high frequency of conversion we observed. Indeed, only two of the six loci analyzed had suffered no such events in any lineage. Solo LTRs, which arise by recombinational deletion of the intervening viral genes, and which are found by the thousands in the genomes of many animal species, are further evidence for high frequency of recombination involving ERV sequences (1, 49–51). The mechanism that gives rise to such events is unlikely to be provirus-specific, but probably reflects the likelihood of conversion among any repeated, nuclear DNA sequences. Because many ERVs belong to multicopy families, it is also possible that interlocus recombination gives rise to concerted evolution among some of these loci. This latter mechanism may explain the rather confusing topology of the RTVL-Hb tree (Fig. (Fig.22 The use of LTR-to-LTR divergence to estimate insertion times has been reported previously (8, 15, 18, 25), but such studies invariably ignore the possibility of sequence conversion. Only one report (25) discussed the concern that sequence conversion between LTRs can result in an underestimate of insertion time, and suggested that conversion events should be detectable as deletions or alterations of the sequences flanking the LTRs. However, most of the loci analyzed in our study have clearly undergone conversion/recombination, yet none of these events resulted in loss or alteration of flanking sequences (data not shown). Phylogenetic analysis using HERV LTR sequences gives rise to trees with a predictable topology, on which is superimposed the phylogeny of the host taxa, and allows ready detection of conversion events. Once aberrant sequences are identified, they can be eliminated from an analysis, and the remaining sequences can be used to calculate insertion times, delineate substitution patterns, and decipher host phylogeny. Because ERVs are abundant within the genomes of many animal species, including (but not limited to) plants, insects, mollusks, fish, rodents, domestic pets, and livestock, the ERV approach can be applied to an endless variety of phylogenetic puzzles (1, 2). Acknowledgments We dedicate this paper to the memory of Igor Slobodkin, Ph.D. We thank Steve O’Brien for helpful comments and discussion. J.M.C. was supported by National Cancer Institute Grant R35CA44385 and is a Research Professor of the American Cancer Society. W.E.J. was supported by National Institutes of Health Training Grants T32GM07310 and T32AI07422. Footnotes Data deposition: The sequences reported in this paper have been deposited in the GenBank database (accession nos. 172454–172500). References 1. Boeke J D, Stoye J P. In: Retroviruses. Coffin J M, Hughes S H, Varmus H E, editors. Plainview, NY: Cold Spring Harbor Lab. Press; 1997. pp. 343–436. 2. Herniou E, Martin J, Miller K, Cook J, Wilkinson M, Tristem M. J Virol. 1998;72:5955–5966. [PubMed] 3. Coffin J M. In: RNA Tumor Viruses. Weiss R, Teich N, Varmus H, Coffin J M, editors. Plainview, NY: Cold Spring Harbor Lab. Press; 1982. pp. 1109–1204. 4. Coffin J M. In: Fundamental Virology. Fields B N, Knipe D M, Howley P M, editors. Philadelphia: Lippincott-Raven; 1996. pp. 763–844. 5. Benveniste R E, Todaro G J. Nature (London). 1976;261:101–108. [PubMed] 6. Atchley W R, Fitch W M. Science. 1991;254:554–558. [PubMed] 7. Hillis D M, Bull J J. Science. 1991;254:528. [PubMed] 8. Shih A, Coutavas E E, Rush M G. Virology. 1991;182:495–502. [PubMed] 9. van der Kuyl A C, Dekker J T, Goudsmit J. J Virol. 1995;69:7877–7887. [PubMed] 10. Atchley W R, Fitch W. Mol Biol Evol. 1993;10:1150–1169. [PubMed] 11. Frankel W N, Rudy C, Coffin J M, Huber B T. Nature (London). 1991;349:526–528. [PubMed] 12. Wilkinson D A, Mager D L, Leong J C. In: The Retroviridae. Levy J, editor. Vol. 3. New York: Plenum; 1994. pp. 465–536. 13. Mariani-Costantini R, Horn T M, Callahan R. J Virol. 1989;63:4982–4985. [PubMed] 14. Maeda N, Kim H S. Genomics. 1990;8:671–683. [PubMed] 15. Dangel A W, Baker B J, Mendoza A R, Yu C Y. Immunogenetics. 1995;42:41–52. [PubMed] 16. Tassabehji M, Strachan T, Anderson M, Campbell R D, Collier S, Lako M. Nucleic Acids Res. 1994;22:5211–5217. [PubMed] 17. Steinhuber S, Brack M, Hunsmann G, Schwelberger H, Dierich M P, Vogetseder W. Human Genet. 1995;96:188–192. [PubMed] 18. Mager D L, Freeman J D. Virology. 1995;213:395–404. [PubMed] 19. Medstrand P, Mager D L, Yin H, Dietrich U, Blomberg J. J Gen Virol. 1997;78:1731–1744. [PubMed] 20. Thompson J D, Higgins D G, Gibson T J. Nucleic Acids Res. 1994;22:4673–4680. [PubMed] 21. Swofford D L. paup*, Phylogenetic Analysis Using Parsimony (*and other methods), version 4. Sunderland, MA: Sinauer; 1998. 22. Varmus H E, Swanstrom R. In: RNA Tumor Viruses. Weiss R, Teich N M, Varmus H E, Coffin J M, editors. Plainview, NY: Cold Spring Harbor Lab. Press; 1984. pp. 369–512. 23. Brown P O. In: Retroviruses. Coffin J M, Hughes S H, Varmus H E, editors. Plainview, NY: Cold Spring Harbor Laboratory Press; 1997. 24. Withers-Ward E S, Kitamura Y, Barnes J P, Coffin J M. Genes Dev. 1994;8:1473–1487. [PubMed] 25. SanMiguel P, Gaut B, Tikhonov A, Nakajima Y, Bennetzen J L. Nat Genet. 1998;20:43–45. [PubMed] 26. Purvis A. Philos Trans R Soc London B. 1995;348:405–421. [PubMed] 27. Napier J R, Napier P H. The Natural History of the Primates. Cambridge, MA: MIT Press; 1985. 28. Fleagle J G. Primate Adaptation and Evolution. San Diego: Academic; 1988. 29. Ruvolo M. Mol Biol Evol. 1997;14:248–265. [PubMed] 30. Ruvolo M, Disotell T R, Allard M W, Brown W M, Honeycutt R L. Proc Natl Acad Sci USA. 1991;88:1570–1574. [PubMed] 31. Ono M. J Virol. 1986;58:937–944. [PubMed] 32. Ono M, Kawakami M, Takezawa T. Nucleic Acids Res. 1987;15:8725–8737. [PubMed] 33. Meese E, Gottert E, Zang K D, Sauter M, Schommer S, Mueller-Lantzsch N. Cytogenet Cell Genet. 1996;72:40–42. [PubMed] 34. Löwer R, Löwer J, Kurth R. Proc Natl Acad Sci USA. 1996;93:5177–5184. [PubMed] 35. Klein J, O’hUigin C, Figueroa F, Mayer W E, Klein D. Mol Biol Evol. 1993;10:48–59. [PubMed] 36. Mager D L, Henthorn P S. Proc Natl Acad Sci USA. 1984;81:7510–7514. [PubMed] 37. Mager D L, Freeman J D. J Virol. 1987;61:4060–4066. [PubMed] 38. Yoder J, Walsh C, Bestor T. Trends Genet. 1997;13:335–340. [PubMed] 39. Krawczak M, Cooper D N. In: Human Genome Evolution. Jackson M S, Strachan T, Dover G, editors. Oxford: BIOS Scientific; 1996. pp. 1–33. 40. Gingerich P D. Yearbook Phys Anthropol. 1984;27:57–72. 41. Pilbeam D R. Sci Am. 1984;250(3):84–96. [PubMed] 42. Takahata N, Satta Y. Proc Natl Acad Sci USA. 1997;94:4811–4815. [PubMed] 43. Li W-H, Wu C-I, Luo C-C. Mol Biol Evol. 1985;2:150–174. [PubMed] 44. Miyata T. In: Molecular Evolution, Protein Polymorphism, and the Neutral Theory. Kimura M, editor. Tokyo: Scientific Societies Press; 1982. pp. 233–266. 45. Nei M. Molecular Evolutionary Genetics. New York: Columbia Univ. Press; 1987. 46. Hillis D M, Mable B K, Moritz C. In: Molecular Sytematics. Hillis D M, Moritz C, Mable B K, editors. Sunderland, MA: Sinauer; 1996. pp. 515–543. 47. Avise J C. Molecular Markers, Natural History and Evolution. New York: Chapman & Hall; 1994. 48. Arnason U, Gullberg A, Janke A. J Mol Evol. 1998;47:718–727. [PubMed] 49. Copeland N G, Hutchison K W, Jenkins N A. Cell. 1983;33:379–387. [PubMed] 50. Stoye J P, Fenner S, Greenoak G E, Moran C, Coffin J M. Cell. 1988;54:383–391. [PubMed] 51. Mager D L, Goodchild N L. Am J Human Genet. 1989;45:848–854. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||
J Virol. 1998 Jul; 72(7):5955-66.
[J Virol. 1998]Nature. 1976 May 13; 261(5556):101-8.
[Nature. 1976]Mol Biol Evol. 1993 Nov; 10(6):1150-69.
[Mol Biol Evol. 1993]Science. 1991 Oct 25; 254(5031):554-8.
[Science. 1991]Nature. 1991 Feb 7; 349(6309):526-8.
[Nature. 1991]J Gen Virol. 1997 Jul; 78 ( Pt 7)():1731-44.
[J Gen Virol. 1997]Nucleic Acids Res. 1994 Nov 11; 22(22):4673-80.
[Nucleic Acids Res. 1994]Genes Dev. 1994 Jun 15; 8(12):1473-87.
[Genes Dev. 1994]Genomics. 1990 Dec; 8(4):671-83.
[Genomics. 1990]Virology. 1991 Jun; 182(2):495-502.
[Virology. 1991]Immunogenetics. 1995; 42(1):41-52.
[Immunogenetics. 1995]Virology. 1995 Nov 10; 213(2):395-404.
[Virology. 1995]Nat Genet. 1998 Sep; 20(1):43-5.
[Nat Genet. 1998]Philos Trans R Soc Lond B Biol Sci. 1995 Jun 29; 348(1326):405-21.
[Philos Trans R Soc Lond B Biol Sci. 1995]Mol Biol Evol. 1997 Mar; 14(3):248-65.
[Mol Biol Evol. 1997]Proc Natl Acad Sci U S A. 1991 Feb 15; 88(4):1570-4.
[Proc Natl Acad Sci U S A. 1991]J Virol. 1986 Jun; 58(3):937-44.
[J Virol. 1986]Proc Natl Acad Sci U S A. 1996 May 28; 93(11):5177-84.
[Proc Natl Acad Sci U S A. 1996]Immunogenetics. 1995; 42(1):41-52.
[Immunogenetics. 1995]Nucleic Acids Res. 1994 Dec 11; 22(24):5211-7.
[Nucleic Acids Res. 1994]Mol Biol Evol. 1993 Jan; 10(1):48-59.
[Mol Biol Evol. 1993]Proc Natl Acad Sci U S A. 1984 Dec; 81(23):7510-4.
[Proc Natl Acad Sci U S A. 1984]J Virol. 1987 Dec; 61(12):4060-6.
[J Virol. 1987]Virology. 1995 Nov 10; 213(2):395-404.
[Virology. 1995]Trends Genet. 1997 Aug; 13(8):335-40.
[Trends Genet. 1997]Proc Natl Acad Sci U S A. 1997 Apr 29; 94(9):4811-5.
[Proc Natl Acad Sci U S A. 1997]Mol Biol Evol. 1985 Mar; 2(2):150-74.
[Mol Biol Evol. 1985]J Mol Evol. 1998 Dec; 47(6):718-27.
[J Mol Evol. 1998]Cell. 1983 Jun; 33(2):379-87.
[Cell. 1983]Am J Hum Genet. 1989 Dec; 45(6):848-54.
[Am J Hum Genet. 1989]Virology. 1991 Jun; 182(2):495-502.
[Virology. 1991]Immunogenetics. 1995; 42(1):41-52.
[Immunogenetics. 1995]Virology. 1995 Nov 10; 213(2):395-404.
[Virology. 1995]Nat Genet. 1998 Sep; 20(1):43-5.
[Nat Genet. 1998]J Virol. 1998 Jul; 72(7):5955-66.
[J Virol. 1998]J Virol. 1986 Jun; 58(3):937-44.
[J Virol. 1986]Genomics. 1990 Dec; 8(4):671-83.
[Genomics. 1990]Proc Natl Acad Sci U S A. 1984 Dec; 81(23):7510-4.
[Proc Natl Acad Sci U S A. 1984]J Virol. 1987 Dec; 61(12):4060-6.
[J Virol. 1987]Proc Natl Acad Sci U S A. 1997 Apr 29; 94(9):4811-5.
[Proc Natl Acad Sci U S A. 1997]