![]() | ![]() |
Formats:
|
||||||||||||
Transposable elements donate lineage-specific regulatory sequences to host genomes a National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD b School of Information and Library Science, The University of North Carolina at Chapel Hill, Chapel Hill, NC (USA) Request reprints from I. King Jordan, National Center for Biotechnology Information, National Institutes of Health, 8600 Rockville Pike, Bldg 38A, Room N511M, Bethesda, MD 20894 (USA), telephone: 301-594-5714; fax: 301-435-7794, e-mail: jordan/at/ncbi.nlm.nih.gov The publisher's final edited version of this article is available at Cytogenet Genome Res. See other articles in PMC that cite the published article.Abstract The evolutionary implications of transposable element (TE) influences on gene regulation are explored here. An historical perspective is presented to underscore the importance of TE influences on gene regulation with respect to both the discovery of TEs and the early conceptualization of their potential impact on host genome evolution. Evidence that points to a role for TEs in host gene regulation is reviewed, and comparisons between genome sequences are used to demonstrate the fact that TEs are particularly lineage-specific components of their host genomes. Consistent with these two properties of TEs, regulatory effects and evolutionary specificity, human-mouse genome wide sequence comparisons reveal that the regulatory sequences that are contributed by TEs are exceptionally lineage specific. This suggests a particular mechanism by which TEs may drive the diversification of gene regulation between evolutionary lineages. Historical perspective Controlling elements The influence of transposable elements (TEs) on gene regulation has been apparent for as long as these genetic elements have been known to exist. In fact, the discovery of TEs was predicated upon their ability to regulate the expression patterns of the genes of the host organisms in which they reside. Barbara McClintock (1984) originally referred to the mobile genetic elements that she discovered in maize as “controlling elements” based on their ability to control the expression of genes involved in pigmentation. Beginning in 1944, McClintock observed many cases of variegation, in other words differences in the pattern of expression, for the distribution of chlorophyll among maize seedling leaves. Importantly, McClintock noticed that distinct chlorophyll patterns were localized to discrete sectors and that these sectors occurred in adjacent pairs where each member of the pair was the reciprocal of the other with respect to their pigmentation. Similar observations were made for patterns of gain and loss of genetic markers on maize kernels (Fig. 1A
Despite McClintock’s standing as a highly respected geneticist and the volume of evidence that she presented, the implications of these findings were not widely appreciated or even accepted until much later. In her recollections of this period, McClintock has attributed the initial reluctance of the scientific community to embrace her conclusions to two aspects of her discovery, both of which were particularly difficult to reconcile with the understanding of genetics that existed at that time (McClintock, 1987). First and foremost, the notion of mobile genetic elements implied a dynamic genome that was radically at odds with the prevailing notion of a static genome based on the “beads on a string” model of chromosomal organization. Somewhat less obviously, at that time even the basic concept that the expression of genes was developmentally regulated was generally not conceived of and would not become widely appreciated until more than a decade later with the publication of the classic work of Jacob and Monod (1961). Of course, the significance of McClintock’s work would come to be fully appreciated in time, and a reflection on the path to her discovery may be taken colloquially to suggest that the very essence of TEs is tied to their ability to influence patterns of host gene regulation. COT curves Another critical early line of research that underscored the potential influence of repetitive DNA on gene regulation was founded on studies of the kinetics of DNA reassociation pioneered by Roy Britten and colleagues (Britten and Kohne, 1968). In short, they observed that the rate of DNA reassociation for relatively large eukaryotic genomes was much more rapid than would be expected if all or even most of the genomic DNA was single copy. Reassociation kinetics were visualized on so-called COT curves where the fraction of reassociated DNA was plotted against COT, a parameter that is equal to the product of the DNA concentration in the solution times the time of incubation (moles of DNA times seconds per liter). Careful examinations of these plots revealed distinct fractions of genomic DNA that reassociate at different rates, and these fractions were inferred to represent different classes of genomic DNA consisting of more (relatively rapidly reassociating) or less (more slowly reassociating) repetitive DNA (Fig. 1B The significance of this experimental work was of course the novel demonstration of the prevalence of repetitive DNA in eukaryotic genomes. Fortunately however, Britten and colleagues did not stop there. They considered the preponderance of repetitive DNA with respect to their interest in both evolutionary theory and gene regulation and hypothesized at length on the significance of repetitive DNA to the evolution of regulatory differences. In fact, their theoretical work of that era represented one of the strongest assertions to date of the importance of regulatory changes driving evolutionary diversification. Britten and Davidson (1969) articulated a detailed model on the genomic architecture of regulatory networks and suggested that repetitive DNA may influence gene expression patterns by providing binding sites for regulatory factors in the 5′ regions of genes. Further elaboration of this model placed even more of an emphasis on the role of repetitive DNA in gene regulation and demonstrated how repetitive sequences could move in the genome and serve as source of evolutionary variation in regulatory patterns. In their model, repetitive sequences were considered to move via chromosomal rearrangement and not transposition per se (Britten and Davidson, 1971). Of course, the precise nature of repetitive DNA was unknown at the time as was the preponderance of TE sequences among this fraction of genomic DNA. However, the predictions of Britten and colleagues were subsequently born out in a number of cases where TEs were demonstrated to alter expression patterns by providing cis-regulatory sequences after insertion into the vicinity of a host gene (Britten, 1996a). Examples of TE influences on gene regulation Molecular evidence Over the last 15 years, an abundance of experimental evidence has accumulated that directly points to the contribution of repetitive DNA to gene regulation. This evidence consists largely of examples where TEs have been shown to contribute to the regulation of a host gene by providing cis-regulatory sequences that interact with host trans factors. Interestingly, the vast majority of these cases were uncovered fortuitously in the sense that the investigators were not out to assess the role of TEs in gene regulation but rather were seeking to understand the molecular basis of the regulatory properties of the particular system that they were working on. The first example of this kind came from the study of the sex-limited protein (Slp) encoding gene in mouse (Stavenhagen and Robins, 1988). Slp is one of two tandem genes and is closely related to the adjacent C4 gene that encodes the fourth component of complement. Apparently, after the duplication of these two genes an endogenous retrovirus (ERV) inserted upstream of the Slp gene and this insertion resulted in an altered expression pattern for Slp which in turn drove the functional divergence of the protein (van den Berg et al., 1992). Unlike the C4 gene, Slp is expressed only in males due to androgen dependence conferred by androgen response elements found in the long terminal repeat of the ERV (Adler et al., 1992, 1993). Pursuant to his interest in the relationship between repetitive DNA and gene regulation, Britten reviewed a number of such cases where insertions of TEs have resulted in fixed novel regulatory patterns and established four criteria for the identification of convincing examples: 1 – the presence of a known TE sequence in the gene region, 2 – evidence that the insertion has been present long enough to be fixed, 3 – evidence that part of the TE sequence participates in the regulation of the nearby gene and 4 – evidence that the gene encodes some function (Britten, 1996a, b; 1997). By 1997, Britten was able to find more than 20 examples that conformed to all four of these criteria and many more similar examples have been uncovered since that time. For instance, a number of cases where human TEs can be shown to serve as promoters for adjacent genes have recently been identified (Landry et al., 2001, 2002; Medstrand et al., 2001; Dunn et al., 2003). The most extensive literature survey to date of TE contributions to host gene regulation identified almost 80 cases where regulatory elements of vertebrate genes are derived from TEs (Brosius, 1999). In addition to serving as promoter and enhancer sequences for nearby genes, TE insertions have also been shown to influence host gene expression by providing alternative splice sites (Varagona et al., 1992; Feuchter-Murthy et al., 1993; Baban et al., 1996; Davis et al., 1998) and polyadenylation sites (Goodchild et al., 1992; Sugiura et al., 1992; Mager et al., 1999). Alu elements may be particularly prone to providing alternative splice sites to host genes and being incorporated into mRNA sequences as a result (Makalowski et al., 1994; Sorek et al., 2002; Lev-Maor et al., 2003). Genomic evidence The accumulation of genomic sequence data has led to a number of efforts to systematically assess the contribution of TEs to gene regulation. These studies have consisted of computer-based inquiries that rely on large scale analyses of sequence data. The earliest examples of these types of studies were conducted on plant genome sequences; investigators interested in the relationship between TEs and plant genes took the novel approach of computationally searching plant gene sequences for the presence of TEs. An initial survey of maize and barley gene sequences revealed that quite a few members of one specific TE family – Tourist, a miniature inverted repeat element (MITE) – were inserted in the regions just flanking genes or in intron sequences (Bureau and Wessler, 1992). This observation suggested that these elements may be often associated with genes, and this was confirmed with more extensive analyses that revealed the frequent association of Tourist elements with genes from a number of different cereal grass genomes (Bureau and Wessler, 1994a; Bureau et al., 1996). Evidence that these TE-gene associations may include functionally significant cases was supplied by studies that revealed that MITEs had contributed regulatory sequences such as cis-binding sites and polyadenylation signals to host genes (Bureau and Wessler, 1994b; Wessler et al., 1995). The recent availability of complete eukaryotic genome sequences provided increased opportunities to systematically evaluate the contribution of TEs to host gene regulatory sequences. For example, the majority of retrotransposons discovered in the complete genome sequence of Caenorhabditis elegans were found to be located in close proximity to host gene sequences suggesting that they may contribute to the regulation of these genes (Ganko et al., 2003). In addition, a survey of the retrotransposons of the fission yeast Schizosaccharomyces pombe revealed that these elements were disproportionately associated with pol II promoters in complete genome sequence (Bowen et al., 2003). TE sequences make up a much greater fraction of vertebrate genomes and studies of the human genome in particular have underscored the substantial contribution of TEs to regulatory sequences. For example, the initial analysis of the human genome sequence revealed that hundreds of transcriptional terminator sequences were donated by one class of retrotransposon alone (Lander et al., 2001). Subsequently more detailed analyses revealed the extent to which human regulatory sequences are derived from TEs. For instance, a survey of human genome sequences found that almost 25% of proximal promoter sequences (i.e. 500 bp upstream of the transcription start site) as well as numerous 5′ and 3′ untranslated regions (UTRs) contained TE derived sequences (Jordan et al., 2003). Clearly, as was the case with the plant sequences studied earlier, there is a strong association between TE sequences and gene sequences in the human genome. However, this fact alone does not necessarily imply functionally relevant relationships where TEs provide working regulatory sequences to host genes; the association of TEs with promoters may simply be due to the prevalence of TE-derived sequences in the human genome at large. To address this issue, experimentally characterized human regulatory sequences were mapped to their gene sequences to examine whether they may have been donated by TEs. When experimentally characterized regulatory sequences were evaluated, it was shown that TEs have donated sequences to both cis-binding regulatory elements that act in a gene-specific manner as well as scaffold/matrix attachment regions (S/MARs) and locus control regions that exert their regulatory effects in a more global manner (Jordan et al., 2003). A subsequent genomic scale analysis of human and mouse sequences confirmed the abundance of TE-derived sequences in regulatory regions and the donation of experimentally characterized regulatory elements by TEs (van de Lagemaat et al., 2003). Interestingly, this study also found that TEs were found more often in the regulatory regions of genes that are rapidly evolving and those with relatively narrow phylogenetic distributions (i.e. those that are mammalian specific). For example, genes involved in immune suppression and those involved in the response to external stimuli were particularly enriched for TE sequences. These observations were taken to suggest that TEs may have contributed substantially to the evolutionary diversification of mammalian genomes presumably by generating lineage-specific patterns of gene regulation. Lineage-specific regulatory sequences contributed by TEs TEs are lineage-specific TEs may be the most lineage-specific elements of eukaryotic genomes. For instance, a recent comparison of a single 12-megabase (Mb) genomic region among 12 vertebrate species indicated that the distribution of different TE types differed greatly within and between vertebrate lineages (Thomas et al., 2003). Among the nine mammalian species examined in this study, species-specific TE insertions account for the majority of size differences seen between lineages. In addition, when the complete mouse genome sequence was compared to that of the human, it was shown that mouse-specific TEs made up 87.0% of all mouse TEs (32.4% of the mouse genome) and human-specific TEs accounted for 51.9% of all human TEs (24.4% of the human genome) (Waterston et al., 2002). In other words, mouse lineage-specific TEs have contributed well over 800 Mb of DNA to the mouse genome and human lineage-specific TEs make up over 700 Mb of the human genome. On the other hand, the same comparison revealed that only 1% of mouse protein coding genes do not have any human homolog and only 20% of mouse genes do not have a direct 1:1 human ortholog (i.e. are not descended from precisely the same ancestral gene). TEs are clearly far more lineage-specific than the host genes of these two mammals. Even more remarkably, TE insertions can generate substantial genomic fractions over much shorter periods of evolutionary time than have elapsed since the human-mouse divergence. Comparison of several primate genomic sequences suggests that transposition rates vary widely across lineages and that the human lineage has experienced a particularly high rate of retrotransposition (Liu et al., 2003). This has led to a TE generated expansion of over 500 Mb in the human lineage over the last 50 million years and an increase of 30 Mb in the human lineage just since the divergence from chimpanzee ~6 million years ago. When these findings are considered with respect to the influence of TEs on gene regulation, it suggests that TEs may exert regulatory effects in a way that is most likely to cause differences between evolutionary lineages. The implications for this aspect of TE influence on gene regulation with respect to methods used to identify regulatory sequences are explored below. Also, in support of the notion that TEs may contribute to lineage-specific regulatory differences, data on the lineage-specific contributions that TEs make to human regulatory sequences are presented. Phylogenetic footprinting may overlook TEs Recently, a sustained effort based on the comparative analysis of genomic sequence data has been made to improve methods for the prediction of cis-regulatory sites in genomic DNA. This approach is known as “phylogenetic footprinting” and it rests on the plausible assumption that functionally important regions of genomic DNA will evolve more slowly than non-important regions due to the effects of purifying selection (Gumucio et al., 1992; Zhang and Gerstein, 2003). From this it follows that when non-coding sequences are compared between species, functionally important regulatory sequences (e.g. cis-binding sites) will be characterized by anomalously low levels of sequence divergence. This method has been employed to identify putative regulatory sequences in a number of different systems (McCue et al., 2002; Boffelli et al., 2003; Kellis et al., 2003; Lenhard et al., 2003). It is worth noting that, at this time, relatively little is known about the pattern of non-coding sequence evolution. The notion that functionally important regulatory sites will be more conserved than neighboring nonfunctional sequences is entirely reasonable and consistent with what is known about molecular evolution (Li, 1997), but it is still mostly an assumption. Only recently have investigators begun to study the pattern of noncoding sequence evolution with respect to the location of known regulatory sites and the results do not entirely support the phylogenetic footprinting rationale. There is evidence that the rate of evolution for noncoding DNA at regulatory sites is lower than the rate of evolution for the surrounding, presumably nonfunctional, noncoding sequence (Dermitzakis and Clark, 2002; Moses et al., 2003). However, several recent studies indicate that there is a rapid evolutionary turnover of regulatory sites, which suggests that the phylogenetic footprinting approach may yield numerous false negatives. For example, when Drosophila pseudoobscura genomic sequences were compared to D. virilis sequences, only 50% of the known regulatory regions were found to be located in sequences that are conserved between the two species (Alkema and Wasserman, 2003). A comparative analysis of genomic sequence from 12 vertebrates fared only slightly better with respect to the identification of functionally validated regulatory elements; in this case, 63% of the known regulatory elements were shown to be located in conserved sequence regions (Thomas et al., 2003). Another analysis of regulatory sequence evolution compared experimentally characterized transcription factor binding sites between human and rodent genomes and found extensive sequence variation at these sites (Dermitzakis and Clark, 2002). Based on this survey, 32–40% of human functional sites were estimated to be nonfunctional in rodents. It appears that the assumption that functionally important regulatory sites will be highly conserved is not always true, and one can expect that phylogenetic footprinting will yield numerous false negatives as a result. An approach that employs the same rationale as that of phylogenetic footprinting has recently been used to evaluate the potential contribution of TEs to functionally important non-coding sequences in mammalian genomes (Silva et al., 2003). In this study, orthologous intergenic regions were compared between the human and mouse genomes. Consistent with the fact that they are very lineage specific, TEs were shown to make up 40–60% of the regions with low similarity between species and only 20% of the regions of high similarity. However, certain families of elements, namely MIR a family of DNA-type elements and L2 a family of LINE-like elements, were found to be common within the conserved segments. From this observation, it was inferred that these ancient conserved TE sequences have been under purifying selection based on some functional utility that they provide to their hosts. Remarkably, the recruitment of these TEs to perform some function that benefits their hosts was shown to be quite common having occurred two times or more for each host gene examined. As demonstrated by the study of Silva et al. (2003), TEs clearly make up an important component of functionally important noncoding DNA. However, the problem of false positives discussed above with respect to phylogenetic footprinting would seem to be even more exacerbated for TEs. Because TEs are so lineage specific, they should be expected to rarely show up as conserved regions in sequence alignments between species. Below, cross-species comparisons of experimentally characterized regulatory sites are shown to suggest that TE-contributed regulatory sequences are far more lineage specific and much less conserved than regulatory sites that are not derived from TEs. Thus, TEs appear to be particularly likely to contribute to the generation of lineage-specific regulatory elements and as such may play a role in driving the diversification between evolutionary lineages. Evidence of TE contributions to lineage-specific regulatory sequences One way to assess the contributions of TEs to regulatory sequences is to search for their presence among promoter sequence regions proximal to host genes. Indeed, a number of studies have inferred a possible role for TEs in gene regulation based on their proximity to host genes (Bureau and Wessler, 1992, 1994b; Bureau et al., 1996; Ganko et al., 2003; Jordan et al., 2003; van de Lagemaat et al., 2003). To this end, we have surveyed the proximal promoter sequences of 4,737 human transcripts for the presence of TE sequences as well as for low complexity repetitive sequences. Full-length human transcript sequences were taken from the database of transcriptional start sites (Suzuki et al., 2002) and proximal promoter regions from the transcripts that mapped unambiguously to the human genome sequence (National Center for Biotechnology, build 33, ftp://ftp.ncbi.nih.gov/genomes/H_sapiens/) were used for analysis. Each of these proximal promoter sequences consists of nucleotides from −2,000 bp to +1,000 bp with respect to the transcriptional start site. Promoter sequences were analyzed with the RepeatMasker program (http://ftp.genome.washington.edu/RM/RepeatMasker.html) to determine the location of TEs and low complexity repetitive sequences (Fig. 2
One problem with the approach described above is that it is difficult if not impossible to definitively claim a role for TEs in host gene regulation based simply on their presence in sequence regions that are involved in regulation. In the case of the human genome for instance, the presence of TEs in host gene regulatory regions may simply reflect their abundance in the genome. Indeed, the pattern of TE insertions in the human promoter regions is consistent with this possibility, and even suggests that most TE insertions in promoter regions are actually deleterious and selected against. This is because the density of TE sequences in human promoters is greatest in the most distal regions and steadily declines closer to the start site of transcription (Fig. 2A One way to make definitive inferences about the contribution of TEs to regulatory sequences is to start with experimentally characterized sites that are known to contribute to the regulation of host genes and then search for cases where such sites can be shown to have been donated by TEs. This approach has been employed successfully to identify TE-derived cis-regulatory sequences as well as TE-derived S/MARs that regulate gene expression in a more global manner (Jordan et al., 2003; van de Lagemaat et al., 2003). We combine a similar approach here, employing the identification of experimentally characterized cis-regulatory sites that overlap with TE sequences, with human-mouse sequence comparisons to evaluate the level of evolutionary conservation of regulatory sites that have been derived from TEs. The TRANSFAC database (Matys et al., 2003) was used to identify experimentally characterized human regulatory sequences. The data that were taken from TRANSFAC (professional version 7.1) are cis-binding sites that have been identified with a number of different experimental procedures including footprinting, gel-shift assays, promoter deletion experiments and mutagenesis. A total of 1,145 of these cis-regulatory sites were mapped to the complete human genome sequence (National Center for Biotechnology, build 33, ftp://ftp.ncbi.nih.gov/genomes/H_sapiens/). The locations of the regulatory sites in the human genome sequence were compared to the location of TE sequences detected using the program Repeat-Masker (http://ftp.genome.washington.edu/RM/RepeatMasker.html). A total of 38 cases where experimentally characterized regulatory sites overlapped with TE-derived sequences were identified in this way (Table 1 and Fig. 3
Conclusion TEs are perhaps the most lineage-specific elements of eukaryotic genomes and they are known to contribute regulatory sequences that control the expression of host genes. Taken together, these facts suggest that TE-derived regulatory sequences may be particularly lineage specific. A comparison of human and mouse genome sequences with respect to the location of TE-derived regulatory sequences suggests that this is indeed the case. This result is consistent with a recent survey that showed TEs to be more prevalent in the UTRs of relatively unconserved human genes (van de Lagemaat et al., 2003). Thus, the activity of TEs may provide one specific mechanism that drives the regulatory diversification of host genome evolutionary lineages. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||
Science. 1984 Nov 16; 226(4676):792-801.
[Science. 1984]J Mol Biol. 1961 Jun; 3():318-56.
[J Mol Biol. 1961]Science. 1968 Aug 9; 161(841):529-40.
[Science. 1968]Nature. 2001 Feb 15; 409(6822):860-921.
[Nature. 2001]Nature. 2002 Dec 5; 420(6915):520-62.
[Nature. 2002]Science. 1969 Jul 25; 165(891):349-57.
[Science. 1969]Q Rev Biol. 1971 Jun; 46(2):111-38.
[Q Rev Biol. 1971]Mol Phylogenet Evol. 1996 Feb; 5(1):13-7.
[Mol Phylogenet Evol. 1996]Cell. 1988 Oct 21; 55(2):247-54.
[Cell. 1988]Proc Natl Acad Sci U S A. 1992 Nov 15; 89(22):10711-5.
[Proc Natl Acad Sci U S A. 1992]Proc Natl Acad Sci U S A. 1992 Dec 15; 89(24):11660-3.
[Proc Natl Acad Sci U S A. 1992]Mol Cell Biol. 1993 Oct; 13(10):6326-35.
[Mol Cell Biol. 1993]Mol Phylogenet Evol. 1996 Feb; 5(1):13-7.
[Mol Phylogenet Evol. 1996]Proc Natl Acad Sci U S A. 1996 Sep 3; 93(18):9374-7.
[Proc Natl Acad Sci U S A. 1996]Gene. 1997 Dec 31; 205(1-2):177-82.
[Gene. 1997]Genomics. 2001 Aug; 76(1-3):110-6.
[Genomics. 2001]Mol Biol Evol. 2002 Nov; 19(11):1934-42.
[Mol Biol Evol. 2002]Plant Cell. 1992 Jul; 4(7):811-20.
[Plant Cell. 1992]Nucleic Acids Res. 1993 Jan 11; 21(1):135-43.
[Nucleic Acids Res. 1993]Genomics. 1996 May 1; 33(3):463-72.
[Genomics. 1996]Genetics. 1998 Nov; 150(3):1105-14.
[Genetics. 1998]Gene. 1992 Nov 16; 121(2):287-94.
[Gene. 1992]Plant Cell. 1992 Oct; 4(10):1283-94.
[Plant Cell. 1992]Proc Natl Acad Sci U S A. 1994 Feb 15; 91(4):1411-5.
[Proc Natl Acad Sci U S A. 1994]Proc Natl Acad Sci U S A. 1996 Aug 6; 93(16):8524-9.
[Proc Natl Acad Sci U S A. 1996]Plant Cell. 1994 Jun; 6(6):907-16.
[Plant Cell. 1994]Curr Opin Genet Dev. 1995 Dec; 5(6):814-21.
[Curr Opin Genet Dev. 1995]Mol Biol Evol. 2003 Nov; 20(11):1925-31.
[Mol Biol Evol. 2003]Genome Res. 2003 Sep; 13(9):1984-97.
[Genome Res. 2003]Nature. 2001 Feb 15; 409(6822):860-921.
[Nature. 2001]Trends Genet. 2003 Feb; 19(2):68-72.
[Trends Genet. 2003]Trends Genet. 2003 Oct; 19(10):530-6.
[Trends Genet. 2003]Nature. 2003 Aug 14; 424(6950):788-93.
[Nature. 2003]Nature. 2002 Dec 5; 420(6915):520-62.
[Nature. 2002]Genome Res. 2003 Mar; 13(3):358-68.
[Genome Res. 2003]Mol Cell Biol. 1992 Nov; 12(11):4919-29.
[Mol Cell Biol. 1992]J Biol. 2003; 2(2):11.
[J Biol. 2003]Genome Res. 2002 Oct; 12(10):1523-32.
[Genome Res. 2002]Science. 2003 Feb 28; 299(5611):1391-4.
[Science. 2003]Nature. 2003 May 15; 423(6937):241-54.
[Nature. 2003]Mol Biol Evol. 2002 Jul; 19(7):1114-21.
[Mol Biol Evol. 2002]BMC Evol Biol. 2003 Aug 28; 3():19.
[BMC Evol Biol. 2003]Genome Biol. 2003; 4(7):327.
[Genome Biol. 2003]Nature. 2003 Aug 14; 424(6950):788-93.
[Nature. 2003]Genet Res. 2003 Aug; 82(1):1-18.
[Genet Res. 2003]Genet Res. 2003 Aug; 82(1):1-18.
[Genet Res. 2003]Plant Cell. 1992 Oct; 4(10):1283-94.
[Plant Cell. 1992]Plant Cell. 1994 Jun; 6(6):907-16.
[Plant Cell. 1994]Proc Natl Acad Sci U S A. 1996 Aug 6; 93(16):8524-9.
[Proc Natl Acad Sci U S A. 1996]Mol Biol Evol. 2003 Nov; 20(11):1925-31.
[Mol Biol Evol. 2003]Trends Genet. 2003 Feb; 19(2):68-72.
[Trends Genet. 2003]Trends Genet. 2003 Feb; 19(2):68-72.
[Trends Genet. 2003]Trends Genet. 2003 Oct; 19(10):530-6.
[Trends Genet. 2003]Nucleic Acids Res. 2003 Jan 1; 31(1):374-8.
[Nucleic Acids Res. 2003]Genome Res. 2003 Jan; 13(1):103-7.
[Genome Res. 2003]Nucleic Acids Res. 2003 Jan 1; 31(1):51-4.
[Nucleic Acids Res. 2003]Genet Res. 2003 Aug; 82(1):1-18.
[Genet Res. 2003]Trends Genet. 2003 Feb; 19(2):68-72.
[Trends Genet. 2003]Trends Genet. 2003 Oct; 19(10):530-6.
[Trends Genet. 2003]