![]() | ![]() |
Formats:
|
||||||||||||||||||||||||
Copyright © 2004, Genetics Society of America Duplicative and Conservative Transpositions of Larval serum protein 1 Genes in the Genus Drosophila Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Bellaterra (Barcelona), Spain 1Corresponding author: Departament de Genètica i de Microbiologia, Facultat de Ciències-Edifici C, Universitat Autònoma de Barcelona, 08193 Bellaterra (Barcelona), Spain. E-mail: alfredo.ruiz/at/uab.es Communicating editor: M. Veuille Received December 17, 2003; Accepted June 1, 2004. This article has been cited by other articles in PMC.Abstract Interspecific comparative molecular analyses of transposed genes and their flanking regions can help to elucidate the time, direction, and mechanism of gene transposition. In the Drosophila melanogaster genome, three Larval serum protein 1 (Lsp1) genes (α, β and γ) are present and each of them is located on a different chromosome, suggesting multiple transposition events. We have characterized the molecular organization of Lsp1 genes in D. buzzatii, a species of the Drosophila subgenus and in D. pseudoobscura, a species of the Sophophora subgenus. Our results show that only two Lsp1 genes (β and γ) exist in these two species. The same chromosomal localization and genomic organization, different from that of D. melanogaster, is found in both species for the Lsp1β and Lsp1γ genes. Overall, at least two duplicative and two conservative transpositions are necessary to explain the present chromosomal distribution of Lsp1 genes in the three Drosophila species. Clear evidence for implication of snRNA genes in the transposition of Lsp1β in Drosophila has been found. We suggest that an ectopic exchange between highly similar snRNA sequences was responsible for the transposition of this gene. We have also identified the putative cis-acting regulatory regions of these genes, which seemingly transposed along with the coding sequences. SEQUENCE analysis of genomes has revealed that gene transposition contributes significantly to the reorganization of eukaryotic genomes. Gene transposition refers to the movement of relatively small genomic segments, containing one or a few genes, from one chromosomal position to another. This movement may be accompanied or not by the duplication of the genomic segment, two processes that may be denoted as duplicative and conservative transposition, respectively. In nematodes, gene transposition seems to be the most frequent kind of genome rearrangement (Coghlan and Wolfe 2002) whereas duplications of chromosomal segments encompassing a few genes followed by differential gene loss is a common cause of gene order changes in yeasts (Llorente et al. 2000; Fischer et al. 2001). In plants, repeated rounds of large-scale genome duplication followed by selective gene loss are the main factors in genome evolution. Chromosomal rearrangements were thought to be only a minor factor in the divergence of plant genomes (Ku et al. 2000). However, when more detailed comparisons were performed, many chromosomal rearrangements were found. For example, the comparison of genome sequences of rice to orthologous regions from other grass species revealed that numerous local rearrangements, including transpositions of single genes to different chromosomes, have occurred (Bennetzen and Ma 2003). Transposition has also played a significant role in the evolution of the mammalian genome. Segmental duplications originated from the duplicative transposition of small portions of chromosomal material represent ~5% of the human genome (Eichler 2001; Lander et al. 2001; Bailey et al. 2002) and at least 1.2% of the mouse genome (Cheung et al. 2003). Some of the duplicated segments in the human genome are associated with rapid gene innovation and chromosomal rearrangement in the genomes of man and the great apes (Samonte and Eichler 2001; Armengol et al. 2003; Locke et al. 2003). In Drosophila, segmental duplications seem to be rare in comparison to the number found in mammalian genomes (Lander et al. 2001; Celniker et al. 2002). Also, detailed analyses by in situ hybridization show that the gene content of chromosomal elements is generally conserved and suggest that gene transpositions are relatively scarce (González et al. 2002; Ranz et al. 2003) in relation to paracentric inversions, which have been traditionally considered as the chief type of chromosomal change (Krimbas and Powell 1992; Powell 1997). However, recent sequence analyses comparing five different Drosophila species point to similar numbers of inversions and gene transpositions (Bergman et al. 2002) and a number of new genes originated by retroposition has been unveiled (Betrán et al. 2002). Diverse molecular mechanisms (sometimes poorly understood) may be responsible for gene transposition events. A common mechanism for gene transposition is retroposition, which implies reverse transcription of RNA and insertion of the resulting cDNA into a different genome site. In humans, the long interspersed element (LINE) L1 often associates 3′ flanking DNA as a read-through transcript and carries the non-L1 sequence to a new genomic location, a process termed L1-mediated transduction (Moran et al. 1999; Lander et al. 2001). Seemingly the LINE machinery can also act in trans to cellular RNA substrates giving rise to the trans-mobilization of genomic DNA, processed pseudogenes, and occasionally new functional genes (Esnault et al. 2000; Betrán et al. 2002; Ejima and Yang 2003; Long et al. 2003). Another mechanism of transposition is transposon-mediated excision and insertion of genomic segments. For instance, in D. melanogaster, Folback elements flanking relatively large genomic segments are able to transport these segments to sites far away in the genome, forming the so-called “giant transposons” (Chia et al. 1985; Lovering et al. 1991). Excision and insertion of these giant transposons is mediated by homologous recombination involving the Foldback sequences at the transposon termini. Transposable elements seem to be implicated also in the origin of segmental duplications in humans. Duplication junctions have been found to be enriched for Alu short interspersed element sequences with a significant proportion of all segmental duplications ending within Alu sequences (Bailey et al. 2003). This observation suggests Alu-Alu homologous recombination as the most likely mechanism for these rearrangements. A similar mechanism has previously been shown to generate small duplications, deletions, and inversions in diverse organisms. Larval serum protein 1 (Lsp1) genes provide one of the few examples of gene transposition in the genus Drosophila. In D. melanogaster, each of the three Lsp1 genes is located on a different chromosome: Lsp1α in chromosome X, Lsp1β in chromosomal arm 2L, and Lsp1γ in chromosomal arm 3L (Roberts and Evans-Roberts 1979; Smith et al. 1981). These chromosomal arms correspond to Muller's elements A, B, and D, respectively (Table 1). Brock and Roberts (1983) mapped Lsp1 genes by in situ hybridization in nine other species of the Sophophora subgenus (including D. pseudoobscura) and five species of the Drosophila subgenus (including D. hydei as a representative species of the repleta group). In the melanogaster subgroup species they were able to localize the three Lsp1 genes: Lsp1α on element A, Lsp1β on element B, and Lsp1γ on element D. In all the other species, both Lsp1α and Lsp1β hybridized to the same polytene band of Muller's element E, suggesting a gene exchange between elements. No hybridization of the Lsp1γ gene was observed although γ-like proteins were detected with specific antibodies. To determine Lsp1 gene number, Brock and Roberts (1983) performed Southern analyses and concluded that at least two genes, one α-like and one β-like, were present in all the species analyzed. Their data also suggested that the ancestor of the genus Drosophila probably had its Lsp1 genes on element E. Recently, we localized by in situ hybridization Lsp1α in chromosome 2 (Muller's element E) of D. repleta and D. buzzatii (González et al. 2002; Ranz et al. 2003), corroborating their results.
In this work, Lsp1 genes and their flanking sequences have been cloned and sequenced in D. buzzatii, a species belonging to the repleta group of the Drosophila subgenus, which diverged from D. melanogaster 40–62 MYA (Beverly and Wilson 1984; Russo et al. 1995). In addition, the genome sequence of D. pseudoobscura (available at http://www.hgsc.bcm.tmc.edu) has been searched and Lsp1 genes have been annotated in this species, which diverged from D. melanogaster ~30 MYA (Throckmorton 1975) and belongs to the same subgenus. The aims of this study are: (i) to determine beyond doubt the number and localization of Lsp1 genes in these two Drosophila species because Southern analyses and in situ hybridization results for members of gene families may be misleading (Bachtrog and Charlesworth 2003); (ii) to ascertain the number and type of transposition events undergone by Lsp1 genes during the evolution of the genus Drosophila; (iii) to uncover the molecular mechanism of transposition, in particular to test the hypothesis of an involvement of transposable elements; and (iv) to identify putative regulatory sequences of Lsp1 genes and determine whether these regulatory sequences transposed along with the coding sequences or were recruited ex novo at the new chromosomal location. MATERIALS AND METHODS Drosophila stocks: Two lines of D. buzzatii were used: line j-19 from Ticucho (Argentina) is fixed for chromosomal arrangement 2j and line jq7-4 from Otamendi (Argentina) is fixed for arrangement 2jq7. The genome sequence of D. pseudoobscura comes from inbred line MV2-25 (http://www.hgsc.bcm.tmc.edu). Screening of genomic libraries: Two different lambda genomic libraries were screened by plaque hybridization: the j-19 library (Cáceres et al. 2001) and the jq7-4 library (Casals et al. 2003). Both libraries were previously amplified as described in Sambrook et al. (1989). DNA from positive phages was digested and the resulting fragments were subcloned into Bluescript II SK vector (Stratagene, La Jolla, CA) after gel purification. The j-19 library was screened with a 0.7-kb BamHI fragment of the D. melanogaster Lsp1α gene (Brock and Roberts 1983). Six positive phages were recovered and one of them, λj-19/8, was partially sequenced and found to contain the 5′ region of Lsp1β (Figure 1a
Southern analysis: Southern hybridization was performed as described in Sambrook et al. (1989). Probes were labeled by random primer with digoxygenin-11-dUTP and hybridization was carried out overnight in standard buffer with 50% formamide at 42° for homologous probes and at 37° for heterologous probes. Stringency washes were performed with 2× SSC 0.1% SDS and 0.1× SSC 0.1% SDS solutions at 68° and 50° for homologous and heterologous hybridizations, respectively. DNA sequencing and sequence analysis: Sequences were obtained with an ABI 373 A (Perkin Elmer, Norwalk, CT) automated DNA sequencer using M13 universal forward and reverse primers. A few internal primers were also designed when necessary. Nucleotide sequences were assembled using GeneToolLite software (BioTools). Similarity searches in the GenBank/EMBL and in the Drosophila pseudoobscura genome project (available at http://www.hgsc.bcm.tmc.edu) databases were performed using blastn and fasta3 programs. Multiple sequence alignments were obtained with ClustalW (Thompson et al. 1994) and DiAlign (Morgenstern 1999). PAML software (Yang 1997) was used to estimate the number of synonymous (dS) and nonsynonymous (dN) substitutions per site. This software avoids using reconstructed ancestral sequences to estimate dS and dN for lineages in a phylogeny by using a maximum likelihood approach. Different codon-based likelihood models that allow for different dN/dS ratios among evolutionary lineages can be devised. The models can then be compared to test the neutral prediction that the dN/dS ratio is identical among lineages (Yang and Bielawski 2000). In situ hybridization: In situ hybridization of DNA probes was carried out as described in Montgomery et al. (1987). Hybridization temperature was 37°. Probes were labeled with biotin-16-dUTP by nick translation and detection was done using the ABC-Elite kit from Vector Laboratories (Burlingame, CA). Hybridization signals were localized using the cytological maps of D. buzzatii (Ruiz and Wasserman 1993) and photographs were taken with a phase contrast Nikon Optiphot-2 microscope at ×600 magnification. RESULTS Lsp1 gene number: Screening of two genomic libraries of D. buzzatii with different probes allowed us to isolate clones containing the Lsp1β and Lsp1γ genes (see materials and methods). To determine the number of Lsp1 genes present in the D. buzzatii genome, a Southern analysis was performed (see Figure S1 available as online supplementary material at http://www.genetics.org/supplemental/). Genomic DNA from D. buzzatii and D. melanogaster (as control) was digested with EcoRI and HindIII and hybridized with the same three probes used to screen the D. buzzatii libraries (see materials and methods). The D. melanogaster Lsp1α probe hybridized to 6- and 4.1-kb fragments of D. melanogaster genomic DNA corresponding to Lsp1α and Lsp1β, respectively (Brock and Roberts 1983). In D. buzzatii this probe hybridized to a single fragment of 3.1 kb corresponding to Lsp1β. The D. buzzatii Lsp1β probe hybridized to 6- and 4.1-kb fragments of D. melanogaster corresponding to Lsp1α and Lsp1β, respectively (Brock and Roberts 1983), and a 3.1-kb fragment of D. buzzatii genomic DNA corresponding to Lsp1β. Finally, the D. buzzatii Lsp1γ probe hybridized to the 4.1-kb fragment corresponding to Lsp1β and to 1.8- and 1.4-kb fragments both corresponding to Lsp1γ of D. melanogaster (Brock and Roberts 1983) and also with a 3.1-kb fragment corresponding to Lsp1β and a 1-kb fragment corresponding to Lsp1γ of D. buzzatii. Overall, the results of Southern analyses indicated that only two genes, Lsp1β and Lsp1γ, are present in the D. buzzatii genome. No Lsp1α gene was detected by Southern analysis in the D. buzzatii genome. Similarity searches against the D. pseudoobscura genome sequence database were carried out (Table 2). Two sequences with significant similarity were found with Lsp1β. The most similar sequence, that included in contig 815_contig 5737, was considered to correspond to the D. pseudoobscura Lsp1β ortholog. When the database was searched with the coding sequence of Lsp1γ as query the same two hits were recovered. This time the most similar sequence included in contig 1500_contig 3546 was considered to correspond to the D. pseudoobscura ortholog of Lsp1γ. No new sequences were identified when D. melanogaster Lsp1α sequence was used as query, the only significant hits being those previously identified as Lsp1β and Lsp1γ. We conclude that only Lsp1β and Lsp1γ are present in the D. pseudoobscura genome.
Organization of the Lsp1β and Lsp1γ genomic regions in D. buzzatii and D. pseudoobscura: In D. buzzatii, the Lsp1β gene was localized by in situ hybridization, using the λj-19/25 phage as probe to polytene band D3c of chromosome 2 (see Figure S2 at http://www.genetics.org/supplemental/). This chromosome is homologous to chromosomal arm 3R of D. melanogaster (Muller's element E; Table 1). Overall, 9464 bp including the entire D. buzzatii Lsp1β gene and its flanking regions were sequenced (accession no. AY561258). In this species, Lsp1β is flanked by CG12492 and Pellino (Figure 1a In D. pseudoobscura, the Lsp1β gene is included within contig 815_contig 5737, which has been putatively assigned to chromosome XR, homologous to D. melanogaster chromosomal arm 3L (Muller's element D; Table 1). As in D. buzzatii, Lsp1β is flanked by CG12492 and Pellino but in D. pseudoobscura there are three snRNA:U1 genes between Lsp1β and Pellino (Figure 1b In situ hybridization of the λjq7-4/27 phage, containing the entire Lsp1γ coding sequence, to the polytene chromosomes of D. buzzatti allowed us to map this gene to band C2g of chromosome 3 (see Figure S2 at http://www.genetics.org/supplemental/). This chromosome is homologous to chromosomal arm 2L of D. melanogaster (Muller's element B; Table 1). In total, 10,817 nucleotides including the entire Lsp1γ gene and its flanking regions have been sequenced in D. buzzatii (accession no. AY561259). Upstream of Lsp1γ, 4.5 kb from the ATG codon, there is a Leucyl transfer RNA (Leu-tRNA) gene (Figure 2a In D. pseudoobscura, Lsp1γ is found within contig 1500_contig 3546, which belongs to chromosome 4, homologous to chromosomal arm 2L of D. melanogaster (Muller's element B; Table 1). Analysis of the D. pseudoobscura genomic sequence revealed that 4.9 kb upstream from Lsp1γ there is an Aspartic acid transfer RNA (Asp-tRNA). Both Lsp1γ and Asp-tRNA are nested inside the first intron of the gene Sema-1a (Figure 2b Molecular structure of Lsp1β and Lsp1γ genes in D. buzzatii and D. pseudoobscura: In the two species, both Lsp1β and Lsp1γ are made up of two exons separated by a small intron (Table 3). The alignment of D. buzzatii, D. pseudoobscura, and D. melanogaster nucleotide sequences shows a 79.4% nucleotide identity for Lsp1β and 75.7% nucleotide identity for Lsp1γ. For both genes, the intron is placed in the same precise site in the three species and has a similar length (63–71 nucleotides for Lsp1β and 57–65 nucleotides for Lsp1γ) but shows little nucleotide conservation other than the donor (G·T·A/G·A·G·T/C) and acceptor (C·A·G) splice sites (Delaney et al. 1986).
Both Lsp1β and Lsp1γ possess a TATA box, which is localized in D. buzzatii and D. pseudoobscura at the same nucleotide position as in D. melanogaster (−32 to −26). Sequence similarity extends for several nucleotides on either side of the TATA box (Figure 3
We searched for putative regulatory sequences in the 5′ regions of Lsp1 genes in D. buzzatii and D. pseudoobscura following the criteria devised by Bergman and Kreitman (2001). Comparison of the 5′ ends of Lsp1β led to the identification of three conserved sequences starting at sites −189, −126, and −683 of D. buzzatii (Table 4, b, c, and d). Another three conserved sequences starting at sites −379, −181, and −71 of D. buzzatii were found in the 5′ region of Lsp1γ (Table 4, e, f, and g). Molecular evolution of the Lsp1 genes: The coding sequence of Lsp1β and Lsp1γ from D. buzzatii, D. melanogaster, and D. pseudoobscura and that of Lsp1α from D. melanogaster were aligned using ClustalW. The number of synonymous and nonsynonymous substitutions per site for pairwise comparisons between the three species were estimated using maximum likelihood methods (Yang 1997). The dN/dS ratios for Lsp1β (0.0226–0.0449) are similar to those for Lsp1γ (0.0209–0.0512). Comparisons between Lsp1α of D. melanogaster with Lsp1β (0.0470–0.0545) and Lsp1γ (0.0367–0.0386) of the other two species also yielded similar results (Table 5). Overall, the dN/dS ratios were low, suggesting a relatively high degree of functional constraint of these genes in the three species analyzed.
Two different methods, neighbor joining and the unweighted pair group method using arithmetic averages (UPGMA), were used to construct phylogenetic trees using PHYLIP software (Felsenstein 1989). In addition to the seven above-mentioned sequences, we included in the trees those of arylphorin, an Lsp1-like gene of Calliphora vicina (Naumann and Scheller 1991), and Lsp2 of D. melanogaster (Adams et al. 2000). Alternative models for the evolution of the Lsp genes were then tested using maximum likelihood methods (Yang 1997; Bielawski and Yang 2003). First we tested for the constancy of evolution rates by comparing both trees: that produced with the UPGMA method, which assumes a molecular clock, and that built with the neighbor-joining method, assuming no clock. In both cases a single dN/dS ratio for all lineages was considered. The difference between the likelihood of both trees was significant (2Δl = 32.64; 7 d.f.; P < 0.005) indicating that the model assuming no clock provides a significantly better fit to the data (Figure 4
DISCUSSION Lsp1 gene number: In D. melanogaster the LSP-1 protein is made up of three subunits encoded by three Lsp1 genes: Lsp1α, Lsp1β and Lsp1γ (Roberts and Evans-Roberts 1979). However, in D. buzzatii, a species of the Drosophila subgenus, and in D. pseudoobscura, a species of the Sophophora subgenus, only two Lsp1 genes seem to be present: Lsp1β and Lsp1γ. Two different genomic libraries of D. buzzatii have been screened using as probes a fragment of D. melanogaster Lsp1α and a fragment of D. buzzatii Lsp1β or Lsp1γ. Overall, four different library screenings have been carried out and in every case all positive clones contained either Lsp1β or Lsp1γ. No Lsp1α gene was found. To corroborate this result, genomic DNA was digested with restriction enzymes and hybridized with the same three probes used to screen the libraries and again the results were in agreement with the existence of only two genes, Lsp1β and Lsp1γ, in the D. buzzatii genome. Also, the D. pseudoobscura genome has been recently sequenced to approximately sevenfold coverage and is available at http://www.hgsc.bcm.tmc.edu. Similarity searches against this database have been performed, allowing us to annotate Lsp1β and Lsp1γ but not Lsp1α. Thus, our results indicate that Lsp1α is not present in D. buzzatii or D. pseudoobscura. The most parsimonious explanation for this observation is that the duplicative transposition that gave rise to Lsp1α took place in the lineage leading to D. melanogaster after the divergence of the D. pseudoobscura lineage ~30 MYA (but see below). The fact that D. melanogaster Lsp1α is not dosage compensated although it is X linked is in agreement with a recent duplicative transposition onto this chromosome (Roberts and Evans-Roberts 1979). Brock and Roberts (1983) reported that in eight species of the Drosophila and Sophophora subgenera, Lsp1α and Lsp1β map to the same polytene band in chromosomal element E except in D. pseudoobscura, where they observed an extra signal in element B (see Table 1). They concluded that there were at least two genes, one α-like and one β-like, given that Lsp1γ could not be localized by in situ hybridization. Contrasting results have been obtained in this work. We have unambiguously localized Lsp1β to chromosomal element E and Lsp1γ gene to element B in both D. buzzatii and D. pseudoobscura. According to our results the extra signal in D. pseudoobscura element B likely corresponds to Lsp1γ. A plausible explanation for their results is that Lsp1α and Lsp1β probes were cross-hybridizing to Lsp1β as is suggested by the fact that the signals obtained with the Lsp1β probe were stronger than those obtained with the Lsp1α probes (Brock and Roberts 1983). Likewise, our previous in situ hybridization of D. melanogaster Lsp1α to D. buzzatii (and D. repleta) chromosomes (González et al. 2002; Ranz et al. 2003) must be reinterpreted as due to cross-hybridization with Lsp1β (see Figure S2 at http://www.genetics.org/supplemental/). Larval serum proteins belong to the hemocyanin superfamily and are thought to act as storage proteins that provide amino acids and energy during nonfeeding periods of immature or adult development (Burmester et al. 1998). In D. melanogaster LSP-1 is a heterohexamer of randomly associated subunits encoded by Lsp1α, Lsp1β, and Lsp1γ genes. The lack of one subunit in D. buzzatii and D. pseudoobscura does not imply that the LSP-1 protein will not be functional. In fact, it has been reported that an inbred stock of D. melanogaster lacking the γ-chain is viable under laboratory conditions, suggesting that a subunit specific function for the LSP-1 monomers does not exist (Brock and Roberts 1980). The polypeptides encoded by Lsp1β and Lsp1γ of D. melanogaster, D. pseudoobscura, and D. buzzatii and by Lsp1α of D. melanogaster were aligned with the ClustalW algorithm (see Figure S3 at http://www.genetics.org/supplemental/). These seven sequences have a similar length (772–789 aa), 49% of the amino acids are identical, and 33% of the amino acid substitutions are conservative or semiconservative. However, different subunits can accumulate different specific amino acids and this could allow the organism to modulate the availability of these amino acids in different developmental processes (Massey et al. 1997). The three subunits of the LSP-1 protein of D. melanogaster, as well as similar proteins in other Diptera, are enriched in aromatic amino acids (Burmester et al. 1998). Aromatic residues are thought to serve as precursors for quinones, which play a role in cuticle hardening during metamorphosis (Burmester et al. 1998). The polypeptides coded by Lsp1α and Lsp1β are also enriched in methionine but not those encoded by Lsp1γ. The same pattern is observed in D. pseudoobscura and D. buzzatii. The role of methionine in Drosophila development is not clear (Massey et al. 1997). Gene duplication is considered a major force in gene family expansion and gene innovation. After duplication, one copy may become silenced (nonfunctionalization) or assume a novel function (neofunctionalization) or both copies may split the multiple functions of the ancestral gene (subfunctionalization; Lynch and Conery 2000). Gene amplification of highly expressed functions often lead to highly conserved paralogs in microbial genomes (Hooper and Berg 2003). The latter seems to be the case of the Lsp1 genes in Drosophila. The protein coded by these genes is accumulated to high levels by feeding larvae (Massey et al. 1997) and gene amplification has not resulted in different discernible functions. Genome organization and number of transposition events: In both D. buzzatii and D. pseudoobscura, Lsp1β is flanked by CG12492 and Pellino genes on chromosomal element E (Figure 1 In D. buzzatii and D. pseudoobscura, Lsp1γ is flanked by a tRNA and both genes are located inside the first intron of Sema-1a on element B (Figure 2 Given that Lsp1β and Lsp1γ are present in the three species studied in different chromosomes, the duplicative transposition originating these two genes must have occurred before the divergence of these species (40–62 MYA). The chromosomal localization of the ancestral Lsp1 gene in the genus Drosophila is unknown. There is no reason to believe that this chromosome was element E as suggested by Brock and Roberts (1983) because in both D. buzzatii and D. pseudoobscura Lsp1β is located on element E and Lsp1γ is located on element B. Therefore, the localization of the ancestral Lsp1 gene could be any of these two elements. In any case, to explain the current localization of these two genes in D. melanogaster at least two conservative transpositions are needed (see above). Another duplicative transposition gave rise to D. melanogaster Lsp1α. The absence of this gene in D. pseudoobscura indicates that this transposition likely occurred after the divergence between D. melanogaster and D. pseudoobscura. However, the neighbor-joining tree of Lsp1 sequences (Figure 4
Mechanism of transposition: As stated before, the position of the unique intron of Lsp1β and Lsp1γ and the 5′ putative regulatory sequences are conserved in the three species analyzed. This allows us to rule out retroposition (Betrán et al. 2002) as the mechanism of transposition. The analysis of the flanking regions of both genes provides no evidence that transposition had been mediated by transposable elements (TEs). A single TE copy was found in the original position of Lsp1β in D. melanogaster (Figure 1c We propose that an ectopic exchange between snRNA:U1 sequences mediated the Lsp1β transposition in Drosophila. Three observations indicate that snRNA:U1 genes are implied in this transposition. First, they are present in the original location of chromosomal element E as well as in the destination site of element B. Second, snRNA:U1 genes present a high level of nucleotide identity among the three species (98%) suggesting that they might act as substrates for ectopic exchanges. Finally, the snRNA:U1 gene represents the downstream boundary of the transposed chromosomal segment (Figure 1 The evidence for the implication of tRNA genes in the transposition of Lsp1γ is weak. The tRNA genes found near Lsp1γ in D. buzzatii and D. pseudoobscura belong to a different isoacceptor type and no tRNA gene was found near Lsp1γ in D. melanogaster. In addition these tRNAs lie outside the transposed chromosomal segment. Therefore, an involvement of tRNA genes in the transposition of Lsp1γ is unlikely and in this case the mechanism remains uncertain. 5′ noncoding conserved sequences: Transposition implies the localization of the transposed gene in a new genomic environment. The probability of success of a transposition should be higher when it includes the regulatory regions than when it does not. To be functional, in the latter case the transposed gene would need to recruit new regulatory regions (Betrán et al. 2002). Comparative sequence analysis allows the identification of conserved DNA sequences in noncoding regions that are considered putative cis-acting regulatory elements (Bergman and Kreitman 2001). Comparison of the 5′ ends of Lsp1α, Lsp1β and Lsp1γ genes of D. melanogaster led to the identification of two such conserved sequences (Delaney et al. 1986). These sequences are also conserved in Lsp1β (Table 4, b and c) and Lsp1γ (Table 4, e and f) in the three species analyzed. These two regions are more conserved between Lsp1α and Lsp1β than between Lsp1β and Lsp1γ, which is in agreement with the origin of Lsp1α from a duplication of Lsp1β (Smith et al. 1981). Two conserved sequences not previously described in D. melanogaster have been also identified. One is exclusive of Lsp1β, the other is exclusive of Lsp1γ (Table 4, d and g), and both are highly conserved in the three species analyzed. All conserved sequences except that located at −181 of D. buzzatii Lsp1γ fulfill the requirements used by Bergman and Kreitman (2001) to identify noncoding conserved blocks. The mean size of these five blocks is 13 bp, similar to the modal size of 11 bp reported by these authors, and three of them are included in the conserved blocks that they have described. They follow the pattern described for cis-regulatory elements in Drosophila, i.e., highly conserved sequences separated by unalignable gaps (Bergman and Kreitman 2001). The 5′-untranslated regions of the RNA show two highly conserved sequences common to the three Lsp1 genes in the three species analyzed (Figure 3 Molecular evolution: Maximum likelihood methods of phylogenetic inference were used to test alternative models for the evolution of Lsp1 genes (Yang 1997). Figure 4 The ratio of synonymous (silent, dS) to nonsynonymous (amino acid changing, dN) substitution rates is a measure of selective pressure on a protein: if amino acid changes are neutral dN/dS = 1, if they are mostly deleterious dN/dS < 1, and if they offer a selective advantage dN/dS > 1 (Yang and Bielawski 2000; Bielawski and Yang 2003). A model assuming free dN/dS ratios for different lineages provided a significantly better fit to the data than a model with a single ratio for all lineages. This led us to test if a model assuming four different dN/dS ratios, one background ratio and one ratio for each of the three lineages leading to D. melanogaster Lsp1 genes, was significantly different from the single-ratio model. According to our observations, the Lsp1 genes have transposed in these three lineages, and we wanted to test if the change of chromosomal location has affected the molecular evolution of these genes. Again the results were significant, indicating that the model assuming four different dN/dS ratios fits better to the data. The estimates obtained for the dN/dS ratios in this model were 0.0518 for the background ratio and 0.0769, 0.0598, and 0.0231 for the lineages leading to Lsp1α, Lsp1β, and Lsp1γ of D. melanogaster, respectively. These results suggest that changes in the selection regime (degree of functional constraint) are associated with Lsp1 transpositions. However dN/dS ratios do not show a consistent pattern, being higher than the background ratio for Lsp1α and lower than the background ratio for Lsp1γ. In addition, the background ratio is itself heterogeneous, indicating that other factors besides transposition have an effect on the molecular evolution of Lsp1 genes. Acknowledgments We thank L. Sánchez (Centro de Investigaciones Biológicas, Consejo Superior de Invesigaciones Cientifícas, Madrid) for providing the D. melanogaster Lsp1α clone and E. Hasson (Universidad de Buenos Aires) for D. buzzatii stocks. We also thank B. Negre for technical advice and helpful discussion. This work was supported by grant BMC2002-01708 from the Dirección General de Investigación (Ministerio de Ciencia y Tecnología, Spain) awarded to A.R. Notes References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||
Genome Res. 2002 Jun; 12(6):857-67.
[Genome Res. 2002]Genome Res. 2001 Dec; 11(12):2009-19.
[Genome Res. 2001]Proc Natl Acad Sci U S A. 2000 Aug 1; 97(16):9121-6.
[Proc Natl Acad Sci U S A. 2000]Curr Opin Plant Biol. 2003 Apr; 6(2):128-33.
[Curr Opin Plant Biol. 2003]Trends Genet. 2001 Nov; 17(11):661-9.
[Trends Genet. 2001]Nature. 2001 Feb 15; 409(6822):860-921.
[Nature. 2001]Genetics. 2002 Jul; 161(3):1137-54.
[Genetics. 2002]Evolution. 2003 Jun; 57(6):1325-35.
[Evolution. 2003]Genome Res. 2002 Dec; 12(12):1854-9.
[Genome Res. 2002]Science. 1999 Mar 5; 283(5407):1465;1467.
[Science. 1999]Nature. 2001 Feb 15; 409(6822):860-921.
[Nature. 2001]Nat Genet. 2000 Apr; 24(4):363-7.
[Nat Genet. 2000]Genome Res. 2002 Dec; 12(12):1854-9.
[Genome Res. 2002]Hum Mol Genet. 2003 Jun 1; 12(11):1321-8.
[Hum Mol Genet. 2003]Nature. 1979 Aug 23; 280(5724):691-2.
[Nature. 1979]Cell. 1981 Feb; 23(2):441-9.
[Cell. 1981]Genetics. 2002 Jul; 161(3):1137-54.
[Genetics. 2002]Evolution. 2003 Jun; 57(6):1325-35.
[Evolution. 2003]Mol Biol Evol. 1995 May; 12(3):391-404.
[Mol Biol Evol. 1995]Genetics. 2003 Jul; 164(3):1237-40.
[Genetics. 2003]Genome Res. 2001 Aug; 11(8):1353-64.
[Genome Res. 2001]Mol Biol Evol. 2003 May; 20(5):674-85.
[Mol Biol Evol. 2003]Nucleic Acids Res. 1994 Nov 11; 22(22):4673-80.
[Nucleic Acids Res. 1994]Bioinformatics. 1999 Mar; 15(3):211-8.
[Bioinformatics. 1999]Comput Appl Biosci. 1997 Oct; 13(5):555-6.
[Comput Appl Biosci. 1997]Trends Ecol Evol. 2000 Dec 1; 15(12):496-503.
[Trends Ecol Evol. 2000]Genet Res. 1987 Feb; 49(1):31-41.
[Genet Res. 1987]Heredity. 1993 Jun; 70 ( Pt 6)():582-96.
[Heredity. 1993]J Mol Biol. 1986 May 5; 189(1):1-11.
[J Mol Biol. 1986]Genome Res. 2001 Aug; 11(8):1335-45.
[Genome Res. 2001]Comput Appl Biosci. 1997 Oct; 13(5):555-6.
[Comput Appl Biosci. 1997]Biochem Biophys Res Commun. 1991 Jun 28; 177(3):963-72.
[Biochem Biophys Res Commun. 1991]Science. 2000 Mar 24; 287(5461):2185-95.
[Science. 2000]Comput Appl Biosci. 1997 Oct; 13(5):555-6.
[Comput Appl Biosci. 1997]J Struct Funct Genomics. 2003; 3(1-4):201-12.
[J Struct Funct Genomics. 2003]Nature. 1979 Aug 23; 280(5724):691-2.
[Nature. 1979]Genetics. 2002 Jul; 161(3):1137-54.
[Genetics. 2002]Evolution. 2003 Jun; 57(6):1325-35.
[Evolution. 2003]J Mol Evol. 1998 Jul; 47(1):93-108.
[J Mol Evol. 1998]Eur J Biochem. 1980 May; 106(1):129-35.
[Eur J Biochem. 1980]Eur J Biochem. 1997 Apr 1; 245(1):199-207.
[Eur J Biochem. 1997]Science. 2000 Nov 10; 290(5494):1151-5.
[Science. 2000]Mol Biol Evol. 2003 Jun; 20(6):945-54.
[Mol Biol Evol. 2003]J Mol Evol. 1988; 27(3):250-60.
[J Mol Evol. 1988]Proc Natl Acad Sci U S A. 1991 Nov 15; 88(22):10203-7.
[Proc Natl Acad Sci U S A. 1991]Genome Res. 2002 Dec; 12(12):1854-9.
[Genome Res. 2002]Nature. 2003 May 15; 423(6937):241-54.
[Nature. 2003]Am J Hum Genet. 2003 Oct; 73(4):823-34.
[Am J Hum Genet. 2003]Genome Res. 2002 Dec; 12(12):1854-9.
[Genome Res. 2002]Genome Res. 2001 Aug; 11(8):1335-45.
[Genome Res. 2001]J Mol Biol. 1986 May 5; 189(1):1-11.
[J Mol Biol. 1986]Cell. 1981 Feb; 23(2):441-9.
[Cell. 1981]Dev Biol. 1984 Mar; 102(1):206-15.
[Dev Biol. 1984]EMBO J. 1985 Dec 30; 4(13B):3789-3795.
[EMBO J. 1985]J Mol Biol. 1986 May 5; 189(1):13-24.
[J Mol Biol. 1986]Comput Appl Biosci. 1997 Oct; 13(5):555-6.
[Comput Appl Biosci. 1997]Trends Ecol Evol. 2000 Dec 1; 15(12):496-503.
[Trends Ecol Evol. 2000]J Struct Funct Genomics. 2003; 3(1-4):201-12.
[J Struct Funct Genomics. 2003]Genome Res. 2001 Aug; 11(8):1335-45.
[Genome Res. 2001]