![]() | ![]() |
Formats:
|
||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright : © 2008 Crocker et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Evolution Acts on Enhancer Organization to Fine-Tune Gradient Threshold Readouts Department of Biological Sciences, Dartmouth College, Hanover, New Hampshire, United States of America Michael B Eisen, Academic Editor University of California, Berkeley, United States of America * To whom correspondence should be addressed. E-mail: albert.erives/at/dartmouth.edu Received April 25, 2008; Accepted September 16, 2008. This article has been cited by other articles in PMC.Abstract The elucidation of principles governing evolution of gene regulatory sequence is critical to the study of metazoan diversification. We are therefore exploring the structure and organizational constraints of regulatory sequences by studying functionally equivalent cis-regulatory modules (CRMs) that have been evolving in parallel across several loci. Such an independent dataset allows a multi-locus study that is not hampered by nonfunctional or constrained homology. The neurogenic ectoderm enhancers (NEEs) of Drosophila melanogaster are one such class of coordinately regulated CRMs. The NEEs share a common organization of binding sites and as a set would be useful to study the relationship between CRM organization and CRM activity across evolving lineages. We used the D. melanogaster transgenic system to screen for functional adaptations in the NEEs from divergent drosophilid species. We show that the individual NEE modules across a genome in any one lineage have independently evolved adaptations to compensate for lineage-specific developmental and/or genomic changes. Specifically, we show that both the site composition and the site organization of NEEs have been finely tuned by distinct, lineage-specific selection pressures in each of the three divergent species that we have examined: D. melanogaster, D. pseudoobscura, and D. virilis. Furthermore, by precisely altering the organization of NEEs with different morphogen gradient threshold readouts, we show that CRM organizational evolution is sufficient for explaining changes in enhancer activity. Thus, evolution can act on CRM organization to fine-tune morphogen gradient threshold readouts over a wide dynamic range. Our study demonstrates that equivalence classes of CRMs are powerful tools for detecting lineage-specific adaptations by gene regulatory sequences. Author Summary The regulatory control of genes allows an organism to generate a diversity of cell types throughout its body. Gene regulation involves specialized DNA sequences called transcriptional enhancers that increase the expression of genes in specific places and times. Enhancers contain clusters of specific DNA sequences that are uniquely recognized by DNA binding proteins, whose activities are also regulated in space and time. The critical role that DNA enhancers play in generating the diversity of cell types within a single organism suggests that changes in these DNA sequences may also underlie the diversity of organismal forms produced by evolution. However, few examples linking specific changes in enhancer sequences to functional adaptations have been documented. We studied a group of neuro-embryonic enhancers that turn on a certain group of genes in different fruit fly species that have been diverging from each other for ~50 million years. Each species has experienced unique changes in its protein-coding sequences, gene regulatory sequences, egg morphology, and developmental timing. We found that the organizational spacing between the protein binding sites in these enhancers has evolved in a manner that is consistent with functional adaptations compensating for the dynamic and idiosyncratic evolutionary history of each species. Introduction The state of a biological cell can be defined by the combined transcriptional status of each gene in a genome. Developmental systems specify cell state by regulating transitions between states. The regulatory logic for these state transitions is encoded in cis-regulatory DNA sequences, which specify the transcriptional activity of each gene [1,2]. Each gene may be controlled by multiple locus-specific, independently acting cis-regulatory modules (CRMs), which function as transcriptional enhancers, silencers, and insulators [3,4]. Such a set of CRMs can function collectively to sculpt a robust, complex spatiotemporal expression pattern [5]. Because of the critical role that CRMs play in specifying the transcriptional states of a cell, they have been proposed to be a primary target of natural selection [6–20]. Nonetheless, the relative importance of cis-regulatory versus protein-coding evolution has been debated because of a relative deficit of specific examples of functional CRM evolution [21]. Some important functional and evolutionary properties of CRMs have been elucidated. For example, enhancers possess switch-like properties that respond to well-defined physiological conditions [22–25], and generally enhancers can drive expression of a heterologous locus when placed almost anywhere into that locus [2]. Each CRM itself is a DNA segment of around 200 to 400 bp long that is composed of clustered binding sites for cooperative and competitive trans-acting factors that interact with the DNA. The elements constituting a CRM can arise rapidly anywhere in a gene locus in response to selection [9,17]. Furthermore, slightly deleterious mutations of binding sites in a CRM can be stabilized by the selection of compensatory sites elsewhere in the same CRM [26]. All of these properties of CRMs clearly establish that the evolutionary histories of such DNA sequences are unlike the evolution of protein-coding sequences. However, little is currently known about how evolutionary forces operate on the internal structure of CRMs simply because the organizational constraints of such sequences have not been fully explored. To address the role of organizational constraints in CRM evolution, we have used the sequenced genomes for three different Drosophila species [27], which have been diverging for ~50 million years [28,29]. These lineages have experienced divergent evolutionary pressures related to lineage-specific ecological life histories. Specific morphological differences include egg developmental morphology (e.g., size and shape of egg [30], composition of dorsal respiratory appendages in the egg chamber [31,32]), and embryonic developmental timing. Additionally, each lineage has experienced divergent genomic evolution as a result of differences in mutational processes. For instance, differences have been documented in insertion and deletion rates [33–35], as well as specific chromosomal inversions and transposition events [29]. Thus, Drosophila provides a powerful model system for studying how developmental suites of genes still manage to produce the basic body plan of a fly despite divergent processes affecting embryogenesis and genome composition. In this study, we show how a class of equivalent developmental CRMs track evolutionary change in different Drosophila lineages. These CRMs act as neurogenic ectoderm enhancers (NEEs) and function to drive gene expression in the early embryonic neuroectoderm before gastrulation has commenced [36]. The NEEs map to unrelated loci: the rhomboid (rho) locus, which encodes a serine protease; the vein (vn) locus, which encodes an epidermal growth factor receptor ligand; the ventral neurons defective (vnd) locus, which encodes an NK-2 class homeobox transcription factor; and the brinker (brk) locus, which encodes a dipteran-specific helix-turn-helix repressor. The NEEs from these loci are located variably in either upstream or intronic positions and do not share sequence homology indicative of a common evolutionary origin. Each NEE has independently evolved an organized cluster of common binding sites defined by three sequence signatures in D. melanogaster [36]. First, there are one to two pairs of a Dorsal binding sites closely juxtaposed (<20 bp) to a CA-core E-box motif, which is variably bound in different cells by either Twist basic helix-loop-helix (bHLH) complexes or the Snail C2H2 zinc-finger repressor [37]. Synergistic activation by Dorsal and Twist at specific positions along the Dorsal morphogen concentration gradient has been well documented [38–40]. Second, there is a unidirectionally oriented site, the μ motif, which is situated at a relatively fixed distance from the Dorsal–Twist pair, and which resembles the binding site for another co-activator, Dorsal interacting protein-3 (Dip3) [41–43]. Third, there is a unidirectionally oriented site, which is a composition of overlapping binding sites for the Notch signaling effector Suppressor of Hairless [Su(H)] and Dorsal. Other nuclear factors may also operate at NEE motifs in distinct territories along the dorsal/ventral (D/V) axis [44]. These diverse motifs co-occur in a 240–320 bp window defining each NEE [36]. Here, we show that selection acts on the organization of NEE binding sites to fine-tune the threshold readouts along the Dorsal concentration gradient. Results Identification of Multiple Drosophilid NEEs We identified NEE-type sequences across the D. melanogaster, D. pseudoobscura, and D. virilis genomes in order to determine how a set of coordinately regulated gene loci co-evolve in a given lineage. We found that the NEE signatures of paired Dorsal–Twist binding sites (5′-SGGAAADYCSS and 5′-CACATGT, respectively) and a Su(H) site overlapping a separate Dorsal site (5′-CGTGGGAAAWDCSM, Su(H) site underlined) were present together in a single CRM across many loci (Figures 1
From these three genomes, we cloned and assayed in transgenic stage 5(2) D. melanogaster embryos all NEE sequences from these species, which comprised five NEEs from D. melanogaster, five NEEs from D. virilis, and four NEEs from D. pseudoobscura, making a total of 14 distinct NEE-like sequences (Figure 1 To verify the endogenous expression patterns of NEE-bearing genes in these three species, we performed whole-mount antisense RNA in situ hybridization experiments using species-specific probes. Because the developmental timing of embryogenesis differs in the different species, we focused on one developmental time point corresponding to embryonic stage 5(2), when the embryo is midway through cellularization (Figure S1). At this point, the cell walls are 50% elongated and are easily identifiable under bright field microscopy. For D. melanogaster embryos growing at 25 °C this corresponds to ~2 h 45 min after egg deposition. For D. virilis embryos growing at 25 °C, stage 5(2) corresponds to ~5 h after egg deposition (Figure S1). However, in all three systems, NEE-driven reporters show earlier activity in late stage 4. This early pattern of activity in late stage 4 sometimes includes faint staining in the mesoderm that disappears by stage 5(2), at which point the lateral stripes are at their most robust and most reproducible levels. We found that the expression patterns of orthologous genes across different species were significantly more alike than the expression patterns of NEE-bearing genes within the same species (Figure 1
Rapid Divergence of NEE Sequences Despite the similar patterns of endogenous expression between NEE-bearing orthologs, the amount of sequence divergence among these 14 enhancers is such that no two orthologous NEEs are more than ~60% identical as a result of numerous substitutions, insertions, and deletions. In some cases, the amount of identity between orthologous enhancers is as low as ~44%. This sequence divergence could represent both neutral drift processes and/or positive selection operating in each lineage. Although these NEE sequences share certain signatures, some of these are undoubtedly examples of stabilizing selection creating de novo sites that compensate for sites lost by mutation. We find that this has occurred for all types of NEE motifs. For example, overlapping Su(H)/Dorsal motifs are not always present in the same location in some species for the vnd and rho NEEs (details 3 and 12 in Figure 2 Not all NEE-bearing loci in one species are necessarily NEE-bearing in other species. We have found that the D/V patterning gene sog, which encodes a chordin-like inhibitor of the BMP/dpp signaling pathway, is an NEE-bearing locus in the D. melanogaster lineage but not in the D. pseudoobscura and D. virilis lineages (Figures S2 and S3). Previous work identified an intronic lateral stripe enhancer in the D. melanogaster locus (LSE in Figure S2E) [46]. This enhancer was identified by its cluster of multiple Dorsal binding site but lacks Su(H), E-box and μ motifs. We find that this intronic enhancer is not as well conserved across species as the upstream NEE-like sequences. Moreover, the D. melanogaster sog NEE drives the broadest lateral stripe of expression of all the other NEEs we have tested, spanning 15 nuclei across the entire embryo (see Figure 1 What: Lineage-Specific Thresholds for Morphogen Gradients An unknown portion of the substitution, insertion, and deletion mutations observed in the NEE cis sequences may be lineage-specific adaptations that stabilize changes occurring in trans. To address this question, we assayed individual enhancers from all three species in transgenic stage 5(2) D. melanogaster embryos, with multiple lines per enhancer to ensure reproducibility. This assay effectively decouples lineage-specific changes in trans from changes in cis by testing all enhancers in the same trans-environment of D. melanogaster. If these enhancers have evolved to compensate for lineage-specific changes in the trans regulatory environment, then we should observe similar directional changes for the entire equivalence class from one lineage when tested in transgenic D. melanogaster embryos. Interestingly, despite similar functional outputs of orthologous NEEs in the context of their native genomes (Figure 1
The vnd enhancers, which produce the narrowest stripes of the NEE modules tested, showed the smallest differences in relative expression patterns when assayed in D. melanogaster (Figure 3 Similar results across NEE orthologs were obtained when we measured lacZ transcript intensity levels along the D/V axis using confocal microscopy. By aligning the sharp border of snail expression we see similar differences in stripe width (Figure 5 How: Precise Organization Controls NEE Function As the D/V patterning system is known to be mediated primarily by Dorsal and Twist proteins, we decided to investigate the configuration of their binding sites in all of the enhancers and relate these in turn to their widths of expression across the lateral regions of the embryo. We found, first, with a few exceptions, that the CA-core E-box motif 5′-CACATGT is remarkably constant across the NEE sequences of all three species. Second, the Dorsal site occasionally sustains some point mutations. Third, there appear to have been many insertions and deletions that have adjusted the spacing between these two sites. Thus, the changes from all three types of variables (Twist sites, Dorsal sites, and their spacing) have served to alter the spacing in most cases, and occasionally to alter the number and quality of paired Dorsal and Twist sites (see Figure 2 To test this hypothesis of genome-wide threshold adaptations, we decided to alter specific NEE sequences that differed only slightly in their Dorsal and Twist binding motif configuration relative to another NEE sequence that nonetheless differed greatly in the span of expression along the D/V axis. Of relevance, we also noted that the broadest NEE transgenes had a spacing between the Dorsal site and the adjacent E-box close to 7–12 bp (compare Figure 4 For example, the Drosophila brk NEEs have a conserved organization consisting of a central invariant Dorsal site flanked on either side by invariant CA-core E-box motifs (5′-CACATGT) (Figure 2
In a similar example, the Drosophila melanogaster vn and sog NEEs possess the same Dorsal motif, which otherwise tends to vary at other loci (Figure 6 Thus, both the natural range of NEE configurations within each genome and across all three genomes, together with our functional manipulation of NEE configurations, confirm that the Dorsal site/E-box configuration controls the precise extent of D/V expression by extending the dorsal border of expression, where concentration of key activators is limiting. Thus, not only is there is an optimal organization for maximum affinity, but there is also a range of affinities that is exploited by natural selection to maintain precise threshold readouts of the concentration gradient of a developmental morphogen. Why: Lineage-Specific Changes in the D/V Patterning System We next investigated diverse possibilities for lineage-specific selective pressures that might have caused NEEs across each genome to functionally adapt in similar directions via changes in spacing between the binding sites of cooperative activators. Such reasons might help explain why D. pseudoobscura NEEs have the weakest and narrowest stripes of expression in D. melanogaster embryos, while D. virilis NEEs have the strongest and widest stripes of expression when NEEs from all three species are tested in D. melanogaster embryos (Figure S8). First, amino acid substitutions have occurred in the known NEE transactivators Dorsal and Twist (Figure S4, and unpublished data). Some enhancer evolution could therefore conceivably be due to stabilizing selection for changes in the trans factors themselves. Relative to the D. melanogaster Dorsal peptide sequence, these changes include a few non-synonymous substitutions in the DNA-binding REL homology domain (RHD, underlined sequence in Figure S4), as well as several non-synonymous substitutions and peptide indels in the non-DNA-binding regions. Interestingly, some of these amino acid substitutions in the DNA-binding domain correspond to known mutations that either reduce or augment Dorsal–Twist synergistic activation [52]. A lysine (K) to leucine (L) change in the D. pseudoobscura Dorsal RHD corresponds to a position that augments activation when mutated to alanine (A) in D. melanogaster Dorsal (see M7 in Figure S4). Both are changes of a basic side-chain to an aliphatic one. Such changes in D. pseudoobscura Dorsal might allow the evolution of weaker target NEEs. Remarkably, another mutation in the D. virilis Dorsal RHD corresponds to a position that reduces activation when mutated in D. melanogaster Dorsal (see M23 in Figure S4). Such a change in Dorsal might necessitate the evolution of stronger D. virilis NEEs. Future studies will investigate Dorsal protein and NEE co-evolution. A second potential reason for genome-wide adaptations could also be changes in the protein expression levels. Staining with polyclonal antibodies made to D. melanogaster Dorsal and Twist factors reveals ventral to dorsal nuclear concentration gradients in all three species, with detectably slightly narrower nuclear Dorsal concentration gradients in D. virilis and broader nuclear Dorsal concentration gradients in D. pseudoobscura relative to the D. melanogaster gradients (Figures S5 and S6). The ratio of intensities for dorsal cytoplasmic levels of Dorsal antigen versus ventral nuclear levels are qualitatively similar across species and indicate that the shapes or profiles of the nuclear concentration gradient are comparable, even though the absolute intensities may not be comparable (Figure S7). Nonetheless, if the Dorsal morphogen gradient really is augmented in the smaller D. pseudoobscura embryos, and reduced in the larger D. virilis embryos, such changes would be consistent with the NEEs adapting to lineage-specific concentration readouts (Figure S8). We also see a third potential reason for lineage-specific threshold readouts related to genome evolution (Table 1). Flow cytometry analyses of Drosophila genome sizes have confirmed a diverse range of sizes from 130 Mb for D. mojavensis up to 364 Mb for D. virilis [27]. D. melanogaster and D. pseudoobscura have intermediate genome sizes of 200 Mb and 193 Mb, respectively [27]. Of interest, we find that the total number ND = NAG/A of estimated NEE-style Dorsal motifs (5′-SGGAAABYCCH), where NA is the number of motifs found in the unfiltered assembly of size A in a genome of size G, is relatively constant across all three genomes, the total number NE of NEE-style CA-core E-boxes (5′-CACATGT) is 2-fold greater in D. virilis than in D. melanogaster (Table 1). We also note that the relative increase of this motif is a secondary trend related to a simple expansion (D. virilis) or compaction (D. melanogaster) of 5′-CACA repeats occurring primarily in euchromatic regions of the genome (unpublished data). In general, longer microsatellites have been documented in D. virilis than in D. melanogaster [35]. However, a potential effect of a 2-fold greater number of genomic 5′-CACATGT motifs might be to reduce the effective free nuclear concentration gradient of Twist bHLH complexes in D. virilis relative to D. melanogaster via background sequence sequestration [53]. Such an effect would also be consistent with the observed adaptive trend to a lower concentration threshold readout in D. virilis than in D. melanogaster (Figure S8B). Thus there are potentially several lineage-specific changes affecting the activity or profile of the dorsal/ventral morphogen system that would necessitate the observed genome-wide adaptations in downstream target enhancers. Discussion In this study we demonstrated the effectiveness of studying gene regulatory evolution in the context of a CRM equivalence class consisting of all genomic sequences that regulate nearly identical patterns of activity by interacting with a common set of transcription factors. Equivalence classes of CRMs represent molecular examples of parallelisms, which can be defined as similar, homoplastic patterns of evolutionary innovation constrained by a restrictive set of available regulatory mechanisms [54]. For several important reasons, generalization of CRM logic for a class of functionally and mechanistically equivalent CRMs is difficult to obtain from the phylogenetic study of a single CRM. First, insufficient time for divergence away from an ancestral CRM can leave much superficial similarity among orthologous CRMs. Second, the persistence of initial organizational constraints present in the ancestral CRM can obscure possible alternative configurations of the class-defining cis elements. Third, novel lineage-specific evolutionary adaptations will impede any method that relies purely on phylogenetic conservation. We used the equivalence class of NEEs from several species to show that selection acts on the organization of enhancers to fine-tune their output. In this case, NEEs have evolved in parallel to adapt to changes in a developmental morphogen gradient, whose activity levels have shifted in different directions in different lineages, necessitating compensatory changes in the cis components of downstream targets (Figure S8). Such stabilizing selection, encoded in the configuration of Dorsal binding sites and Twist-binding CA-core E-boxes, reflects species-specific thresholds of activation (compare θDm, θDp, and θDv in Figure S8). Thus, the D. pseudoobscura enhancers have evolved to respond to higher morphogen concentrations than those in D. melanogaster (θDp > θDm), while the D. virilis enhancers have evolved to respond to lower concentrations than those in D. melanogaster (θDv < θDm). This stabilizing selection is revealed only when finely tuned NEEs are tested in the exogenous concentration gradient of a different species, in this case the reference species D. melanogaster. It should not be surprising that small changes in the linkage between the Dorsal and Twist complex binding sites could have such a dramatic effect on the D/V range of expression. First, evolutionary changes in the dorsal border of neuroectodermal gene expression are a simple readout of the most limiting amount of nuclear Dorsal, further limited by limiting amounts of Dorsal target proteins working as Dorsal co-factors, such as Twist bHLH complexes [39,40,55]. Second, recent studies on the Bicoid morphogen gradient, which simultaneously patterns the anterior/posterior axis, suggest that its precision is pushed to the physical limits imposed by the stochasticity of molecules [56–58]. Therefore, precision in the morphogen gradients patterning the embryonic axes should indicate a precision in CRM readouts, as we have shown here for one class of D/V enhancers and others have indicated for diverse anterior/posterior enhancers [59,60]. It should also not be surprising that it is the site linkage that is adjusted by natural selection rather than the quality of the binding sites. Quantitative modeling of the classical Dorsal morphogen system has placed heavy emphasis on the quality of binding sites as determinants of differential threshold readouts by target enhancers in the mesoderm, mesectoderm, and neuroectoderm [51]. However, we do not believe this is entirely at odds with our results showing evolutionary modification of NEE activity via organizational linkage because it may indicate that within the neuroectodermal territory, where Dorsal protein is present in limiting amounts, binding sites are likely to have already evolved to be high affinity sites. This would indicate that in this embryonic territory, extra affinity can only be achieved by optimizing positional linkage between these cooperatively binding factors. This in turn implies that these same factors have relatively fixed steric dimensions. Another potential locus and/or cause of selection lies in the protein-coding sequences of the morphogens themselves, as has been shown for other trans factors [61,62]. Both Dorsal and Twist are used as combinatorial inputs for other regulons in other tissues (e.g., mesoderm and mesectoderm) and the pleiotropic consequences of changes to either protein–protein interaction motifs or DNA-binding domains might limit the number of possibilities for such changes. Additionally, such protein-coding changes may not effectively or precisely target the Dorsal–Twist interactions where their amounts are limiting and/or affect the interaction in the continuously graded fashion that we have documented. Nevertheless, stabilizing selection for changes in the Dorsal peptide sequence itself or other trans factors could be explored by future trans complementation assays. However, in many cases, it may be difficult to disentangle evolutionary cause and effect between co-evolving loci throughout the genome because sequence changes may be either the initiating causes or the products of selection for developmental homeostasis. In summary, the coordinate changes in both sequence and activity shown by the neuroectodermal enhancers from each species provide strong evidence for functionally adaptive cis-regulatory evolution. The number of fixed nucleotide changes corresponding to these parallel molecular adaptations occurring across a genome is minimal, and corresponds to a few indels between Dorsal and Twist sites across the NEEs in a single genome. Occasionally, new Dorsal and Twist sites with optimal spacing are presumed to have been selected in individual species for the vnd, rho, and sog NEEs while in others only the spacing has changed between existing sites. Our results show that natural selection can act with ease to fine-tune CRM organization and thus calibrate enhancer activity over a wide dynamic range. As such, CRM organization may represent a large and unexplored locus of stored adaptive information. Materials and Methods Fly stocks. D. melanogaster strain w1118 was used for P-element transformations of all reporter constructs. D. virilis and D. pseudoobscura were obtained from the Tucson Drosophila Stock Center. Cloning and in situ hybridization. D. melanogaster, D. virilis, and D. pseudoobscura embryos were collected, and subsequently fixed. Hybridization with digoxigenin-labeled antisense RNA probes was conducted as previously described [63]. Fluorescent multiplex in situ hybridization methods were performed as previously described [64]. Briefly, primary antibodies were used to detect fluorescein isothiocyanate- and digoxigenin-labeled antisense RNA probes (used 1:400, Invitrogen), followed by detection of primary antibodies using secondary antibodies labeled with Alexa Fluor dyes (used 1:500, Invitrogen). Images of fluorescently labeled embryos were acquired on a Nikon Eclipse 80i scanning confocal microscope with a 20× objective lens. Sum projections of confocal stacks were assembled and plot profiles of the RNA transcripts were analyzed using the ImageJ software. Anti-sense endogenous probes were created by PCR amplification from genomic DNA with a T7 RNA polymerase promoter included on the reverse primer (see Supplemental methods for all primer pairs). DNA fragments for injection were cloned into the [-42EvelacZ]-pCaSpeR vector and introduced into the D. melanogaster as described previously [3]. Between three and seven independent transgenic lines were obtained for each construct: Dm brk NEE (657 bp), Dp brk NEE (859 bp), Dv brk NEE (744 bp), Dm rho NEE (871 bp), Dp rho NEE (843 bp), Dv rho NEE (726 bp), Dm vn NEE (919 bp), Dp vn NEE (858 bp), Dv vn NEE (836 bp), Dm vnd NEE (1020 bp), Dp vnd NEE (1305 bp), Dv vnd NEE (1093 bp), Dm sog NEE (550 bp), and Dv sog NEE (871 bp). Immunochemistry. For protein expression experiments, Drosophila embryos for all species were fixed in 3.7% formaldehyde for 20 min at room temperature. Polyclonal rabbit anti-Dorsal, guinea pig anti-Dorsal, and rabbit anti-Twist antibodies were used for primary detection. Secondary anti-rabbit antibodies conjugated to FITC (Roche Applied Sciences) and anti-guinea pig conjugated to TRITC (Sigma-Aldrich) were used for visualization. Bioinformatics. Whole-genome scans for sequences matching enhancer models were conducted using multiple techniques to verify results. These methods included searching fly genomes using scripts written in PYTHON and as well as local-alignment-based methods, primarily the VISTA suite of whole genome alignments. Dialign2 was used for additional sequence alignment and to determine nucleic acid identity. DNA construct cloning and injection. The enhancer sequences used in this study have been deposited in GenBank (http://www.ncbi.nlm.nih.gov/Genbank/index.html) with accession numbers FJ169871–FJ169884. D. melanogaster and D. virilis DNA fragments containing identified enhancer elements were amplified from genomic DNA with the following primer pairs (lowercase letters denote flanking primer sequence used to introduce the restriction site indicated in brackets): Dm rho: [BsaI] ccgcataattcgagaccCAGTTAAGTGAGTCGCTTTCAGG, ccgcataattcgagaccTAGATAGATATACCCATCCTGGCC; Dv rho: [BsaI] ccgcataattcgagaccCTGGGAAGTTGCACGAGAGACGC, ccgcataattcgagaccGAGAAACTCTTCTGGCACAACGC; Dp rho: [EcoRI] ccgcggaattcACTAGTGGAAGCTGCTCTACAGACGCG, ccgcggaattcACTAGTCACACACAGCGAAGCACTGAGA; Dm vnd: [EcoRI] ccgcggaattcGGAAGATTGGGCGTTGAAAGC, ccgcggaattcCGGCCATTCACACGATTGACACA; Dv vnd: [Mef1] ccgcgcaattgCTGTTTGGCCTGGCTGGC, ccgcgcaattgATGGCCGGAAAGCAACACAATGG; Dp vnd: [BsaI] ccgcataattcgagaccGTATTTCGAAGATGCATTTGTTTGC, ccgcataattcgagaccTGAATGGCCGGAAGGTCCAACAAG; Dm vn: [BsaI] ccgcataattcgagaccCAGTTCTGGATCTTCCGAATCACC, ccgcataattcgagaccAAATTTGTAGCCAGCGGCGACG; Dv vn: [EcoRI] ccgcggaattcCATATGTTGCCCCTTTGCTGTTGC, ccgcataattcgagaccAAATTTGTAGCCAGCGGCGACG; Dp vn: [BsaI] ccgcataattcgagaccTGTGGGGCAATATTTTCTTTTTAGC, ccgcataattcgagaccCTAAAAATGCAACTCCAACTTGTCTG; Dm brk: [EcoRI] ccgcggaattcTTGGTCGGAAAATACCTGCGC, ccgcggaattcATTGTGTGGCGTTAGAAAGATATGG; Dv brk: [EcoRI] ccgcggaattcTGTCCGGGCTTATGGATCG, ccgcggaattcTGCATTATCCGTGCTAAGTTTGGG; Dp brk: ccgcTCTAGAAAAATGCCGAACAGGTACGTCG, ccgcTCTAGAAAATCATATCCTAACCCCATCTGGG; Dm sog NEE: [EcoRI] ccgccgaattcTGTTTATGGCAGCCAATTGATGCCGA, ccgccgaattcgatgatctagaatcgcacggagag. The brk and rho enhancers were mutagenized using overlap extension PCR with the following primers: Dm brk D/T1 (letters flanking deletion underlined): GGCACAGGCACACATGTGTGTTTGTGAACGGGAAAGCCCCATTTT; Dm brk D/T2 (letters flanking deletion underlined): GCCCCATTTTAAAGCTGGCCCAACGGCAACACATGTTCATGTTAG; Dm vn-1 (letters flanking deletion underlined): GGACAGGTAACGGGCCACATGTCTGCCGGAAATTCCCCGTTGACCCCTG; Dm vn+1 (inserted letter underlined): GGACAGGTAACGGGCCACATGTCTGAGCCGGAAATTCCCCGTTGACCCCTG; Dm vn+2 (inserted letters underlined): GGACAGGTAACGGGCCACATGTCTGATGCCGGAAATTCCCCGTTGACCCCTG. Probes for whole-mount in situ hybridization. The following primer pairs were used to amplify probes for each of the indicated genes from each species with the T7 promoter sequence indicated): Dm rho: ATCTGGGCTATGCTCTCTACACC, T7-TTAACTGCAAACGGTAACGATAACG; Dv rho: GCCGTCTACACGCAGTACTTCG, T7-CATTTGTTTACACGTTTCGGCCCG; Dp rho: TTGCCATCTTCGCCTACGATCG, T7-GCTTAGGAGACACCCAAGTCG; Dm vn: CTTTGCGGCACCCACCGTTTT, T7-TCCACTCACTATAATTTTCGCTCAC Dv vn: GAGTAGAAGATATATGCGTATGAGC, T7-GTTCACAGCCATTTTAACTGCTTCG; Dp vn: AATTTTGGAGGACGCCATGTATGCG, T7-ACCGATTCCATCACCGAGTGG; Dm brk: GAAATACAACATTCACCGCCGG, T7-TCAGGATTGGCACTTGTATTGGC; Dv brk: ATTTTTGTTACTTCCAAGACGACGG, T7-TATCGAACTTGTTCGCTGTTATCC; Dp brk: CAGCAACCACAGTCCTAACGC, T7-ATCAGGTTGTCGTTGGAGACG; Dm vnd: TCAGTTTGGAATGTGTAGAGTGCGC, T7-GGATAAAAGGCGGCTGGTAGG; Dv vnd: AATGTTTAGAGTGCGGCTGACAACG, T7-TCCAAGGGGGCAGCAATATGG; Dp vnd: TGTCCTACACCTACATCGGTTCC,TCTAGCAGTATTAGGGCCACC; Dm sog: GGAAATGAAGTCCATGTACACCACC, T7-TCTCGTACACCTTGTTGACCACC; Dv sog: GAGGAGATGAAGTCCATGTACACG, T7-CCGTTCTCGTAGATCTTGTTGACG; Dp sog: ATAGTATGTCCCATGCCTCACCG,TTCTGCTCTGGCGAATTTTAAAGC; Dm cg8117: ccgccCTAAAAATTGTGCTTTCCGGTTTCG, T7-ATTTACATATTGTGGAAGCCAACGG; Dm cg8119: ccgccAACAAGTATCCGACCAACAATCTGG, T7-TCACGTCAGCAGCTTCTTCTCC. Figure S1: Staging of Drosophila Embryos (A) The average amount of time to develop to embryonic stages 3, 4, 5, and 6 at 25 °C was plotted for D. melanogaster (blue diamonds) and D. virilis (orange squares). Initial NEE-driven transgene activity is first detectable in mid to late stage 4. (B) Embryonic stages corresponding to time points measured in (A). Stage 5 can be subdivided into stage 5(1) of incipient cellularization , stage 5(2) of mid-cellularization, and stage 5(3) of recently completed cellularization. These stages can be easily measured by following the leading edge cellularization (yellow arrows). For the purposes of this study activities were compared at stage 5(2), when cellularization is 50% complete. (847 KB TIF) Click here for additional data file.(847K, tif) Figure S2: Identification of an NEE in the Drosophila sog Locus Endogenous expression patterns of the sog loci (E) in D. melanogaster (A) and D. virilis (C) are recapitulated by an upstream NEE-like sequence found at both loci (B, D). A previously described but poorly conserved lateral stripe enhancer (LSE) in the D. melanogaster sog intron drives a slightly narrower stripe of expression relative to the upstream NEE. In situ hybridization experiments for two other adjacent genes (CG8117 and CG8119) failed to show lateral stripes of expression in D. melanogaster and are not as well conserved in D. virilis (unpublished data). (1.89 MB TIF) Click here for additional data file.(1.8M, tif) Figure S3: Evolution of the Upstream Neuroectodermal Enhancer of the Drosophila sog Locus (A) Comparison of a 2-kb window around the upstream NEE and NEE-like sequences from D. melanogaster (y-axis) and D. virilis (x-axis) reveals minimal sequence homology in a standard dot plot or graph matrix. Both sequences are functional in D. melanogaster embryos. (B) More extensive blocks of identity are evident at the Su(H)/Dorsal motif, and at the linked Dorsal and TA-core E-box (5′-CATATG) motifs (red, blue, and green dotted lines, respectively, in (A)). (C) Conservation of elements in upstream sog NEE-like sequence. A separate pair of linked Dorsal and Twist (5′-CACATGT) binding motifs is found at the D. melanogaster sequence (blue and green boxes, respectively) (C). (471 KB TIF) Click here for additional data file.(471K, tif) Figure S4: Dorsal Peptide Evolution in Divergent Drosophilids Multiple amino acid substitutions (red letters) have occurred in the Dorsal peptide sequence in each of the three lineages examined in this study: D. melanogaster (Dm), D. pseudoobscura (Dp), and D. virilis (Dv). A few of these changes have occurred in the DNA-binding RHD (underlined sequence). A few of these substitutions occur in positions that are known to either diminish (green “class I” mutation) or augment (red “class II” mutation) activation in Dorsal–Twist synergistic activation assays [52]. Others positions in the RHD have had no effects in such assays when mutated (black mutations) or have not been mutated (“?”). Mutation numbers (M7, M9, M21, and M23) refer to their original description [52]. (1.06 MB TIF) Click here for additional data file.(1.0M, tif) Figure S5: Dorsal Nuclear Concentration Gradient in Divergent Drosophilids Dorsal antibody staining (A, D, and G) and DAPI (4',6-diamidino-2-phenylindole) counter-staining (B, E, and H) in D. melanogaster (A, B, and C), D. pseudoobscura (D, E, and F), and D. virilis (G–I) stage 5 cellularizing embryos. Dissected cross-sections in D. melanogaster (C), D. pseudoobscura (F), and D. virilis (J) embryos show broader D. pseudoobscura (F) and narrower D. virilis (J) nuclear Dorsal concentration gradients than in D. melanogaster (C). (1.40 MB TIF) Click here for additional data file.(1.3M, tif) Figure S6: Co-Expression of Dorsal and Twist Proteins in D. melanogaster and D. virilis Embryos Double-staining with Dorsal (A, C, E, and I) and Twist (B, D, F, J) antibodies in stage 5 cellularizing D. melanogaster (A, B, E–H) and D. virilis (C, D, I–L) embryos. (A–D) Lateral views of double-stained embryos are depicted. (E–G, I–K) Ventro-lateral views of double-stained embryos are depicted. (G, K) Merged Dorsal (red) and Twist (green) signals are co-expressed in the yellow nuclei. (H, L) Bright field views showing cellularizing embryos at comparable stages in the two species. (3.77 MB TIF) Click here for additional data file.(3.6M, tif) Figure S7: Cytoplasmic and Nuclear Concentrations of Dorsal Antigen (Close-up of ventral (A, C, and E) and dorsal (B, D, and F) nuclei of early stage 5 D. melanogaster (A, B), D. pseudoobscura (C, D), and D. virilis (E, F) embryos showing transition of signal intensity from the nucleus (thick arrow, ventral surface) to the cytoplasm (thin arrow, dorsal surface). (2.03 MB TIF) Click here for additional data file.(1.9M, tif) Figure S8: Model for Lineage-Specific Evolution of Morphogen Threshold Readouts (A) Nuclear concentration gradients for Dorsal for D. melanogaster and D. pseudoobscura embryos. (B) Nuclear concentration gradients for Dorsal for D. melanogaster and D. virilis embryos. Lineage-specific changes in the absolute levels of the free Dorsal and Twist nuclear concentration gradients (y-axis; solid and dotted curves) necessitated compensatory adaptations at the level of NEE specificity for lineage-specific thresholds (θ) below which NEE activity ceases along the D/V axis (x-axis). D. pseudoobscura NEEs are adapted to a higher concentration of Dorsal morphogen (θDp) in the smaller D. pseudoobscura embryos, such that when they are each tested in transgenic D. melanogaster embryos, they produce narrower stripes of expression (small red arrow) than D. melanogaster genes, which are adapted to θDm. Conversely, we propose that D. virilis NEEs are adapted to a lower concentration thresholds for Dorsal and Twist morphogens θDv as a result of changes in the Dorsal gradient as well as potential genome Twist sequestration effects, such that when they are each tested in transgenic D. melanogaster embryos, they produce broader stripes of expression (large red arrow) than D. melanogaster genes. (552 KB TIF) Click here for additional data file.(552K, tif) Table S1: List of Enhancer Fragments Used in This Study Binding sites are underlined and in capital letters using the color scheme shown in Figure 2 (36 KB DOC) Click here for additional data file.(36K, doc) Acknowledgments The authors thank M. McPeek, T. Jack, and A. Kern for comments on the manuscript, and A. Heimberg and K. Peterson for helpful discussions. We thank A. Lavanway for help with confocal microscopy. Rabbit anti-Dorsal and anti-Twist antibodies were provided by M. Biggin. Guinea pig anti-Dorsal was provided by M. Levine. Abbreviations
Footnotes ¤ Current address: Department of Biological Sciences, Florida State University, Tallahassee, Florida, United States of America Author contributions. AE conceived and directed the work. JC conducted the whole-genome bioinformatic analyses, constructed all of the transgenic animals, and performed in situ hybridization assays. AE and JC analyzed all the data. YT assisted with characterization of the sog locus. AE wrote the paper. All authors commented on the manuscript. Funding. The authors received no specific funding for this study. Competing interests. The authors have declared that no competing interests exist. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||||||||||||||||||||||||||||
Development. 1997 May; 124(10):1851-64.
[Development. 1997]EMBO J. 1992 Nov; 11(11):4047-57.
[EMBO J. 1992]Genes Dev. 1991 May; 5(5):827-39.
[Genes Dev. 1991]Dev Cell. 2003 Apr; 4(4):509-19.
[Dev Cell. 2003]Q Rev Biol. 1971 Jun; 46(2):111-38.
[Q Rev Biol. 1971]Science. 1992 Sep 18; 257(5077):1682-5.
[Science. 1992]Genes Dev. 1996 Jan 15; 10(2):185-95.
[Genes Dev. 1996]Nature. 2005 Feb 3; 433(7025):481-7.
[Nature. 2005]Nature. 2006 Apr 20; 440(7087):1050-3.
[Nature. 2006]Development. 1998 Mar; 125(5):949-58.
[Development. 1998]Nature. 2007 Nov 8; 450(7167):203-18.
[Nature. 2007]PLoS Biol. 2007 Nov 6; 5(11):e310.
[PLoS Biol. 2007]Proc Natl Acad Sci U S A. 2005 Dec 20; 102(51):18403-7.
[Proc Natl Acad Sci U S A. 2005]Trends Genet. 2005 Jun; 21(6):346-55.
[Trends Genet. 2005]Mech Dev. 2003 Aug; 120(8):897-907.
[Mech Dev. 2003]Proc Natl Acad Sci U S A. 2004 Mar 16; 101(11):3851-6.
[Proc Natl Acad Sci U S A. 2004]Proc Natl Acad Sci U S A. 2004 Mar 16; 101(11):3851-6.
[Proc Natl Acad Sci U S A. 2004]Genes Dev. 1994 Aug 1; 8(15):1829-38.
[Genes Dev. 1994]Genes Dev. 1992 Sep; 6(9):1728-39.
[Genes Dev. 1992]Cell. 1993 Mar 12; 72(5):741-52.
[Cell. 1993]Gene. 2002 Oct 16; 299(1-2):173-84.
[Gene. 2002]Development. 1998 Mar; 125(5):949-58.
[Development. 1998]PLoS Comput Biol. 2006 Oct; 2(10):e130.
[PLoS Comput Biol. 2006]Proc Natl Acad Sci U S A. 2002 Jan 22; 99(2):763-8.
[Proc Natl Acad Sci U S A. 2002]Dev Biol. 2003 Oct 15; 262(2):335-49.
[Dev Biol. 2003]Dev Cell. 2004 Aug; 7(2):229-40.
[Dev Cell. 2004]Proc Natl Acad Sci U S A. 2005 Apr 5; 102(14):4966-71.
[Proc Natl Acad Sci U S A. 2005]Dev Biol. 2003 Oct 15; 262(2):335-49.
[Dev Biol. 2003]Dev Cell. 2004 Aug; 7(2):229-40.
[Dev Cell. 2004]Mol Cell Biol. 2002 Jul; 22(14):5089-99.
[Mol Cell Biol. 2002]Nature. 2007 Nov 8; 450(7167):203-18.
[Nature. 2007]Mol Biol Evol. 2000 Nov; 17(11):1641-6.
[Mol Biol Evol. 2000]J Biol Chem. 2007 Jul 20; 282(29):20868-76.
[J Biol Chem. 2007]Genes Dev. 1991 Oct; 5(10):1881-91.
[Genes Dev. 1991]Cell. 1993 Mar 12; 72(5):741-52.
[Cell. 1993]Genes Dev. 1992 Aug; 6(8):1518-30.
[Genes Dev. 1992]Cell. 2007 Jul 13; 130(1):153-64.
[Cell. 2007]Nature. 2007 Jul 26; 448(7152):420-1.
[Nature. 2007]Proc Natl Acad Sci U S A. 2005 Apr 5; 102(14):4966-71.
[Proc Natl Acad Sci U S A. 2005]PLoS Biol. 2004 Dec; 2(12):e398.
[PLoS Biol. 2004]Nature. 2002 Feb 21; 415(6874):914-7.
[Nature. 2002]Dev Biol. 1998 Feb 15; 194(2):213-25.
[Dev Biol. 1998]Science. 2004 Aug 6; 305(5685):846.
[Science. 2004]EMBO J. 1992 Nov; 11(11):4047-57.
[EMBO J. 1992]Mol Cell Biol. 2002 Jul; 22(14):5089-99.
[Mol Cell Biol. 2002]