![]() | ![]() |
Formats:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright : © 2005 Vermaak et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Positive Selection Drives the Evolution of rhino, a Member of the Heterochromatin Protein 1 Family in Drosophila 1 Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America 2 Howard Hughes Medical Institute, Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America Andy G Clark, Editor Cornell University, United States of America *To whom correspondence should be addressed. E-mail: hsmalik/at/fhcrc.org Received February 7, 2005; Accepted May 13, 2005. This article has been cited by other articles in PMC.Abstract Heterochromatin comprises a significant component of many eukaryotic genomes. In comparison to euchromatin, heterochromatin is gene poor, transposon rich, and late replicating. It serves many important biological roles, from gene silencing to accurate chromosome segregation, yet little is known about the evolutionary constraints that shape heterochromatin. A complementary approach to the traditional one of directly studying heterochromatic DNA sequence is to study the evolution of proteins that bind and define heterochromatin. One of the best markers for heterochromatin is the heterochromatin protein 1 (HP1), which is an essential, nonhistone chromosomal protein. Here we investigate the molecular evolution of five HP1 paralogs present in Drosophila melanogaster. Three of these paralogs have ubiquitous expression patterns in adult Drosophila tissues, whereas HP1D/rhino and HP1E are expressed predominantly in ovaries and testes respectively. The HP1 paralogs also have distinct localization preferences in Drosophila cells. Thus, Rhino localizes to the heterochromatic compartment in Drosophila tissue culture cells, but in a pattern distinct from HP1A and lysine-9 dimethylated H3. Using molecular evolution and population genetic analyses, we find that rhino has been subject to positive selection in all three domains of the protein: the N-terminal chromo domain, the C-terminal chromo-shadow domain, and the hinge region that connects these two modules. Maximum likelihood analysis of rhino sequences from 20 species of Drosophila reveals that a small number of residues of the chromo and shadow domains have been subject to repeated positive selection. The rapid and positive selection of rhino is highly unusual for a gene encoding a chromosomal protein and suggests that rhino is involved in a genetic conflict that affects the germline, belying the notion that heterochromatin is simply a passive recipient of “junk DNA” in eukaryotic genomes. Synopsis Eukaryotic genomes are organized into good and bad neighborhoods. In fruit fly genomes, most genes are found in euchromatin—good neighborhoods that tend to be amenable to gene expression and deficient in selfish mobile elements. Conversely, heterochromatic regions are deficient in genes but chock full of mobile genetic elements, both dead and alive. Cells expend considerable effort to maintain this organization, to prevent bad neighborhoods from exerting their negative influence on the rest of the genome. At the forefront of this organization are the HP1 proteins, which are involved in the compaction and silencing of heterochromatic sequences. First discovered in Drosophila, HP1 proteins have been subsequently found in virtually all fungi, plants, and animals. Most HP1 proteins evolve under stringent evolutionary pressures, suggesting that they lack any discriminatory power in their action. However, a recent paper by Vermaak finds that one of the five HP1 encoding genes in Drosophila genomes, rhino, bucks the trend and evolves rapidly. rhino is predominantly expressed in ovaries, which is where many mobile elements are also active. Their results suggest that rhino has been constantly evolving to police a particularly dynamic, novel compartment in heterochromatin with exquisite specificity. Thus, instead of a genomic wasteyard that genes shun and where transposons go to die, heterochromatin now appears to have been shaped by a constant struggle for evolutionary dominance. Introduction Repetitive DNA sequences can constitute large parts of many genomes (approximately 30% in human and fly genomes) and are involved in fundamental cellular processes [1–3]. For example, centromeres in higher eukaryotes consist of large, repetitive regions required for accurate chromosome segregation during each cell division [4]. Heterochromatin flanks the centromere and is also essential for segregation [5–7]. It is composed largely of repetitive DNA and transposable elements and their relics, but can contain genes important for fertility and viability [8,9]. Transcriptionally silent heterochromatin can influence the expression of not only mobile elements embedded in heterochromatin, but also euchromatic genes [6,10–12]. Given the importance of heterochromatin, it is not surprising that perturbation of heterochromatic proteins is associated with cancer and other diseases [13,14]. The study of repetitive heterochromatic DNA lags far behind that of euchromatic regions because heterochromatin is hard to sequence and manipulate experimentally. Even when DNA sequence is available, the underlying evolutionary forces that shape patterns of rapidly changing repetitive sequences and chromosomal architecture are hard to discern. A complementary approach is to study the evolution of protein components that associate with repetitive DNA instead of studying the DNA directly. These protein components have been well studied, especially in Drosophila genomes [15–18]. Using a similar strategy, the discovery of positive selection acting on the proteins that bind centromeric DNA has led to the centromere-drive hypothesis that may account for the sequence complexity of centromeres [19–21]. Here, we examine the evolutionary pressures that shape proteins that bind heterochromatic DNA. Heterochromatin protein 1 (HP1) is a ubiquitous component of heterochromatin that is the best available surrogate to study heterochromatin complexity. HP1 was first identified in flies [18,22] and is present in most eukaryotes where it is required for maintenance of most aspects of the heterochromatic state [6,10,11,23]. HP1 consists of a N-terminal chromo domain, a hinge region, and a C-terminal chromo shadow (or simply “shadow”) domain that structurally resembles the chromo domain and mediates homodimerization [16,22,24–26]. The chromo domain binds to histone H3 tails methylated at lysine 9 (H3K9me), a covalent modification associated with heterochromatin maintenance and transcriptional silencing [10,11,27,28] and can directly influence the targeting of HP1 in vivo [29]. Multiple HP1-like genes, which may have different functions, can be found in the same genome. In vertebrates, for example, there are at least three HP1-like genes (HP1α, HP1β, and HP1γ) that each encode proteins with distinct localization patterns, despite being about 65% identical [22,30–33]. Drosophila melanogaster contains five genes with HP1-like domain organization. We undertook a molecular evolutionary study of these HP1 paralogs in Drosophila, aiming to use them as a surrogate for studying heterochromatic DNA evolution. HP1A (or Su[Var]205) was the first of these to be identified. This HP1A gene encodes the prototypic HP1 protein required for heterochromatin maintenance [18,34]. The functions of the other four HP1 proteins are unknown. However, HP1B and HP1C differ from HP1A in their chromatin localization [35], suggesting that their function is not redundant with HP1A. The fourth HP1-like protein, HP1D/Rhino (hereafter referred to as “Rhino”), was discovered in a screen for female sterile mutants [36] whereas we identified the fifth, HP1E, using bioinformatic criteria in this study. rhino mutants display a variety of late-stage eggshell defects, among them the fused dorsal appendages for which the gene was named [36]. Careful characterization of mutant egg chambers revealed several defects [36]. First, nurse cells failed to undergo a higher-order chromatin structure reorganization from a “five-blob” state to a dispersed state at stage 5. Second, although transcript levels of several patterning genes were unaffected, transcripts of key patterning genes such as gurken and oskar were mislocalized. Furthermore, Gurken protein synthesis was delayed in early egg chambers and germaria, and Gurken protein showed aberrant accumulation in later egg chambers [36]. Unlike other HP1 proteins, Rhino is expressed predominantly during oogenesis [36]. Its unusual expression pattern suggested that the evolutionary constraints on rhino might more accurately reflect pressures on heterochromatin in the female germline, relatively free from constraints imposed during somatic expression. In this report, we show that tagged Rhino protein localizes to distinct foci within the heterochromatic domain of tissue culture cells. Remarkably, we find that all three domains of Rhino show strong evidence of recurrent positive selection. Such positive selection implies that rhino is involved in a heritable and recurrent genetic conflict, raising the intriguing possibility that heterochromatin itself might represent a paleontological record of this genetic conflict. Results HP1 Paralogs in Drosophila Genomes D. melanogaster contains five HP1-like genes, defined as such because they all encode an N-terminal chromo domain and a C-terminal shadow domain (Figure 1
Among the HP1 paralogs, the HP1D/rhino gene appears to be particularly rapidly evolving. In phylogenetic analyses, both the rhino chromo and shadow domains appear to have evolved far more rapidly (Figure 1 rhino Is Expressed Predominantly in Ovaries Previous Northern blot analysis had detected a 1.6 kb rhino mRNA in female flies, early embryos, and ovary, but not in male flies and rhino mutants [36]. In situ analysis showed that the rhino transcript was present both within the germline and somatic cells of the ovary [36]. However, an abundant and much larger band on the Northern blot did not show the same restricted expression pattern. This band was also present in RNA made from rhi2 mutant flies suggesting that it did not contain rhino transcript. In order to further delineate the expression pattern of this unusual HP1 gene, we used RT-PCR to assess the presence of rhino mRNA in male or female flies and in different tissues, because it provides a more sensitive assay that complements the previous Northern analysis (Figure 2
We confirmed the predominant expression of rhino in D. melanogaster ovaries, although low levels of transcript could also be detected in testis, head, and faintly in carcass, likely below detection limits for Northern analysis (Figure 2 Rhino Localization in D. melanogaster Cells The localization of protein products of three HP1 genes have been tested so far in Drosophila tissue culture cells. Only HP1A was found to localize predominantly to heterochromatin, whereas HP1C localized to euchromatin and HP1B to both euchromatin and heterochromatin [35]. Therefore, we decided to first study the localization pattern of Rhino to determine whether it localized to heterochromatin. Drosophila S2 interphase cells have a DAPI-dense staining area that helps demarcate cytological boundaries of heterochromatin, although it is worth noting that DAPI does not stain all heterochromatic DNA, owing to sequence-dependent DNA-binding preference [38]. H3K4me is an excellent cytological marker for euchromatin, whereas H3K9me marks heterochromatin [10,11]. The localization patterns of green fluorescent protein (GFP) fused to HP1A, HP1B, or HP1C and expressed in tissue culture cells were previously shown to be faithful representations of the localization of the endogenous proteins by antibody staining [35]. We therefore expressed rhino as a C-terminal GFP fusion protein in Drosophila S2 cells, followed by immunostaining with antibodies to HP1A, HP1B, HP1C, or specific modifications of histone H3 (Figure 3
The localization pattern of Rhino-GFP differed from that of HP1A, -B, and -C in interphase tissue culture cells (Figure 3 Molecular Evolution of rhino: Positive Selection of the Hinge and Chromo Shadow Domains The indication that rhino may be a rapidly evolving HP1 (see Figure 1 Rapid evolution of HP1s may be attributed to relaxed constraint, allowing sequence changes to accumulate, especially if different gene copies are functionally redundant. Alternatively, amino acid replacement changes may confer a selective advantage, in which case they would be expected to accumulate at a rate higher than expected under neutral evolution (positive selection). To evaluate whether any of the HP1s are undergoing such positive selection between the closely related D. melanogaster and D. simulans species, we performed a 100-bp sliding window analysis of the number of replacement changes per site (dN) compared to the number of synonymous changes per site (dS) (Figure 4
Sliding window dN/dS analyses suggest that rhino is subject to positive selection. To follow up on this initial observation, we undertook a more detailed study in D. melanogaster and D. simulans. We used PCR to obtain rhino sequence from 17 strains of D. melanogaster and 11 strains of D. simulans. DNA sequence changes were categorized as replacement (R) or synonymous (S) (Table S1). Changes were further classified as either fixed between species (f) or polymorphic within species (p) (Table S1). Under a neutral evolutionary model, the ratio of replacement to synonymous changes that have been fixed between species (Rf:Sf) is expected to be roughly the same as the ratio for polymorphic changes (Rp:Sp) (McDonald-Kreitman test) [39]. We did not find a significant deviation from neutrality when the entire rhino sequence was considered (Table 1, entire coding region, p = 0.13). However, a sliding window analysis clearly showed that the observed fixed replacement changes far exceeded those expected under neutral evolution in the C terminal part of the protein (Figure 5
rhino Evolution in Other Drosophila Species Is the positive selection of rhino limited to the melanogaster species group? To address this question, we identified D. pseudoobscura rhino by synteny with D. melanogaster; rhino is contained within an intron of another gene in both species. We used RT-PCR to confirm the predicted splice sites for rhino from the obscura species group. D. pseudoobscura rhino is very different in length (317 vs. 418 encoded amino acids) and sequence from D. melanogaster rhino (see Figure 1
We find that rhino is evolving at an unprecedented rate for an HP1. The hinge regions cannot be unambiguously aligned between different species groups or in some cases not even within the same species group. We did not detect any significant similarity of the Rhino hinge regions to other proteins or motifs, yet all the hinge regions share certain sequence features, most noticeably long runs of serines as well as proline- and glutamine-rich sequences (Figure 6 Positive Selection of rhino Chromo Domain Phylogenetic analyses (see Figure 1 We used a DNA sequence alignment of the rhino gene corresponding to the encoded chromo domain from different Drosophila species. The corresponding amino acid sequence alignment is shown in Figure 7
We have already shown that the shadow domain is under strong positive selection between D. melanogaster and D. simulans. To find out if some codons of the shadow domain have also been under continuous positive selection, we carried out a PAML analysis. A tree based on the shadow domain amino acid alignment also recapitulates Drosophila phylogeny (Figure 7 PAML analyses like these are very useful to highlight codons that have been repeatedly subject to positive selection [43,50]; however, they do run a risk of false positives. This is somewhat ameliorated in our dataset because the tree lengths are of moderate value (Table 2). Similar tree lengths have been shown by simulations to have a significantly lower risk of false positives [51]. Nonetheless, the true test for the significance of these positively selected residues will come from functional assays on Rhino function and localization. Discussion In this paper, we have undertaken an evolutionary study of HP1-like proteins, with the ultimate aim of discerning the selective pressures that act on heterochromatin. We have found that Rhino, the only HP1 paralog that is expressed predominantly in ovaries, encodes a protein that has a unique localization pattern in S2 cells. Although it is excluded from the euchromatic compartment, the Rhino protein does not overlap with HP1A or H3K9me. This immediately suggests that H3K9me or HP1A does not mark all Drosophila heterochromatin, and that Rhino has a uniquely different specificity for a previously unappreciated compartment in heterochromatin. It has not been easy to discern the molecular function of rhino from mutant phenotypes in eggshell defects. Possibilities range from a role for Rhino in gross chromatin structural changes, to transcriptional or translational regulation and even microtubule organization in the oocyte [36]. Despite our current lack of knowledge about the molecular function of rhino, the fact that mutations are female sterile, point to its importance to proper oogenesis [36]. HP1A, which is far better understood, is an essential gene. Such chromosomal proteins serving crucial functions are expected to be under strong evolutionary constraints and purifying selection. Although this is true for four of the five D. melanogaster HP1s including HP1A (see Figure 4 Can co-evolutionary pressures explain the positive selection acting on rhino? For instance, rhino might be continually “catching up” to mutations in interacting proteins required for its function. We believe this is unlikely, because mutations that compromise a required interaction are likely to be culled out of the population by purifying selection, long before a chance compensatory mutation in rhino can occur. A second possibility is that the positive selection of rhino may be driven by changes in the regulation of key genes between two species. Although we cannot formally rule out such a scenario, it appears unlikely to explain the relatively constant positive selection that we have seen for approximately 25 million years of Drosophila evolution. Positive selection need not involve rhino's “normal” function, whatever that may be, but rather underlie a second and unrelated “defense” function of rhino. In such a scenario the positive selection on rhino would be driven by a recurrent intracellular conflict that yields a selection advantage to the “winner.” Genes encoding proteins involved in direct host–parasite interactions are often subject to positive selection. In this case, changes that are beneficial for the parasite (to evade interactions for instance) will be followed by selection favoring changes in the host proteins (that restore interactions). Thus, two antagonistic entities locked in genetic conflict face repeated episodes of positive selection, only to arrive at the same quasi-steady state, a scenario formalized as the “Red Queen” hypothesis [52]. rhino may be subject to the same kind of genetic conflict that occurs intracellularly. It is especially intriguing that the only HP1 we have found to be subject to positive selection is expressed predominantly in ovaries ([36] and Figure 2 Under the first model, rhino participates in suppressing “selfish” behavior of centromeres, which can compete to maximize their transmission advantage in female meiosis, where only one of four meiotic products is destined to become the egg [21]. We have previously proposed that this kind of drive can have deleterious consequences for male meiosis and is likely to be suppressed either by centromeric proteins altering their DNA-binding specificity [19,20] or by heterochromatin proteins evolving to limit centromere boundaries, and thereby limiting “strength” [4,21,53]. Similar selective pressures have been previously proposed to result in deleterious mutations in the nod chromokinesin in D. melanogaster [54]. rhino may represent another repressor of the drive by directly or indirectly influencing centromere strength. A second model is that positive selection on rhino is a direct result of genetic conflict between rhino and mobile genetic elements. Although we have no evidence to support this hypothesis, it is attractive for several reasons. Transposable elements can evolve rapidly and differ significantly between Drosophila species, including D. melanogaster and D. simulans [55,56]. Rhino-GFP localizes to the heterochromatic region of the nucleus (see Figure 3 Whatever is driving the positive selection of rhino, mutations in any of Rhino's three domains appear to be selected to give rhino the upper hand in the current round of competition. The chromo and related shadow domains are very versatile interaction domains that can influence binding to DNA, RNA, and proteins [63]. The hinge domain can also strongly influence localization of HP1-like proteins [64,65]. Future experiments will address the functional role of the three amino acids under recurrent positive selection in the chromo and shadow domains (Figure 7 Our results complement previous findings that other proteins that bind heterochromatin appear to be among the most rapidly evolving proteins in an unbiased screen in Drosophila [67–68], although this does not appear to be the result of positive selection [69]. Polymorphisms in heterochromatin-binding proteins can have direct effects on non-disjunction frequencies [54,70,71]. Similarly, although HP1A, -B, and -C appear to be conserved and evolving under purifying selection, HP1 evolution (in both sequence and gene copy number; see Figure 1 Materials and Methods Sequences from Drosophila and databases and RT-PCR. Drosophila species and strains (Table S1) were obtained from the Drosophila stock center (currently in Tucson, Arizona) and genomic DNA was prepared by standard methods [19]. The rhino locus was amplified using PCR Supermix High Fidelity (Invitrogen, Carlsbad, California, United States) with the primers indicated in Table S2. PCR products were either sequenced directly or following Topo-TA cloning (Invitrogen). RNA was prepared from whole male or female flies or different tissues (head, ovary, testis, or carcass) using a kit (Qiagen RNeasy; Qiagen, Valencia, California, United States) and cleared of genomic DNA by DNase I digestion (Ambion DNA-free; Ambion, Austin, Texas, United States). RNA concentrations were measured from various tissues, and the same amount of total RNA was used as template in the RT-PCR analysis. RT-PCR (Invitrogen) to evaluate the presence of rhino mRNA was carried out using Dmid1f and Dmid2b primers (Table S2) that span the rhino intron, along with actin-42A primers [72] as a control. For D. bipectinata, primers dv15 and dv230 that span the rhino intron were used. RT-PCR and sequencing was carried out to confirm the predicted splice-site positions for rhino from D. simulans (strain 2), D. bipectinata, and D. miranda. Splice sites for rhino from other species were predicted using Berkeley Drosophila Genome Project Splice site predictor (http://www.fruitfly.org/seq_tools/splice.html). All sequences have been deposited in Genbank (accession numbers AY944308–AY944358, Table S2). Sequence analysis. Sequences were assembled using DNA Strider [73]. Clustal_X [74] was used to obtain pairwise or multiple alignments and to generate formatted files for further analysis. Pairwise sequence alignments used for dN/dS analysis were hand edited, using the amino acid sequence as a guide to place indels. For instance, there is an 80 amino acid length difference between the D. melanogaster and D. simulans hinge regions. These regions cannot be compared in tests for positive selection. Pairwise dN and dS comparisons and confidence values were calculated using the K-estimator software [75,77]. Sliding window size was arbitrarily chosen as 100 bases with 35 base steps for all pairwise dN/dS comparisons. Confidence interval estimates were calculated using Monte Carlo simulations, taking into account (1) dN and dS values, (2) the number of codons, (3) transition: transversion ratio, and (4) GC content and amino acid composition. Thus, K-estimator [75] at least takes into account most of the confounding variables that are known to give false positives in terms of dN/dS. We also present a dN/dS analysis using the reconstructed hypothetical ancestors to all the D. melanogaster and D. simulans rhino sequences (Figure S2). The DNASP software package [77] was used to perform several tests for positive selection using genomic sequence of rhino from 17 strains of D. melanogaster and 11 strains of D. simulans. The Fu and Li [40], Tajima's D [41], and Hudson-Kreitman-Aguade [42] tests were carried out on the complete sequence, including the intron, whereas the McDonald-Kreitman test [39] was carried out on coding regions only (1,209 total positions with indels removed). Fixed replacement changes in the chromo and chromo shadow domains were polarized using D. yakuba and D. teissieri sequences as outgroups, but we could not unambiguously polarize all changes in the hinge region. The expected fixed replacement changes (Rfexpected) shown in Figure 4 Neighbor-joining phylogenetic trees were constructed using the PAUP software, version 4.0b10 [79] and appropriate Clustal_X multiple alignments of either the chromo or chromo shadow domains. A total of 1,000 replicates were carried out for bootstrapping. Maximum likelihood analysis was performed with the PAML software package [43] in separate analyses for multiple alignments of the chromo domain and the shadow domains (the rapid evolution of the hinge in both size and sequence precluded its comparison in such a multiple alignment). Codons that were repeatedly subject to positive selection were identified using N sites models (M1, M7) that do not permit positive selection compared to models (M2, M8) that permit sites to evolve under positive selection. The strength of positive selection was calculated by comparing twice the log likelihood difference (M2 vs. M1, M8 vs. M7) in a chi-square test with two degrees of freedom. Codons that were identified as having evolved under positive selection with high posterior probabilities (p > 0.95) were highlighted on a three-dimensional structure of the respective domains and visualized using the Cn3D software (version 4.0) [80]. Plasmid constructs. A plasmid for expressing rhino as a C-terminal GFP fusion protein under control of the hsp70 heat shock promoter (HSRhiGFP) was constructed as follows: rhino coding sequence flanked by XbaI and NotI restriction enzyme sites was amplified by RT-PCR from D. melanogaster (Canton S) using primers KcRhiF and KcRhiB (Table S1). The PCR product was digested and cloned into a modified heat shock expression plasmid [81] that had been digested with XbaI and EagI and phosphatase treated to yield the rhino open reading frame followed by a six amino acid linker and GFP. Correct cloning was verified by sequencing. An N-terminal fusion protein of a biotin recognition peptide (MAGGLNDIFEAQKIEWHEDTGGS) to rhino (BLRPRhi) was constructed as follows: Primers dv99 and dv100 were used to amplify rhino coding sequence with flanking NotI and BamHI restriction enzyme sites from the HSRhiGFP plasmid. The PCR fragment was TA cloned and the sequence verified before digestion of the TA clone and subcloning of the gel-isolated fragment into a BLRP expression vector with a metallotheionine promoter [82,83]. A plasmid (pBirA) expressing the Escherichia coli biotin ligase enzyme (BirA) from a metallotheionine promoter was a gift from Takehito Furuyama. Cell culture, transfection, and immunostaining. S2 cells (Invitrogen, D-mel2) were maintained in serum-free insect media (Invitrogen) supplemented with 90 ml/l of 200 mM L-Glutamine (Sigma, St. Louis, Missouri, United States). Twenty micrograms of the HSRhiGFP plasmid was transfected as previously described [81]. Cells were heat shocked for 1 h on the next day and allowed to recover for 2 h before immunostaining [84]. In the case of the BLRPrhino construct, 10 μg of plasmid DNA were co-transfected with 10 μg of pBirA plasmid that contains the biotin ligase under control of a metallotheionine promoter. After overnight incubation, cells were induced for 3 h with 500 μM CuSO4, added directly to the media, followed by immunostaining. HP1A, HP1B, and HP1C antibodies have been previously described [35]. Antibodies to H3K9me or H3K4me were purchased from Upstate Biotech (Waltham, Massachusetts, United States). Monoclonal mouse anti-Fibrillarin antibody was purchased from Encor Biotechnology Inc (Alachua, Florida, United States). All antibodies, including the secondary Texas-red fluorescently labeled goat anti-rabbit or anti-mouse antibodies (Amersham, Piscataway, New Jersey, United States), were used at a dilution of 1/200, with the exception of the anti-fibrillarin antibody that was used at 1/500. Images of nuclei were obtained and de-convolved using the Deltavision software (Applied Precision, Issaquah, Washington, United States). Figure S1: Rhino-GFP Localization in Drosophila S2 Cells These additional images of Rhino-GFP show a localization pattern that is distinct from HP1A, H3K4me, and H3K9me. In addition, an N-terminal biotinylated-tagged Rhino protein shows the same localization pattern as that of the C-terminal GFP-tagged Rhino protein. (5.2 MB PDF) Click here for additional data file.(5.1M, pdf) Figure S2: A Sliding Window dN/ dS Analysis Only those changes that were found to have been fixed differences between D. melanogaster and D. simulans were used. All intraspecific polymorphisms were eliminated for this analysis. Compared to Figure 4 (203 KB PDF) Click here for additional data file.(204K, pdf) Table S1: All Polymorphisms within the Coding Region of the rhino Gene in D. melanogaster and D. simulans Are Shown Changes are highlighted as being either fixed (f) between species or polymorphic (p) within species, as replacement (R) or synonymous (s) changes. Fixed changes were polarized using an outgroup species to changes along either the D. melanogaster (m) or D. simulans (s) lineages. Many changes could not be unambiguously polarized. (34 KB DOC) Click here for additional data file.(35K, doc) Table S2: List of Primers Used and Accession Numbers of Sequences Obtained in This Study (36 KB XLS) Click here for additional data file.(37K, xls) Accession Numbers The Flybase (http://flybase.bio.indiana.edu) accession numbers of the genes discussed in this paper are rhino (CG10683) and HP1E (CG8120). New sequences obtained during the course of this study have been deposited in Genbank under the accession numbers AY944308–AY944358. The Molecular Modeling Database (MMDB; http://www.ncbi.nlm.nih.gov/Structure/MMDB/mmdb.shtml) accession numbers of the proteins discussed in this paper are H3K9me (19011, PDB 1KNE) and HP1β shadow domain dimer (13286, PDB 1DZ1). Acknowledgments We thank the Drosophila stock center (Tucson, Arizona) for various Drosophila species stocks and Judy O'Brien for maintenance of Drosophila stocks. We are grateful to Kami Ahmad and Celeste Berg for useful discussions throughout this project, and Jiro Yasuhara, Barbara Wakimoto, Sara Sawyer, and Julie Kerns for comments on the manuscript. We also gratefully acknowledge Terri Bryson for help with maintenance of Drosophila tissue culture cells, and Takehito Furuyama for the BLRP and pBirA plasmids. This work was supported by a Damon Runyon Cancer Postdoctoral Fellowship (DV), by the Howard Hughes Medical Institute (SH), startup funds from the Fred Hutchinson Cancer Research Center, and a Scholar Award from the Sidney Kimmel Foundation (HSM). HSM is an Alfred P. Sloan Fellow in Computational and Evolutionary Molecular Biology. Abbreviations
Footnotes Competing interests. The authors have declared that no competing interests exist. Author contributions. DV and HSM conceived and designed the experiments. DV performed the experiments. DV and HSM analyzed the data. DV and SH contributed reagents/materials/analysis tools. DV, SH, and HSM wrote the paper. Citation: Vermaak D, Henikoff S, Malik HS (2005) Positive selection drives the evolution of rhino, a member of the heterochromatin protein 1 family in Drosophila. PLoS Genet 1(1): e9. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||||||||||||||||||||||||||||||
Curr Opin Genet Dev. 2004 Dec; 14(6):680-5.
[Curr Opin Genet Dev. 2004]Curr Opin Genet Dev. 2003 Dec; 13(6):651-8.
[Curr Opin Genet Dev. 2003]Curr Opin Genet Dev. 2002 Dec; 12(6):711-8.
[Curr Opin Genet Dev. 2002]Dev Cell. 2001 Aug; 1(2):165-77.
[Dev Cell. 2001]Science. 2001 Dec 21; 294(5551):2539-42.
[Science. 2001]Development. 2003 May; 130(9):1817-24.
[Development. 2003]Genetics. 1992 Jun; 131(2):345-52.
[Genetics. 1992]Genetics. 2001 Mar; 157(3):1293-8.
[Genetics. 2001]Science. 2001 Aug 10; 293(5532):1098-102.
[Science. 2001]Genetics. 1992 Jun; 131(2):345-52.
[Genetics. 1992]Curr Opin Genet Dev. 2000 Apr; 10(2):204-10.
[Curr Opin Genet Dev. 2000]Biol Cell. 2004 Oct; 96(8):631-7.
[Biol Cell. 2004]Cell. 2002 Feb 22; 108(4):489-500.
[Cell. 2002]Science. 2003 Aug 8; 301(5634):798-802.
[Science. 2003]Curr Opin Genet Dev. 2000 Apr; 10(2):204-10.
[Curr Opin Genet Dev. 2000]Chromosoma. 2001 Feb; 109(8):536-44.
[Chromosoma. 2001]Cytogenet Cell Genet. 2000; 90(3-4):279-84.
[Cytogenet Cell Genet. 2000]Genetics. 1992 Jun; 131(2):345-52.
[Genetics. 1992]Mol Cell Biol. 1986 Nov; 6(11):3862-72.
[Mol Cell Biol. 1986]Genetics. 2001 Nov; 159(3):1117-34.
[Genetics. 2001]Genetics. 2001 Nov; 159(3):1117-34.
[Genetics. 2001]Science. 2000 Mar 24; 287(5461):2185-95.
[Science. 2000]Genetics. 2001 Nov; 159(3):1117-34.
[Genetics. 2001]Genetics. 2001 Nov; 159(3):1117-34.
[Genetics. 2001]Mol Cell Biol. 2001 Apr; 21(7):2555-69.
[Mol Cell Biol. 2001]J Biomol Struct Dyn. 1994 Feb; 11(4):849-67.
[J Biomol Struct Dyn. 1994]Cell. 2002 Feb 22; 108(4):489-500.
[Cell. 2002]Science. 2003 Aug 8; 301(5634):798-802.
[Science. 2003]Nature. 1991 Jun 20; 351(6328):652-4.
[Nature. 1991]Genetics. 1993 Mar; 133(3):693-709.
[Genetics. 1993]Genetics. 1989 Nov; 123(3):585-95.
[Genetics. 1989]Genetics. 1993 Mar; 133(3):693-709.
[Genetics. 1993]Genetics. 1987 May; 116(1):153-9.
[Genetics. 1987]Genetics. 1989 Nov; 123(3):585-95.
[Genetics. 1989]Comput Appl Biosci. 1997 Oct; 13(5):555-6.
[Comput Appl Biosci. 1997]Proc Natl Acad Sci U S A. 2002 Feb 5; 99(3):1449-54.
[Proc Natl Acad Sci U S A. 2002]Science. 2002 Mar 15; 295(5562):2080-3.
[Science. 2002]EMBO J. 2000 Apr 3; 19(7):1587-97.
[EMBO J. 2000]Curr Biol. 2000 May 4; 10(9):517-25.
[Curr Biol. 2000]Curr Biol. 2000 Jan 13; 10(1):27-30.
[Curr Biol. 2000]Comput Appl Biosci. 1997 Oct; 13(5):555-6.
[Comput Appl Biosci. 1997]Genetics. 2004 Oct; 168(2):1041-51.
[Genetics. 2004]Mol Biol Evol. 2002 Jun; 19(6):950-8.
[Mol Biol Evol. 2002]Genetics. 2001 Nov; 159(3):1117-34.
[Genetics. 2001]Nature. 1991 Jun 20; 351(6328):652-4.
[Nature. 1991]Comput Appl Biosci. 1997 Oct; 13(5):555-6.
[Comput Appl Biosci. 1997]Genetics. 2001 Nov; 159(3):1117-34.
[Genetics. 2001]Science. 2001 Aug 10; 293(5532):1098-102.
[Science. 2001]Genetics. 2001 Mar; 157(3):1293-8.
[Genetics. 2001]J Biol. 2004; 3(4):18.
[J Biol. 2004]Curr Opin Genet Dev. 2002 Dec; 12(6):711-8.
[Curr Opin Genet Dev. 2002]Trends Biotechnol. 2002 Mar; 20(3):89-92.
[Trends Biotechnol. 2002]Genetics. 1990 Feb; 124(2):339-55.
[Genetics. 1990]Genome Biol. 2002; 3(12):RESEARCH0084.
[Genome Biol. 2002]Chromosoma. 2002 Jul; 111(2):114-25.
[Chromosoma. 2002]Curr Opin Genet Dev. 2004 Dec; 14(6):680-5.
[Curr Opin Genet Dev. 2004]Science. 2003 Aug 8; 301(5634):798-802.
[Science. 2003]Gene. 2001 Sep 5; 275(1):19-29.
[Gene. 2001]Chromosoma. 2005 Feb; 113(7):370-84.
[Chromosoma. 2005]EMBO J. 2003 Jun 16; 22(12):3164-74.
[EMBO J. 2003]Genetica. 2003 Mar; 117(2-3):127-34.
[Genetica. 2003]Nat Cell Biol. 2003 Jan; 5(1):82-4.
[Nat Cell Biol. 2003]Genetics. 1999 Dec; 153(4):1717-29.
[Genetics. 1999]Genetics. 1999 Aug; 152(4):1605-14.
[Genetics. 1999]J Cell Biol. 1995 Nov; 131(4):833-43.
[J Cell Biol. 1995]Genetics. 2001 Mar; 157(3):1293-8.
[Genetics. 2001]Dev Genet. 1990; 11(1):15-26.
[Dev Genet. 1990]Methods Mol Biol. 1994; 25():181-94.
[Methods Mol Biol. 1994]Nucleic Acids Res. 1997 Dec 15; 25(24):4876-82.
[Nucleic Acids Res. 1997]Bioinformatics. 1999 Sep; 15(9):763-4.
[Bioinformatics. 1999]Bioinformatics. 2003 Dec 12; 19(18):2496-7.
[Bioinformatics. 2003]Bioinformatics. 2003 Dec 12; 19(18):2496-7.
[Bioinformatics. 2003]Genetics. 1993 Mar; 133(3):693-709.
[Genetics. 1993]Genetics. 1989 Nov; 123(3):585-95.
[Genetics. 1989]Genetics. 1987 May; 116(1):153-9.
[Genetics. 1987]Nature. 1991 Jun 20; 351(6328):652-4.
[Nature. 1991]Comput Appl Biosci. 1997 Oct; 13(5):555-6.
[Comput Appl Biosci. 1997]Trends Biochem Sci. 2000 Jun; 25(6):300-2.
[Trends Biochem Sci. 2000]Mol Cell Biol. 2002 Nov; 22(21):7553-61.
[Mol Cell Biol. 2002]Proc Natl Acad Sci U S A. 2003 Jun 24; 100(13):7480-5.
[Proc Natl Acad Sci U S A. 2003]Protein Sci. 1999 Apr; 8(4):921-9.
[Protein Sci. 1999]Mol Cell Biol. 2002 Nov; 22(21):7553-61.
[Mol Cell Biol. 2002]Proc Natl Acad Sci U S A. 2000 Jan 18; 97(2):716-21.
[Proc Natl Acad Sci U S A. 2000]Mol Cell Biol. 2001 Apr; 21(7):2555-69.
[Mol Cell Biol. 2001]Bioinformatics. 1999 Sep; 15(9):763-4.
[Bioinformatics. 1999]Science. 2002 Mar 15; 295(5562):2080-3.
[Science. 2002]EMBO J. 2000 Apr 3; 19(7):1587-97.
[EMBO J. 2000]