![]() | ![]() |
Formats:
|
||||||||||||||||||
Copyright © 2009 by Cold Spring Harbor Laboratory Press Lessons from X-chromosome inactivation: long ncRNA as guides and tethers to the epigenome Howard Hughes Medical Institute, Department of Molecular Biology, Massachusetts General Hospital, Department of Genetics, Harvard Medical School, Boston, Massachusetts 02114, USA 1Correspondence.E-MAIL lee/at/molbio.mgh.harvard.edu; FAX (617) 726-6893. Freely available online through the Genes & Development Open Access option. Abstract Transcriptome studies are revealing that the eukaryotic genome actively transcribes a diverse repertoire of large noncoding RNAs (ncRNAs), many of which are unannotated and distinct from the small RNAs that have garnered much attention in recent years. Why are they so pervasive, and do they have a function? X-chromosome inactivation (XCI) is a classic epigenetic phenomenon associated with many large ncRNAs. Here, I provide a perspective on how XCI is achieved in mice and suggest how this knowledge can be applied to the rest of the genome. Emerging data indicate that long ncRNAs can function as guides and tethers, and may be the molecules of choice for epigenetic regulation: First, unlike proteins and small RNAs, large ncRNAs remain tethered to the site of transcription, and can therefore uniquely direct allelic regulation. Second, ncRNAs command a much larger sequence space than proteins, and can therefore achieve very precise spatiotemporal control of development. These properties imply that long noncoding transcripts may ultimately rival small RNAs and proteins in their versatility as epigenetic regulators, particularly for locus- and allele-specific control. Keywords: X-chromosome inactivation, chromatin modifier, dosage compensation, epigenetic regulation, noncoding RNA Over the past several years, large-scale analyses of the mouse and human genomes have revealed that, while only 1.5% of the genome carries protein-coding information, 60%–80% of its nucleotides are transcribed. According to Unigene (Wheeler et al. 2008), there are 84,000 transcripts in the human genome, of which only 20,000–25,000 are associated with protein-coding genes. According to FANTOM3 (Carninci et al. 2005), 62% of the mouse genome is transcribed to make 181,000 transcripts. These large noncoding RNAs (ncRNAs) are often developmentally regulated, sometimes conserved, and frequently next to genes known to be subject to tight transcriptional control. A very small minority of the RNAs has previously described functions as catalysts and structural RNAs, distinctly different from the small effectors of the RNAi pathway (Cech 2009; Ghildiyal and Zamore 2009; Sharp 2009). A few also have been characterized as transcriptional scaffolds, a platform on which protein factors can be recruited to chromatin (Cam et al. 2009; Moazed 2009). However, the vast majority of these transcripts, which can range in size from 100 nucleotides (nt) to >100 kb and are apparently pervasive, has no obvious function. Some are antisense to known protein-coding genes (Katayama et al. 2005; He et al. 2008), whereas others originate in promoter regions and intergenic space (Claverie 2005; Kapranov et al. 2007a,b; Guttman et al. 2009; Mercer et al. 2009). In the post-genomic era, why so much cellular energy should be spent on RNA production has captured the imagination. Are they merely spurious transcripts—wasteful biproducts of genomic activity? Or do they encode useful information and serve crucial functions in the epigenome? Although it is now very fashionable to study “macroRNAs,” knowledge of RNA behemoths dates back decades and has been pursued intensively by those studying the once-unusual epigenetic phenomena of genomic imprinting (Sleutels and Barlow 2002; Edwards and Ferguson-Smith 2007; Wan and Bartolomei 2008) and X-chromosome inactivation (XCI) (Lyon 1961; Wutz 2003; Lucchesi et al. 2005; Masui and Heard 2006; Payer and Lee 2008). Discovery of the 17-kb Xist RNA in 1991 marked the beginning of a long fascination with regulatory ncRNAs in this field (Borsani et al. 1991; Brown et al. 1991a, 1992; Brockdorff et al. 1992). Discovery of a second transcript—the 40-kb Tsix antisense RNA (Lee and Lu 1999; Lee et al. 1999a)—brought early recognition that untranslated RNAs may come to dominate regulation of XCI. Today, the Xist and Tsix RNA pair serves as a paradigm for understanding sense–antisense relationships in eukaryotes and for long-range chromatin control. Around the “X-inactivation center” (Xic) (Brown et al. 1991b; Lee et al. 1996; Simmler et al. 1996; Willard 1996; Chureau et al. 2002), more than seven distinct ncRNA loci have now been identified, and it is clear that Xist and Tsix are not the only ones with regulatory properties. For years, the unique aspects of Xist and Tsix were thought to be evolutionary deviations—possibly “X-centric” and relevant only for unusual epigenetic phenomena such as sex-chromosome inactivation and imprinting. Below, I argue that lessons learned from the X can be applied elsewhere. In short vignettes, I illustrate models for ncRNA control of X and propose the idea of RNA as the molecule of choice for locus-specific and allelic control. I alert the reader to the fact that, for every aspect of XCI, there are many worthy contemporary models and that the goal of the following dissertation is not to review all of them, but rather to elaborate on current thinking about the role of ncRNA. I kindly refer interested parties to published works elsewhere, as cited, for in-depth discussion of alternative viewpoints. X-inactivation, checkpoints, and ncRNA XCI is the mechanism of dosage compensation in mammals by which one X chromosome is transcriptionally silenced in the female sex to ensure that XX and XY individuals have equivalent X-linked gene dosage (Lyon 1961; Wutz 2003; Lucchesi et al. 2005; Masui and Heard 2006; Payer and Lee 2008). Two forms of XCI occur in eutherian mammals. During preimplantation development, dosage compensation is imprinted to occur exclusively on the paternal X chromosome (XP) (Takagi 1974; Huynh and Lee 2003; Okamoto et al. 2004). Around the time of uterine implantation (mouse embryonic day 4.6–6.5 [E4.5–E6.5]), “imprinted XCI” is retained in the extraembryonic tissues (placenta) of the developing embryo. However, in the epiblast lineage (embryo proper), the imprint is erased, XP undergoes reactivation (Mak et al. 2004), and a second form of XCI takes hold. In “random XCI,” the two Xs in each female cell have an equal chance of being inactivated (Lyon 1961). Random XCI is controlled by a specialized X-linked region that is replete with ncRNA (Fig. 1
These ncRNAs collaborate to ensure orderly progression through the many crucial “checkpoints” of XCI (Fig. 3
Following the “counting” checkpoint, a choosing mechanism randomly selects one X chromosome as Xa and the other as Xi (Avner and Heard 2001; Boumil and Lee 2001). I favor the idea that “choice” occurs in a mutually exclusive manner to ensure that every event results in a nucleus with exactly one Xa and one Xi (Lee 2002, 2005). An alternative viewpoint posits that chromosome choice is not determined purposefully but is the outcome of a “stochastic” process—one that leads to the survival of cells that, by chance, selected one Xa and one Xi, and one that leads to the death of others that made the inappropriate selection of two Xa or two Xi (Monkhorst et al. 2008,2009). Still another model suggests that choice may be predetermined long before XCI actually initiates (Mlynarczyk-Evans et al. 2006). I refer readers to the cited works for further elaboration of these alternative models. Hereafter, discussion will focus on the preferred view that allelic choice occurs through a tightly controlled mechanism during XCI to achieve mutual exclusion of Xa and Xi without incurring cell death. In my opinion, the precision with which choice is determined implies the existence of a second checkpoint—in the form of cross-talking or a feedback loop—to guarantee distinct fates of the two Xs (Lee 2005). Because the two female Xs are thought to be equivalent in the pre-XCI state, how their epigenetic symmetry is broken at the onset of cell differentiation, especially when the Xs are bathed in the same nucleoplasm, remains one of the big questions in the field. Below, I argue that somatic “pairing” facilitates symmetry break, and that the same ncRNAs, Xite and Tsix, play crucial roles during this process. Once chosen, the Xi-elect must be distinctly marked from Xa-elect and initiate the process of chromosome-wide silencing in a strictly cis-limited fashion. The strict cis requirement presents several conceptual difficulties for the female cell. First, how is the initial silencer recruited only to one Xic when Xi and Xa-elect reside in the same nuclear milieu following the pairing and choice checkpoints? Second, how can silencing spread in an allele-specific fashion in cis along 150 Mb of sequence? As for all other checkpoints, the answers may lie in ncRNA: in this case, in the two overlapping transcripts, RepA (Zhao et al. 2008) and Xist RNA (Brown et al. 1992). X-chromosome pairing controlled by transcription of ncRNA Diverse as they are, ncRNAs of the Xic have been proposed to act in cis; i.e., on the chromosome that synthesizes them. Although cis-acting genes dominate the Xic, one viewpoint suggests that Xic function must also extend in trans (Marahrens 1999; Lee 2005). The requirement to select Xa and Xi in a mutually exclusive manner implies a means of interchromosomal communication or feedback to ensure that no cell befalls the lethal outcome of creating two Xa or two Xi. The idea of cross-talk was first supported by a homozygous knockout of Tsix, which led to a state in which choice became chaotic, with some female cells displaying two Xi, one Xi, or no Xi at all (Lee 2005). This result suggested that the noncoding Tsix gene, already known to be a negative regulator of Xist and a requirement for designating Xa (Lee and Lu 1999), must also play a role in the mutually exclusive choice of Xa and Xi. The subsequent discovery of the transient pairing of Xs in ES cells led to the realization that physical association provides a unique opportunity for the Xs to communicate directly (Anguera et al. 2006; Bacher et al. 2006; Carrel 2006; Xu et al. 2006). By fluorescence in situ hybridization (FISH), it was observed that, at the onset of cell differentiation and just prior to the initiation of chromosome-wide silencing, the Xs briefly make contact at the Xic. Before cell differentiation, the Xs are epigenetically equivalent and both express Xist RNA at very low levels. The pairing event then brings the two Xs together and leads to the establishment of asymmetry, as Xist RNA grows to engulf Xi while simultaneously disappearing from Xa. Genetic or pharmacological interference with pairing results in aberrant XCI patterns and loss of cell viability (Xu et al. 2006, 2007). The discovery of X–X interactions in ES cells has offered a convenient genetic model to understand homologous chromosome pairing, a phenomenon previously thought to be excluded in the mammalian soma. Pairing elements reside within a 15-kb region encompassing Xite and the 5′ end of Tsix (Fig. 1 Another common thread among the pairing-competent transgene fragments is the occurrence of cryptic and minor RNA polymerase II (Pol II) promoters with the potential to drive transcription independently of the major promoters (Sado et al. 2001; Ogawa and Lee 2003; Stavropoulos et al. 2005; Cohen et al. 2007; Xu et al. 2007). This observation raised the possibility that RNA may direct pairing. Although there is currently no direct evidence for the involvement of RNA per se, inhibiting Pol II activity using actinomycin D exerted discrete effects on the pairing process (Xu et al. 2007). Treating cells for 2.0–4.0 h abrogated X–X pairing, suggesting that pairing requires new transcription. Interestingly, time-course analysis indicated that treating with actinomycin D for as little as 0.5 h was sufficient to disrupt new pair formation but did not affect cohesion of previously formed pairs. Approximately 50% of preformed pairs still remained together at 1.0 h post-treatment, implying a pairing half-life of no more than 0.5–1.0 h. Together, these data argue that homologous X-chromosome interaction requires transcription of Tsix and Xite and, moreover, insinuate that the ncRNAs themselves may be involved in pairing. It therefore has been proposed that protein and RNA may collaborate to bring the Xs together and facilitate the breaking of chromosomal symmetry (Fig. 4
Binary state of X specified by antisense RNA By this model, the binary state of X is directly regulated by allele-specific expression patterns of Tsix (Lee and Lu 1999). How does Tsix act as the molecular switch and how does it specify Xa and Xi fates? Genetic analyses have shown that loss of Tsix expression on the Xi-elect enables transcriptional activation of Xist. Forced expression of the antisense RNA on an X chromosome renders that X incapable of expressing Xist (Luikenhuis et al. 2001; Stavropoulos et al. 2001), and deletion or truncation of the antisense RNA results in constitutively elevated Xist expression (Lee and Lu 1999; Sado et al. 2001; Morey et al. 2004; Shibata and Lee 2004; Ohhata et al. 2008). Conversely, continued expression of Tsix is required to maintain Xa activity during female cell differentiation. Chromatin immunoprecipitation (ChIP) analyses have revealed that changes in chromatin structure at or around the Xic are highly associated with events of XCI (Heard et al. 2001; Navarro et al. 2005, 2006; Sado et al. 2005; Sun et al. 2006). While differences in experimental systems have led to slightly different conclusions in some of these studies, most agreed that Tsix's effect on Xa and Xi involves changing the chromatin state of the Xist locus (Fig. 5
Opposite events occur on Xa-elect. During cell differentiation, Xa-elect continues to express Tsix, which in turn maintains the Tsix/Xist domain in the euchromatic state enriched for H3-K4 methylation and H4 acetylation. This euchromatic state paradoxically correlates with Xist repression. By RNA ChIP, it was shown that Tsix RNA occurs in a specific complex containing the de novo methyltransferase Dnmt3a at the Xist promoter (Sun et al. 2006). Allele-specific analysis indicated that DNA methylation occurs predominantly on the Xist allele of Xa-elect (Sado et al. 2005; Sun et al. 2006), presumably induced in cis by the bound Tsix RNA. Thus, Tsix RNA elicits two specific effects on Xa-elect: First, the RNA (or the act of transcribing it) directs euchromatic modifications to the Xist locus and, in doing so, apparently prevents the activation of the Xist allele in cis even as cell differentiation signals trigger XCI on Xi-elect. Second, it associates with Dnmt3a at the Xist promoter and facilitates de novo CpG methylation and stable silencing of Xa's Xist allele. These findings have led to the idea of RNA-directed DNA methylation (RDDM) and transcriptional gene silencing (TGS) on the X in mammals (Sun et al. 2006). Although both phenomena are well-established in yeast and plants (Volpe et al. 2002; Cam et al. 2009; Martienssen et al. 2008) and much less so in mammals (Morris et al. 2004; Kim et al. 2006; Moazed 2009), recent evidence indicated that Xist and Tsix RNAs can duplex in vivo, and that the long duplexes are processed to small RNAs during XCI (Ogawa et al. 2008). The small RNAs were 21–42 nt in size and occurred in the complementary regions of Tsix and Xist in ES cells, as well as within the promoter and 5′ end of Xist. In Dicer (Dcr)-deficient ES cells, production of the small RNAs was dramatically reduced during cell differentiation, suggesting that Dcr may be involved, directly or indirectly, in cleaving long duplexes to small RNAs. The possibility of endogenous siRNAs originating from overlapping long duplexes has also been proposed recently for mammalian oocytes (Tam et al. 2008). Intriguingly, when Dcr is deficient, female ES cells displayed reduced DNA methylation at the Xist promoter and, at the same time, elevated steady-state levels of Xist RNA in undifferentiated ES cells (Nesterova et al. 2008; Ogawa et al. 2008; Kanellopoulou et al. 2009). Because Dcr has pleiotropic effects on many aspects of cell physiology, indirect effects on Dnmt3a must be considered (Nesterova et al. 2008) alongside indirect effects of poor cell differentiation on XCI (Ogawa et al. 2008; Kanellopoulou et al. 2009). Nonetheless, the findings provide tantalizing, albeit preliminary evidence for RNAi's involvement in regulating DNA methylation and expression of Xist. For example, small RNAs produced from Xist and Tsix RNA may direct DNA methylation of the Xist promoter and silence it via an RNAi-mediated pathway. Silencing factors targeted to the X by ncRNA Xi-elect undergoes a cascade of changes that culminate in the outward spread of the ncRNA away from the Xic to blanket the rest of the X chromosome. Recent analysis has provided clues to the initial series of events. When Tsix RNA is down-regulated from Xi-elect, one of the first changes to appear is a concentration of H3-K27me3 at the 5′ end of Xist, concurrently with enrichment for Polycomb-repressive complex 2 (PRC2) (Sun et al. 2006; Zhao et al. 2008; BK Sun and JT Lee, unpubl.), the enzyme complex responsible for trimethylating H3-K27 (Cao et al. 2002; Czermin et al. 2002; Kuzmichev et al. 2002; Muller et al. 2002). PRC2 belongs to the Polycomb group (PcG) family of proteins that were originally discovered in fruit flies and are now known to have genome-wide repressive functions that are conserved from flies to mammals (Ringrose and Paro 2004; Schwartz and Pirrotta 2008). Intriguingly, the changes at the 5′ end of Xist occur prior to Xist up-regulation, but are followed rapidly by Xist transcriptional induction upon the initiation of cell differentiation (Sun et al. 2006). The concentration of PRC2 at the 5′ end of Xist is intriguing because recruitment of PRC2 and H3-K27me3 are two of the earliest changes to occur on the X following Xist up-regulation (Plath et al. 2003; Silva et al. 2003; Kohlmaier et al. 2004). The question was raised whether the 5′ end of Xist could function as a PRC2 “nucleation site” akin to Polycomb response elements (PRE) in Drosophila melanogaster that have remained elusive in mammals (Ringrose and Paro 2004; Schwartz and Pirrotta 2008). If so, what might be recruiting PRC2 to this region? Given the abundance of ncRNA within the Xic, could an as yet unidentified ncRNA be the missing link? Answers to these questions were facilitated by discovery of a novel 1.6-kb ncRNA located within the 5′ end of Xist (Zhao et al. 2008). Called “RepA,” this independent transcription unit encompasses Xist's Repeat A, a conserved motif composed of 7.5 tandem repeats of two stem–loop structures (Wutz et al. 2002). Genetic analyses have revealed that this repeat element is required for the initiation of X-chromosome silencing, as an X chromosome deleted for the repeat cannot initiate XCI (Wutz et al. 2002; Hoki et al. 2009). Prior to XCI, RepA RNA is produced from both Xs and occurs at a steady level of approximately six to seven copies per female cell—approximately twice the level of full-length Xist RNA in the pre-XCI state (Zhao et al. 2008). During XCI, RepA continues to be produced only from Xi. Unlike Xist RNA, however, RepA RNA is not up-regulated 100-fold during cell differentiation, and RNA FISH of autosomally produced RepA suggests that it localizes to the Xic without spreading along the X in cis. Most importantly, RepA RNA was found to interact directly with PRC2 (Zhao et al. 2008). RNA immunoprecipitations (RIPs) using antibodies against two of PRC2's subunits, Ezh2 and Suz12, demonstrated that RepA RNA is complexed with PRC2 in vivo. Gel shift analysis using RepA oligos showed that a 28-nt conserved stem–loop structure interacts directly with the catalytic subunit Ezh2 in vitro. When a tet-inducible RepA transgene is placed at an ectopic autosomal location, its expression leads to increased recruitment of PRC2. Thus, RepA alone, without full-length Xist, is sufficient to recruit PRC2. Furthermore, the experiment suggests that transcription of RepA or the ncRNA itself is important, as the effect of recruitment is only observed when cells were treated with doxycyline (which results in expression of the RepA transgene). Additional analysis using shRNAs to destroy the RNA without compromising transcription demonstrated that the act of recruiting can be attributed to the ncRNA itself. On the basis of these findings, it has been posited that RepA is the RNA cofactor that initially recruits PRC2 to the X chromosome. When RepA RNA is knocked down by shRNA, Xist RNA clusters and H3-K27me3 foci could not form on Xi-elect. By contrast, knockdown of a nonoverlapping region of Xist exon 1 displayed a lesser effect, suggesting that the effect of RepA shRNA resulted from loss of RepA RNA rather than Xist RNA. Knocking down Eed or Ezh2 protein also compromised Xist induction and H3-K27me3 of Xi-elect. These data argued that the PRC2–RepA complex's initial function may be to trimethylate H3-K27 locally at the 5′ end of Xist and to create the heterochromatic patch that has been proposed to be essential for Xist transactivation (Sun et al. 2006). Previously, PRC2 seemed an unlikely binding partner for the Repeat A motif, as some studies reported no effects on PRC2 recruitment when Repeat A was deleted (Plath et al. 2003; Kohlmaier et al. 2004). However, one of the studies actually found that PRC2 recruitment dropped precipitously in the Repeat A mutants (Kohlmaier et al. 2004), supporting the idea that RepA is required to recruit PRC2. Other studies have also suggested that mutations in Eed do not affect the initiation of XCI (Kalantry and Magnuson 2006; Schoeftner et al. 2006). It is noteworthy that endogenous regulation of Xist activation was bypassed by fusion to a tet-inducible promoter in some studies (Kohlmaier et al. 2004; Schoeftner et al. 2006) (so that the effect on H3-K27me3 on the Xist promoter would have been masked). The lack of effect seen with both Eed mutants may indicate the functional redundancy between Polycomb proteins, as proposed by Schoeftner et al. (2006). Tsix regulation of PRC2–RepA function By pretreating RIP products with various nucleases, it was learned that the RNA bound by PRC2 possess both single-stranded (ss) and double-stranded (ds) character (Zhao et al. 2008). The RNA was destroyed in the presence of RNase I and RNase V1, which digest ssRNA and dsRNA respectively, whereas pretreatment with RNase H and DNase I had no effect on the RNA's ability to be reverse-transcribed and amplified by PCR. By inference, the bound transcript may be one RNA strand containing a secondary structure formed by intramolecular base-pairing, such as that computationally predicted to occur within the tandem elements of the Repeat A motif (Wutz et al. 2002). Alternatively, the bound RNA may be a duplex formed by intermolecular base-pairing between two complementary single strands, such as would occur should RepA and its antisense counterpart, Tsix, anneal to each other in vivo. Indeed, in vivo RNase protection assays have shown that dsRNA are detected in pre-XCI ES cells (Ogawa et al. 2008). Moreover, when RIP analysis was performed in a manner that could discern strandedness, both forward and reverse strands were detected (Zhao et al. 2008). Gel shift analysis confirmed that PRC2 could directly bind either sense or antisense RNA alone. Thus, PRC2 unexpectedly incorporates both RepA and Tsix RNA, although it is not known if each PRC2 complex binds both strands or if binding of sense and antisense RNAs is mutually exclusive. These findings provoked ideas about how the PRC2–RepA complex might be regulated. Because Tsix is an established antagonist of Xist, Tsix RNA may directly control the function of PRC2–RepA in several ways. First, given that Tsix RNA could bind both RepA RNA and PRC2, Tsix RNA could prevent formation of the PRC2–RepA complex by titrating RepA away from PRC2, especially if the binding of Tsix or RepA RNA to PRC2 were mutually exclusive. Because Tsix RNA is present in a 10-fold to 100-fold molar excess over RepA/Xist RNA during the pre-XCI stage in vivo (Shibata and Lee 2003), binding to Tsix RNA would be thermodynamically favored. Instead of blocking PRC2 binding to RepA, Tsix RNA could also interfere with the loading of precomplexed PRC2–RepA to chromatin or, alternatively, with the complex's catalytic activity on chromatin. Notably, some PRC2 already can be found to bind RepA in the pre-XCI state, although the extent to which this occurs is not known. RIP and ChIP studies showed that the RNA–protein interaction and the complex's contact with chromatin are biochemically separable (Zhao et al. 2008). While bound to RNA in the pre-XCI state, PRC2 was not enriched at the 5′ end of Xist and H3-K27me3 levels were accordingly low. At this time, Tsix RNA levels were high. Only upon cell differentiation and the initiation of XCI—when Tsix RNA was down-regulated from Xi-elect—did PRC2 bind Xist chromatin and methylate H3-K27. In my model (Fig. 6
One of the most puzzling questions in the Polycomb field has been how the complexes are recruited to their target destinations. The idea of an RNA cofactor for PcG proteins has been suspected for quite some time, beginning with the curious observation that the stability of PcG complexes may be affected by RNase treatment (R Paro, pers. comm.). In Drosophila, PcG complexes are known to contain sequence-specific DNA-binding proteins such as Zeste, Pipsqueak (PSQ), and Pho, which could target PcG complexes to the genome (Ringrose and Paro 2004; Schwartz and Pirrotta 2008). Because mammalian complexes do not necessarily associate with such proteins, how the mammalian proteins can be targeted in a sequence-specific manner has been a point of major interest. It has been hypothesized that specific ncRNA may be the missing link that directs the chromatin modifier to its genomic target. Why long ncRNAs make excellent guides and tethers for cis regulation By virtue of the fact that long ncRNA remains tethered to its parent locus during the act of transcription, large RNAs may be the molecule of choice for regulatory systems challenged by the need to act in cis. No member of the proteome can subsume this function, as memory of allelic origin is always lost once mRNA exits the nucleus and is translated to protein. I argue that such ncRNAs are necessarily lengthy, with a 5′ business end that binds protein partners as soon as it is synthesized and a transcriptionally lagging 3′ end that therefore tethers the RNA to chromatin through Pol II during the act of transcription (Fig. 7
Because RNA is also inherently sequence-specific and is transcribed in a developmentally specific manner, RNA would also be an ideal regulator of spatial and temporal specificity during development. Transcription factors and DNA-binding proteins usually interact with an entire network of genes (e.g., Oct4 binds an 8mer consensus that appears many times over in the genome), and therefore rarely specify a single location in the genome. In contrast, long ncRNA can identify a unique address. For instance, Tsix and RepA RNAs occur only once in the genome and are therefore uniquely positioned to attract specific chromatin modifiers to that location. With a virtually infinite number of unique addresses that can be specified by a combination of RNA length and nucleotide permutations, sequence space for the long ncRNA transcriptome must ultimately far exceed that of the proteome. Here, I describe the potential for long transcripts to function as tethers and guides that recruit chromatin modifiers—and even entire chromosomes (Bacher et al. 2006; Xu et al. 2006; Zhang et al. 2007)—to a specific location in the genome. RNA conventionally has been regarded as information carriers, translating genetic code to protein sequence. The idea of RNA guidance takes this nucleic acid beyond its traditional role of messenger in the transition from genotype to phenotype and moves it into a more dynamic arena of reshaping the epigenome. The principles of RNA guidance need not be confined to the X. Indeed, there is very recent evidence of PRC2 being associated either directly or indirectly with other ncRNAs, such as HOTAIR (Rinn et al. 2007) or Kcnqt1ot1 (Pandey et al. 2008). There is also no reason to think that RNA guides would be restricted to Polycomb complexes. ChIP using antibodies against G9a (Nagano et al. 2008) and Dnmt3a (Sun et al. 2006) have also hinted at RNA in their complexes, although it remains to be seen whether there is direct RNA–protein interaction or whether, instead, the RNAs interact with the proteins indirectly via the underlying chromatin. A general RNA-based mechanism would rationally explain how a limited set of chromatin modifiers, which often lack subunits with sequence-specific DNA-binding activity but curiously possess putative RNA-binding domains (Denisenko et al. 1998; Bernstein and Allis 2005; Bernstein et al. 2006), could be directed to the mammalian genome in a spatially and temporally unique manner. These unique properties of long ncRNA may explain why evolution, at the expense of protein-coding genes, has placed so many ncRNA genes at the Xic. It seems likely that strategies employed by the X will become recurrent themes throughout the epigenome, as indeed interchromosomal interactions are not confined to the X (LaSalle and Lalande 1996; Spilianakis et al. 2005; Lomvardas et al. 2006), many autosomal genes are now known to possess antisense partners (Katayama et al. 2005; He et al. 2008), and a large number of long ncRNAs are being discovered across the genome (Claverie 2005; Kapranov et al. 2007a,b; Guttman et al. 2009; Mercer et al. 2009). Even within the Xic, ncRNA's full capabilities have yet to be revealed, as most of the seven known ncRNA loci still remain to be studied. It seems probable that long ncRNAs ultimately will rival small RNAs and proteins in versatility. With its unique ability to function in cis and its inherent command of a large sequence space, long ncRNA may prove to be the molecule of choice for many challenges presented by epigenetic regulation. Acknowledgments I am grateful to all laboratory members for many valuable discussions, and especially thank Brian del Rosario, Daniel Kim, Stefan Pinter, Sha Sun, and Jing Zhao for thoughtful comments on the manuscript. I am an investigator of the HHMI. Footnotes Article is online at http://www.genesdev.org/cgi/doi/10.1101/gad.1811209. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||
Nucleic Acids Res. 2008 Jan; 36(Database issue):D13-21.
[Nucleic Acids Res. 2008]Science. 2005 Sep 2; 309(5740):1559-63.
[Science. 2005]Cell. 2009 Feb 20; 136(4):599-602.
[Cell. 2009]Nat Rev Genet. 2009 Feb; 10(2):94-108.
[Nat Rev Genet. 2009]Cell. 2009 Feb 20; 136(4):577-80.
[Cell. 2009]Curr Opin Cell Biol. 2007 Jun; 19(3):281-9.
[Curr Opin Cell Biol. 2007]Adv Genet. 2008; 61():207-23.
[Adv Genet. 2008]Nature. 1961 Apr 22; 190():372-3.
[Nature. 1961]Bioessays. 2003 May; 25(5):434-42.
[Bioessays. 2003]Annu Rev Genet. 2005; 39():615-51.
[Annu Rev Genet. 2005]Nature. 1961 Apr 22; 190():372-3.
[Nature. 1961]Bioessays. 2003 May; 25(5):434-42.
[Bioessays. 2003]Annu Rev Genet. 2005; 39():615-51.
[Annu Rev Genet. 2005]Cold Spring Harb Symp Quant Biol. 2006; 71():419-28.
[Cold Spring Harb Symp Quant Biol. 2006]Annu Rev Genet. 2008; 42():733-72.
[Annu Rev Genet. 2008]Nature. 1991 Jan 3; 349(6304):82-4.
[Nature. 1991]Cell. 1996 Jul 12; 86(1):83-94.
[Cell. 1996]Hum Mol Genet. 1996 Nov; 5(11):1713-26.
[Hum Mol Genet. 1996]Cell. 1996 Jul 12; 86(1):5-7.
[Cell. 1996]Genome Res. 2002 Jun; 12(6):894-908.
[Genome Res. 2002]Nature. 1979 Sep 27; 281(5729):311-3.
[Nature. 1979]Science. 2008 Sep 19; 321(5896):1693-5.
[Science. 2008]Nature. 2009 Jul 2; 460(7251):128-32.
[Nature. 2009]Cell. 1994 Jun 3; 77(5):639-50.
[Cell. 1994]Nat Genet. 1998 Jul; 19(3):249-53.
[Nat Genet. 1998]Nat Rev Genet. 2001 Jan; 2(1):59-67.
[Nat Rev Genet. 2001]Hum Mol Genet. 2001 Oct 1; 10(20):2225-32.
[Hum Mol Genet. 2001]Nat Genet. 2002 Sep; 32(1):195-200.
[Nat Genet. 2002]Science. 2005 Jul 29; 309(5735):768-71.
[Science. 2005]Cell. 2008 Feb 8; 132(3):410-21.
[Cell. 2008]Science. 2005 Jul 29; 309(5735):768-71.
[Science. 2005]Science. 2008 Oct 31; 322(5902):750-6.
[Science. 2008]Cell. 1992 Oct 30; 71(3):527-42.
[Cell. 1992]Genes Dev. 1999 Oct 15; 13(20):2624-32.
[Genes Dev. 1999]Science. 2005 Jul 29; 309(5735):768-71.
[Science. 2005]Cell. 1999 Oct 1; 99(1):47-57.
[Cell. 1999]Cold Spring Harb Symp Quant Biol. 2006; 71():429-37.
[Cold Spring Harb Symp Quant Biol. 2006]Nat Cell Biol. 2006 Mar; 8(3):293-9.
[Nat Cell Biol. 2006]Science. 2006 Feb 24; 311(5764):1107-9.
[Science. 2006]Science. 2006 Feb 24; 311(5764):1149-52.
[Science. 2006]Nat Genet. 2007 Nov; 39(11):1390-6.
[Nat Genet. 2007]Science. 2006 Feb 24; 311(5764):1149-52.
[Science. 2006]Nat Cell Biol. 2006 Mar; 8(3):293-9.
[Nat Cell Biol. 2006]Nat Genet. 2007 Nov; 39(11):1390-6.
[Nat Genet. 2007]Oncogene. 1990 Dec; 5(12):1743-53.
[Oncogene. 1990]Cell. 1999 Aug 6; 98(3):387-96.
[Cell. 1999]Development. 2001 Apr; 128(8):1275-86.
[Development. 2001]Mol Cell. 2003 Mar; 11(3):731-43.
[Mol Cell. 2003]Mol Cell Biol. 2005 Apr; 25(7):2757-69.
[Mol Cell Biol. 2005]Dev Cell. 2007 Jan; 12(1):57-71.
[Dev Cell. 2007]Nat Genet. 2007 Nov; 39(11):1390-6.
[Nat Genet. 2007]Science. 2006 Feb 24; 311(5764):1149-52.
[Science. 2006]Nat Genet. 2007 Nov; 39(11):1390-6.
[Nat Genet. 2007]Phys Rev Lett. 2007 Mar 9; 98(10):108104.
[Phys Rev Lett. 2007]Nature. 2009 Jul 2; 460(7251):128-32.
[Nature. 2009]PLoS Comput Biol. 2007 Nov; 3(11):e210.
[PLoS Comput Biol. 2007]Cold Spring Harb Symp Quant Biol. 2006; 71():429-37.
[Cold Spring Harb Symp Quant Biol. 2006]Cell. 1999 Oct 1; 99(1):47-57.
[Cell. 1999]Mol Cell Biol. 2001 Dec; 21(24):8512-20.
[Mol Cell Biol. 2001]Proc Natl Acad Sci U S A. 2001 Aug 28; 98(18):10232-7.
[Proc Natl Acad Sci U S A. 2001]Development. 2001 Apr; 128(8):1275-86.
[Development. 2001]EMBO J. 2004 Feb 11; 23(3):594-604.
[EMBO J. 2004]Cell. 2001 Dec 14; 107(6):727-38.
[Cell. 2001]Genes Dev. 2005 Jun 15; 19(12):1474-84.
[Genes Dev. 2005]Genes Dev. 2006 Oct 15; 20(20):2787-92.
[Genes Dev. 2006]Dev Cell. 2005 Jul; 9(1):159-65.
[Dev Cell. 2005]Mol Cell. 2006 Mar 3; 21(5):617-28.
[Mol Cell. 2006]Mol Cell. 2006 Mar 3; 21(5):617-28.
[Mol Cell. 2006]Mol Cell. 2006 Mar 3; 21(5):617-28.
[Mol Cell. 2006]Dev Cell. 2005 Jul; 9(1):159-65.
[Dev Cell. 2005]Mol Cell. 2006 Mar 3; 21(5):617-28.
[Mol Cell. 2006]Science. 2002 Sep 13; 297(5588):1833-7.
[Science. 2002]Cell. 2009 Feb 20; 136(4):610-4.
[Cell. 2009]Cold Spring Harb Symp Quant Biol. 2008; 73():265-71.
[Cold Spring Harb Symp Quant Biol. 2008]Science. 2004 Aug 27; 305(5688):1289-92.
[Science. 2004]Mol Cell. 2006 Mar 3; 21(5):617-28.
[Mol Cell. 2006]Science. 2008 Oct 31; 322(5902):750-6.
[Science. 2008]Science. 2002 Nov 1; 298(5595):1039-43.
[Science. 2002]Cell. 2002 Oct 18; 111(2):185-96.
[Cell. 2002]Genes Dev. 2002 Nov 15; 16(22):2893-905.
[Genes Dev. 2002]Science. 2003 Apr 4; 300(5616):131-5.
[Science. 2003]Dev Cell. 2003 Apr; 4(4):481-95.
[Dev Cell. 2003]PLoS Biol. 2004 Jul; 2(7):E171.
[PLoS Biol. 2004]Annu Rev Genet. 2004; 38():413-43.
[Annu Rev Genet. 2004]Curr Opin Cell Biol. 2008 Jun; 20(3):266-73.
[Curr Opin Cell Biol. 2008]Science. 2008 Oct 31; 322(5902):750-6.
[Science. 2008]Nat Genet. 2002 Feb; 30(2):167-74.
[Nat Genet. 2002]Development. 2009 Jan; 136(1):139-46.
[Development. 2009]Science. 2008 Oct 31; 322(5902):750-6.
[Science. 2008]Mol Cell. 2006 Mar 3; 21(5):617-28.
[Mol Cell. 2006]Science. 2003 Apr 4; 300(5616):131-5.
[Science. 2003]PLoS Biol. 2004 Jul; 2(7):E171.
[PLoS Biol. 2004]PLoS Genet. 2006 May; 2(5):e66.
[PLoS Genet. 2006]EMBO J. 2006 Jul 12; 25(13):3110-22.
[EMBO J. 2006]Science. 2008 Oct 31; 322(5902):750-6.
[Science. 2008]Nat Genet. 2002 Feb; 30(2):167-74.
[Nat Genet. 2002]Science. 2008 Jun 6; 320(5881):1336-41.
[Science. 2008]Hum Mol Genet. 2003 Jan 15; 12(2):125-36.
[Hum Mol Genet. 2003]Science. 2008 Oct 31; 322(5902):750-6.
[Science. 2008]Science. 2008 Oct 31; 322(5902):750-6.
[Science. 2008]Mol Cell. 2006 Mar 3; 21(5):617-28.
[Mol Cell. 2006]Science. 2008 Oct 31; 322(5902):750-6.
[Science. 2008]Annu Rev Genet. 2004; 38():413-43.
[Annu Rev Genet. 2004]Curr Opin Cell Biol. 2008 Jun; 20(3):266-73.
[Curr Opin Cell Biol. 2008]Nat Cell Biol. 2006 Mar; 8(3):293-9.
[Nat Cell Biol. 2006]Science. 2006 Feb 24; 311(5764):1149-52.
[Science. 2006]Cell. 2007 May 18; 129(4):693-706.
[Cell. 2007]Cell. 2007 Jun 29; 129(7):1311-23.
[Cell. 2007]Mol Cell. 2008 Oct 24; 32(2):232-46.
[Mol Cell. 2008]Science. 1996 May 3; 272(5262):725-8.
[Science. 1996]Nature. 2005 Jun 2; 435(7042):637-45.
[Nature. 2005]Cell. 2006 Jul 28; 126(2):403-13.
[Cell. 2006]Science. 2005 Sep 2; 309(5740):1564-6.
[Science. 2005]Science. 2008 Dec 19; 322(5909):1855-7.
[Science. 2008]