Logo of plosgenPLoS GeneticsSubmit to PLoSGet E-mail AlertsContact UsPublic Library of Science (PLoS)View this Article
PLoS Genet. 2008 Apr; 4(4): e1000056.
Published online 2008 Apr 25. doi:  10.1371/journal.pgen.1000056
PMCID: PMC2289841

Small RNA-Directed Epigenetic Natural Variation in Arabidopsis thaliana

Joseph R. Ecker, Editor


Progress in epigenetics has revealed mechanisms that can heritably regulate gene function independent of genetic alterations. Nevertheless, little is known about the role of epigenetics in evolution. This is due in part to scant data on epigenetic variation among natural populations. In plants, small interfering RNA (siRNA) is involved in both the initiation and maintenance of gene silencing by directing DNA methylation and/or histone methylation. Here, we report that, in the model plant Arabidopsis thaliana, a cluster of ∼24 nt siRNAs found at high levels in the ecotype Landsberg erecta (Ler) could direct DNA methylation and heterochromatinization at a hAT element adjacent to the promoter of FLOWERING LOCUS C (FLC), a major repressor of flowering, whereas the same hAT element in ecotype Columbia (Col) with almost identical DNA sequence, generates a set of low abundance siRNAs that do not direct these activities. We have called this hAT element MPF for Methylated region near Promoter of FLC, although de novo methylation triggered by an inverted repeat transgene at this region in Col does not alter its FLC expression. DNA methylation of the Ler allele MPF is dependent on genes in known silencing pathways, and such methylation is transmissible to Col by genetic crosses, although with varying degrees of penetrance. A genome-wide comparison of Ler and Col small RNAs identified at least 68 loci matched by a significant level of ∼24 nt siRNAs present specifically in Ler but not Col, where nearly half of the loci are related to repeat or TE sequences. Methylation analysis revealed that 88% of the examined loci (37 out of 42) were specifically methylated in Ler but not Col, suggesting that small RNA can direct epigenetic differences between two closely related Arabidopsis ecotypes.

Author Summary

Phenotypic variation has been mainly attributed to their differences in genetic materials, i.e., the DNA sequence. The advances in Epigenetics in past decades has revealed it as a fundamental mechanism that could inheritably influence gene function without change in DNA sequence, but by modulating chemical modifications on DNA itself (methylation), or on histone proteins, which package the DNA further into nucleosome. Nevertheless, the roles of epigenetic regulation in natural variation were not explored much because of the limitation in high-throughput analytical tools. A recent study in model plant Arabidopsis showed that there are many DNA methylation polymorphisms between the two ecotypes. In plant, a subset of RNA named small interfering RNA (siRNA), is capable of triggering the epigenetic modifications on DNA or histone at their target region with complementary nucleotide sequences. Here, we took a view from the small RNA side and by applying molecular and bioinformatic approaches we showed that the same region could be led to a different epigenetic status because of the difference in their corresponding small RNA abundance and between the two closely related Arabidopsis ecotypes, suggesting that there could be small RNA-directed epigenetic differences among natural populations.


Epigenetics, defined as the study of heritable alteration in gene expression without changes in DNA sequence, has greatly expanded our understanding of inheritance [1]. A recent study of DNA methylation by tiling array analysis of Arabidopsis Chromosome 4 in Col and Ler showed that although transposable elements (TEs) are often methylated, the methylation in the transcribed regions of genes is highly polymorphic between these two ecotypes [2]. Although epigenetic differences could potentially contribute to evolution [3][5], studies of evolution and natural variation have still been focused mainly on sequence variation, and little is known about the role of epigenetic machinery in these processes. This is primarily due to the lack of evidence for epigenetic natural variation between populations.

Small interfering RNAs (siRNAs), as a key player in the epigenetic machinery, have been well documented for their general role in gene silencing at both the transcriptional and post-transcriptional levels [6],[7]. In Arabidopsis, ∼24 nt siRNAs can direct DNA methylation (RNA-directed DNA methylation, RdDM) and chromatin remodeling at their target loci [8]. In the RdDM process, ∼24 nt siRNAs are incorporated into ARGONAUTE 4 (AGO4)-containing complexes and further guide the DOMAINS REARRANGED METHYLTRANSFERASE 2 (DRM2) to de novo methylate their target DNA [9],[10]; once established, the non-CG methylation could be maintained by DRM2 and/or CHROMOMETHYLASE 3 (CMT3) in a locus-specific manner, and the CG methylation by METHYLTRANSFERASE 1 (MET1) [11]. Recent advances in high-throughput sequencing techniques have enabled the thorough exploration of the small RNAs populations [12][16]. Therefore, together with the complete genome sequence, we are able to directly examine whether there are regions specifically matched by siRNAs that differ among ecotypes, a situation that could lead to epigenetic natural variation.

FLC, a MADS box transcription factor, is a major repressor of the transition to flowering in Arabidopsis, and many genes coordinately function in flowering time control by regulating the amount of FLC transcript [17]. In addition, allelic variation at FLC, both genetic [18][21] and epigenetic [22],[23], contributes to the differences in flowering time and vernalization response among accessions, which makes FLC a classic locus for the study of natural variation in Arabidopsis. Previous studies have shown that in Ler, a 1224 base pair (bp) nonautonomous Mutator-like transposable element (TE) inserted in the first intron of FLC (FLC-TE-Ler) [19] was methylated and heterochromatic under the direction of ∼24 nt siRNAs generated by homologous TEs, and mutation of HUA ENHANCER 1 (HEN1) in Ler (hen1-1), a key component in small RNA biogenesis [7], released the transcriptional silencing of FLC-Ler [22].

In this study, we discovered a cluster of ∼24 nt siRNAs that are present at high levels in the ecotype Ler and that could direct DNA methylation and heterochromatinization adjacent to FLC promoter [24]. However siRNAs matching to the same region in Col are of low abundance and cannot direct DNA methylation. Furthermore, from comparisons between Ler and Col of small RNA data produced by high-throughput sequencing, we identified at least 68 loci that are matched by significant levels of ∼24 nt siRNAs, and 88% are methylated in Ler but not Col from a set of 42 loci that were examined.. Although siRNA clusters are often heavily methylated [25] and a large proportion of the methylation polymorphisms between Col and Ler are not associated with small RNAs [2], our data reveal that there could still be considerable small RNA-directed epigenetic natural variation between two ecotypes of Arabidopsis.


A Region Adjacent to the Promoter of FLC is Methylated in Ler but not Col

In addition to the previously described Mutator-like transposable element (TE) inserted in the first intron of FLC [19] in Ler, we found that a region located adjacent to the promoter of the FLC was specifically methylated in Ler but not in Col (Figure 1A). We named this region MPF (Methylated region near Promoter of FLC). Restriction enzymes including AciI, HpyCH4 IV and Fnu4HI, which are sensitive to CpG methylation, were able to cut outside of the MPF but not within this region in Ler (Figure 1). Notably different from the TE inserted in FLC-Ler, the MPF of Ler and Col share almost identical sequences (Figure S1). Bisulfite sequencing of MPF (B1 region, Figure 2A) revealed that a small region of less than 100 bp was exhibited a very high level of asymmetric methylation (also called CHH methylation, where H represents A, C or T) (Figure 2C). This region also demonstrated extensive CpG and CNG (where N is any nucleotide) methylation (Figure 2C). In addition, no DNA methylation was found outside the MPF (the B2 and B3 regions, Figure 2A) in Ler (data not shown) or the MPF in Col (Figure 3A) by bisulfite sequencing.

Figure 1
DNA Methylation Analysis of the FLC Promoter by Southern Blots.
Figure 2
RNA-directed DNA Methylation and Heterochromatinization at the MPF.
Figure 3
Methylation Analysis of MPF and FLC-TE.

High Levels of MPF-siRNAs in Ler, but not Low Levels in Col, Direct DNA Methylation and Heterochromatinization at MPF

Since asymmetric methylation is the hallmark of RdDM [26], we decided to verify whether there are corresponding siRNAs matching to this methylated region in Ler. Because no methylation was found at the MPF in Col, we speculated that there would be no small RNAs matching to this region. However, four 17 nt tags with very low abundances (approximately two transcripts per quarter-million, TPQ) were found in the Col-derived small RNA massively parallel signature sequencing (MPSS) datasets [12]. These small RNAs precisely matched both strands of the highly asymmetrically methylated region within MPF (Figure 2B). We performed a small RNA Northern blot hybridization to verify these small RNA in Col and Ler. By using an LNA (locked nucleic acid) modified oligonucleotide probe (Figure 2B) and a large amounts of RNA enriched for small RNAs (see materials and method for more details), we found that siRNAs complementary to this probe (MPF-siRNAs) were more abundant in Ler than in Col (Figure 2D). Published high-throughput small RNA 454 sequencing datasets from Ler [15] confirmed our RNA gel blot results. In those data, six unique 23 to 24 nt small RNAs were found matching to a region of <50 bp at the MPF, in exactly the same region as the Col-derived MPF-siRNAs (Figure 2B). Analyses of additional Col-derived 454 small RNA data [16],[27] didn't identify any MPF-matching small RNAs, possibly due to lower sequencing depth compared to that of the MPSS data. We performed chromatin immunoprecipitation (ChIP) experiments and demonstrated that the MPF in Ler was enriched in H3K9me2, a characteristic of heterochromatin, in comparison to Col (Figure 2E). These data suggest that the high levels of MPF-siRNAs in Ler could trigger DNA methylation and heterochromatinization at MPF whereas the lower levels in Col might not be sufficient.

Methylation at MPF Is Sensitive to Deficiency in RdDM

Next, we investigated methylation at the MPF using silencing pathway mutants in either a Ler background or in lines that had been backcrossed to Ler to have the homozygote FLC-Ler allele. These mutants included hen1-1, cmt3-7, ago4-1, kryptonite-2 (kyp, a histone H3K9 methyltransferase, also known as SUVH4, can affect the DNA methylation at some loci[28][30], and drm2 5×Ler (homozygous drm2 backcrossed five times to Ler). Methylation at MPF was sensitive to the deficiency in the RdDM machinery: all mutants tested, with the exception of kyp-2, completely relieved methylation in all three sequence contexts at MPF (Figure 3A and Figure S2A). Although KYP has been reported to control CNG methylation together with CMT3 [26],[30], the methylation at MPF was independent of its function, perhaps because MPF at several hundred base pairs is too small for KYP to maintain the positive feed back between DNA methylation and chromatin modification [30]. Alternatively, in addition to KYP, the heterochromatic feature of this region might be redundantly controlled by other two histone H3K9 methyltransferases, SUVH5 and SUVH6 [31]. In addition, methylation of the nearby TE insertion (Figure 3B and Figure S2C) was also sensitive to ago4-1 and hen1-1 (Figure 3B). However, none of these mutants released all DNA methylation at AtSN1, a retroelement which also undergoes RdDM [26] (Figure 3C). Moreover, AGO4 complementation [15] could not restore DNA methylation at the MPF in ago4-1 (data not shown). This situation resembles the FWA locus whose methylation, once lost in ddm1(decrease in DNA methylation 1) mutant, is not recovered again even in the presence of wild type DDM1 [32]. The MPF in hen1-4, a strong hen1 allele in the Col background, had an identical methylation pattern to Col (Figure 1). Also, the identical methylation pattern of the miRNA deficient mutant dcl1-9 [7] to Ler at MPF (Figure S2B) ruled out the possibility that the restricted methylation at MPF is directed by miRNAs [33]. These observations were substantially different from prior analyses of silenced loci, at which DNA methylation was often affected in certain but never all sequence contexts by mutants in the RdDM pathway [26].

Methylation at MPF Is Independent of the TE Insertion Nearby

Since MPF is methylated and it is near to the TE insertion in FLC-Ler, it was of interest to investigate whether the methylation at MPF is induced by the TE. We examined the methylation status of MPF in several accessions that are also reported to contain transposable elements inserted in the first intron of FLC (Figure S3A) [19],[20]. These were tested by McrBC-PCR [34] (for Bd-0, JI-1, Stw-0, Kin-0 (CS1273), and Gr-3) and bisulfite sequencing (for Da(1)-12). Although the MPF is methylated in Bd-0, JI-1 and Kin-0 (CS1273), it remains unmethylated in Stw-0, Gr-3 and Da(1)-12 (Figure S3B, and data not shown for Da(1)-12) indicating that the TE insertions nearby are dispensable for the methylation at MPF.

A previous study using 27 Arabidopsis accessions showed that the FLC-TE in Ler was also detected in Dijon-G and Di-2 (Figure S3A) but was absent in the closely related Landsberg-0 or Di-1 [18]. McrBC-PCR analysis showed that MPF is methylated in all four of these accessions, even in those without the FLC-TE insertion (Figure S3C), which further confirmed that the methylation at MPF is independent of the TE insertion nearby.

Origin of MPF-siRNAs

To study the origin of the MPF-siRNAs, we found that a 220 bp sequence at MPF is absent in one Kin-0 accession (CS6755, different from the Kin-0 (CS1273) accession mentioned above that contains a methylated MPF). Further analysis revealed that this difference is caused by the insertion of a non-autonomous hAT element [35] with the typical 8 bp TSD (target site duplication) and short terminal inverted repeats (TIRs) (Figure 4 and Figure S1). However, MPF-siRNAs in Ler are probably not derived from other hAT elements because those MPF-siRNAs with the full length information from 454 sequencing in Ler [15] have only one match (at MPF) in the genome; also, genomic Southern blot hybridization revealed that Ler do not contains extra copy of this hAT element comparing to Col (Figure S4). Therefore, the MPF-siRNAs are probably generated from MPF itself.

Figure 4
Structure of MPF in Three Accessions.

Methylation State at MPF in Ler Is Transmissible to Col by Genetic Crossing but with Extensive Diversity in the F1

In paramutation, the silenced paramutagenic lines are able to confer the active state of the paramutable lines, and make them become paramutagenic [36]. To test whether the methylated state at MPF in Ler is transmissible, we performed bisulfite sequencing to investigate the DNA methylation status in four F1 lines from the crosses of both Col ♀×Ler ♂ and Ler ♀×Col ♂, with the single nucleotide polymorphisms (SNPs) at MPF (Figure S1) used to distinguish the Col and Ler derived sequencing results (Figure 5A). In addition, twenty-four more lines from reciprocal crosses were tested for their MPF methylation by real-time McrBC-PCR (Figure 5B). These experiments revealed extensive diversity in the methylation status of MPF in each individual line in the F1 generation. This diversity could be summarized in the following way: 1) in some lines, the MPF-siRNAs from Ler are able to trigger the de novo methylation at Col-derived MPF; 2) in some other lines, not only the Col-derived MPF remains unmethylated, the Ler-derived MPF could even lose its methylation; 3) there are also cases in which the Ler-derived MPF remains methylated and Col-derived MPF remains unmethylated, just like their ancestors; therefore the MPF is semi-methylated in the whole plant.

Figure 5
DNA Methylation Analysis in the F1 Heterozygous Plants from the Reciprocal Crosses between Col and Ler.

De novo Methylation at MPF Does Not Alter the Flowering Behavior of Col

The 1.2 kb FLC-TE, when inserted into a Col FLC genomic construct, is sufficient to cause reduced expression of FLC in the transgenic lines [19], therefore, it is unclear whether the MPF has any functional relevance in FLC expression. Interestingly, FLC-Ler could strongly suppress the late flowering phenotype induced by FRIGIDA (FRI) and luminidependens (ld), but remains moderately sensitive to other mutants that up-regulate FLC like fca, fve, and fpa [37]. Recently, SUPPRESSOR OF FRI4 (SUF4) has been shown to bind to the promoter of FLC and directly interact with FRI and LD [38]. Moreover, FLC-Ler is again sensitive to FRI in a hen1-1 background [22] suggesting reversible epigenetic alteration might account for this weak response.

To address the role of the epigenetic variation at MPF in flowering time control, we used an RNAi approach to artificially methylate MPF in Col, the ecotype in which MPF is originally unmethylated. All transgenic plants used for further analyses had been tested for their successful de novo methylation at MPF by McrBC PCR (data not shown). Both flowering time and FLC expression analysis showed that de novo methylation at MPF does not alter the flowering behavior of wild type Col (Figure S5). However, since Col is an early flowering ecotype and its FLC expression level is relative low, we can not rule not the possibility that MPF may play a more prominent role in some late flowering backgrounds with higher FLC levels, like FRI or ld.

Genome-Wide Identification of ∼24 nt siRNAs Directed Epigenetic Natural Variation

The identification of MPF-siRNAs in Ler- but not Col-derived small RNA data made us wonder whether other loci are differentially and specifically matched by ∼24 nt siRNAs in these ecotypes. Because the MPSS small RNA sequencing data are not readily comparable with the 454 data (due to length differences in the sequencing reads), the small RNA datasets we used for a genome-wide identification are all 454 sequencing data, derived from two recent studies: 247,318 unique small RNA sequences from Col [16]and 25,981 unique small RNA sequences from Ler [15]. Also, to balance the enrichment of longer siRNAs in the sequencing results of AGO4 precipitated pool from Ler [15], we only selected for further analyses the siRNA reads of length no less than 23 nt, hence most of the miRNAs and short sRNAs are discarded from both the Col and Ler datasets. Since only the Col genome sequence is complete and the number of sequenced Col derived siRNAs is much greater than that of Ler, in this study, we only analyzed the regions matched by clusters of siRNAs present specifically in Ler, to exclude the interference of genetic alteration and also for higher reliability (please see materials and methods for details about the bioinformatic analysis). The unique siRNA sequences over 23 nt from both Col and Ler were mapped to the genome, respectively, and hits were counted in windows of 100 bp. Although the majority of the ∼24 nt small RNA clusters are conserved between Col and Ler (data not shown), after combining the overlapping regions, 68 unique loci were identified (including the MPF, locus #57; Table S1). These all shared the characteristic that they were matched by at least three distinct siRNAs within 300 bp in Ler but there were no hits in 1500 bp around the same region in Col (see Figure 6 for an example). Most of these loci are MPF-like, in that the siRNA matches are restricted to a small region (Figure S6), and their distribution in the genome is quite dispersed (Figure S7). Twenty-two loci are within known genes, and the other 46 are in intergenic regions (Table S2). An search of methylation data in Col (http://signal.salk.edu/cgi-bin/methylome) [25] demonstrated that all of these loci except locus #60 (located in a highly methylated region longer than several hundred kb, Table S1) were clearly lacking methylation; in addition, 28 loci contain repeat-associated sequences with one end beginning close to or within the small RNA matching region, and 15 loci had matching MPSS small RNA tags [12] (Table S1). We had also searched the website of DNA methylation information on the fourth chromosome in both Ler and Col background (http://chromatin.cshl.edu/cgi-bin/gbrowse/epivariation/) [2]. For the 13 loci (#44∼56) we identified on the fourth chromosome, six loci are found with methylation signals in their data: five loci (#46, 49, 52, 54, 55) are found specifically methylated in Ler as expected; one locus (#53) is methylated in both ecotypes but with a much higher methylation signal in Ler comparing to Col. Overall, our results are well supported by the two independent studies on epigenomics and epigenetic natural variation [2],[25].

Figure 6
Illustration of the Strategy for Identifying Loci Matched by Significant Level of ∼24 nt siRNA Specifically in Ler using Chromosome 3 as an Example.

We investigated the methylation pattern of locus #10 as an example using bisulfite sequencing. Extensive methylation was found in Ler (Figure S8), whereas the same region in Col remained unmethylated (data not shown). Other eight randomly selected loci were tested using methylation sensitive McrBC-PCR, and all of them, even those with the minimal number of three unique siRNAs, were methylated in Ler but not Col (Figure S9). Furthermore, we tested the methylation status of 44 loci (in which 42 have successful amplification results), including all the loci on Chromosome I and II,, by real-time McrBC-PCR (Figure 7A). From these analyses, 88% of the loci (37 out of 42) were found to be specifically methylated in Ler but not Col, and no locus was found only methylated in Col, strongly supporting the role of ∼24 nt siRNA in triggering epigenetic natural variation (Figure 7B).

Figure 7
DNA Methylation Analysis of 44 loci Varied in Small RNA Abundance between Col and Ler using Real-time McrBC-PCR.

For the features of these 68 loci showing evidence of small RNA-directed variation in DNA methylation, we looked at the genes either corresponding to or adjacent to these loci within less than 1 kb distance of flanking sequence. Among the 64 genes identified (some intergenic loci did not have flanking genes within 1 kb upstream and downstream), 22 genes were found matched by genic siRNA clusters; 18 genes contained siRNA clusters in their 5′ region and 24 genes with clusters in 3′ regions (Table S2). Among the 22 genic regions, six were transposable elements, consistent with the role of transposable element in epigenetic regulation [39]. Moreover, many of these genes are reported or predicted to have important functions (Table S2). Therefore, additional investigation of these genes may help us to understand the role of epigenetic alteration in evolution and natural variation.


Natural variation is a fundamental aspect of biology, and the implications of natural variation for deciphering the genetics of complex agricultural traits have been widely used. Recent progress in epigenetics has revealed mechanisms that can heritably regulate gene function without alteration of primary nucleotide sequences. Although the importance of epigenetic natural variation have become more and more noticed [3],[5], the role of epigenetic regulation in evolution has been less well studied due in part to limitations in the techniques used for the investigation of epigenetic variation among natural populations. Recently, substantial improvements in high-throughput analysis approaches have made it possible for the effective detection of variation in DNA methylation, histone modifications and small RNA abundances [2], [12][16],[25],[40]. Small RNAs that can target DNA methylation and chromatin modifications have been proposed as a potential source in inherited epigenetic differences [3], and the latest techniques offer rapid and relatively inexpensive means for the profiling of small RNAs. In this study, we discovered that a hAT element adjacent to the promoter of FLC, which we named MPF, is methylated and heterochromatic in Ler but not Col because of their differences in the abundance of corresponding siRNAs. Furthermore, by comparisons between Ler and Col of publicly available small RNA data produced by high-throughput sequencing [15],[16], we identified at least 68 loci that are matched by significant levels of ∼24 nt siRNAs, and 88% examined loci are methylated specifically in Ler but not Col. Our data reveal that there could be a considerable amount of small RNA-directed epigenetic natural variation between two ecotypes of Arabidopsis.

Although we identified dozens of loci, this analysis is still far from saturating. A Sadhu element (At2g10410), which was reported to be epigenetically silenced in Ler and other 18 strains but highly expressed in Col, did not show up among the 68 loci [41]; although bisulfite sequencing revealed that this element contains CNG and asymmetric methylation in Ler, which is presumably siRNA-directed to some extent [41]. Furthermore, hundreds of additional loci with one or two hits specifically in Ler (data not shown) may also be silent; these may be better characterized when additional Ler small RNA and genome sequence data become available.

Two examples of siRNA-associated, naturally-occurring epigenetic variation have been well studied in plants, including the phosphoribosylanthranilate isomerase (PAI) gene family in Arabidopsis and paramutation in maize [36]. In some Arabidopsis ecotypes, two PAI genes form an inverted repeats that may generate siRNAs and silence related members in the same gene family [42]. Paramutation, the allele-dependent transfer of heritable silencing state from one allele to another [36], is associated with another type of repeats, the tandem repeats. MEDIATOR OF PARAMUTATION 1 (MOP1) [43], whose deficiency disrupts paramutation, is an ortholog of the Arabidopsis RDR2 (RNA Dependent RNA polymerase 2), an essential component of RNAi machinery [6]. Notably, epigenetic variation at the MPF is quite different from these two cases: first, neither inverted- nor tandem-repeats features were found at MPF or elsewhere in the genome with similar sequence; second, the level of MPF-siRNAs is high in Ler and low in Col, instead of all-or-none; third, the restricted location of MPF-siRNAs is markedly different from the dispersed distribution of siRNAs from most inverted or tandem repeats [12].

Although paramutation phenomenon had been well documented, the details of how the silencing signal is transmitted from one allele to the other in the F1 heterozygote are still less understood. In our study, the diverse methylation status among individuals in F1 generation of the reciprocal crosses from Col×Ler indicate that there might be a reprogramming stage shortly after fertilization, in which the DNA or chromatin are open to modifiers like the MPF-siRNA containing RISC (RNA induced silencing complex) from Ler. However, this open stage must be very short, and when it is over, the epigenetic state, no matter active or silenced, will be maintained in the following developmental processes, so that the unmethylated state of Col-derived MPF and the methylated state of Ler-derived MPF could well maintained in Ler ♀×Col ♂line #2 (Figure S5A).

Thus far, the function of ∼24 nt siRNAs in plants has mainly been ascribed a role in silencing transposable elements and repeat-associated sequences [39]. Thus, it is unclear how Ler and Col, both with the functional RNAi machinery, might acquire many siRNA-directed epigenetically variable loci. One characteristic of MPF-siRNAs, their very restricted location (all matching to a region less than 50 bp), may confer on them more flexibility than other, larger silent loci.

Genetic variability (due to insertion, deletion and point mutation) occurs stochastically, at very low frequency, primarily irreversibly and is often recessive. In contrast, heritable epigenetic variability may be more appropriate to regulate, rather than disrupt or create, gene function, and thus may be an ideal or more dynamic force for evolutionary change of gene regulation.

Materials and Methods

Plant Materials

The Bd-0 (CS962), JI-1 (CS1248), Stw-0 (CS1538), Gr-3 (CS1202), Kin-0 (CS1273, CS6755), Da(1)-12 (CS917), Dijon-G (CS910), Di-1 (CS1108), Di-2 (CS1110), and La-0 (CS1299) accessions of Arabidopsis were acquired from ABRC; hen1-1 (Ler background), hen1-4 (Col background), and dcl1-9 mutants were described before [22]; cmt3-7, kyp-2, ago4-1, and drm2 5×Ler were generous gifts from Steve Jacobsen at UCLA. The AGO4 complementation lines were kindly provided by Gregory J. Hannon at CSHL and Yijun Qi at NIBS.

Small RNA Northern Blot

RNAs were extracted from 20-day-old, soil-grown plants. 32P end-labeled LNA probe was used for hybridization. Total RNAs were extracted using Trizol solution (Invitrogen) from 20-d-old soil-grown plants and dissolved in RNase free water. Small sized RNAs were enriched by adding the same volume of 8M LiCl and centrifuging at 12,000rpm for 30 min at 4°C. RNA filter hybridizations were carried out as previously described [44]. LNA probe [45] was used for hybridization (5′- cgagcAgtGgcGgatCcaaga-3′; uppercases represent modified nucleotides).

Chromatin Immunoprecipitation (ChIP) Assays

The ChIP assays were performed using 20-d-old soil-grown plants and as previously described [46]. Antibodies against H3K9me1 (07-450), H3K9me2 (07-441) and H3K9me3 (07-442) were from Upstate Biotechnology.

Construction of RNAi Vector

The genomic DNA from Col was used as a template for PCR amplification using the primer pairs (CX2004: ctcgagATTTTTGTGGTAATATATATATA and CX2005: agatctACATCAATCCAAGTTCAAGC, carrying the XhoI and BglII sites, respectively). The PCR products were sequentially inserted into pUCC-RNAi vector using the XhoI/BglII and BamHI/SalI sites for both the sense and antisense orientations. The stem-loop structured fragment was cut off and further cloned into a modified pCambia1302 vector (pCambia1302-LX-1) and used for plant transformation (XF718). All transgenic plants used for further analyses had been tested for their successful de novo methylation at MPF.

DNA Methylation Analysis: Southern Blot, Bisulfite Sequencing, and McrBC-PCR

Genomic DNA was isolated from rosette leaves of 4-week-old, soil-grown plants. Southern blots was performed as previously described [22] using PCR products amplified from FLC promoter as the probe (Figure 1). Bisulfite sequencing experiments were performed as previously described [47]. Primers with one end in FLC-TE and the other in FLC were designed to specifically amplify the FLC-TE and exclude other TEs in the genome. Only the cytosines within TE were counted for methylation analysis of FLC-TE in Figure 3. McrBC-PCR experiments were performed as previously described [34],[47], Equal amounts of McrBC-digested and non-digested DNA were used for PCR amplification. Real-time McrBC-PCR was performed to quantitatively measure the methylation level. The primer information for these experiments could be found in Supporting Information (Text S1).


After discarding smaller (<23 nt) and redundant sequences, 247,318 unique small RNA sequences in Col and 25,981 unique small RNA sequences in Ler were used for further analysis. All these siRNAs were mapped to the Col genome by BLAST [48] and PERL scripts, and the numbers of perfect matches were counted per 100 bp. Next, regions contain more than 3 hits within 300 bp in Ler but no hits in 1.5 kb at the same region in Col (Figure 6) were filtered out and overlapping regions were artificially combined. Col derived small RNA dataset was downloaded from NCBI GEO (GSE5228), and Ler derived small RNA sequences from NCBI GenBank (DQ927324-DQ972825). The Arabidopsis genome (Col) information was provided by TIGR (release version 5). Gene positions were annotated according to TAIR's SeqViewer data. Tandem gene duplication information was provided by TIGR (tandem_gene_duplicates.Arab_R5).

Supporting Information

Figure S1

Sequence Alignment of MPF Region in Col and Ler. Gray shades indicate the polymorphism; green box indicates the hAT element insertion; red region indicates the TSDs (Target Site duplication); blue region indicates the TIRs (Terminal Inverted Repeats).

(10.02 MB DOC)

Figure S2

Bisulfite Sequencing Analysis of DNA Methylation at the MPF in kyp-2 (A), dcl1-9 (B), and FLC-TE in Ler (C). The x axis represents the position of the cytosines within the sequencing region; n indicates the number of the sequenced clones. The B4 region spans the junction between TE (white box) and the first intron of FLC (gray box). Only the cytosines within TE were counted for methylation analysis of FLC-TE in Figure 3.

(9.01 MB TIF)

Figure S3

DNA Methylation Analysis of MPF among Arabidopsis Accessions using McrBC-PCR. (A) Summary of the TE insertions at the first intron of FLC in different ecotypes. The number under each accession represents the length of the TE insertion. (B) Accessions reported to contain transposable element inserted in the first intron of FLC. (C) Accessions that are closely related to Ler. Di-1 and La-0 do not contain the FLC-TE insertion. TE (methylated) and Actin (unmethylated) serve as controls for the McrBC-PCR assay.

(7.89 MB TIF)

Figure S4

Genomic Southern Blot Analysis for the Copy Number of hAT Element in Col and Ler. Genomic DNAs from both Col and Ler were digested by EcoR V, Hpa II and Nco I. A 160 bp region within the hAT element was PCR amplified and used as the probe for hybridization.

(13.48 MB TIF)

Figure S5

Target DNA Methylation to MPF in Col using RNAi Approach. (A) A diagram shows the 202 bp fragment used for the construction of the RNAi vector. (B) Flowering time analysis for the RNAi transgenic lines (T0 generation); each individual transgenic line was confirmed for their de novo methylation at MPF. (C) FLC expression analysis by real-time RT-PCR using the seedlings of one T2 transgenic line (homozygote for the transgene) which had been confirmed for its methylation at MPF.

(9.20 MB TIF)

Figure S6

Cluster Analysis. Small RNA hits were counted per 100 bp of a 1.5 kb range in Ler at the 68 loci identified in this study that have no less than 3 unique 24 nt siRNA matches within 300 bp (show in the central) and meanwhile no hits in a 1.5 kb region in Col (Figure 4).

(7.40 MB TIF)

Figure S7

Genome-wide Distribution of the 68 loci. Black bars represents loci with 3 to 5 hits within 300 bp; blue bars represents loci with 6 to 8 hits within 300 bp; red bars represents loci with more than 9 hits within 300 bp. Black rectangles represent the centromeric region.

(10.03 MB TIF)

Figure S8

RNA-directed DNA Methylation at Locus #10. (A) The siRNAs matched to this region. (B) Bisulfite sequencing results summarized in different sequence contexts; the x axis represents the position of the cytosines within the sequencing region; n indicates the number of the sequenced clones. The color coding of the cytosines in (A) matches the legend in (B).

(4.26 MB TIF)

Figure S9

DNA Methylation Analysis using McrBC-PCR. McrBC cuts at methylated sites in the template DNA, therefore resulting in attenuated PCR products for methylated loci; however, the PCR amplification of unmethylated loci will not be affected by McrBC digestion. (A) “Locus” represents the locus number tested from among the 68 loci that passed our filters; “hits” represents the unique siRNA hits within each 300 bp region. Locus #60 with the methylation signal in Col (Table S1) is also methylated in Ler. (B) The negative (Actin) and positive (MPF and FLC-TE) controls for McrBC-PCR. The 1.2 kb methylated FLC-TE is only present in Ler, therefore the PCR products (using primers matched to FLC on both sides of the TE but not within itself) from Ler derived samples are 1.2 kb larger than those from Col derived samples.

(6.99 MB TIF)

Table S1

The 68 Loci Identified in this Study.

(0.12 MB DOC)

Table S2

Basic Information of the Genes Corresponding or Adjacent to siRNA Clusters.

(0.11 MB DOC)

Text S1

Primer sequences.

(0.13 MB DOC)


We thank Steve Jacobsen for providing seeds of various mutants; Gregory J. Hannon and Yijun Qi for the AGO4 complementation lines; Ning Jiang at MSU for discussion of the nature of the insertion at MPF; Tong Ren at USTC for the careful revision of the manuscript.


The authors have declared that no competing interests exist.

This work was supported by National Basic Research Program of China (grant no. 2005CB522400), by Chinese Academy of Sciences (grant no. CXTD-S2005-2) to X.C, National Natural Science Foundation of China (grant nos. 30325015, 30430410 and 30621001) to X.C; the Meyers lab is supported by awards from the US National Science Foundation Plant Genome Research Program.


1. Goldberg AD, Allis CD, Bernstein E. Epigenetics: a landscape takes shape. Cell. 2007;128:635–638. [PubMed]
2. Vaughn MW, Tanurd Ic M, Lippman Z, Jiang H, Carrasquillo R, et al. Epigenetic Natural Variation in Arabidopsis thaliana. PLoS Biol. 2007;5:e174. [PMC free article] [PubMed]
3. Richards EJ. Inherited epigenetic variation–revisiting soft inheritance. Nat Rev Genet. 2006;7:395–401. [PubMed]
4. Rapp RA, Wendel JF. Epigenetics and plant evolution. New Phytol. 2005;168:81–91. [PubMed]
5. Rando OJ, Verstrepen KJ. Timescales of genetic and epigenetic inheritance. Cell. 2007;128:655–668. [PubMed]
6. Zaratiegui M, Irvine DV, Martienssen RA. Noncoding RNAs and gene silencing. Cell. 2007;128:763–776. [PubMed]
7. Vaucheret H. Post-transcriptional small RNA pathways in plants: mechanisms and regulations. Genes Dev. 2006;20:759–771. [PubMed]
8. Matzke MA, Birchler JA. RNAi-mediated pathways in the nucleus. Nat Rev Genet. 2005;6:24–35. [PubMed]
9. Cao X, Jacobsen SE. Role of the arabidopsis DRM methyltransferases in de novo DNA methylation and gene silencing. Curr Biol. 2002;12:1138–1144. [PubMed]
10. Cao X, Aufsatz W, Zilberman D, Mette MF, Huang MS, et al. Role of the DRM and CMT3 methyltransferases in RNA-directed DNA methylation. Curr Biol. 2003;13:2212–2217. [PubMed]
11. Cao X, Jacobsen SE. Locus-specific control of asymmetric and CpNpG methylation by the DRM and CMT3 methyltransferase genes. Proc Natl Acad Sci U S A. 2002;99(Suppl 4):16491–16498. [PMC free article] [PubMed]
12. Lu C, Tej SS, Luo S, Haudenschild CD, Meyers BC, et al. Elucidation of the small RNA component of the transcriptome. Science. 2005;309:1567–1569. [PubMed]
13. Henderson IR, Zhang X, Lu C, Johnson L, Meyers BC, et al. Dissecting Arabidopsis thaliana DICER function in small RNA processing, gene silencing and DNA methylation patterning. Nat Genet. 2006;38:721–725. [PubMed]
14. Kasschau KD, Fahlgren N, Chapman EJ, Sullivan CM, Cumbie JS, et al. Genome-wide profiling and analysis of Arabidopsis siRNAs. PLoS Biol. 2007;5:e57. [PMC free article] [PubMed]
15. Qi Y, He X, Wang XJ, Kohany O, Jurka J, et al. Distinct catalytic and non-catalytic roles of ARGONAUTE4 in RNA-directed DNA methylation. Nature. 2006;443:1008–1012. [PubMed]
16. Rajagopalan R, Vaucheret H, Trejo J, Bartel DP. A diverse and evolutionarily fluid set of microRNAs in Arabidopsis thaliana. Genes Dev. 2006;20:3407–3425. [PMC free article] [PubMed]
17. Baurle I, Dean C. The timing of developmental transitions in plants. Cell. 2006;125:655–664. [PubMed]
18. Gazzani S, Gendall AR, Lister C, Dean C. Analysis of the molecular basis of flowering time variation in Arabidopsis accessions. Plant Physiol. 2003;132:1107–1114. [PMC free article] [PubMed]
19. Michaels SD, He Y, Scortecci KC, Amasino RM. Attenuation of FLOWERING LOCUS C activity as a mechanism for the evolution of summer-annual flowering behavior in Arabidopsis. Proc Natl Acad Sci U S A. 2003;100:10102–10107. [PMC free article] [PubMed]
20. Lempe J, Balasubramanian S, Sureshkumar S, Singh A, Schmid M, et al. Diversity of flowering responses in wild Arabidopsis thaliana strains. PLoS Genet. 2005;1:109–118. [PMC free article] [PubMed]
21. Shindo C, Aranzana MJ, Lister C, Baxter C, Nicholls C, et al. Role of FRIGIDA and FLOWERING LOCUS C in determining variation in flowering time of Arabidopsis. Plant Physiol. 2005;138:1163–1173. [PMC free article] [PubMed]
22. Liu J, He Y, Amasino R, Chen X. siRNAs targeting an intronic transposon in the regulation of natural flowering behavior in Arabidopsis. Genes Dev. 2004;18:2873–2878. [PMC free article] [PubMed]
23. Shindo C, Lister C, Crevillen P, Nordborg M, Dean C. Variation in the epigenetic silencing of FLC contributes to natural variation in Arabidopsis vernalization response. Genes Dev. 2006;20:3079–3083. [PMC free article] [PubMed]
24. Sheldon CC, Conn AB, Dennis ES, Peacock WJ. Different regulatory regions are required for the vernalization-induced repression of FLOWERING LOCUS C and for the epigenetic maintenance of repression. Plant Cell. 2002;14:2527–2537. [PMC free article] [PubMed]
25. Zhang X, Yazaki J, Sundaresan A, Cokus S, Chan SW, et al. Genome-wide high-resolution mapping and functional analysis of DNA methylation in arabidopsis. Cell. 2006;126:1189–1201. [PubMed]
26. Chan SW, Henderson IR, Jacobsen SE. Gardening the genome: DNA methylation in Arabidopsis thaliana. Nat Rev Genet. 2005;6:351–360. [PubMed]
27. Gustafson AM, Allen E, Givan S, Smith D, Carrington JC, et al. ASRP: the Arabidopsis Small RNA Project Database. Nucleic Acids Res. 2005;33:D637–640. [PMC free article] [PubMed]
28. Jackson JP, Lindroth AM, Cao X, Jacobsen SE. Control of CpNpG DNA methylation by the KRYPTONITE histone H3 methyltransferase. Nature. 2002;416:556–560. [PubMed]
29. Malagnac F, Bartee L, Bender J. An Arabidopsis SET domain protein required for maintenance but not establishment of DNA methylation. Embo J. 2002;21:6842–6852. [PMC free article] [PubMed]
30. Tran RK, Zilberman D, de Bustos C, Ditt RF, Henikoff JG, et al. Chromatin and siRNA pathways cooperate to maintain DNA methylation of small transposable elements in Arabidopsis. Genome Biol. 2005;6:R90. [PMC free article] [PubMed]
31. Ebbs ML, Bender J. Locus-specific control of DNA methylation by the Arabidopsis SUVH5 histone methyltransferase. Plant Cell. 2006;18:1166–1176. [PMC free article] [PubMed]
32. Kakutani T. Genetic characterization of late-flowering traits induced by 2DNA hypomethylation mutation in Arabidopsis thaliana. Plant J. 1997;12:1447–1451. [PubMed]
33. Bao N, Lye KW, Barton MK. MicroRNA binding sites in Arabidopsis class III HD-ZIP mRNAs are required for methylation of the template chromosome. Dev Cell. 2004;7:653–662. [PubMed]
34. Rabinowicz PD, Palmer LE, May BP, Hemann MT, Lowe SW, et al. Genes and transposons are differentially methylated in plants, but not in mammals. Genome Res. 2003;13:2658–2664. [PMC free article] [PubMed]
35. Rubin E, Lithwick G, Levy AA. Structure and evolution of the hAT transposon superfamily. Genetics. 2001;158:949–957. [PMC free article] [PubMed]
36. Chandler VL, Stam M. Chromatin conversations: mechanisms and implications of paramutation. Nat Rev Genet. 2004;5:532–544. [PubMed]
37. Sanda SL, Amasino RM. Interaction of FLC and late-flowering mutations in Arabidopsis thaliana. Mol Gen Genet. 1996;251:69–74. [PubMed]
38. Kim S, Choi K, Park C, Hwang HJ, Lee I. SUPPRESSOR OF FRIGIDA4, encoding a C2H2-Type zinc finger protein, represses flowering by transcriptional activation of Arabidopsis FLOWERING LOCUS C. Plant Cell. 2006;18:2985–2998. [PMC free article] [PubMed]
39. Slotkin RK, Martienssen R. Transposable elements and the epigenetic regulation of the genome. Nat Rev Genet. 2007;8:272–285. [PubMed]
40. Zhang X, Henderson IR, Lu C, Green PJ, Jacobsen SE. Role of RNA polymerase IV in plant small RNA metabolism. Proc Natl Acad Sci U S A. 2007;104:4536–4541. [PMC free article] [PubMed]
41. Rangwala SH, Elumalai R, Vanier C, Ozkan H, Galbraith DW, et al. Meiotically stable natural epialleles of Sadhu, a novel Arabidopsis retroposon. PLoS Genet. 2006;2:e36. [PMC free article] [PubMed]
42. Bender J. DNA methylation and epigenetics. Annu Rev Plant Biol. 2004;55:41–68. [PubMed]
43. Alleman M, Sidorenko L, McGinnis K, Seshadri V, Dorweiler JE, et al. An RNA-dependent RNA polymerase is required for paramutation in maize. Nature. 2006;442:295–298. [PubMed]
44. Liu B, Li P, Li X, Liu C, Cao S, et al. Loss of Function of OsDCL1 Affects MicroRNA Accumulation and Causes Developmental Defects in Rice. Plant Physiol. 2005;139:296–305. [PMC free article] [PubMed]
45. Valoczi A, Hornyik C, Varga N, Burgyan J, Kauppinen S, et al. Sensitive and specific detection of microRNAs by northern blot analysis using LNA-modified oligonucleotide probes. Nucleic Acids Res. 2004;32:e175. [PMC free article] [PubMed]
46. Deng W, Liu C, Pei Y, Deng X, Niu L, et al. Involvement of the Histone Acetyltransferase AtHAC1 in the Regulation of Flowering Time via Repression of FLOWERING LOCUS C in Arabidopsis. Plant Physiol. 2007;143:1660–1668. [PMC free article] [PubMed]
47. Ding Y, Wang X, Su L, Zhai J, Cao S, et al. SDG714, a histone H3K9 methyltransferase, is involved in Tos17 DNA methylation and transposition in rice. Plant Cell. 2007;19:9–22. [PMC free article] [PubMed]
48. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–410. [PubMed]

Articles from PLoS Genetics are provided here courtesy of Public Library of Science
PubReader format: click here to try


Save items

Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...