![]() | ![]() |
Formats:
|
||||||||||||||||||||||||
Copyright © 2004, American Society of Plant Biologists Institute for Integrative Genome Biology and Department of Botany and Plant Sciences, University of California, Riverside, California 92521 1To whom correspondence should be addressed. E-mail jian-kang.zhu/at/ucr.edu; fax 909-827-7115. Received March 22, 2004; Accepted June 3, 2004. This article has been cited by other articles in PMC.Abstract MicroRNAs (miRNAs) and short interfering RNAs (siRNAs) are small noncoding RNAs that have recently emerged as important regulators of mRNA degradation, translational repression, and chromatin modification. In Arabidopsis thaliana, 43 miRNAs comprising 15 families have been reported thus far. In an attempt to identify novel and abiotic stress regulated miRNAs and siRNAs, we constructed a library of small RNAs from Arabidopsis seedlings exposed to dehydration, salinity, or cold stress or to the plant stress hormone abscisic acid. Sequencing of the library and subsequent analysis revealed 26 new miRNAs from 34 loci, forming 15 new families. Two of the new miRNAs from three loci are members of previously reported miR171 and miR319 families. Some of the miRNAs are preferentially expressed in specific tissues, and several are either upregulated or downregulated by abiotic stresses. Ten of the miRNAs are highly conserved in other plant species. Fifty-one potential targets with diverse function were predicted for the newly identified miRNAs based on sequence complementarity. In addition to miRNAs, we identified 102 other novel endogenous small RNAs in Arabidopsis. These findings suggest that a large number of miRNAs and other small regulatory RNAs are encoded by the Arabidopsis genome and that some of them may play important roles in plant responses to environmental stresses as well as in development and genome maintenance. INTRODUCTION Whereas the roles of proteins as regulatory factors in development and adaptation to the environment are well known, the involvement of regulatory small RNA molecules is just emerging (Carrington and Ambros, 2003). MicroRNAs (miRNAs) and short interfering RNAs (siRNAs) are two classes of small noncoding RNAs with important roles in the regulation of gene expression in plants and animals. miRNAs are an abundant class of tiny RNAs that are often found to be conserved evolutionarily across species boundaries and are capable of regulating the expression of protein-coding genes in multicellular eukaryotes (Bartel, 2004). miRNAs are ~20 to 24 nucleotides in length, single-stranded noncoding RNAs derived predominantly from intergenic regions. miRNA genes are transcribed as long pre-miRNA transcripts ranging from 70 to 300 nucleotides that form fold-back structures in which mature miRNAs reside in either the 5′ or 3′ arm and are processed by DICER (Ambros et al., 2003a; Bartel and Bartel, 2003; Bartel, 2004). miRNAs are known to silence genes posttranscriptionally by targeting cognate mRNAs for degradation (Llave et al., 2002a; Palatnik et al., 2003; Tang et al., 2003) or translation repression (Lee et al., 1993; Reinhart et al., 2000; Aukerman and Sakai, 2003; Chen, 2004). The miRNA profile of Caenorhabditis elegans is probably the most complete, with the number of validated miRNA genes being ~95 (Ambros et al., 2003b; Lim et al., 2003). The total number of miRNAs in each organism is unknown but is estimated to represent ~1% of the number of coding genes (Grad et al., 2003; Lai et al., 2003; Lim et al., 2003; Bartel, 2004). This prediction is based on the fact that miRNAs are derived from evolutionarily conserved hairpin precursor RNAs. The total number of miRNAs in a plant is unknown but could also be ~1% of the total coding genes (~250 to 300 miRNAs in Arabidopsis thaliana). Three groups have independently reported miRNAs from Arabidopsis (Llave et al., 2002b; Park et al., 2002; Reinhart et al., 2002) and yet the number of miRNAs identified appears to be far from saturation. So far, only 43 miRNAs falling into 15 families have been reported in Arabidopsis (Bartel and Bartel, 2003; Bartel, 2004). Several of these miRNAs were demonstrated to play critical roles in leaf or flower development by targeting relevant developmental regulators for mRNA cleavage or translational repression (Aukerman and Sakai, 2003; Palatnik et al., 2003; Chen, 2004; Juarez et al., 2004; Kidner and Martienssen, 2004). Consistent with the important roles of miRNAs in the development of plants, the mutants dcl1, hen1-1, and hyl1 that are affected in miRNA accumulation all have severe developmental defects (Lu and Federoff, 2000; Chen et al., 2002; Golden et al., 2002; Schauer et al., 2002; Han et al., 2004; Vazquez et al., 2004). Phenotypic defects in these mutants could be attributed, at least in part, to blocking or lowered levels of miRNA production. siRNAs are double-stranded 21- to 26-nucleotide small RNAs derived from the processing of long double-stranded RNAs (Waterhouse et al., 2001; Plasterk, 2002). siRNAs were originally discovered in plants undergoing transgene silencing or infection with viruses (Hamilton and Baulcombe, 1999; Zamore et al., 2000). Endogenous small RNAs with characteristic features of siRNAs have been cloned from Arabidopsis (Llave et al., 2002b; Reinhart et al., 2002). Double-stranded RNA may be derived from the transcription of inverted-repeat loci because of converging promoters or may be the product of host- or viral-encoded RNA-dependent RNA polymerases (Dalmay et al., 2000; Mourrain et al., 2000; Aravin et al., 2001; Grishok et al., 2001; Sijen et al., 2001). siRNAs assemble in complexes termed RNA-induced silencing complexes and target homologous RNA sequences for endonucleolytic cleavage, which is referred to as RNA interference in animals, posttranscriptional gene silencing in plants, and quelling in fungi (Hamilton and Baulcombe, 1999; Hammond et al., 2000; Zamore et al., 2000). The biological roles of siRNA-mediated RNA silencing include protection of the genome against mobile DNA elements (Ketting et al., 1999; Tabara et al., 1999; Wu-Scharf et al., 2000) and resistance against viruses (Voinnet, 2001; Waterhouse et al., 2001). In addition to PTGS, siRNAs may be involved in transcriptional gene silencing by directing DNA methylation and chromatin modification (Mette et al., 2000; Aufsatz et al., 2002; Gong et al., 2002; Hamilton et al., 2002; Volpe et al., 2002; Zilberman et al., 2003). A distinguishing feature of plants is that they are sessile and thus have to cope with, rather than move to avoid, adverse environments. Plants have evolved sophisticated mechanisms to adapt to environmental stresses (Zhu, 2002). Abiotic stresses, such as drought, salinity, and cold, regulate the expression of thousands of genes in plants at both transcriptional and posttranscriptional levels. We are interested in exploring potential roles of regulatory small RNAs in abiotic stress tolerance and gene regulation. This study was performed with two objectives. One is to identify new miRNAs and siRNAs in Arabidopsis, given that only a small number of these small RNAs have been reported, and thus there is likely more to be found. The other objective is to discover miRNAs that might be regulated by abiotic stresses. We constructed a library of small RNAs from seedlings treated with cold, dehydration, salinity, and the plant stress hormone abscisic acid (ABA). Through library sequencing and analysis, we found 15 new families of miRNAs and many novel putative siRNAs. Some of the miRNAs are preferentially expressed in certain tissues, and several are either upregulated or downregulated by abiotic stresses. Our results have important implications for gene regulation under abiotic stresses and also contribute significantly to the long-term goal of having a complete profile of miRNAs and siRNAs in Arabidopsis. RESULTS Identification of 15 New Families of miRNAs from Stress-Treated Seedlings To identify miRNAs that might be regulated by abiotic stress, the expression of known Arabidopsis miRNAs under dehydration, salt, cold, and ABA treatments was tested by RNA gel blot analysis. However, we did not find substantial stress regulation on any of the known miRNAs (see Supplemental Figure C online; data not shown). Therefore, to identify novel miRNAs and miRNAs that might be regulated by abiotic stresses, we constructed a library of small RNA species from pooled seedlings exposed to cold, dehydration, salinity, and ABA. Small RNAs (15 to 26 bp) were recovered by size fractionation on denaturing polyacrylamide gels, eluted from the gel, ligated to 3′ and 5′ adapters, amplified, cloned, and sequenced. A BLAST search against the Arabidopsis database revealed that a major portion (90%) of the ~2500 cloned sequences appears to be breakdown products from rRNAs, tRNAs, and small nuclear RNAs. The proportion (9%) of putative regulatory small RNAs is slightly lower than published results, presumably because the library was made from seedlings subjected to different abiotic stresses that may cause enhanced breakdown of noncoding RNAs (rRNAs, tRNAs, and nucleolar RNAs). Several clones matched to chloroplast or mitochondrial sequences and may represent degradation or possibly regulatory products of organellar RNAs. A few clones do not have perfect matches to the Arabidopsis genome sequence and appear to be derived from RNAs of fungal/bacterial/viral origin. Thirteen previously published families of miRNAs from Arabidopsis are represented in our sequences. These include miR156, miR157, miR158, miR159, miR161, miR163, miR166, miR168, miR169, miR171, miR172, miR173, and miR319/miRJAW (Llave et al., 2002b; Park et al., 2002; Reinhart et al., 2002; Palatnik et al., 2003). The frequency of any one miRNA appearing in our sequenced library varied from 1 to 30 times (Table 1; see Supplemental Table 1 online).
Importantly, we identified many new miRNAs and putative siRNAs. The ability of RNA corresponding to the genomic sequences surrounding the clones, to fold into hairpin miRNA precursors, was applied to distinguish miRNAs from other potential small regulatory RNAs. This criterion, together with phylogenetic conservation of mature miRNAs, is now generally accepted as evidence for the existence of a miRNA (Ambros et al., 2003a). We analyzed miRNA precursors of up to 300 nucleotides for hairpin structures using the mfold program (see Supplemental Figure A online) (Zuker, 2003). The analysis revealed that we have cloned 21 new miRNAs from Arabidopsis that correspond to 29 loci (Table 1). Furthermore, on analyzing the Arabidopsis genome, we were able to predict three homologs of the cloned miRNAs miR399b and miR399f based on sequence similarity, the capacity of their precursor sequences to adopt hairpin structures, and their conservation in rice (Oryza sativa) (Figure 1F
Genomic Organization of the miRNA Genes Twenty of the miRNAs identified in this study, including the five predicted miRNAs, correspond to single copy loci in the Arabidopsis genome, whereas the other six miRNAs (miR399b and c, miR393a and b, miR398b and c, miR405a, b, c, and d, miR171b and c, miR389a.1, and miR389b.1) correspond to multiple loci (two to four) that are scattered in the genome (Table 1). Out of the 34 loci, 15, 9, and 5 come from chromosome 2, 1, and 5, respectively, and the remaining five loci are represented in chromosomes 3 and 4. Based on sequence homology and predicted fold-back structures, we were able to identify some miRNA homologs in rice, Lotus, Medicago, and Populus. Thirty-two miRNA loci matched with intergenic regions, whereas two loci correspond to either an intron or 3′ untranslated region (UTR) of a gene. miR408 is located in the antisense polarity to the 3′ UTR of At2g47020 (Table 1). miR402 originates from the intron of gene At1g77230, in the same orientation as the pre-mRNA. Small RNAs corresponding to the introns or exons of pre-mRNAs in the same polarity may be miRNAs or may simply be general degradation products. The miRNAs can be ascertained by the capacity of their precursors to form hairpin structures, together with expression studies. The miR402 precursor sequence has the ability to adopt a long and typical fold-back structure (see Supplemental Figure A online). In addition, this miRNA can be detected as a discrete band of 22 bp on RNA gel blots (Figure 5E
miR399 and its homologs correspond to six genomic loci (a, b, c, d, e, and f; b and c are identical in sequence but come from two different loci [Table 1]) in Arabidopsis with identical sequences or only a slight change in sequence. A sequence identical to miR399f is also found in rice, but the rice precursor sequence does not appear to form a fold-back structure; therefore, it cannot be ascertained as miRNA. However, rice has nine other loci with sequences that differ from the Arabidopsis miR399f in only one or two nucleotides (Figure 1F
Although miR389 has three loci in Arabidopsis, only two loci (on chromosomes 1 and 2) are able to adopt fold-back structures (Figures 3A and 3B
Conserved miRNA Gene Families Based on sequence similarity, the miRNAs were grouped into families. Five of the miRNA families (miR399, miR393, miR398, miR171b, and miR408) are highly conserved in rice (Table 1, Figure 1 One miRNA (miR319c) was represented by five clones in our library, and the sequence is highly similar to the previously cloned miR319/miRJAW and miR159 (Reinhart et al., 2002; Palatnik et al., 2003). miRJAW has two genomic loci, one each on chromosomes 3 and 5 (Palatnik et al., 2003). However, miR319c is derived from a distinct locus on chromosome 2 with one nucleotide change. miR319c and miRJAW have the same size (20 nucleotides), and the only difference is that miR319c has an additional U at its 5′ end. The presence of an additional U in miR319c makes it more similar to miR159 because both have three uridines at the 5′ end. This may explain why we detected two equally strong signals at 20- and 21-nucleotide size positions using miR319c as a probe (Figure 5G Predicted Targets of the miRNAs To predict potential targets of the miRNAs identified in this study, we searched various databases for Arabidopsis mRNAs with sequence complementarity to the miRNAs, allowing up to three mismatches without any gap (Rhoades et al., 2002) and treating G:U pair as a match. In some cases, there is no predictable target with three or fewer mismatches, and the best match is regarded as a putative target (Table 2). Complementarity to the 5′ end of the miRNA has been given due importance while predicting a target (Bartel, 2004). This complementarity requirement includes a 7-nucleotide perfect or near perfect match near the 5′ terminus of the miRNA (Lai et al., 2003; Lewis et al., 2003; Stark et al., 2003; Doench and Sharp, 2004). The 3′ region of the miRNA is less critical (Bartel, 2004) but can modulate its activity under certain circumstances (Doench and Sharp, 2004).
Our analysis predicted 51 potential targets for 21 of the newly identified miRNAs (Table 2). The plant miRNA target sites are predominantly found within the protein-coding segment of the mRNAs (Rhoades et al., 2002), whereas animal miRNAs appear to primarily target the 3′ UTRs (Enright et al., 2003; Lewis et al., 2003; Stark et al., 2003). Consistent with the earlier findings in Arabidopsis (Rhoades et al., 2002), 47 of our predicted target sites fall in the open reading frames (ORFs). Five of the predicted target sites lie in the 3′ UTR regions (At1g07650 [miR404], At1g12010 and At5g49870 [miR407a], At1g54870 [miR407b], and At2g47020 [miR408]). Interestingly, two miRNAs (miR399 and miR398) exhibit complementarity to the 5′ UTR regions of their respective targets. miR399 appears to target four sites in the 5′ UTR of At2g33770 (a putative ubiquitin conjugating enzyme E2) (Figure 4
We found that if more than one mRNA is predicted as the target of a miRNA, these targets tend to belong to the same gene family. For instance, miR400 is predicted to target more than 10 genes of the pentatricopeptide repeat (PPR) protein family; miR393 appears to target five genes belonging to the TIR1 (an F-box protein important for signaling by the phytohormone auxin) family; miR406 has the potential to target two members of spliceosome-related genes; miR397 appears to target two members of the laccase family; and miR389a.1 is capable of targeting four unknown proteins in the same family (Table 2). miR319c, a homolog of miR319/miRJAW, is likely to target the Teosinite branched1, cycloidea, and proliferating cell nuclear antigen factors family of transcription factors (Palatnik et al., 2003). miR399 family members are predicted to target the same gene, At2g33770 (ubiquitin conjugating enzyme E2). miRNAs that are represented by more than one member may recognize a consensus target sequence and, hence, might act on a common target or they might have tissue/stage-specific roles. In addition, different members of an miRNA family with slight changes in nucleotide sequence might also target different genes. For example, miR399b miRNA displays complementarity with a gene encoding a vesicle-associated membrane protein (At4g00170) in addition to the common target (At2g33770) of this family of miRNAs. miR397a is likely to target members of the laccase family, whereas another family member, miR397b, displays better complementarity with another gene (At3g60250, a casein kinase). Therefore, a slight change in miRNA sequence may alter its target preferences, distinguishing its role from that of other members of the same family. The previously reported Arabidopsis miR161 is predicted to target PPR genes At1g06580(3), At1g62720(3), and At1g62670(3) (numbers in parenthesis are mismatches) (Rhoades et al., 2002). We have discovered an miRNA, miR400, that is different from miR161 and yet it displays perfect or near perfect complementarity with some of the predicted targets of miR161 [At1g06580(0), At1g62720(1), and At1g62670(1)]. This suggests that two completely different miRNAs may target the same gene. The attack on one gene by more than one miRNA could be a matter of cooperative regulation. Alternatively, it may indicate a redundant control of target gene expression by multiple miRNA genes or the deployment of different miRNAs for the regulation of a target in a developmental- or tissue-specific manner. Indeed, miR161 and miR400 have different spatial expression patterns. miR161 is strongly expressed in the stem and inflorescence (Reinhart et al., 2002), whereas miR400 is almost below the detection limit in stem and inflorescence but is moderately expressed in leaves and roots (Figure 5H Some of the miRNA complementary sites are highly conserved in different members of the same gene family in Arabidopsis or in mRNA targets from different plant species (Table 3). For instance, miR393 complementary sites are highly conserved in F-box proteins in Arabidopsis, rice, and maize (Zea mays) (Table 3). miR397 is predicted to target laccases, and the target sites are conserved in Arabidopsis, rice, tobacco (Nicotiana tabacum), and Populus (Table 3). miR408 may target mRNAs encoding plantacyanins, and the complementary sites are conserved in rice, maize, chickpea (Cicer arietinum), and spinach (Spinacia oleracea) (Table 3). The complementary sites of miR389a.1 are conserved in mRNAs that encode proteins belonging to the same family (Table 3). The complementary sites are also conserved in spliceosomal proteins, which are predicted to be targeted by miR406. The conservation in the target sequence is observed not only at the amino acid level but also at the nucleotide level, including sequences at codon position 3 (Table 3). Furthermore, the sequence conservation in the target genes is markedly higher in the section corresponding to the 5′ end of the miRNAs (Table 3).
Tissue and Developmental Expression Patterns of miRNAs Preferential expression of an miRNA in specific tissues or developmental stages may suggest a role in development for the miRNA. We investigated the tissue distribution of the newly identified miRNAs by RNA gel blot analysis on total RNA samples from various tissues of mature plants as well as young seedlings. We were able to detect a signal of the expected size for 13 miRNAs identified through cloning (Figure 5 Most of the previously known miRNAs in Arabidopsis are strongly expressed in floral tissues (Llave et al., 2002b; Park et al., 2002; Reinhart et al., 2002). This pattern is mirrored by miR319c, which is expressed at moderate levels in leaves, roots, and young seedlings but is very strongly expressed in stems and inflorescence (Figure 5G However, other miRNAs in this study have quite divergent expression patterns. The most striking is that of miR398b, which could not be detected in the inflorescence (Figure 5A We also examined the expression of the miRNAs in dcl1-9 and hen1-1 mutant seedlings and compared the mutants to their wild-type background (Landsberg erecta ecotype). Consistent with the important roles of DCL1 (Reinhart et al., 2002) and HEN1 (Park et al., 2002) in miRNA accumulation, most of the miRNAs show substantially reduced expression in hen1-1, and several display reduced expression in dcl1-9 (Figure 5 Abiotic Stress Regulation of miRNAs To test whether the expression of any of the miRNAs is regulated by abiotic stresses, RNA gel blot analysis was performed on 2-week-old seedlings grown on MS-agar plates without stress treatment or treated with cold (0°C for 24 h), 100-μM ABA for 3 h, or dehydration (seedlings in agar plates were exposed to dry air), or seedlings were transferred to a blotting paper saturated with 300-mM NaCl for 5 h. We found that the expression of several of the miRNAs is either upregulated or downregulated by one or more of the stress treatments (Figure 6
Other Novel Endogenous Small RNAs Large censuses of small RNA cDNA libraries have revealed a significant class of other 21- to 22-nucleotide RNAs (Reinhart and Bartel, 2002; Ambros et al., 2003b; Aravin et al., 2003). These small RNAs differ from miRNAs in that they do not arise from hairpin precursors. Nevertheless, they display a strong size bias that suggests regulated processing by a Dicer-like activity. Furthermore, unlike some miRNAs, these small RNAs are generally not conserved phylogenetically. The recent discovery of a large class of tiny noncoding RNAs from C. elegans that share some features with miRNAs is an example for exceptional cases like this (Ambros et al., 2003b). These tiny noncoding RNAs are not even conserved within Caenorhabditis species (Ambros et al., 2003b). Another class of small RNAs, repeat-associated silencing RNAs, has been reported from Drosophila (Aravin et al., 2003). Repeat-associated silencing RNAs were noticed to be derived potentially from genomic regions rich in repeats (Aravin et al., 2003). In this study, we have identified 102 endogenous small RNAs (putative siRNAs) that cannot be designated as miRNAs (Table 4). The putative siRNAs represent a heterogeneous class with respect to size and origin. The size ranged between 15 and 25 nucleotides, although the majority are in the range of 20 to 24 nucleotides. The small RNAs do not exhibit nucleotide preferences at their 5′ end. The majority have a single locus in the Arabidopsis genome, although some (28) correspond to multiple loci (2 to 64). Twelve of the small RNAs corresponding to multiple loci appear to be derived from repeat-rich regions or centromeric regions (Table 4). Two small RNAs (P22-H04 and P43-A12) are derived from putative retrotransposons. The vast majority is found in intergenic regions, but some match to coding regions and introns in the antisense polarity. We consider those in the sense polarity as possible degradation products. Some of the putative siRNAs have perfect complementarity with transposases, LOB, AIG1, 23s rRNA, F-box protein, or pre-mRNA splicing factor. Some siRNAs display complementarity with their putative target genes with one to three mismatches. For instance, P78-C02 is complementary to At2g27152 (AAO3, a drought stress-inducible ABA biosynthesis enzyme) with two mismatches, whereas P52-B05 is complementary with a putative lipoxygenase with three mismatches. Some of these small RNAs may actually be miRNAs but are not considered as miRNAs at this time because the precursor sequences cannot adopt a fold-back structure using the mfold program. We were unable to predict targets for the majority of the putative siRNAs identified. These small RNAs might target promoter regions of protein coding genes for transcriptional gene silencing (Park et al., 2002; Ambros et al., 2003b; Finnegan and Matzke, 2003; Bartel, 2004). The existence of diverse and vast numbers of endogenous small RNAs suggest that they may play important roles in the silencing of endogenous genes, in addition to their proposed roles in defense against transposons and viral invasion (Voinnet, 2002; Ambros et al., 2003b).
DISCUSSION Small noncoding RNAs have recently emerged as important regulators of both transcriptional and posttranscriptional gene silencing. The identification of the entire set of small RNAs from an organism is of fundamental importance and would lay the foundation for understanding gene regulation involving small RNAs. The endeavor is as important as mining genes that code for proteins. The establishment of a comprehensive list of miRNAs from any organism will be instrumental for not only gene regulation studies but also for genome organization (e.g., transposon activity and chromatin assembly), phylogenetic comparison, comparative studies of development, and other evolutionary analysis. By sequencing a library of small RNAs from seedlings treated with different abiotic stresses, we are able to add 15 new families of miRNAs to the previously published 15 families in Arabidopsis. In addition, we identified new members of the previously reported miR171 and miR319 families (Table 1, Figure 1 Some miRNAs are encoded by single copy genes, others by multiple genes. Most miRNA genes come from intergenic regions, but a sizable minority of miRNAs (about one-quarter of the human miRNA genes) occurs in the introns of protein coding genes, preferentially in the same orientation as the mRNAs (Bartel, 2004). All miRNAs identified in this study are intergenic in origin with two exceptions: miR402 is derived from the first intron of At1g77230, and miR408 is in the antisense polarity to the 3′ UTR of At2g47020. The majority of the miRNAs described in this study are located in the 3′ arm of the fold-back structure (Table 1). Most miRNAs have a 5′ U preference that is similar to earlier reports (Bartel and Bartel, 2003; Bartel, 2004), although a few miRNAs begin with a 5′ A or C. Analysis of the thermodynamic profiles of miRNAs from plants revealed that the strand having low internal stability at the 5′ end is retained within the RNA-induced silencing complex (Khvorova et al., 2003). Some of the miRNAs reported in this study conform to the features described by Khvorova et al. (2003) and Schwarz et al. (2003), although some show considerable variability (see Supplemental Table 2 online). Ten of the miRNAs that we have identified have clear orthologs or homologs in rice, Lotus, Medicago, or Populus, suggesting that they conduct important and perhaps conserved functions. On the other hand, many of the miRNAs are not conserved. These nonconserved miRNAs may play more species-specific roles. Computational prediction of putative targets for miRNAs in plants has been facilitated by their perfect or near perfect complementarity to target genes (Rhoades et al., 2002). By contrast, bioinformatic approaches to identifying mRNA targets for animal miRNAs are complicated by the partial complementarity between an miRNA and its target (Lewis et al., 2003). Sixty-one targets, predominantly transcription factors, were predicted previously in Arabidopsis (Rhoades et al., 2002; Bartel and Bartel, 2003). The predicted targets that are not transcription factors include PPR mRNAs by miR161, DCL-1 by miR162, and Argonaute by miR168 (Rhoades et al., 2002). In this study, we were able to predict 51 genes that are likely targeted by the miRNAs described here (Table 2). These targets include metabolic enzymes (laccases and cytochrome c oxidase), enzymes in the ubiquitination pathway (ubiquitin conjugating enzyme E2 and TIR1/ubiquitin ligase), transcription factors (Scarecrow-like transcription factors), signal transduction components (protein kinase, casein kinase, and leucine-rich repeat–transmembrane [LRR-TM] kinase), and genes involved in RNA processing (spliceosomal proteins and PPR proteins), protein synthesis (peptide chain release factor), and transcriptional gene silencing [DNA glycosylase and Su(var)3-9, Enhancer-of-zeste, Trithorax (SET) domain proteins]. Our predicted targets appear to be involved in more diverse biological processes and functions and not predominantly in transcriptional regulation as predicted in earlier studies (Rhoades et al., 2002). It was hypothesized that a complementary site in the coding region of an mRNA may lead to the cleavage of the target, whereas target sites located in 3′ UTRs might attenuate translation (Bartel, 2004). Consistent with this view, Arabidopsis miRNAs that target ORFs have largely been shown to be involved in the cleavage of target mRNAs (Llave et al., 2002a; Kasschau et al., 2003; Palatnik et al., 2003; Xie et al., 2003; Vazquez et al., 2004). Perhaps as an exception, Arabidopsis miR172 represses translation despite having a complementary site within the ORF of its target mRNA, AP2 (Aukerman and Sakai, 2003; Chen, 2004). The complementary sites for the animal miRNAs have been predicted to reside in the 3′ UTRs of target genes (Lewis et al., 2003; Bartel, 2004). This bias might reflect a mechanistic preference for translational repression over target cleavage. Only in a few cases are 3′ UTRs predicted as the target sites for plant miRNAs. For example, the complementary sites for Arabidopsis miRNAs, miR156/157, miR156, and miR169, reside in the 3′ UTRs of At1g53160, At2g33810, At1g17590, and At1g54160, respectively (Rhoades et al., 2002). The target sites for five of the predicted targets described here are within the 3′ UTR regions. Until now, no miRNA from animals or plants has been predicted to target the 5′ UTR of a gene. In this work, we find that two miRNA families (miR399 and miR398) are likely to target 5′ UTRs. miR398 family members are complementary to the 5′ UTR of At3g15640 (cytochrome c oxidase). The miR399 has multiple complementary sites (four in Arabidopsis; Figure 4 The targeting of the 3′ UTR of At2g47020 by miR408 is particularly interesting. The translation of At2g47020 mRNA may be repressed by miR408. At2g47020 is predicted to encode a peptide chain release factor important in mRNA translation. This suggests that the translational machinery may be targeted by an miRNA for translational regulation. The targeting of a ROS1-like DNA glycosylase and a SET domain protein by miR402 and miR407 miRNAs, respectively, is also very interesting. ROS1 is a DNA glycosylase/lyase required for the repression of siRNA-triggered transcriptional gene silencing by DNA demethylation (Gong et al., 2002). SET domain proteins are also important in transcriptional gene silencing by functioning in histone Lys methylation (Malagnac et al., 2002). These, together with previous evidence for the targeting of DCL1 by an miRNA (Xie et al., 2003), point to an important role of miRNAs in the regulation of machineries involved in gene silencing. The concentration and activities of many cellular proteins are regulated by the ubiquitin pathway. Ubiquitination requires a cascade of three enzymatic activities for activating (E1), conjugating (E2), and ligating (E3) ubiquitin to a substrate. In this study, we have identified two miRNAs (miR399 and miR393) that may target components (ubiquitin conjugating enzyme E2 and ubiquitin ligase/TIR1) of the ubiquitination pathway. The result suggests a role for miRNAs in regulating the process of ubiquitination. Fungal laccases are well known for their roles in morphogenesis, pathogenicity, and lignin degradation (Mayer and Christopher, 2002). The physiological roles of higher plant laccases are relatively unknown despite their existence as multigene families (Mayer and Christopher, 2002). The identification of miRNAs (miR397 family) that can target laccases provides a unique tool to investigate the function of laccases in higher plants. Similarly, the identification of miR389a.1 that is predicted to target four unknown proteins in the same family (Tables 2 and 3) will be valuable for investigating the function of these proteins. Overexpression of miR389a.1 or miR389b.1 may be an effective way to silence these genes (either by cleavage or translational repression) and thus reveal their function. Several of the miRNAs discovered in this study are either upregulated or downregulated by abiotic stresses, suggesting that they may be involved in stress-responsive gene expression and stress adaptation. Consistent with this notion, the Arabidopsis hen1-1 and dcl1-9 mutants that are impaired in the production of some miRNAs are hypersensitive to abiotic stresses (our unpublished data). Stress-induced or upregulated miRNAs are expected to target negative regulators of stress responses or positive regulators of processes that are inhibited by stresses (e.g., cell division and expansion). On the other hand, stress downregulated miRNAs may repress the expression of positive regulators and/or stress upregulated genes. The significance of miR389a.1 miRNA downregulation by ABA and several stresses is unclear because this miRNA appears to target several genes encoding unknown proteins. The stress upregulation of miR393, which targets TIR1, suggests that stress may cause increased TIR1 mRNA degradation or translational repression. Because TIR1 is a positive regulator of auxin signaling by promoting the degradation of Aux/IAA proteins through ubiquitination (Dharmasiri and Estelle, 2002), the miRNA inhibition of TIR1 would downregulate auxin signaling and seedling growth. Thus, the upregulation of miR393 may contribute to the inhibition of plant growth under stress conditions. Future analysis of TIR1 mRNA and protein levels in plants with altered miR393 expression, or expressing miR393-resistant forms of TIR1 mRNA, will help to test this hypothesis. The identification of endogenous siRNAs provides insights into the complexity of RNA-guided regulation. In this study, we have also uncovered a large number (102) of putative endogenous siRNAs that were not described previously, and these are distributed across all five chromosomes. We tested several of these small RNAs but were unsuccessful in detecting them on RNA gel blots, likely reflecting their low abundance. Unlike miRNAs, only a few of these small RNAs were represented by more than one clone in the sequenced library. These putative siRNAs are not conserved in other plants, which suggests species-specific endogenous RNA silencing mechanisms. The identification of unique intergenic regions as the sites of origin for most of the siRNAs indicated that they might be expressed independently. However, where multiple loci were identified for an siRNA, the origin was ambiguous. In plants, siRNAs have been suggested to mediate repression of retroelements via histone methylation as well as asymmetric DNA methylation (Zilberman et al., 2003) and to mediate systemic silencing (Hamilton et al., 2002). Our finding that several of the siRNAs might target Athila-related genes suggests that the taming of transposable elements is of critical importance for the plant genome. The production of siRNAs involved in controlling the mobility of transposable elements may also operate in Drosophila (Aravin et al., 2003). In addition, our identification of small RNAs that might target putative LRR-TM kinase, 23s rRNA, AAO3, P5CDH, lipoxygenase, or LOB genes raises the possibility that a wide range of protein coding and noncoding genes may be subjected to silencing by endogenous siRNAs. Several of the miRNAs reported in this study (miR402, miR403, miR397a, miR171b, and a predicted miRNA, miR389b.2) have also been identified by an NSF 2010 project, which can be accessed through the Arabidopsis Small RNA Project Web site of the Carrington lab (http://cgrb.orst.edu/smallRNA/). Despite the sequencing by the NSF 2010 project of several libraries from different genetic backgrounds and tissues, the overlap with ours is minimal. This suggests that the cloning of miRNAs in Arabidopsis is not saturated, and more miRNAs have yet to be identified. Therefore, continued efforts in the future are needed to identify the complete set of miRNAs and other small RNAs from Arabidopsis. METHODS Cloning of miRNAs from Arabidopsis Total RNA was isolated from different samples (Arabidopsis thaliana ecotype Columbia; untreated, treated with cold stress 0°C for 24 h, 300 mM NaCl for 5 h, dehydration for 10 h, or 100-μM ABA for 3 h). Cloning of miRNAs was performed as described (Elbashir et al., 2001; Llave et al., 2002a). In brief, ~600 μg of RNA was resolved through two lanes on a denaturing 15% polyacrylamide gel. Labeled DNA oligonucleotides were used as size standards. A gel fragment spanning the size range of 15 to 26 nucleotides was excised, and RNA was eluted overnight with 0.4 M NaCl at 4°C. The RNA was recovered by ethanol precipitation, dephosphorylated, and again ethanol precipitated. Then, the small RNAs were ligated sequentially to 5′ and 3′ RNA/DNA chimeric oligonucleotide adapters (Dharmacon Research, Boulder, CO). The 3′ adapter oligonucleotide (5′-pUUUaaccgcatccttctcx-3′; uppercase, RNA; lowercase, DNA; p, phosphate; x, inverted deoxythymidine) (Elbashir et al., 2001) possessing a 5′ monophosphate and a 3′ inverted deoxythymidine to prevent self ligation, was then ligated to the dephosphorylated small RNA. The ligation product was recovered from the gel, 5′ phosphorylated, and the RNA was recovered by ethanol precipitation. Next, the 5′ adapter (5′tactaatacgactcactAAA; uppercase, RNA; lowercase, DNA) was ligated to the phosphorylated ligation product as described above. The ligation products from the second ligation reaction were excised and eluted from the gel. Reverse transcription reaction was performed using the RT primer (5′-TTTTCTGCAGAAGGATGCGGTTAAA-3′; bold, PstI site). This was followed by PCR using the reverse (RT primer) and forward (5′-AAACCATGGTACTAATACGACTCACTAAA-3′; bold, NcoI site) primers. The PCR product was purified by phenol/chloroform extraction, ethanol precipitated, and digested with PstI and NcoI. After additional phenol/chloroform extraction and ethanol precipitation, the digested products were ligated to a PstI-NcoI–linearized pGEM-T Easy vector using T4 DNA ligase. The inserts from individual colonies were amplified by T7 and SP6 primers and sequenced. The sequences were subsequently processed and used for BLAST analysis against the Arabidopsis sequences in The Arabidopsis Information Resource database (http://www.arabidopsis.org/) and other plant sequences in the National Center for Biotechnology Information database (http://www.ncbi.nlm.nih.gov/BLAST/). Prediction of Fold-Back Structures Fold-back secondary structures were predicted with the computer program RNA fold, using the Zuker algorithm (Zuker, 2003). The cloned RNA was folded with flanking sequence in three contexts, either 300 bp of upstream sequence plus 20 to 30 bp of downstream sequence or vice versa or 150 bp upstream and 150 bp downstream of the cloned sequence. RNA Gel Blot Analysis Total RNA was isolated from 2-week-old seedlings (Columbia ecotype) grown on MS nutrient-agar plates (untreated, treated with cold stress 0°C for 24 h, 300 mM NaCl for 5 h, dehydration for 10 h, or 100-μM ABA for 3 h). Total RNA was also isolated from different tissues (stem, leaf, inflorescence, and root) using the TRIZOL reagent (Invitrogen, Life Technologies, Carlsbad, CA). Forty micrograms of total RNA was loaded per lane and resolved on a denaturing 15% polyacrylamide gel and transferred electrophoretically to Hybond N+ (Amersham, Piscataway, NJ) membranes. Membranes were UV cross-linked and baked for 1 h at 80°C. DNA oligonucleotides complementary to small RNA sequences were end labeled with γ-32P-ATP using T4 polynucleotide kinase (Invitrogen). Blots were prehybridized for at least 1 h and hybridized overnight using PerfectHYB Plus buffer (Sigma, St. Louis, MO) at 38°C. Blots were washed three times (two times with 2× SSC [1× SSC is 0.15 M NaCl and 0.015 M sodium citrate] and 0.1% SDS for 20 min and once with 1× SSC and 0.1% SDS for 20 min) at 50°C. The membranes were briefly air dried and then exposed to a phosphorimager or autoradiographed. [Supplemental Data]
Acknowledgments We thank Xuemei Chen for providing hen1-1 and dcl1-9 seeds and Becky Stevenson for technical assistance during the course of this work. We also thank Shou-Wei Ding and Andre Jaggendorf for critical reading of the manuscript. We thank Anastasia Khavorova and Annaleen Vermeulen for their help in analyzing thermodynamic profiles of miRNAs. This work was supported by National Institutes of Health Grant R01GM0707501 and National Science Foundation Grant IBN-9808398 to J.K.Z. Notes The author responsible for distribution of materials integral to the findings presented in this article in accordance with the Instructions for Authors (www.plantcell.org) is: Jian-Kang Zhu (jian-kang.zhu/at/ucr.edu). Online version contains Web-only data.Article, publication date, and citation information can be found at www.plantcell.org/cgi/doi/10.1105/tpc.104.022830. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||
Science. 2003 Jul 18; 301(5631):336-8.
[Science. 2003]Cell. 2004 Jan 23; 116(2):281-97.
[Cell. 2004]RNA. 2003 Mar; 9(3):277-9.
[RNA. 2003]Plant Physiol. 2003 Jun; 132(2):709-17.
[Plant Physiol. 2003]Science. 2002 Sep 20; 297(5589):2053-6.
[Science. 2002]Curr Biol. 2003 May 13; 13(10):807-18.
[Curr Biol. 2003]Genes Dev. 2003 Apr 15; 17(8):991-1008.
[Genes Dev. 2003]Mol Cell. 2003 May; 11(5):1253-63.
[Mol Cell. 2003]Genome Biol. 2003; 4(7):R42.
[Genome Biol. 2003]Cell. 2004 Jan 23; 116(2):281-97.
[Cell. 2004]Nature. 2001 Jun 14; 411(6839):834-42.
[Nature. 2001]Science. 2002 May 17; 296(5571):1263-5.
[Science. 2002]Science. 1999 Oct 29; 286(5441):950-2.
[Science. 1999]Cell. 2000 Mar 31; 101(1):25-33.
[Cell. 2000]Plant Cell. 2002 Jul; 14(7):1605-19.
[Plant Cell. 2002]Annu Rev Plant Biol. 2002; 53():247-73.
[Annu Rev Plant Biol. 2002]Plant Cell. 2002 Jul; 14(7):1605-19.
[Plant Cell. 2002]Curr Biol. 2002 Sep 3; 12(17):1484-95.
[Curr Biol. 2002]Genes Dev. 2002 Jul 1; 16(13):1616-26.
[Genes Dev. 2002]Nature. 2003 Sep 18; 425(6955):257-63.
[Nature. 2003]RNA. 2003 Mar; 9(3):277-9.
[RNA. 2003]Nucleic Acids Res. 2003 Jul 1; 31(13):3406-15.
[Nucleic Acids Res. 2003]Genes Dev. 2002 Jul 1; 16(13):1616-26.
[Genes Dev. 2002]Genes Dev. 2002 Jul 1; 16(13):1616-26.
[Genes Dev. 2002]Nature. 2003 Sep 18; 425(6955):257-63.
[Nature. 2003]Cell. 2002 Aug 23; 110(4):513-20.
[Cell. 2002]Cell. 2004 Jan 23; 116(2):281-97.
[Cell. 2004]Genome Biol. 2003; 4(7):R42.
[Genome Biol. 2003]Cell. 2003 Dec 26; 115(7):787-98.
[Cell. 2003]Genes Dev. 2004 Mar 1; 18(5):504-11.
[Genes Dev. 2004]Cell. 2002 Aug 23; 110(4):513-20.
[Cell. 2002]Genome Biol. 2003; 5(1):R1.
[Genome Biol. 2003]Cell. 2003 Dec 26; 115(7):787-98.
[Cell. 2003]Nature. 2003 Sep 18; 425(6955):257-63.
[Nature. 2003]Cell. 2002 Aug 23; 110(4):513-20.
[Cell. 2002]Genes Dev. 2002 Jul 1; 16(13):1616-26.
[Genes Dev. 2002]Trends Biochem Sci. 2000 Feb; 25(2):46-7.
[Trends Biochem Sci. 2000]Mol Cell. 2003 May; 11(5):1253-63.
[Mol Cell. 2003]Genes Dev. 2003 Apr 15; 17(8):991-1008.
[Genes Dev. 2003]Proc Natl Acad Sci U S A. 2004 Jan 6; 101(1):360-5.
[Proc Natl Acad Sci U S A. 2004]Plant Cell. 2002 Jul; 14(7):1605-19.
[Plant Cell. 2002]Curr Biol. 2002 Sep 3; 12(17):1484-95.
[Curr Biol. 2002]Genes Dev. 2002 Jul 1; 16(13):1616-26.
[Genes Dev. 2002]Genes Dev. 2002 Jul 1; 16(13):1616-26.
[Genes Dev. 2002]Curr Biol. 2002 Sep 3; 12(17):1484-95.
[Curr Biol. 2002]Science. 2002 Sep 13; 297(5588):1831.
[Science. 2002]Curr Biol. 2003 May 13; 13(10):807-18.
[Curr Biol. 2003]Dev Cell. 2003 Aug; 5(2):337-50.
[Dev Cell. 2003]Curr Biol. 2002 Sep 3; 12(17):1484-95.
[Curr Biol. 2002]Curr Biol. 2003 May 13; 13(10):807-18.
[Curr Biol. 2003]J Cell Sci. 2003 Dec 1; 116(Pt 23):4689-93.
[J Cell Sci. 2003]Cell. 2004 Jan 23; 116(2):281-97.
[Cell. 2004]Curr Opin Plant Biol. 2002 Oct; 5(5):444-51.
[Curr Opin Plant Biol. 2002]Cell. 2004 Jan 23; 116(2):281-97.
[Cell. 2004]Plant Physiol. 2003 Jun; 132(2):709-17.
[Plant Physiol. 2003]Cell. 2003 Oct 17; 115(2):209-16.
[Cell. 2003]Cell. 2003 Oct 17; 115(2):199-208.
[Cell. 2003]Cell. 2002 Aug 23; 110(4):513-20.
[Cell. 2002]Cell. 2003 Dec 26; 115(7):787-98.
[Cell. 2003]Plant Physiol. 2003 Jun; 132(2):709-17.
[Plant Physiol. 2003]Cell. 2004 Jan 23; 116(2):281-97.
[Cell. 2004]Science. 2002 Sep 20; 297(5589):2053-6.
[Science. 2002]Dev Cell. 2003 Feb; 4(2):205-17.
[Dev Cell. 2003]Nature. 2003 Sep 18; 425(6955):257-63.
[Nature. 2003]Curr Biol. 2003 Apr 29; 13(9):784-9.
[Curr Biol. 2003]Cell. 1993 Dec 3; 75(5):843-54.
[Cell. 1993]Cell. 1993 Dec 3; 75(5):855-62.
[Cell. 1993]Curr Biol. 2003 Dec 2; 13(23):R925-36.
[Curr Biol. 2003]Cell. 2004 Jan 23; 116(2):281-97.
[Cell. 2004]Genes Dev. 2003 Feb 15; 17(4):438-42.
[Genes Dev. 2003]Cell. 2002 Dec 13; 111(6):803-14.
[Cell. 2002]EMBO J. 2002 Dec 16; 21(24):6842-52.
[EMBO J. 2002]Curr Biol. 2003 Apr 29; 13(9):784-9.
[Curr Biol. 2003]Plant Mol Biol. 2002 Jun-Jul; 49(3-4):401-9.
[Plant Mol Biol. 2002]Science. 2003 Jan 31; 299(5607):716-9.
[Science. 2003]EMBO J. 2002 Sep 2; 21(17):4671-9.
[EMBO J. 2002]Dev Cell. 2003 Aug; 5(2):337-50.
[Dev Cell. 2003]Genes Dev. 2001 Jan 15; 15(2):188-200.
[Genes Dev. 2001]Science. 2002 Sep 20; 297(5589):2053-6.
[Science. 2002]Nucleic Acids Res. 2003 Jul 1; 31(13):3406-15.
[Nucleic Acids Res. 2003]Nucleic Acids Res. 1994 Nov 11; 22(22):4673-80.
[Nucleic Acids Res. 1994]