![]() | ![]() |
Formats:
|
||||||||||||||||||||||
Copyright © 2009 RNA Society Small RNA analysis in Petunia hybrida identifies unusual tissue-specific expression patterns of conserved miRNAs and of a 24mer RNA 1Institute of Molecular and Cellular Biology, The University of Leeds, Leeds LS2 9JT, United Kingdom 2Centre for Plant Science, The University of Leeds, Leeds LS2 9JT, United Kingdom 3These authors contributed equally to this work.
Reprint requests to: Peter Meyer, Centre for Plant Science, The University of Leeds, Leeds LS2 9JT, United Kingdom; e-mail: p.meyer/at/leeds.ac.uk; fax: 0044-113-3433144. Received December 15, 2008; Accepted February 24, 2009. Freely available online through the open access option. Abstract Two pools of small RNAs were cloned from inflorescences of Petunia hybrida using a 5′-ligation dependent and a 5′-ligation independent approach. The two libraries were integrated into a public website that allows the screening of individual sequences against 359,769 unique clones. The library contains 15 clones with 100% identity and 53 clones with one mismatch to miRNAs described for other plant species. For two conserved miRNAs, miR159 and miR390, we find clear differences in tissue-specific distribution, compared with other species. This shows that evolutionary conservation of miRNA sequences does not necessarily include a conservation of the miRNA expression profile. Almost 60% of all clones in the database are 24-nucleotide clones. In accordance with the role of 24mers in marking repetitive regions, we find them distributed across retroviral and transposable element sequences but other 24mers map to promoter regions and to different transcript regions. For one target region we observe tissue-specific variation of matching 24mers, which demonstrates that, as for 21mers, 24mer concentrations are not necessarily identical in different tissues. Asymmetric distribution of a putative novel miRNA in the two libraries suggests that the cloning method can be selective for the representation of certain small RNAs in a collection. Keywords: microRNA, siRNA, small RNA, database, petunia INTRODUCTION Small noncoding RNAs play important roles in plant development, environmental response, antiviral defense, and genome structure. The most important classes of small RNAs are microRNAs (miRNAs) and the different forms of small interfering (si)RNAs, trans-acting (ta) siRNAs, natural antisense (nat) siRNAs, and heterochromatin siRNAs. Mature ~21-nucleotide (nt)-long miRNAs control regulatory genes, especially transcription factors (Rhoades et al. 2002). They are processed by RNAseIII-like enzymes from noncoding primary precursor transcripts with local stem–loop structures (Chen 2005). miRNAs guide the degradation of target transcripts via endonucleolytic slicer activity (Baumberger and Baulcombe 2005) or act to translationally repress mRNAs (Brodersen et al. 2008). miRNAs and their targets are usually evolutionarily conserved, which suggests an ancient origin of miRNAs in the plant lineage (Bartel and Bartel 2003). miRNAs share many features with siRNAs, but in contrast to miRNAs, siRNAs are processed from double-stranded RNAs, frequently requiring RNA-dependent RNA polymerase activity and different RNAseIII-type enzymes. In Arabidopsis thaliana, miRNAs are predominantly processed by DICER-LIKE 1 (DCL1), viral siRNAs, and nat-siRNAs by DCL2, heterochromatin siRNAs by DCL3, and tasiRNAs by DCL4 (Xie et al. 2004; Gasciolli et al. 2005). tasiRNAs derive from single-stranded tasiRNA precursor RNAs (TAS) that were converted into a double-strand by RNA-dependent RNA polymerase 6 (RDR6) (Peragine et al. 2004), and which are cleaved in a phased manner by DCL4. DCL1 is also required for tasiRNA production, as certain miRNAs set the phase for the production of tasiRNAs (Allen et al. 2005). DCL1 is also involved in nat-si RNA production. Primary 24-nt nat-siRNAs derive from DCL2 cleavage of dsRNAs formed between a sense and an overlapping natural antisense transcript. The 24-nt RNA directs the cleavage of the sense transcript and sets the phase for production of secondary 21-nt nat-siRNAs by DCL1 in combination with RDR6 (Borsani et al. 2005). Twenty-four-nucleotide heterochromatin siRNAs are components of the RNA directed DNA methylation (RdDM) pathway that regulate histone modification and heterochromatin formation. An atypical RNA polymerase, NUCLEAR RNA POLYMERASE D 1A (NRPD1A), produces RNA single strands as substrates for RNA-dependent RNA polymerase2 (RDR2). The resulting dsRNA is cleaved by DICER-like3 (DCL3) into 24mers of sense and antisense orientation, which are incorporated into a silencing complex that directs modification of cytosine and histone marks to homologous regions (Pontes et al. 2006) (Li et al. 2006). While 21–24mers represent the dominant class of small RNAs, larger RNAs have also been described. An antisense-specific 30nt RNA was detected in the 3′ region of the Arabidopsis FLC gene (Swiezewski et al. 2007), and a ~35-nt antisense RNA was found in the 3′ region of the petunia Sho gene (Zubko and Meyer 2007). Small RNAs have been sequenced for a number of plant species, especially those for which a fully sequenced genome or an extensive DNA database was available (Rajagopalan et al. 2006; Fahlgren et al. 2007; Pilcher et al. 2007; Yao et al. 2007; Moxon et al. 2008). Genomic information for Petunia hybrida is rather limited, but comparison of a petunia small RNA database with small RNAs in other species and with sequence data for petunia genes and ESTs allowed us to identify conserved and putative new microRNAs as well as an unusual class of 24mers associated with nonheterochromatic regions. RESULTS Cloning of small RNAs We prepared two types of small RNA libraries from floral bud RNA. Clones in batch 1 were prepared after adapter ligation to the 5′ and 3′ end of the RNA. As this method requires target RNAs with a 5′monophosphate, we used a 5′-ligation-independent method for batch 2 to include RNAs with a modified 5′ end. For both batches, RNA fractions in size ranges around 21–25 nt and 33–37 nt were excised and combined. From a total of 925,405 sequences for which adaptor sequences could be identified, 397,899 unique clones were extracted, from which 38,130 were removed, which represented rRNA, tRNA, snRNA, mitochondrial, or chloroplast gene sequences (Table 1).
Both batches contain a similar size distribution of small RNAs, with 24mers representing the largest class in batch 1 (60%) and batch 2 (54%). A total of 2.3% of batch 2 clones and 1.1% of batch 1 clones are 21mers. Seven percent of batch 2 clones and 4% of batch 1 clones are 22mer. With 2.3%, 34mers and 35mers are more frequent in batch 1 compared with batch 2 with 1.4% (Fig. 1
The petunia small RNA BLAST website The 359,769 small RNA sequences of batch 1 and batch 2 were entered into a website that allows the screening of DNA sequences for perfect or partial matches to small RNA sequences. Users can submit DNA sequences of any length that can be compared with batch 1 and batch 2 clones, either separately or jointly. Search parameters include maximum and minimum length of the matching region, number of aligned clones to be displayed, orientation of the aligned clones, and selection of clones with 100% identity or less (http://www.petunia_smrna.leeds.ac.uk/). The output graphic displays all small RNAs that fulfill the search criteria. Along the query sequence, each hit is illustrated as a box in one of four colors that define its batch origin and its orientation. For each box, the name of the clone, its length and its representation frequency in the small RNA library are listed. The graphic display is followed by a list of matching clones, their normalized bit score and expect value (E-value), and by an alignment for each clone with the query sequence. Matches to known microRNAs We identified 15 clones with perfect matches to microRNAs found in other species (Table 2; Supplemental File 1) and 53 clones with one mismatch to known microRNAs (Table 3; Supplemental File 2). A similarity screen of these clones against petunia sequences identified a hit for mirBL locus AM489765, which encodes a conserved miR169 microRNA (Cartolano et al. 2007). No other homologous petunia loci could be identified for the other 21mer sequences listed in Tables 2 and 3. We tested the distribution of three conserved microRNAs in different tissues. miR159 is relatively uniformly expressed in all tissue types, expression of miR166 is significantly reduced in flower buds and miR390 is mainly detectable in floral tissue (Fig. 2
To test if our database contained additional, petunia-specific microRNAs, we searched for petunia clones with similar features to the mirBL locus (Fig. 3A
Only a fraction of small RNA clones match known petunia sequences, and these can be examined for secondary structure features. Without a matching petunia sequence, miRNA candidates can only be tested experimentally. Assuming that the frequent representation of 21mer or 22mer clones in the database is an indicators for a potential microRNA, we selected the 21mer EMUNAOL02FR3EL as a candidate. In the database, we find seven copies of this clone as well as 61 22mers that contain the 21mer sequence plus an additional nucleotide. Hybridization to an antisense probe of EMUNAOL02FR3EL confirmed the presence of a class of homologous 21–22mer in floral tissue and leaves (Fig. 4 Analysis of 24mer clones A BLAST search for the 208,721 24mer present in the library identified 3757 hits with a 100% identity to petunia sequences (Supplemental File 4). As expected, 24mer clones match sense and antisense sequences of moderately repeated sequences such as transposable elements, retrotransposons, and viral sequences (Fig. 5A,B
To test the presence of local 24mers experimentally, a 27-nt region between position 386 and 412 of EST887452 was selected, for which the library contains 41 24mer matches in sense or antisense orientation (Fig. 6A
DISCUSSION We used a 5′-ligation-dependent and a 5′-ligation-independent method to clone small RNAs from petunia flower buds. RNA cleavage by DICER generates primary small RNAs with 5′ monophosphates that can be cloned by 5′-adaptor ligation, while secondary RNAs that are synthesized by an RNA-directed RNA Polymerase (RdRP) have a triphosphate 5′ terminus and require a 5′-ligation-independent cloning method (Pak and Fire 2007). The two libraries were combined into one database that can be screened for perfect or near-perfect matches to sequences submitted to a public website. A color code allows the identification of the origin and orientation of each clone. Compared with other genomes, petunia sequence data are relatively limited, which restricts the search for MIR loci or miRNA targets. For the 68 clones with perfect or near-perfect matches to known microRNAs, only one matching MIR locus, mirBL, could be identified. The mirBL locus served as a model for the identification of putative novel MIR loci. Like mirBL, two EST sequences contain a distinct region with a potential stem–loop structure and matches to small RNAs, which are all in sense orientation and predominantly 21-nt long. However, the presence of a homologous 21mer could only be demonstrated for one of the two candidates. This confirms that in silico prediction of MIR loci or small RNA target loci can still have a high false-positive rate and must be validated experimentally (Moxon et al. 2008). Experimental validation also demonstrated that the tissue-specific expression profile for conserved microRNAs can vary among different species. At least two of the three conserved microRNAs that we tested show a tissue-specific distribution. MiR390 is strongly expressed in floral tissues but not detectable in root tissue. This distribution is a marked contrast to miR390 expression in rice, which is much stronger in roots than in inflorescences (Sunkar et al. 2005). This illustrates that, although certain microRNAs are conserved among different species, their tissue-specific expression level can differ significantly. MiR390 has been cloned from Arabidopsis (Gustafson et al. 2005) and rice (Sunkar et al. 2005), and is present in fern and moss species as well (Axtell and Bartel 2005). Two Arabidopsis miRNAs, miR173 and miR390, target primary TAS transcripts encoding ta-siRNAs. Cleavage of TAS transcripts recruits RDR6 that synthesises a dsRNA substrate for ta-siRNA biogenesis (Allen et al. 2005). The TAS3 transcript upon which miR390 acts is conserved in Arabidopsis and rice, along with the corresponding mRNA targets of the miR390-dependent TAS3 siRNAs. This conservation suggests that setting the phasing of ta-siRNAs is an ancient function of miRNA in plants (Allen et al. 2005). However, rice contains a conventional mRNA target for miR390 (encoding a receptor-like kinase) (Sunkar et al. 2005), while in Arabidopsis an auxin response factor mRNA (ARF3/ETTIN and ARF4) is targeted by the ta-siRNAs generated after miR390-directed cleavage of the siRNA precursor transcript. It has therefore been suggested that miR390 may have dual roles regulating both mRNAs and loci encoding ta-siRNAs (Bartel 2005). The differences in tissue-specific expression of miR390 in petunia and other species indicate an additional level of flexibility in the evolution of miR390. The tissue-specific expression profile of miR159 also differs from its expression in Arabidopsis. While in Arabidopsis miR159 is strongest in flowers but hardly detectable in roots (Achard et al. 2004), miR159 levels in roots and floral tissues are similar in petunia. Tissue-specific variation of miRNA expression in different species may therefore be a more common phenomenon than expected. We identified two potentially new miRNA with strongest representation in flower buds, the tissue from which small RNAs had been cloned. One of these microRNA candidates matches VUA594 (gene bank gi91064442), a 272-bp EST from petals extracted from buds at maximum elongation. The database contains a total of 74 21–25-nt clones matching the 100–132-bp region of the VUA594 sense transcript. Thirteen clones are 21mer and 22 clones are 22mer, all of which exclusively derive from batch 2 (Fig. 3C Twenty-four mers represent the largest group of small RNAs in both batches, and the majority of these clones very likely mark target regions for RNA directed DNA methylation. The RdDM pathway controls methylation of many, although not all, transposons and DNA repeats located in constitutive or facultative heterochromatin, or dispersed within euchromatic regions (Pontier et al. 2005). Retrotransposon promoters and LTRs are also preferential RdDM targets (Huettel et al. 2006). It is therefore not surprising to detect matches to 24mer RNAs in petunia transposon and retrotransposon, or in a copy of an integrated viral genome. More surprising is the match of 24mers to individual small regions in EST sequences, and to coding regions or UTRs. There is no indication for the presence of repeat elements at these 24mer matches, but in the absence of a complete petunia genome analysis, we cannot exclude that at least some of the 24mer sequences have been generated from unspecified repetitive regions that share small homologies with the nonrepetitive loci. But even if the local 24mer target regions were false-positive matches with 24mers originating from heterochromatic regions, one would not expect a tissue-specific variation as it was observed for EST887452-specific 24mers. At least for 24mer clones with variable representation in different tissues, it is more likely that these derive from transcripts of the matching euchromatic regions. Other examples for 24-nt siRNAs matching euchromatic loci, include the NaCl-inducible nat-siRNA SRO5-P5CDH (Borsani et al. 2005) and a 24-nt RNA corresponding to the 3′ reverse strand of the FLOWERING LOCUS C (FLC) (Swiezewski et al. 2007). The SRO5–P5CDH 24-nt siRNA sets the phase for the production of secondary 21-nt nat-siRNAs by DCL1, and the FLC 24-nt siRNA is part of a RNAi-mediated chromatin modification process under the control of RdRM pathway functions. The lack of phased 21mers, makes it very unlikely that the EST887452-specific 24mers represent nat-siRNAs. Tissue-specific variation is a common feature between EST887452-specific 24mers and the FLC 24-nt siRNA, but while FLC siRNA is specific for the complementary strand, EST887452-specific 24mers comprise both sense and antisense specific 24mers. In this respect, they resemble a pool of 24 nt-long sense and antisense RNAs associated with the petunia Sho locus (Zubko and Meyer 2007). However, unlike Sho-specific 24mers, EST887452-specific 24mers map to a very small transcript region. Despite the limited number of petunia sequences that small RNAs can be compared with, our analysis has provided some surprising results about the tissue-specific representation of some small RNA types. The petunia small RNA website will hopefully stimulate a wider comparison of our database with unpublished and novel petunia sequences. MATERIALS AND METHODS Cloning and sequencing of small RNA Low molecular-weight RNAs were isolated from petunia flower buds as described (Hamilton and Baulcombe 1999). Cloning of small RNAs followed a published protocol (Arazi et al. 2005) with some modification. Small RNAs were size fractionated by 15% denaturing polyacrylamide gel, eluted from gel in 0.3 M NaCl, purified, and sequentially ligated to 5′ and 3′ RNA–DNA chimeric oligonucleotide adapters. Reverse transcription was performed using Superscript II RT and antisense PCR3′ primer. Resulting cDNAs were amplified by PCR using PCR5′ and PCR3′ primers tailed with sequence A and B, which are required for 454 sequencing (Supplemental File 5). PCR fragments were purified and sequenced by 454 Life Sciences (Margulies et al. 2005) using the Amplicon library preparation procedure (http://www.454.com/downloads/protocols/Guide_To_Amplicon_Sequencing.pdf). For 5′ independent cloning, small RNAs were first ligated to a 3′RNA–DNA adapter. The ligation products were reverse transcribed and the cDNA was ligated to a cDNA adapter, followed by PCR amplification. Small RNA expression analysis using Northern blot For hybridization analysis, 25 μg of small RNA was separated on a 15% denaturing polyacrylamide gel, and transferred onto a Hybond Nx membrane (Amersham). The Decade Marker (Ambion) radiolabeled by γ32P[ATP] was used as a size control. Cross-linking of RNA to a nylon membrane was performed using a soluble carbodiimide (Pall et al. 2007). Hybridization was done in ULTRAhyb-Oligo hybridization buffer (Ambion) according to the manufacturer's recommendation. Production of final petunia RNA sequences Deep sequencing was done by 454 Life Sciences (www.454.com). Raw sequence data for batch 1 and batch 2 were processed in standard fasta format files using Perl scripts and the BioPerl SearchIO package (http://www.bioperl.org/wiki/Module:Bio SearchIO). Known adaptor sequences were located contiguously to the petunia RNA sequence at the beginning and end of the sequence. Adaptor sequences were found to have a far lower sequence variability at the ends nearest the petunia RNA (referred to below as the “petunia RNA end”). Therefore, sequences that showed a perfect alignment to the last 10 bases of the petunia RNA end of both of the adaptors were deemed to have been detected and the adaptors were removed. Otherwise, the last 19 bases of the petunia RNA end of the adaptor were aligned using dynamic programming (Needleman and Wunsch 1970), and adaptors were deemed to be detected if the alignment had no more than two mismatches to the known adaptor. A small minority of sequences in batch 1 was found to be concatemers of two RNA sequences flanked by adaptors at each end and containing a central pair. These sequences were split at the junction between the two central adaptors and the resulting sequences were then identified by the additional suffixes part1 and part2. Adaptor sequences were removed and the resulting sequences saved to fasta format files. Nonredundant sequence sets were then created by identifying groups of identical sequences and removing duplicated sequences by the use of an associative array in Perl.The nonredundant sequences were then aligned using the BLAST algorithm with sequences in the rRNA database SILVA (Pruesse et al. 2007), Arabidopsis thaliana tRNA (http://gtrnadb.ucsc.edu/), mitochondrial, and chloroplast genes (http://www.arabidopsis.org/). Petunia RNA sequences with >90% sequence identity to any of the above sequences were removed. Finally, to remove snRNA and any sequences missed by the above processes, the remaining sequences were aligned with sequences in GenBank (Benson et al. 2007), and any perfect matches to rRNA, tRNA, snRNA, mitochondrial, or chloroplast genes were removed. This produced the final sets of petunia RNA sequences, which were used for the rest of this study. Identification of known miRNAs in petunia RNAs Known miRNAs were downloaded from miRBASE (http://microrna.sanger.ac.uk/sequences/ftp.shtml) and petunia RNA sequences that showed perfect matches or one mismatch to the known miRNA were extracted using the BLAST algorithm. Secondary structure analysis For secondary structure analysis, sequences were submitted to the RNA fold webserver (Gruber et al. 2008). Website development The website is based on the NCBI blastall program (ftp://ftp.ncbi.nih.gov/blast/) with graphics in the website being produced using the BioPerl Graphics package (http://www.bioperl.org/wiki/Module:Bio Graphics).100% matches of 21 and 24 base pair petunia RNAs to petunia genes Petunia sequences were extracted from GenBank and aligned using BLAST with the petunia RNA sequences of length 21 and 24 base pairs. Petunia RNA sequences that showed a 100% match to any of the petunia genes were then recorded. SUPPLEMENTAL MATERIAL Supplemental material can be found at http://www.rnajournal.org. ACKNOWLEDGMENTS This work was supported by the Epigenome Network of Excellence (LSHG-CT-2004-503433). We thank Binbin Liu for IT support. EST VUA594 was submitted to the petunia EST collection by Francesca Quattrocchio, Vrije Universiteit, Amsterdam, Netherlands. Footnotes Article published online ahead of print. Article and publication date are at http://www.rnajournal.org/cgi/doi/10.1261/rna.1517209. REFERENCES
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||
Cell. 2002 Aug 23; 110(4):513-20.
[Cell. 2002]FEBS Lett. 2005 Oct 31; 579(26):5923-31.
[FEBS Lett. 2005]Proc Natl Acad Sci U S A. 2005 Aug 16; 102(33):11928-33.
[Proc Natl Acad Sci U S A. 2005]Science. 2008 May 30; 320(5880):1185-90.
[Science. 2008]Plant Physiol. 2003 Jun; 132(2):709-17.
[Plant Physiol. 2003]PLoS Biol. 2004 May; 2(5):E104.
[PLoS Biol. 2004]Curr Biol. 2005 Aug 23; 15(16):1494-500.
[Curr Biol. 2005]Genes Dev. 2004 Oct 1; 18(19):2368-79.
[Genes Dev. 2004]Cell. 2005 Apr 22; 121(2):207-21.
[Cell. 2005]Cell. 2005 Dec 29; 123(7):1279-91.
[Cell. 2005]Cell. 2006 Jul 14; 126(1):79-92.
[Cell. 2006]Cell. 2006 Jul 14; 126(1):93-106.
[Cell. 2006]Proc Natl Acad Sci U S A. 2007 Feb 27; 104(9):3633-8.
[Proc Natl Acad Sci U S A. 2007]Plant J. 2007 Dec; 52(6):1131-9.
[Plant J. 2007]Genes Dev. 2006 Dec 15; 20(24):3407-25.
[Genes Dev. 2006]PLoS One. 2007 Feb 14; 2(2):e219.
[PLoS One. 2007]Planta. 2007 Aug; 226(3):709-17.
[Planta. 2007]Genome Biol. 2007; 8(6):R96.
[Genome Biol. 2007]Genome Res. 2008 Oct; 18(10):1602-9.
[Genome Res. 2008]Nat Genet. 2007 Jul; 39(7):901-5.
[Nat Genet. 2007]Science. 2007 Jan 12; 315(5809):241-4.
[Science. 2007]Genome Res. 2008 Oct; 18(10):1602-9.
[Genome Res. 2008]Plant Cell. 2005 May; 17(5):1397-411.
[Plant Cell. 2005]Nucleic Acids Res. 2005 Jan 1; 33(Database issue):D637-40.
[Nucleic Acids Res. 2005]Plant Cell. 2005 May; 17(5):1397-411.
[Plant Cell. 2005]Plant Cell. 2005 Jun; 17(6):1658-73.
[Plant Cell. 2005]Cell. 2005 Apr 22; 121(2):207-21.
[Cell. 2005]Nat Struct Mol Biol. 2005 Jul; 12(7):569-71.
[Nat Struct Mol Biol. 2005]Development. 2004 Jul; 131(14):3357-65.
[Development. 2004]Genes Dev. 2005 Sep 1; 19(17):2030-40.
[Genes Dev. 2005]EMBO J. 2006 Jun 21; 25(12):2828-36.
[EMBO J. 2006]Cell. 2005 Dec 29; 123(7):1279-91.
[Cell. 2005]Proc Natl Acad Sci U S A. 2007 Feb 27; 104(9):3633-8.
[Proc Natl Acad Sci U S A. 2007]Plant J. 2007 Dec; 52(6):1131-9.
[Plant J. 2007]Science. 1999 Oct 29; 286(5441):950-2.
[Science. 1999]Plant J. 2005 Sep; 43(6):837-48.
[Plant J. 2005]Nature. 2005 Sep 15; 437(7057):376-80.
[Nature. 2005]Nucleic Acids Res. 2007; 35(8):e60.
[Nucleic Acids Res. 2007]J Mol Biol. 1970 Mar; 48(3):443-53.
[J Mol Biol. 1970]Nucleic Acids Res. 2007; 35(21):7188-96.
[Nucleic Acids Res. 2007]Nucleic Acids Res. 2007 Jan; 35(Database issue):D21-5.
[Nucleic Acids Res. 2007]Nucleic Acids Res. 2008 Jul 1; 36(Web Server issue):W70-4.
[Nucleic Acids Res. 2008]