![]() | ![]() |
Formats:
|
||||||||||||||||
Copyright © 2007 RNA Society Hypothesis: RNA editing of microRNA target sites in humans? 1Department of Chemistry, Princeton University, Princeton, New Jersey 08544, USA 2Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey 08544, USA 3
Present address: Department of Ecology and Evolution, University of Chicago, 1101 E. 57th Street, Chicago, IL 60637, USA
Reprint requests to: Laura F. Landweber, Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey 08544, USA; e-mail: lfl/at/princeton.edu; fax: (609) 258-7892. Received September 6, 2006; Accepted December 18, 2006. This article has been cited by other articles in PMC.Abstract Adenosine (A) to inosine (I) RNA editing occurs widely in the human transcriptome, and a large proportion of editing sites are within untranslated regions (UTRs). MicroRNAs (miRNAs), an abundant class of regulatory genes, specify the expression of a large number of target genes by pairing to their 3′ UTRs. To study the interplay between these two post-transcriptional events, we developed a computational pipeline to integrate sequence and miRNA tissue specificity data. The results show that some A-to-I RNA editing positions have a potential to block the miRNA:target recognition, although further computational simulation suggests that RNA editing tends to avoid miRNA target sites in general. We propose that a small proportion of RNA editing events may provide an additional layer of control on miRNA-mediated repression. Further investigation is needed to elucidate the functional effect of these special RNA editing events. Keywords: A-to-I RNA editing, microRNA repression, microRNA target sites, tissue specificity INTRODUCTION RNA editing generates RNA diversity through the post-transcriptional modification of single nucleotides in pre-mRNA (Maas et al. 2003). Adenosine (A) to inosine (I) modification by adenosine deaminases (ADARs), with inosine acting as guanosine (G) during translation, appears to be the most common type of nuclear RNA editing in eukaryotes (Bass 2002). Several recent studies have reported that A-to-I editing occurs widely in the human transcriptome (at least 2% of publicly available mRNAs), and most editing sites reside in Alu repetitive elements, which are typically 300 nucleotides (nt) long and comprise >10% of the human genome (Athanasiadis et al. 2004; Kim et al. 2004; Levanon et al. 2004, 2005; Eisenberg et al. 2005). Because the vast majority of edited nucleotides are within intronic or untranslated regions (UTRs), the functional consequence of RNA editing is often hard to infer. To date, except for a few well-studied cases (Higuchi et al. 1993; Burns et al. 1997; Rueter et al. 1999), the biological role of much A-to-I editing is still under debate. MicroRNAs (miRNAs) are endogenous ~22-nt noncoding RNAs that can play an important role in the regulation of gene expression by pairing to the messages of protein-coding genes, thereby specifying mRNA cleavage or repression of productive translation (Ambros 2004; Bartel 2004; Zamore and Haley 2005). In vertebrates, miRNA genes are one of the most abundant classes of regulatory genes (~1% of all the genes) (Lim et al. 2003, 2005; Bartel 2004; Bartel and Chen 2004). Recent studies have indicated that target recognition by one miRNA is mainly achieved by pairing to the nucleotides at positions 2–7 of the miRNA (the so-called “miRNA seed”), and often a single mutation in the miRNA seed match of the target site can lead to total abolishment of effective repression (Lewis et al. 2003; Bartel 2004; Brennecke et al. 2005; Stark et al. 2005). In fact, several computational methods have been developed to predict miRNA target genes based on this principle, suggesting that 20%–30% of protein-coding genes are under selective pressure to contain conserved 7-nt matches to miRNA seed regions, and that these regions are likely to be miRNA targets (Lewis et al. 2003, 2005; Rajewsky 2006). Very recently, a study further suggested that nonconserved 7-nt matches are also able to mediate repression when exposed to the miRNA, implying a widespread impact of miRNAs on gene expression (Farh et al. 2005). (For simplicity, throughout the rest of this paper we often refer to these 7-nt matches as miRNA target sites, recognizing that they are necessarily biological target sites.) Recently, A-to-I editing of miRNA precursor molecules has been reported (Luciano et al. 2004; Blow et al. 2006); however, the potential role of the editing on miRNA target genes remains unexplored. Given the wide occurrence of both A-to-I editing and miRNA target sites in 3′-UTR regions, we wonder whether some editing events can interrupt (or create) functional target sites, thus adding another layer of control to miRNA-mediated repression. Therefore, by integrating sequence and miRNA tissue specificity data, we developed a computational pipeline to predict miRNA target sites subject to RNA editing. Intriguingly, we found that some edited positions lie in putative miRNA target sites. Our work calls attention to the layers of complexity that govern human gene regulation, and more experimental work is needed to further elucidate the functional effect of these RNA editing events. RESULTS We first identified a set of highly specific A-to-I editing sites in the human genome, using the edited transcripts from two previous large-scale studies (Athanasiadis et al. 2004; Kim et al. 2004). As shown in Figure 1
Since a recent study has suggested that a 7-nt target site (no matter whether it is conserved or not) is often sufficient to mediate suppression when exposed to the cognate miRNA (Farh et al. 2005), we further incorporated miRNA tissue specificity data to identify the editing-interrupted miRNA target sites whose cognate miRNA (family) is coexpressed in the same tissue in which RNA editing occurs. As a result, we found two target sites interrupted by three editing positions (Table 2). The pipeline for our computational analysis is shown in Figure 3
In addition, our study also revealed that ~200 A-to-I editing positions have a potential to create a new miRNA target site, if we assume that inosine works as well as guanine in miRNA:target recognition (Supplemental Data file 3). DISCUSSION Based on our sequence analysis, we first identified a set of A-to-I editing positions that can interrupt 7-nt matches to miRNA seed regions. Most of them should have no functional effect because (1) the 7-nt matches either are not real miRNA target sites or (2) they are miRNA target sites but function in different tissues relative to RNA editing. However, we cannot rule out the possibility that a few RNA editing positions may interrupt functional miRNA target sites and thus block miRNA:target recognition. To elucidate this dual-level regulation, it will be necessary to verify the miRNA target sites and evaluate the functional effect of the specific RNA editing within them. In particular, two types of miRNA target sites that we identified seem most appealing to examine: (1) the interrupted miRNA target sites that have been conserved in the long history of eutherian evolution; and (2) the target sites whose cognate miRNA is coexpressed in the tissue where RNA editing takes place. Undoubtedly, the possible RNA editing of miRNA target sites that this study reveals is only the tip of the iceberg. First, we note that only a small proportion of the human transcriptome is available so far. Second, only those transcripts with extensive A-to-I editing can be confidently identified by the computational and statistical methods. Third, due to the incomplete annotation of transcripts in the public database and the limitation of miRNA tissue specificity data, only a small number of tissues could presently be investigated. If some RNA editing events really interrupt functional miRNA target sites, they may represent some very specific cases of coordinated regulation. One possible scenario (Fig. 4
MATERIALS AND METHODS To identify RNA editing positions relevant to miRNA target sites, we first compiled a set of mRNA transcripts under A-to-I editing from two recent large-scale studies that used both statistical and experimental approaches to identify edited transcripts (Athanasiadis et al. 2004; Kim et al. 2004). Second, we downloaded the mRNA sequences and their genomic locations from the UCSC genome browser and aligned the mRNAs with the reference human genome (hg18). Then, RNA editing positions were inferred from the A-to-G discrepancies between the genomic and mRNA sequences. As a result, based on 2454 edited mRNAs, our data set includes 28,311 nonredundant A-to-I RNA editing positions. Next, we identified the editing positions within the 3′-UTR regions that were defined by well-annotated RefSeq genes. Finally, for each of these editing positions, we extracted its local genomic and mRNA sequences and evaluated whether the RNA editing has a potential to interrupt (or create) a miRNA target site (a 7- or 8-nt match to a miRNA seed region) using the TargetScanS program (Lewis et al. 2005). For this purpose, we used the same set of 73 conserved miRNA families as in Farh et al. (2005) and obtained the miRNA sequences from the miRBase sequence database (Griffiths-Jones 2004). To evaluate the biological significance of RNA editing positions interrupting a potential miRNA target site, we developed a local clustering and sampling (LCS) algorithm to simulate the random occurrence of RNA editing in 3′ UTRs. Because RNA editing positions tend to be clustered in the sequence, our LCS algorithm maintained this clustering feature in the randomization, instead of treating each position independently. First, we defined two RNA editing positions as neighbors if they were within a small distance (10 nt), and then we used the single-linkage algorithm to group editing positions into clusters. That is, if editing positions A and B are neighbors and positions B and C are neighbors, we group editing positions A, B, and C together into a cluster, regardless of whether or not positions A and C are neighbors. This process assigns each editing position to one cluster only. Second, for each cluster of editing positions, we defined a sampling window, which includes the region from the editing position at the 5′ end to the one at the 3′ end in the cluster plus five flanking nucleotides at each side. Third, for each editing cluster, given the number of editing positions within the cluster, we randomly chose the same number of adenosine positions from the sampling window as pseudoediting positions. Then, given the pseudoediting positions and the presence of miRNA target sites (7-mer seed matches regardless of whether conserved or not) in 3′ UTRs, we calculated the total number of positions interrupting a miRNA target site over all the clusters. This step was repeated 1000 times to generate an empirical background distribution (the number of editing positions interrupting a miRNA target site versus frequency). Based on this distribution, we derived the probability of the number of miRNA-target-interrupting editing positions observed in the real data set. By using this algorithm, we not only controlled the number of RNA editing positions in each 3′ UTR in the randomization, but also maintained (1) the spatial clustering of editing positions, (2) the local nucleotide composition (e.g., GC%), and (3) the relative location of editing positions in 3′ UTRs. Moreover, the results remained essentially the same when different cutoff values were explored. To study the conservation of miRNA target sites interrupted by RNA editing, we further extracted the corresponding orthologous 3′-UTR regions (human [hg18], macaque [rheMac2], mouse [mm8], rat [rn4], dog [canFam2], and chicken [galGal2]) from the UCSC MultiZ17 alignments (Blanchette et al. 2004). To study the mRNA targets with coexpressed cognate miRNAs, we estimated the tissue specificity of the miRNA families using the data from Farh et al. (2005), where the signal (as probabilities) of the 73 miRNA families in 61 mouse tissues was inferred based on “target-site depletion.” For our analysis, the miRNA tissue specificity is assumed to be the same in the human homologous tissues. Given the probabilities and a false discovery rate of 10%, we chose 0.0015 as the threshold to label a family of miRNA genes as “expressed in a given tissue.” SUPPLEMENTAL DATA Supplemental Data files can be found at http://oxytricha.princeton.edu/liang/editing/editing.htm. ACKNOWLEDGMENTS We thank David Bartel (MIT) and two anonymous reviewers for their valuable suggestions. We also thank laboratory members Joey Zhou, Tom Doak, and Mariusz Nowacki for discussions. This work was supported by National Institute of General Medical Sciences grant GM59708 to L.F.L. Footnotes Article published online ahead of print. Article and publication date are at http://www.rnajournal.org/cgi/doi/10.1261/rna.296407. REFERENCES
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||
J Biol Chem. 2003 Jan 17; 278(3):1391-4.
[J Biol Chem. 2003]Annu Rev Biochem. 2002; 71():817-46.
[Annu Rev Biochem. 2002]PLoS Biol. 2004 Dec; 2(12):e391.
[PLoS Biol. 2004]Genome Res. 2004 Sep; 14(9):1719-25.
[Genome Res. 2004]Nat Biotechnol. 2004 Aug; 22(8):1001-5.
[Nat Biotechnol. 2004]Nature. 2004 Sep 16; 431(7006):350-5.
[Nature. 2004]Cell. 2004 Jan 23; 116(2):281-97.
[Cell. 2004]Science. 2005 Sep 2; 309(5740):1519-24.
[Science. 2005]Science. 2003 Mar 7; 299(5612):1540.
[Science. 2003]Nature. 2005 Feb 17; 433(7027):769-73.
[Nature. 2005]RNA. 2004 Aug; 10(8):1174-7.
[RNA. 2004]Genome Biol. 2006; 7(4):R27.
[Genome Biol. 2006]PLoS Biol. 2004 Dec; 2(12):e391.
[PLoS Biol. 2004]Genome Res. 2004 Sep; 14(9):1719-25.
[Genome Res. 2004]Science. 2005 Dec 16; 310(5755):1817-21.
[Science. 2005]PLoS Biol. 2004 Dec; 2(12):e391.
[PLoS Biol. 2004]PLoS Biol. 2004 Dec; 2(12):e391.
[PLoS Biol. 2004]Genome Res. 2004 Sep; 14(9):1719-25.
[Genome Res. 2004]Cell. 2005 Jan 14; 120(1):15-20.
[Cell. 2005]Science. 2005 Dec 16; 310(5755):1817-21.
[Science. 2005]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D109-11.
[Nucleic Acids Res. 2004]Genome Res. 2004 Apr; 14(4):708-15.
[Genome Res. 2004]Science. 2005 Dec 16; 310(5755):1817-21.
[Science. 2005]