![]() | ![]() |
Formats:
|
||||||||||||||||||
Copyright © 2007, EMBO and Nature Publishing Group Regulation of gene expression by small non-coding RNAs: a quantitative view 1Racah Institute of Physics, The Hebrew University, Jerusalem, Israel 2Department of Molecular Genetics and Biotechnology, Faculty of Medicine, The Hebrew University, Jerusalem, Israel aRacah Institute of Physics, The Hebrew University, Jerusalem 91904, Israel. Tel.: +972 2 658 4363; Fax: +972 2 652 0089; Email: biham/at/phys.huji.ac.il bDepartment of Molecular Genetics and Biotechnology, Faculty of Medicine, The Hebrew University, PO Box 12272, Jerusalem 91120, Israel. Tel.: +972 2 675 8614; Fax: +972 2 675 7308; Email: hanahm/at/ekmd.huji.ac.il Received May 14, 2007; Accepted August 23, 2007. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits distribution, and reproduction in any medium, provided the original author and source are credited. This license does not permit commercial exploitation or the creation of derivative works without specific permission. This article has been cited by other articles in PMC.Abstract The importance of post-transcriptional regulation by small non-coding RNAs has recently been recognized in both pro- and eukaryotes. Small RNAs (sRNAs) regulate gene expression post-transcriptionally by base pairing with the mRNA. Here we use dynamical simulations to characterize this regulation mode in comparison to transcriptional regulation mediated by protein–DNA interaction and to post-translational regulation achieved by protein–protein interaction. We show quantitatively that regulation by sRNA is advantageous when fast responses to external signals are needed, consistent with experimental data about its involvement in stress responses. Our analysis indicates that the half-life of the sRNA–mRNA complex and the ratio of their production rates determine the steady-state level of the target protein, suggesting that regulation by sRNA may provide fine-tuning of gene expression. We also describe the network of regulation by sRNA in Escherichia coli, and integrate it with the transcription regulation network, uncovering mixed regulatory circuits, such as mixed feed-forward loops. The integration of sRNAs in feed-forward loops provides tight repression, guaranteed by the combination of transcriptional and post-transcriptional regulations. Keywords: cellular networks, network motifs, small non-coding RNA, transcriptional and post-transcriptional regulation Introduction Living cells are self-regulated by interactions between different molecules. Until very recently, most research has focused on transcription regulation interactions and on protein–protein interactions, which in many cases are involved in post-translational regulation. During the last years it has become evident that another type of interaction plays a prominent role in the regulation of cellular processes, manifested by small RNA (sRNA) molecules that base pair with the mRNA and regulate gene expression post-transcriptionally. This mode of regulation was found in both pro- and eukaryotes (for review see Storz et al, 2005). Although there are differences in the characteristics of the eukaryotic and prokaryotic regulatory RNAs and in the fine-details of their mechanism of action, both exert their regulatory function mostly by base pairing with the mRNA and influencing translation or mRNA stability. It is intriguing to study the properties of this type of regulatory interactions in comparison to the other types of interactions, and to understand their integration in the cellular circuitry. In this paper we focus on bacterial sRNAs, and particularly on regulatory interactions found in Escherichia coli, for which most experimental data on sRNAs are available. At present there are about 80 known sRNAs in E. coli (for review see Gottesman, 2005; Storz et al, 2005). These molecules are 50–400 nucleotides long and many of them are evolutionary conserved (Hershberg et al, 2003), hinting to their important roles in the cellular mechanisms. Still, for many of the sRNAs, their cellular and molecular functions have not yet been determined. Many of those, for which some functional knowledge has been acquired, were often shown to act as inhibitors of translation by base pairing with the mRNA in the ribosome-binding site (for review see Gottesman, 2005). However, in E. coli there are also a couple of examples where the sRNAs play a role as translational activators, promoting ribosome binding to the mRNA by exposing its binding site (Majdalani et al, 1998, 2001; Prevost et al, 2007). In many cases the sRNA–mRNA interactions are assisted by the RNA chaperone Hfq (Valentin-Hansen et al, 2004). The acknowledgment that post-transcriptional regulation by sRNAs is a global phenomenon has raised many interesting questions and speculations regarding their roles in the cellular regulatory networks. It was suggested that it would be cost-effective for the cell to use this mode of regulation, because these molecules are small and are not translated, and therefore the energetic cost of their synthesis is smaller in comparison to synthesis of regulatory proteins (Altuvia and Wagner, 2000). The ease of synthesis led to the suggestion that it would be beneficial for the cell to use these molecules for quick responses to environmental stresses. In this paper we describe this regulatory mechanism by dynamical simulations, and analyze quantitatively these intuitive conjectures. Furthermore, we compare the properties of post-transcriptional regulation by sRNA–mRNA base pairing to those of transcriptional regulation by protein–DNA interaction and post-translational regulation by protein–protein interaction. We show that there are measurable differences between the three regulation modes and describe the situations when regulation by sRNA is advantageous. The interactions between molecules within the cell can be described as a network in which nodes represent genes (or their products) and edges represent the interactions between them. Recently, a considerable effort has been put in deducing the structure of these networks from experimental data, aiming at a systematic understanding of regulation mechanisms and cell function (Milo et al, 2002; Shen-Orr et al, 2002; Yeger-Lotem et al, 2004). Here we describe the network of post-transcriptional regulation by sRNAs in E. coli, where nodes represent either sRNA genes or their targets, and edges point from sRNA genes to their targets. By integrating this network with the transcription regulation network, we discover intriguing regulatory circuits involving both transcriptional regulation and post-transcriptional regulation. The different properties of transcription regulation and regulation by sRNAs have important implications in these mixed regulatory circuits. We demonstrate this by comparing analogous feed-forward loops that are either composed of transcription regulation per se or involve also regulation by sRNA. Results and discussion We analyze different types of regulation of gene expression mediated by three different interaction types, protein–DNA, protein–protein and sRNA–mRNA. To this end we described the regulatory mechanisms involving these interactions by mathematical models, followed by simulations, using average kinetic parameters based on experimental data (Altuvia et al, 1997; Altuvia and Wagner, 2000; Alon, 2006). We distinguished between two scenarios. In the first scenario, we assumed that the products of both the regulated gene (target) and the regulator are already present in the cell when an external signal turns on the regulation. In the second scenario, the target protein is already present when an external signal turns on the synthesis of the regulator. For both scenarios we compared the kinetics of regulation mediated by protein–DNA, protein–protein or sRNA–mRNA interaction. We describe in some detail the modeling of regulation by sRNA. Let the sRNA transcription rate be gs (molecules/second), and the target mRNA transcription rate be gm (molecules/second). The target mRNAs are translated into proteins at a rate gp. The degradation rates are ds, dm and dp, for the sRNAs, mRNAs and proteins, respectively. The sRNA base pairs with the target mRNA at a rate α. The base pairing blocks the binding of the ribosome to the mRNA, thus negatively regulating translation. This system is described by the following rate equations: ![]() where Ns, Nm and Np are the number of sRNA, mRNA and protein molecules per cell, respectively. In the analysis below, these equations are solved by direct numerical integration starting from suitable initial conditions, as specified. A similar model was recently used for the analysis of regulation by the sRNA RyhB (Levine et al, 2007). Analogous equations are used in the analysis of transcriptional regulation by protein–DNA interaction and post-translational regulation by protein–protein interaction. The parameters used in the simulations are based on experimental measurements in E. coli (Altuvia et al, 1997; Altuvia and Wagner, 2000; Alon, 2006). The transcription rate of mRNAs was taken to be gm=0.02 (molecules/second). Based on the high abundance of sRNAs, we assumed an average transcription rate of gs=1 (molecules/second), 50 times faster than that of mRNAs. The high abundance of sRNAs may be due to duplicated copies of their genes (Wilderman et al, 2004), strong promoters or high stability (Altuvia and Wagner, 2000). This difference in transcription rates is supported by experimental results obtained with oxyS (Altuvia et al, 1997). The translation rate was taken as gp=0.01 (s−1). The degradation rates for sRNAs, mRNAs and proteins were taken as ds=0.0025, dm=0.002 and dp=0.001 (s−1), respectively. The rate constants for binding of sRNA to mRNA, regulatory protein to promoter and protein to protein were all taken as α=1 (s−1/molecule). It should be noted that we ran the simulations for a range of biologically relevant parameters around these average values and obtained similar conclusions. In Figure 1
The two panels in Figure 1 When both the regulator and the target are present in the cell, protein–protein interaction provides the fastest response to the external stimulus (Figure 1A We now turn to analyze the second scenario, in which the regulator is produced in response to the external signal while the target protein is already present. In case of transcription regulation, the regulation process remains virtually the same as in Figure 1A Another difference between the various regulation mechanisms is considered below. In case of transcriptional regulation, a single bound repressor is sufficient to shutdown the expression of the target gene. In this case, the regulation effectiveness does not depend on the transcription rate of the target gene. It depends only on the production rate of the regulatory protein and on its binding/dissociation rates to the promoter of the target. Thus, with suitable binding/dissociation rates, transcriptional regulation enables using a protein of low concentration to regulate a protein of high concentration. In case of protein–protein interaction, the regulation effectiveness is determined by the relative production rates of the regulator and target proteins. If the production rate of the regulatory protein is faster than that of the target protein, the regulation will be very effective. On the other hand, when the production rates of these two proteins are comparable, it enables fine-tuning of the regulation strength, which is not possible in transcriptional regulation. A similar property characterizes regulation by sRNA. The regulation effectiveness strongly depends on the relative production rates of the sRNA and the target mRNA. Since the rate of production of sRNAs is up to two orders of magnitude faster than of typical mRNAs, it enables effective regulation. It also enables a single sRNA-encoding gene to regulate dozens of other genes. As long as the sRNA is produced at a faster rate than the combined production rate of all the target mRNAs, the regulation is strong. It gradually weakens when the combined production rate of the target mRNAs exceeds that of the sRNA. As an example, we consider an sRNA-encoding gene that regulates n other genes. In this case, the rate equations shown above are modified such that the second and third equations are copied into n equations, accounting for the number of sRNA molecules and the number of protein molecules of each of the n target genes. In addition, the first equation is modified such that Nm is replaced by the total number of mRNA molecules of all the target genes. For simplicity, the parameter values of all the target genes are taken to be identical. In Figure 2
Kinetic studies indicated that the sRNA–mRNA complexes might dissociate back into their original components (Argaman and Altuvia, 2000; Wagner et al, 2002), with dissociation rates γ in the range between 0.02 and 0.1 s−1, which is much faster than the degradation rate of the complex. To address this additional scenario, we added one more equation to the model, which accounts for the copy number Nx of the complex. This equation takes the form dNx/dt=αNsNm−(dx+γ)Nx, where dx is the degradation rate of the complex. For simplicity, we chose the degradation rate of the complex to be equal to that of the free mRNA molecule, namely dx=dm. In addition, we added the term +γNx to the equations that describe the time derivatives of Ns and Nm. As the dissociation rate increases, the regulation effectiveness is reduced. As a result, there are more mRNA molecules available for translation into proteins, and the protein level increases. In Figure 3
Another post-transcriptional regulation mechanism is manifested by mRNA-binding proteins (or metabolites). The rate equations describing this kind of regulation are similar to those describing regulation by sRNA. However, unlike sRNAs, the regulatory proteins do not degrade together with the mRNA. As a result, a smaller copy number of regulatory proteins are sufficient in order to provide strong negative regulation at steady state. However, the transient dynamics of this type of regulation is the same as shown in Figure 1 We now consider the recovery of the target gene after the transcription of the regulator is turned off. For concreteness, we focus on regulation by sRNAs, where a single target gene is regulated. We assume that the binding of the sRNA to mRNA is fast, and that the sRNA–mRNA complex does not dissociate. In this analysis, the initial copy number of sRNAs is given by the steady-state result of the rate equation, namely Ns=(gs−gm)/ds. It then decreases according to dNs/dt=−dsNs−gm, giving rise to Ns(t)=(gse−dst−gm)/ds. The translation of the target proteins will resume when all the sRNA molecules are removed at time t=ln(gs/gm)/ds, denoted as the recovery time. Our simulations show that for the same parameters as in Figure 1
Network view of sRNA–target interactions To establish the framework of our analysis, we described and analyzed in the previous section regulation of a gene as an isolated event. However, regulation of gene expression in response to external stimuli is often achieved by more complex regulatory patterns, involving various types of regulatory interactions. In recent years the transcription regulation networks and protein–protein interaction networks were analyzed in an attempt to identify and characterize such regulatory patterns (Milo et al, 2002; Shen-Orr et al, 2002; Mangan and Alon, 2003; Yeger-Lotem et al, 2004; Zhang et al, 2005). Likewise, it is interesting to examine the network of post-transcriptional regulation by sRNAs and study its structure. We compiled from the literature and from the NPInter database (Wu et al, 2006) regulatory interactions between sRNAs and targets based on experimental evidence (Figure 5
We next integrated the post-transcriptional regulatory network by sRNAs with the transcription regulatory network, in search of mixed regulatory patterns involving the two modes of regulation. For this analysis we used the transcription regulation network based on RegulonDB (Salgado et al, 2006) and on the literature, including 2861 regulatory interactions. Transcription regulation interactions between regulatory proteins and sRNA genes, either direct or indirect, were compiled from the literature (Supplementary information). Since at present the network of regulation by sRNAs is very limited, it is too early to examine the statistical significance of various mixed regulatory circuits in the integrated network, as done earlier for other integrated networks (Yeger-Lotem et al, 2004). Instead, we looked for mixed regulatory circuits of biological meaning (Figure 6
Mixed feed-forward loops As described above for isolated regulatory interactions, it is intriguing to understand the mechanistic differences between a feed-forward loop that contains both transcriptional regulation and post-transcriptional regulation by sRNA, and one that involves only transcriptional regulation. In the analysis below we focus on one type of feed-forward loop shown in Figure 7
A feed-forward loop consists of gene a, whose product A regulates gene c both directly and indirectly through a B regulator encoded by gene b. This module was shown to be superior to direct regulation alone in both regulation efficiency and tolerance to noise (Mangan and Alon, 2003; Mangan et al, 2003, 2006). Several versions of the feed-forward loop were described, including coherent circuits in which the two regulation paths are both positive or both negative, as well as incoherent circuits in which one of them is positive and the other is negative. The circuit analyzed here is a coherent circuit in which in both paths gene c is negatively regulated. The standard feed-forward loop of this type consists only of transcriptional regulations (circuit I in Figure 7 Regulation by sRNA provides further variation to this type of feed-forward loop. One possibility is that gene b encodes an sRNA that negatively regulates gene c (circuit II in Figure 7 A special property of both feed-forward loops that involve sRNAs is that the negative regulation of gene c is carried out simultaneously at two different levels, transcriptional and post-transcriptional. For example, in circuit II of Figure 7 Implicitly, the mixed feed-forward loop that includes sRNA (circuits II and III in Figure 7 Our analysis may be extended to other types of feed-forward loops and other types of regulatory modules, some of which have already been identified in both pro- and eukaryotes. One example of the Fur-RyhB negative mixed feedback loop in E. coli is demonstrated in Figure 6B Conclusions Previous studies speculated that non-coding RNAs would provide an efficient mode of regulation (Guillier et al, 2006), which is manifested by fast responses of the target gene to an external stimulus, and also by fast recovery after removal of the stimulus. Our mathematical modeling and simulations support these conjectures for a wide range of parameters, and provide additional insights. When considering only transcription regulation by regulatory proteins and post-transcriptional regulation by sRNAs, it is evident from Figure 1 The effectiveness of the regulation by sRNA depends on its production rate relative to the production rates of the target mRNAs. Appropriate relations between these two values may allow a single sRNA-encoding gene to regulate many genes, as has indeed been observed experimentally (e.g., Altuvia et al, 1997; Masse et al, 2005). By taking into account the valid range of these parameters in E. coli, we may conclude that such a simultaneous regulation will be effective for only a few dozens of genes (Figure 2 Despite the small number of known targets of sRNAs in E. coli, our integrative analysis of the transcriptional and post-transcriptional regulation networks has identified mixed regulatory circuits involving combinations of the two levels of regulation. Particularly interesting are the mixed feed-forward loops. These feed-forward loops comprise both a repressor and an sRNA (both regulating the same target), and thus provide a means to guarantee the shutdown of the target gene (Figure 7 While transcription regulation involves recognition between amino acids and bases, and protein interaction is determined by recognition between amino acids, regulation by sRNAs involves, in many of the studied cases, base pairing with the mRNA of the target gene. Hence, at least intuitively, it seems that evolutionary design of sRNAs that will regulate target genes by base pairing should be simpler than the evolution of the other regulatory molecules (Eddy, 2001). This evolutionary advantage of sRNAs along with their other properties, implied by the above simulations, may suggest why these molecules are so widespread in all kingdoms of life. Materials and methods The analyses were carried out using rate equation models. These equations account for the concentration (average number of molecules per cell) of each component in the circuit, namely mRNA and sRNA molecules, free proteins and proteins that are bound to the promoter site. The model consists of a set of coupled ordinary differential equations, each equation evaluates the time derivative of the concentration of one type of molecule. The model is based on several assumptions made in order to simplify the equations and their analysis. One assumption is that the binding rates of pairs of molecules are diffusion-limited. The transcription rate constants gm and gs incorporate all the molecular processes involved in the transcription of the mRNA and sRNA molecules, respectively. The simulation is Markovian, in the sense that it does not include any time delays. A similar assumption regards the translation rates. The sRNA network in Figure 5 Supplementary Information Click here to view.(214K, doc) Supplementary Information Click here to view.(98K, doc) Acknowledgments This work was supported by a grant from the Center for Complexity Science founded by the Horowitz Association (granted to HM, SA and OB), by the Israeli Ministry of Science Grant 3/2559 and by BACRNA, a Specific Targeted Research Project supported by European Union's FP6 Life Science, Genomics and Biotechnology for Health, LSHM-CT-2005-018618 (granted to SA and HM), and by grants from the Israeli Cancer Research Foundation and the Binational US-Israel Science Foundation (granted to HM). References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||
Annu Rev Biochem. 2005; 74():199-217.
[Annu Rev Biochem. 2005]Trends Genet. 2005 Jul; 21(7):399-404.
[Trends Genet. 2005]Annu Rev Biochem. 2005; 74():199-217.
[Annu Rev Biochem. 2005]Nucleic Acids Res. 2003 Apr 1; 31(7):1813-20.
[Nucleic Acids Res. 2003]Proc Natl Acad Sci U S A. 1998 Oct 13; 95(21):12462-7.
[Proc Natl Acad Sci U S A. 1998]Mol Microbiol. 2001 Mar; 39(5):1382-94.
[Mol Microbiol. 2001]Proc Natl Acad Sci U S A. 2000 Aug 29; 97(18):9824-6.
[Proc Natl Acad Sci U S A. 2000]Science. 2002 Oct 25; 298(5594):824-7.
[Science. 2002]Nat Genet. 2002 May; 31(1):64-8.
[Nat Genet. 2002]Proc Natl Acad Sci U S A. 2004 Apr 20; 101(16):5934-9.
[Proc Natl Acad Sci U S A. 2004]Cell. 1997 Jul 11; 90(1):43-53.
[Cell. 1997]Proc Natl Acad Sci U S A. 2000 Aug 29; 97(18):9824-6.
[Proc Natl Acad Sci U S A. 2000]PLoS Biol. 2007 Sep; 5(9):e229.
[PLoS Biol. 2007]Cell. 1997 Jul 11; 90(1):43-53.
[Cell. 1997]Proc Natl Acad Sci U S A. 2000 Aug 29; 97(18):9824-6.
[Proc Natl Acad Sci U S A. 2000]Proc Natl Acad Sci U S A. 2004 Jun 29; 101(26):9792-7.
[Proc Natl Acad Sci U S A. 2004]Genes Dev. 1990 Dec; 4(12A):2202-9.
[Genes Dev. 1990]Genes Dev. 2003 Oct 1; 17(19):2374-83.
[Genes Dev. 2003]Mol Microbiol. 1996 Jun; 20(5):911-7.
[Mol Microbiol. 1996]Cell. 1997 Jul 11; 90(1):43-53.
[Cell. 1997]Proc Natl Acad Sci U S A. 2000 Aug 29; 97(18):9824-6.
[Proc Natl Acad Sci U S A. 2000]Cell. 1997 Jul 11; 90(1):43-53.
[Cell. 1997]Curr Opin Microbiol. 2007 Apr; 10(2):140-5.
[Curr Opin Microbiol. 2007]J Bacteriol. 2004 Oct; 186(20):6679-80.
[J Bacteriol. 2004]J Mol Biol. 2000 Jul 28; 300(5):1101-12.
[J Mol Biol. 2000]Adv Genet. 2002; 46():361-98.
[Adv Genet. 2002]Science. 2002 Oct 25; 298(5594):824-7.
[Science. 2002]Nat Genet. 2002 May; 31(1):64-8.
[Nat Genet. 2002]Proc Natl Acad Sci U S A. 2003 Oct 14; 100(21):11980-5.
[Proc Natl Acad Sci U S A. 2003]Proc Natl Acad Sci U S A. 2004 Apr 20; 101(16):5934-9.
[Proc Natl Acad Sci U S A. 2004]J Biol. 2005; 4(2):6.
[J Biol. 2005]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D394-7.
[Nucleic Acids Res. 2006]Proc Natl Acad Sci U S A. 2004 Apr 20; 101(16):5934-9.
[Proc Natl Acad Sci U S A. 2004]Genes Dev. 2006 Sep 1; 20(17):2338-48.
[Genes Dev. 2006]Proc Natl Acad Sci U S A. 2003 Oct 14; 100(21):11980-5.
[Proc Natl Acad Sci U S A. 2003]J Mol Biol. 2003 Nov 21; 334(2):197-204.
[J Mol Biol. 2003]J Mol Biol. 2006 Mar 10; 356(5):1073-81.
[J Mol Biol. 2006]Proc Natl Acad Sci U S A. 2003 Oct 14; 100(21):11980-5.
[Proc Natl Acad Sci U S A. 2003]Proc Natl Acad Sci U S A. 1998 Oct 13; 95(21):12462-7.
[Proc Natl Acad Sci U S A. 1998]Mol Microbiol. 2001 Mar; 39(5):1382-94.
[Mol Microbiol. 2001]Mol Microbiol. 2007 Jun; 64(5):1260-73.
[Mol Microbiol. 2007]Cell. 2005 Dec 2; 123(5):819-31.
[Cell. 2005]Phys Rev Lett. 2006 May 12; 96(18):188101.
[Phys Rev Lett. 2006]Phys Rev E Stat Nonlin Soft Matter Phys. 2005 Sep; 72(3 Pt 1):031908.
[Phys Rev E Stat Nonlin Soft Matter Phys. 2005]Nat Genet. 2004 Feb; 36(2):147-50.
[Nat Genet. 2004]Genes Dev. 2006 Sep 1; 20(17):2338-48.
[Genes Dev. 2006]Cell. 1997 Jul 11; 90(1):43-53.
[Cell. 1997]J Bacteriol. 2005 Oct; 187(20):6962-71.
[J Bacteriol. 2005]Nat Rev Genet. 2001 Dec; 2(12):919-29.
[Nat Rev Genet. 2001]Genome Res. 2003 Nov; 13(11):2498-504.
[Genome Res. 2003]