- Journal List
- Biophys J
- v.89(6); 2005 Dec
- PMC1366954

# Model-Driven Designs of an Oscillating Gene Network

## Abstract

The current rapid expansion of biological knowledge offers a great opportunity to rationally engineer biological systems that respond to signals such as light and chemical inducers by producing specific proteins. Turning on and off the production of proteins on demand holds great promise for creating significant biotechnological and biomedical applications. With successful stories already registered, the challenge still lies with rationally engineering gene regulatory networks which, like electronic circuits, sense inputs and generate desired outputs. From the literature, we have found kinetic and thermodynamic information describing the molecular components and interactions of the transcriptionally repressing *lac*, *tet*, and *ara* operons. Connecting these components in a model gene network, we determine how to change the kinetic parameters to make this normally nonperiodic system one which has well-defined oscillations. Simulating the designed *lac-tet-ara* gene network using a hybrid stochastic-discrete and stochastic-continuous algorithm, we seek to elucidate the relationship between the strength and type of specific connections in the gene network and the oscillatory nature of the protein product. Modeling the molecular components of the gene network allows the simulation to capture the dynamics of the real biological system. Analyzing the effect of modifications at this level provides the ability to predict how changes to experimental systems will alter the network behavior, while saving the time and expense of trial and error experimental modifications.

## INTRODUCTION

Gene regulatory networks are important for sensing a wealth of molecular signals in a biological system and for generating a specific output that has either naturally evolved to increase the fitness of the organism or has been designed by engineers to serve a specific function (1,2). However, it is still not well understood how interactions at the molecular level influence the system's dynamic behavior. To understand how these interactions affect the protein production of the system, it is essential that models are developed that account for all the known components at the molecular level. Specifically, rational engineering of such inducible gene networks requires intervention at two levels:

- The level of network connections, where DNA binding proteins regulate the expression of specific genes by either activation or repression. By combining simple regulatory interactions, such as negative and positive feedback and feed forward loops, one may create more intricate networks that precisely control the production of protein molecules, such as bistable switches, oscillators, and filters. In the laboratory, these networks can be created using existing libraries of regulatory proteins and their corresponding operator sites.
- The level of molecular components, which describes the kinetics and strengths of the protein-DNA, protein-RNA, and protein-protein interactions within the system. The dynamical behavior of the system is a complex function of the kinetic interactions of the components. By altering the characteristics of the components, such as DNA-binding proteins and their corresponding DNA sites, one can modify the system's dynamical behavior without modifying the network level connections. In the laboratory, the DNA sequences that yield the desired characteristics of each component can be engineered to achieve the desired protein-protein, protein-RNA, or protein-DNA binding constants and enzymatic activities.

The large number of components and interactions involved in dynamic gene regulation requires computational modeling, since the cost of experimentally changing these components and the kinetics of their interaction is large. Computer simulations enable exhaustive searches of different network connectivities and molecular thermodynamic/kinetic parameters, greatly advancing the development of simple rules, or design principles, that seek to simplify the complicated behavior of the network into a brief, usable framework. In this work, we present design rules for constructing a robust oscillating gene network, or repressillator. Our objective is to quantify the effects of the molecular level interactions on the period and variability of the oscillating protein concentrations. Oscillatory gene networks are prevalent in nature, especially ones that generate circadian rhythms (3,4). Given the naturally oscillatory levels of many chemicals in the body, it will be useful to have well-controlled oscillating protein levels for use in applications such as chronopharmaceutics (5).

Previous work has studied naturally oscillating systems using deterministic models and techniques from the study of nonlinear dynamics, including bifurcation analysis. The *Drosophila* circadian rhythm (6), a simplified model of circadian rhythm combining positive and negative regulation (7,8), and the entrainment of a synthetic oscillator to a bacterial cell cycle (9) have been mathematically modeled and analyzed. However, deterministic methods make multiple approximations on the continuity and differentiability of the reaction events that occur within a biological system. These approximations have been shown to be invalid for many biological processes, especially gene expression (10). Here, we describe the system's dynamics using a fully stochastic representation and use stochastic simulations to compute an ensemble of trajectories.

Following the work of Elowitz and Leibler (1), we connect three genes in a cycle of negative feedback loops. The protein products repress the expression of the gene next in sequence, creating the possibility of sustained oscillations. We quantitatively model the repressilator by extracting components from the well-characterized *lac*, *tet*, and *ara* operons, using kinetic parameters from the literature (11–17), and inserting them in the network configuration of the repressilator. Besides the natural components, a wide variety of *lac* and *tet* DNA sites and repressor proteins have been created through extensive mutatagenesis, each with altered kinetic characteristics (18–21).

There are two main differences between our approach and previously developed models. We use a detailed, mechanistic model of the components and interactions that constitute bacterial transcription and translation, including transcriptional and translational elongation and the binding of proteins to individual DNA sites. We include all protein-protein interactions, including dimerization and tetramerization reactions. We do not simplify the model by reducing multiple biomolecular interactions to a few functional groups (in an extreme example we could use a model in which DNA produces RNA which produces protein, using phenomenological transcription and translation kinetic constants). We describe the processes of transcriptional and translational elongation as a Γ-distributed event with the rates of elongation of 30 nucleotides per second and 33 amino acids per second, respectively. We also simulate the dynamics of the network using a hybrid stochastic-discrete and stochastic-continuous algorithm. The models capture the behavior of single cells more accurately than deterministic kinetics, which require that the system be at the thermodynamic limit. Since species such as promoter or operator sites may only be present in single molecule quantities, their concentrations may not be modeled as continuous. Instead, our hybrid stochastic algorithm correctly simulates a coupled jump and continuous Markov process describing, respectively, the dynamics of the stochastic-discrete reactions and the stochastic-continuous reactions. A purely Langevin approach would incorrectly approximate the discrete-stochastic reactions, such as repressor proteins binding to individual DNA sites. It is rather straightforward to develop a deterministic-continuous model of biomolecular interactions that will capture existing experimentally measured concentration profiles. It is more challenging to use a model with sufficient detail, in terms of specific molecular species and interactions, to generate design rules for new gene regulatory networks which can then be directly tested in the lab. The disadvantage of our approach is that the mathematical analysis of the dynamics of the system is numerically performed and requires a large number of simulations. Further, in contrast to deterministic nonlinear dynamics, the theory behind the stability and bifurcation analysis of discrete or continuous stochastic systems is far less developed. To offset this disadvantage, we use techniques from the design of electronic circuits, such as the cyclic covariance function, to compute the periods of stochastic limit cycles.

In this work, we propose multiple configurations of network connectivities and their kinetic parameters that result in a robust repressilator. We begin the design work with two configurations, each creatable from currently existing molecular parts, and show that these configurations do not result in sustained oscillations. We then pinpoint multiple mutations which will exhibit more robust oscillations. Finally, we perform a sensitivity analysis of each design parameter of the system, showing the effects of the number of operators, the operator-repressor affinities, the mRNA and protein half-lives, and the numbers of ribosome and RNA polymerase on the period of oscillation. The information should be useful to any synthetic biologist hoping to construct an oscillating gene network.

## MODELS AND METHODS

### Hybrid stochastic simulation algorithm

For a system of *N* species *S*_{i} (*i* = 1,…,*N*) reacting via *M* reaction pathways R_{j} (*j =* 1,…,*M*) in a volume *V* we define the following:

- The state vector of the system as the
*N*-dimensional vector(*X**t*)*=*{*X*_{1}(*t*),…,*X*_{N}(*t*)}, where*X*_{i}is the number of molecules of species*S*_{i}at time*t.* - The
*M*×*N*stoichiometric reaction matrix,, where*ν**ν*_{ij}is the change in the number of*S*_{i}molecules produced by a reaction R_{j}. *a*_{j}()*x**dt*= the probability that a reaction R_{j}will occur somewhere in the system volume in the time interval [*t*,*t*+*dt*]. We can write*a*_{j}=*h*_{j}*k*_{j}, where*h*_{j}is the number of possible combinations of the reacting molecules in R_{j}and*k*_{j}is the reaction rate constant.

The original stochastic simulation algorithm (SSA) of Gillespie (22) exactly simulates trajectories of a jump Markov process described by the Master equation. Improved variants have incrementally decreased the computational cost (23,24) while retaining the jump Markov description. However, because the computational cost scales with the number of reaction occurrences, systems with one or more “fast” reactions become costly to simulate as a jump Markov process. In the case of the repressilator, many biomolecular interactions, especially the protein dimerization ones, are considered “fast” and require extensive computational time to execute their individual events. By assuming these fast reactions occur continuously, we can convert their mathematical representation to a continuous Markov process and describe their dynamics with a system of chemical Langevin equations (CLEs) (25). The result is a system of Itô stochastic differential equations (SDEs) with multiple multiplicative noises, or

where *a*^{f} is the fast reaction propensities, *W* is an *M*^{fast} dimensional Wiener process, and *ν* is correspondingly altered to include only the fast reactions.

The challenge lies with integrating a discrete-stochastic model, like the SSA, with a continuous stochastic model, like the CLE. We have recently developed a hybrid stochastic algorithm that combines the two and significantly outperforms the SSA while retaining its accuracy (26). The algorithm partitions the system into subsets of fast/continuous and slow/discrete reactions, uses the CLE to describe the effects of the fast reactions, and solves for slow reaction times with a system of differential Jump equations, which are

where *R*_{j} is called the reaction residual and URN is a uniform random number between zero and one. The differential Jump equations are solved by randomly selecting their negative initial conditions, integrating them forward in time, and monitoring the zero crossings of the reaction residuals. The *j*th slow reaction occurs at time *τ*_{j} when *R*(*τ*_{j}) = 0. The system of Jump equations are also SDEs because they are coupled to the CLE via the state vector, ** X**(t). By using a stochastic numerical integrator, such as the Euler-Maruyama or Milstein methods, one can solve the coupled system of SDEs and determine the global error of both the CLEs and the slow reaction times. The method has been shown to be accurate and can be many orders of magnitude faster than the original SSA when one or more fast reactions exist. We use this hybrid stochastic simulation method to compute the stochastic dynamics of the repressilator gene network. For each set of kinetic parameters, at least 100 independent trajectories are computed.

### Quality of oscillations

Since gene expression is a stochastic process, single cells may exhibit widely different behaviors, even if they somehow begin with the same initial conditions. Because of internal noise, oscillations in protein or mRNA molecules will have fluctuating periods, amplitudes, and phases. To make these oscillations useful for some purpose, the gene network must be designed so as to minimize the fluctuations in the period and amplitude. To quantitatively characterize the stochasticity of oscillations, we use a method taken from the design of electronic circuits. We assume the oscillating protein signals are a cyclostationary signal and use the Fourier transform of their autocorrelation functions to compute the average and standard deviation of the periods of oscillation. The method works well even when an oscillating protein signal is partially masked by background stochasticity.

The oscillatory concentrations of the molecules of species *S*_{k} are described as a cyclostationary signal. With *X*_{k}(*t*) being a discrete-index random process, we can define the mean of this time series as *μ*_{x}(*t*): = *E*{*X*_{k}(*t*)} and the covariance *c*_{xx}(*t*; *τ*): = *E*{[*X*_{k}(*t*) − *μ*_{x}(*t*)][ *X*_{k}(*t* + *τ*) − *μ*_{x}(*t* +*τ*)]}. The signal *X*_{k}(*t*) is then called cyclostationary if there exists an integer *q* such that *μ*_{x}(*t*) = *μ*_{x}(*t* + *lq*) and *c*_{xx}(*t*; *τ*) = *c*_{xx}(*t* + *lq*; *τ*) . To best determine the period of oscillation, the cyclic correlation function, *C*_{xx}, can be computed as follows (27)

where *T* is the number of data points, *τ* is taken to be 0, and *j* is the square root of −1. The cycle parameter, *α*, is 2*πn*/*P*, where *P* is the period of oscillation and *n* is an integer number. A plot of *C*_{xx} versus *α* gives peaks corresponding to the most dominant periods of oscillation in the signal *X*_{k}(*t*). There will always be a dominant peak at *α* = 0 because *n* may be 0, representing the infinite period. For each simulation trial and each oscillating protein species, the dominant nonzero peak, corresponding to *n* = 1, is chosen and is used to determine the period. Additional peaks will appear in harmonic multiples, corresponding to larger values of *n*. The reported period is an average over all trials. The standard deviation of the period is based on the standard deviation of the *α*-values, such that *σ*_{P} = 2*πσ _{α}*/

*α*

^{2}, where the

*α*used is the average value over the trials. The cyclic correlation curves shown in this work are obtained by averaging over all the trials and are normalized so that the highest amplitude is one. A system exhibiting sustained oscillations with few fluctuations in its period will produce a well-defined

*α*-peak in the averaged cyclic correlation functions.

### The *lac-tet-ara* gene network

The *lac-tet-ara* system is an experimentally realizable gene network with many possible combinations of individual molecular components. The network connections are constructed so that sustained oscillations are possible, but not guaranteed. The production of LacI monomers is repressed by AraC_{2} proteins bound to promoter-overlapping I_{1}/I_{2} sites, the production of TetR monomers is repressed by LacI_{4} tetramers bound to one or more promoter-overlapping *lac* operators, and the production of AraC monomers is repressed by TetR_{2} dimers bound to one or more promoter-overlapping *tet* operators.

The *lac*, *tet*, and *ara* operators may be moved, replicated with one or more adjacent copies, or replaced with mutant variants. The 5′ untranslated region (5′ UTR) of the repressor mRNAs may be altered to increase or decrease their degradation rates. The repressor proteins may also be fused with ssrA peptides to increase their degradation rates. One configuration, consisting of one wild-type operator regulating each gene and using wild-type repressor proteins and mRNAs, is shown in Fig. 1. Its corresponding mechanistic system of reactions is detailed in Table 1. Although there are many possible configurations, we will first focus on ones that are synthesizable from currently available molecular components, including ones using the inducible promoters created by Bujard (28,29) and another using a simpler, single operator design.

*lac*-

*tet*-

*ara*oscillating gene network. Below, the sequences of the promoter regions, using a single, promoter-overlapping operator per gene.

When replacing wild-type DNA sites, proteins, or mRNA molecules with mutant variants, we modify the system of reactions with the altered kinetics. However, if we replicate one or more copies of an operator and place them adjacent to an existing one, we instead add more reactions to the system, including the new operators' interactions with repressors and RNA polymerases. A brief list of experimentally well-characterized mutant DNA sites and repressor proteins from the *lac* and *tet* operons is shown in Table 2. By using a detailed mechanistic model, we can accurately simulate the effects of adding new genetic elements without making possibly invalid approximations.

*lac*and

*tet*operons

To connect the simulation results to experimentally observable phenotypes, the coding sequence for a fluorescent protein, such as green fluorescent protein (GFP), yellow fluorescent protein (YFP), or cyan fluorescent protein (CFP), is bicistronically added after each repressor coding sequence. The production of each fluorescent protein will be under the same control as the corresponding repressor and their concentration may be quantitatively measured, at the single cell level, with optical microscopy. In the model, the half-lives of the fluorescent proteins, which are also fused with the ssrA peptides, are kept constant at 30 min. For future applications, the fluorescent proteins may be replaced with other functional proteins, such as enzymes. We will refer to the fluorescent proteins as Dlac, Dtet, and Dara, referring to the repressor protein with which they are coexpressed.

### Assumptions

There are a number of assumptions in the presented mathematical model. The reaction volume is considered to be a homogenous, well-stirred medium. The velocity distributions of all species must be a Maxwell-Boltzmann distribution after each reaction, requiring that many of the species' collisions are nonreacting and only aid in reaching the well-stirred approximation. Furthermore, it is assumed that other genes expressed in the reaction volume do not interfere with the studied gene networks, or that any of these effects are accounted for in the experimentally measured kinetic parameters. It is also assumed that there is no significant DNA binding of the monomer forms of LacI, TetR, and AraC and of the dimer form of LacI. Additionally, the small molecules lactose, tetracycline, and arabinose, often associated with these systems, are not included in the model. The repressor proteins are always capable of binding their cognate DNA sites and, once bound, sterically prevent the RNA polymerase from binding the promoter region. Although araC is an activator protein in the wild-type operon, one may convert it to a repressor protein by positioning its DNA binding sites so that the bound araC_{2} overlaps with the promoter region.

For all trials, the initial numbers of free and available molecules for RNA polymerases and ribosome are 270 and 900, respectively. All present promoter and operator sites are initialized at one molecule. All other present chemical species are not initially present. Cell division is a discrete event which occurs every 30 ± 4 min, generated according to a Gaussian distribution. The volume of the cell is initially 10^{−15} liters and is linearly increased. The system volume is then halved as the cell is assumed to split into two equal daughter cells. Except for those species involving DNA sites, RNA polymerase, and ribosome, the numbers of molecules in the system are halved. The reported concentrations are on a per cell basis.

## RESULTS AND DISCUSSION

### Designs using wild-type kinetics do not oscillate

We begin the design of an oscillating gene network by using only wild-type molecular components. One configuration, shown in Fig. 1, consists of only a single operator regulating each gene and uses the wild-type kinetics for the *tet*O_{2}, *lac*O_{1}, and *ara*I_{1}/I_{2} DNA sites as well as the repressor mRNAs and proteins. In Fig. 2, we show the dynamical behavior and the cyclic covariance functions of the fluorescent proteins Dlac, Dtet, and Dara. Clearly, there are no sustained oscillations. The Dlac and Dara indicator proteins are constitutively expressed, whereas the production of Dtet protein remains fully repressed. The repressing action of the lacI tetramers is too strong, whereas the repressing action of the araC dimers is too weak. The cyclic covariance functions for all three species exhibit the expected dominant peak at the center, representing the infinite period. For the Dlac and Dara species, the function quickly decreases in amplitude with an initial peak at 0.225 h^{−1} and harmonic peaks at integer intervals, corresponding to the simulation's end time of 27.7 h. The Dtet species has no dominant period but the infinite one, resulting in the bow shape arc. If the system produces sustained oscillations, the cyclic covariance function would exhibit a sharp peak with an amplitude less than the central one but larger than the peaks corresponding to the simulation's end time.

*Top*) Dynamic behavior of the Dlac, Dtet, and Dara proteins in the simple example over 27.7 h. (

*Bottom*) The cyclic covariance function of the same system.

In our next example configuration using only wild-type molecular components, we replace the regulatory regions from the simple example with a slightly modified version of the inducible promoter regions created by Lutz, Lozinski, Ellinger, and Bujard (28,29). The production of TetR monomers is regulated by P_{LlacO-1}, the production of AraC monomers is regulated by P_{LtetO-1}, and the production of LacI monomers is regulated by a modified P_{lar}. The promoter regions contain two *tet*O_{2} operators, two *lac*O_{1} operators, and a single *ara*I_{1}/I_{2} site. In the P_{lar} promoter region, the *ara*I_{1}/I_{2} sites are moved downstream so that the I_{2} site is between the −35 and −10 consensus sequences and the I_{1} site is adjacently upstream to the −35 region, causing the AraC_{2} protein to act as a repressor, and the downstream *lac* operator is removed. The resulting dynamical behavior and the cyclic covariance functions of the indicator proteins Dlac, Dtet, and Dara are similar to the results of the first example (data not shown). The production of Dlac and Dara proteins is constitutive, whereas Dtet expression is fully repressed. The system exhibits no sustained oscillations, due to the same imbalance of repression between the three genes. We choose these two example designs for a specific purpose: computational modeling is able to quickly and cheaply discard hypothetical designs that will not produce the desired behavior. These systems would often be the first choices in an attempt to build an oscillating gene network, and we have shown that they do not function as intended.

### An oscillating gene network with a 3-2-1 mutant operator configuration

We have created an asymmetric 3-2-1 operator design by modifying the Bujard promoter regions in two ways. The first is the creation of a promoter region containing three *lac* operators, combining the P_{LlacO-1} and P_{A1lacO-1} promoter regions, which regulate the production of TetR monomers. The *lac* and *tet* operators are also replaced with mutant variants. The three *lac* operators now have a decreased affinity to the LacI_{4} tetramer with a *K*_{eq} of 5.2e11 [M^{−1}] and a dissociation kinetic constant of 1.93e-3 [s^{−1}], which is similar to the *lac*O_{sym} variant. The two *tet* operators now have an increased affinity to the TetR_{2} repressor with a *K*_{eq} of 1.4e10 [M^{−1}] and a dissociation kinetic constant of 2.13e-2 [s^{−1}]. The half-life of the tetR protein is also decreased to 5 min. The *ara*I_{1}/I_{2} site is kept as the wild type. The resulting dynamical behavior over a time interval of 5.8 days is shown in Fig. 3. There are only sustained oscillations in the Dara species and, to a lesser extent, in the Dtet species. To improve the quality of oscillations, the affinity of the *ara*I_{1}/I_{2} site is increased to 5e10 [M^{−1}] with a dissociation kinetic constant of 4e-3 [s^{−1}] and the half-life of the TetR protein is reverted to its wild type. The resulting dynamical behavior and the corresponding cyclic covariance function are shown in Figs. 4 *A* and 5 *A*, respectively. Using the cyclic covariance functions, the average period of oscillations is computed as 16.2 h with a standard deviation of 4.1 h. The period of oscillations is much longer than the cell division time and is close to the period of a naturally occurring circadian rhythm. To further improve the quality of oscillations, the half-life of the TetR protein is again reduced to 10 min. The dynamical behavior of this last configuration and its corresponding cyclic covariance function is shown in Figs. 4 *B* and 5 *B*, respectively. The average period of oscillation decreases to 15.3 h with a standard deviation of 2.7 h. The amplitudes, or number of molecules at the peak of the oscillation, of all three fluorescent proteins are now roughly equal, and the variability in the period of the oscillation is decreased.

*red*) Dlac, (

*blue*) Dtet, and (

*green*) Dara proteins using the 3-2-1 operator configuration with mutant

*tet*and

*lac*operators and a TetR protein half-life of 5 min.

*A*) Dynamical behavior of the (

*red*) Dlac, (

*blue*) Dtet, and (

*green*) Dara proteins using a 3-2-1 operation design with mutant

*tet*,

*lac*, and

*ara*operators and a TetR protein half-life of 30 min. (

*B*) The same system as in

*A*, but with a TetR protein half-life

**...**

### A systematic analysis of the oscillation envelope

Although such a heuristic approach will give useful results, the power of in silico modeling lies in the ability to conduct more exhaustive searches. To determine the width of the envelope of oscillation, a more thorough exploration of the parameter space is necessary. The initial concentrations of available RNA polymerases and ribosomes and the degradation rates of all proteins and mRNAs are varied. The effect of the number of operators and the repressor-operator affinity on the period of oscillation is also investigated. To isolate the effect of the particular variable under investigation, the asymmetric wild-type *lac*, *tet*, and *ara* operons were balanced by constructing a symmetric “survey model” in the same form as described in Table 1. This survey model contained between one and three operator sites regulating the production of each repressor. The model's rate constants are symmetric and consistent with the range of available molecular components, including both wild-types and mutant variants.

The affinity of the repressor for the operator sites is investigated by varying the equilibrium binding constants in the survey model through modification of the half-lives of the bound repressor-operator complexes. Since many DNA-protein associations have forward rate constants near the diffusion limit of ~10^{8} M^{−1}s^{−1} (30), this rate constant is fixed. The degradation rate of the complex was varied so as to give affinities between 10^{13} and 10^{7} M^{−1}. This corresponds to half-lives between 19 h and 7 s. The results of the sensitivity analysis reveal the design rules for a three-gene repressilator.

The most basic of these rules is that total repression must be neither too strong, nor too weak. The symmetric survey model with three operator sites per regulated gene shows marked oscillations over a range of repressor-operator affinity of 10^{9}–10^{11} M^{−1}. The period of the oscillation also depends on this affinity—at 10^{9} M^{−1}, the period is 3.39 h, at 10^{11} M^{−1}, the period increases to 11.55 h. When one operator site is removed from each gene, the resulting two-operator model shows marked oscillations over a somewhat wider range, giving periods from 20.04 h at an affinity of 10^{12} M^{−1} to 2.94 h at an affinity of 10^{9} M^{−1}. With only a single operator per gene, the envelope of oscillation is very narrow near an affinity of 10^{11} M^{−1}, and the oscillations are irregular. Overall, higher repressor-operator affinity leads to a longer period of oscillation.

To investigate the effects of asymmetry, a similar series of models is constructed with the repressor-operator affinity fixed at 10^{10} M^{−1} for all of the operator sites of two genes, whereas the affinities of the sites on the third gene is varied. When all genes have three operator sites, the asymmetric model oscillates over a somewhat wider range of affinities than the symmetric case, giving a period of 4.93 h at 10^{9} M^{−1} (versus 3.39 h for the symmetric case) and 11.25 h at 10^{12} M^{−1} (versus no oscillations at this affinity in the symmetric case). With two active operator sites per gene, the model oscillates with periods from 3.85 to 7.78 h over this same range of affinities. With only a single operator site per gene, the system gives no regular oscillations. These results are summarized in Fig. 6.

*A*and

*B*) two operators per genes and (

*C*and

*D*) three operators per genes. (

*A*and

*C*) The affinities of only one set of operators are modified, whereas the other two sets

**...**

These models show that a system may oscillate with repressor-operator affinities that differ by up to two orders of magnitude between the regulatory regions of each gene, provided that all genes contain the same number of operator sites. Excessive asymmetry in the amount of repression quenches the oscillations. The effect of increasing period with increasing affinity is also observed, although the trend is not as strong when the operator sites of only a single gene are varied. We observe that the oscillator's period is the sum of the pulse widths of all three fluorescent protein concentrations, where the pulse width is the amount of time required for each gene to express its repressor, become repressed by another repressor, and have both the repressor's mRNA and protein degrade to zero molecules. Asymmetries in the number of operators and the repressor-operator affinity will cause asymmetric pulse widths. In Fig. 7, the model depicted at the right is a symmetric model: repressor-operator affinities at both operator sites on all three operons are 10^{10} M^{−1}. In the model at the left, the repressor-operator affinities of both operator sites on one operon were increased to 10^{12} M^{−1}. Only one marker protein is shown, as the other two were unaffected by the change.

*Left*) The dynamics of the Dlac protein from a model with two operators per gene, with one operator having a repressor-operator affinity of 10

^{12}M

^{−1}and the others with repressor-operator

**...**

Variations in the forward rate constant were also briefly investigated. In the preceding trials, forward rate had been held constant at 10^{8} M^{−1}s^{−1}. Although some repressors, including the *lac* one, are known to exhibit forward binding rates >10^{10} M^{−1}s^{−1} (30), which is greatly in excess of the diffusion limit, such systems are rare. Several models are constructed with the more conservative value of 10^{6} M^{−1}s^{−1} and are investigated over the same range of affinities and symmetries as the previous models. In general, these models do not oscillate well, even when the degradation rate of the repressor-operator complex is adjusted to give the same range of affinities. When oscillations do occur, they tend to have longer periods than systems with the same affinity but higher forward rate. For example, a three operator-per-gene symmetric model with an operator-repressor affinity of 10^{9} M^{−1} gives a period of 3.39 h and 5.89 h with forward rates of 10^{8} M^{−1}s^{−1} and 10^{6} M^{−1}s^{−1}, respectively. Therefore, the rate of forward binding of a repressor to its operator is shown to be as important as its affinity to the operator.

Other parameters of interest are also investigated. The concentrations of free, functional RNA polymerase and ribosome are difficult to experimentally control. Gene network designs should therefore be robust to variations in these values. The concentration of RNA polymerase in *Escherichia coli* is known to vary with bacterial doubling time. This quantity is not accessible to direct experimentation; however, models indicate values on the order of 100 to 1000 molecules per cell (31). Within this range, five models are evaluated. An initial concentration of RNA polymerases of 100 molecules per cell yields a period of oscillation of 5.66 h, whereas 1000 molecules per cell decreases the period to 5.33 h, which is a difference of only 6%. Variation in initial available ribosome concentration has a somewhat greater effect. With 300 ribosomes per cell, the model oscillates with a period of 4.95 h. When this number is increased to 1000 molecules per cell, the period increases by 19% to 5.89 h. Neither of these quantities leads to a loss of oscillation within the range of values tested. Interestingly, whereas increasing the number of RNA polymerase molecules in a cell leads to a decrease in period, increasing the number of ribosomes has the opposite effect, which is shown in Fig. 8.

The effect of protein degradation rate may also play a role in oscillator design. When the protein products of all three genes are assigned half-lives of 10 min, the period of oscillation is 3.57 h. When the half-lives are increased to 60 min, the period increases to 9.08 h. The same trend is observed when the half-lives of the products of two operators are held constant at 20 min and the half-lives of all of the protein products of the third operator are varied. In this asymmetric case, a half-life of 10 min leads to a period of 4.88 h (versus 3.57 h in the symmetric case) and a half-life of 60 min leads to a period of 6.39 h (versus 9.08 h in the symmetric case). Variation of protein half-life does not lead to a loss of oscillation within the range of values tested.

Finally, the effect of mRNA half-life on system oscillation is investigated. As mRNA half-life is varied from 5 min to 15 min for all three mRNA species, the period of oscillations increases from 5.30 to 6.56 h. Fixing the half-lives of two species at 5 min while varying the half-life of the third over the same range yields the same trend, but to a lesser degree. As the third half-life is varied from 5 min to 15 min, the period increases from 5.36 to 5.85 h. Fig. 9 summarizes the data.

## CONCLUSIONS

The modified *lac-tet-ara* system described here demonstrates that a gene network created from molecular parts not normally associated with each other nor naturally oscillatory can lead to the expression of a robustly periodic protein product. More interestingly, with the help of inexpensive simulations, a simple set of design rules is created that can guide a first cycle of experiments. The design rules compact the results of these simulations into a few concise statements, which may then be used to more easily construct robust oscillating gene networks.

All of the investigated design parameters affect the period of oscillation. First, increasing the repressor-operator affinity results in a longer period of oscillation. Increasing the number of operators regulating each gene also increases the period of oscillation and makes the repression of each gene more sensitive to the number of repressors, which is abstractly referred to as cooperativity. However, if the operators of the three genes differ in their affinities by larger than two orders of magnitude, then sustained oscillations are not possible. To obtain a desired period of oscillation, one may increase the number of operators and decrease the repressor-operator affinity or vice versa. However, using only one operator generally results in unsustained oscillations, and inserting more than three promoter-overlapping operators is difficult. Second, increasing the half-life of the mRNA or protein of any repressor will increase the period of oscillation. Finally, increasing the number of available RNA polymerases or ribosomes respectively decreases and increases the period of oscillation. Because the RNA polymerases directly compete with repressors for binding to promoter sites, the increase in the number of RNA polymerases has a similar effect as decreasing the repressor-operator affinity. However, because only transcriptional regulation is utilized, increasing the number of ribosomes has a similar effect as increasing the half-lives of the repressors themselves. Although experimentally altering the expression of RNA polymerase and ribosome is not practical, it is important to know how the period of oscillations will change as a result of global shifts in the metabolism of the cell. For example, the period of oscillation is predicted to change when switching from the exponential to stationary growth phases, due only to changes in RNA polymerase and ribosome numbers.

Constructing and testing the variant models described here would be enormously costly. Instead, we use stochastic simulations of a detailed mechanistic model. Although the simulations are subject to multiple assumptions, stochastic modeling of the known interactions should provide a set of verifiable and falsifiable rules. Using the results of a first cycle of targeted experiments, the model may be directly refined to correct invalid assumptions or kinetic parameters. Because the model uses a detailed, mechanistic system of reactions, it is much easier to modify the kinetic characteristics of a particular molecular interaction, which may be directly measured from experiments. Alternative models, which include the extensive use of course-grained or lumped interactions, are more difficult to modify according to new experimental data because they group together multiple biological processes whose independent actions are not fully accounted for.

In the near future, toolboxes will be created of known DNA sequences and protein molecules that exhibit a wide spectrum of well-characterized kinetic parameters. The first successful attempts are already being reported (32). The combination of these molecular components into a synthetic gene network will create novel and useful functions. As these networks become more complex, our ability to intuitively predict their behavior will fail. By using detailed mechanistic models and stochastic simulation techniques, one can more quickly determine the necessary molecular components and network connectivities that produce a desired dynamical behavior.

## Acknowledgments

This work was supported by grants from the National Science Foundation (BES-0425882 and EEC-0234112). Computational support from the Minnesota Supercomputing Institute is gratefully acknowledged. This work was also supported by the National Computational Science Alliance under TG-MCA04N033.

## References

*In*Digital Signal Processing Handbook. V. K. Madisetti and D. Williams, editors. CRC Press, Boca Raton FL. 17-1–17.31.

**The Biophysical Society**

## Formats:

- Article |
- PubReader |
- ePub (beta) |
- PDF (308K) |
- Citation

- Noise-limited frequency signal transmission in gene circuits.[Biophys J. 2007]
*Tan C, Reza F, You L.**Biophys J. 2007 Dec 1; 93(11):3753-61. Epub 2007 Aug 17.* - Hybrid stochastic simplifications for multiscale gene networks.[BMC Syst Biol. 2009]
*Crudu A, Debussche A, Radulescu O.**BMC Syst Biol. 2009 Sep 7; 3:89. Epub 2009 Sep 7.* - Transient analysis of stochastic switches and trajectories with applications to gene regulatory networks.[IET Syst Biol. 2008]
*Munsky B, Khammash M.**IET Syst Biol. 2008 Sep; 2(5):323-33.* - Complex networks and simple models in biology.[J R Soc Interface. 2005]
*de Silva E, Stumpf MP.**J R Soc Interface. 2005 Dec 22; 2(5):419-30.* - Gene regulatory network inference: data integration in dynamic models-a review.[Biosystems. 2009]
*Hecker M, Lambeck S, Toepfer S, van Someren E, Guthke R.**Biosystems. 2009 Apr; 96(1):86-103. Epub 2008 Dec 27.*

- Systematic design methodology for robust genetic transistors based on I/O specifications via promoter-RBS libraries[BMC Systems Biology. ]
*Lee YY, Hsu CY, Lin LJ, Chang CC, Cheng HC, Yeh TH, Hu RH, Lin C, Xie Z, Chen BS.**BMC Systems Biology. 7109* - Electronic Implementation of a Repressilator with Quorum Sensing Feedback[PLoS ONE. ]
*Hellen EH, Dana SK, Zhurov B, Volkov E.**PLoS ONE. 8(5)e62997* - Stochastic model reduction using a modified Hill-type kinetic rate law[The Journal of Chemical Physics. 2012]
*Smadbeck P, Kaznessis Y.**The Journal of Chemical Physics. 2012 Dec 21; 137(23)234109* - Efficient Moment Matrix Generation for Arbitrary Chemical Networks[Chemical engineering science. 2012]
*Smadbeck P, Kaznessis YN.**Chemical engineering science. 2012 Dec 24; 84612-618* - Stochastic simulations of a synthetic bacteria-yeast ecosystem[BMC Systems Biology. ]
*Biliouris K, Babson D, Schmidt-Dannert C, Kaznessis YN.**BMC Systems Biology. 658*

- PubMedPubMedPubMed citations for these articles
- SubstanceSubstancePubChem Substance links
- TaxonomyTaxonomyRelated taxonomy entry
- Taxonomy TreeTaxonomy Tree

- Model-Driven Designs of an Oscillating Gene NetworkModel-Driven Designs of an Oscillating Gene NetworkBiophysical Journal. 2005 Dec; 89(6)3873

Your browsing activity is empty.

Activity recording is turned off.

See more...