• We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
Logo of plosonePLoS OneView this ArticleSubmit to PLoSGet E-mail AlertsContact UsPublic Library of Science (PLoS)
PLoS One. 2010; 5(5): e10743.
Published online May 27, 2010. doi:  10.1371/journal.pone.0010743
PMCID: PMC2877713

A Software Tool to Model Genetic Regulatory Networks. Applications to the Modeling of Threshold Phenomena and of Spatial Patterning in Drosophila

Johannes Jaeger, Editor

Abstract

We present a general methodology in order to build mathematical models of genetic regulatory networks. This approach is based on the mass action law and on the Jacob and Monod operon model. The mathematical models are built symbolically by the Mathematica software package GeneticNetworks. This package accepts as input the interaction graphs of the transcriptional activators and repressors of a biological process and, as output, gives the mathematical model in the form of a system of ordinary differential equations. All the relevant biological parameters are chosen automatically by the software. Within this framework, we show that concentration dependent threshold effects in biology emerge from the catalytic properties of genes and its associated conservation laws. We apply this methodology to the segment patterning in Drosophila early development and we calibrate the genetic transcriptional network responsible for the patterning of the gap gene proteins Hunchback and Knirps, along the antero-posterior axis of the Drosophila embryo. In this approach, the zygotically produced proteins Hunchback and Knirps do not diffuse along the antero-posterior axis of the embryo of Drosophila, developing a spatial pattern due to concentration dependent thresholds. This shows that patterning at the gap genes stage can be explained by the concentration gradients along the embryo of the transcriptional regulators.

Introduction

A genetic regulatory network is an ensemble of interactions in a biological process involving proteins, genes and mRNAs. The interactions between different proteins and genes can be done by transcriptional activation and repression at the level of the genes, by protein-protein interactions, and by protein-mRNA interactions.

A genetic regulatory networks is described by a graph where vertices represent genes, proteins, enzymes or other chemical substances. The edges represent transformations, e. g., phosphorylation and dephosphorylation, or activation and inhibitory actions through transcription regulators.

More precisely, a genetic regulatory networks is described by a double graph An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e001.jpg, where An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e002.jpg is the set of vertices or nodes of the graph and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e003.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e004.jpg are two sets of ordered pairs of vertices of the double graph. Each ordered pair of vertices defines the activation or the repression mechanism of the first node over the second. In classical graph theory, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e005.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e006.jpg are two graphs with a common set of vertices. For example, in Figure 1, we show the graph of a genetic network associated with the production of the proteins Bicoid (BCD), Hunchback (HB), Knirps (KNI) and Tailless (TLL) in Drosophila early development, [1], [2]. In this example, we have An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e007.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e008.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e009.jpg.

Figure 1
Graph describing the genetic network associated with the production of proteins Bicoid (BCD), Hunchback (HB), Knirps (KNI) and Tailless (TLL) in Drosophila early development.

The graph of Figure 1 has a clear biological meaning. It expresses the fact that BCD is a transcriptional activator of both HB and KNI, HB and KNI proteins both repress each other, and TLL is a repressor of KNI. The vertices of the graph of Figure 1 can represent mRNAs, as in the case of hb and bcd, or proteins, as in the case of BCD and TLL, or genes and proteins simultaneously, as in the case of HB and KNI.

Here we propose a set of rules in order to construct the model equations associated with a genetic regulatory networks described by a double graph An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e012.jpg. This paper is an attempt to delineate a methodological approach for the construction of mathematical models of gene expression regulation from the principles of chemical kinetics and chemical bound. In the literature, it is often found examples of mathematical models of biological systems described by different sets of equations and characterized by different sets of parameters that are difficult to interpret and to measure experimentally. Making qualitative predictions with these different models has a limited predictable value. For a review on the different approaches see, for example, [3], [4].

In the construction of models for generic regulatory networks, we assume that models can be built with rate equations reflecting a mean field view of the stochastic random motion occurring at the molecular scale. This mean field approach, also called mass action law, is derived from the probabilistic collision laws occurring at the molecular scale. The models originated from this view are described by ordinary differential equations with polynomial vector fields, [5][7].

One of the advantages of the mass action law approach is that the mean field rate equations have a direct microscopic interpretation, being associated with the collision mechanism that are in the origin of every reactive process. For model refinement, fluctuations can also be studied through the corresponding master equation. From the experimental point of view, microbiology techniques are strongly anchored in the mass action law or mean field approach, [5], [8].

For genetic regulatory networks described by graphs with a large number of vertices, and a complex structure of edges, the rate equations describing the evolution in time of concentrations are in general difficult to build, and are critically dependent on the assumptions done about the biological and the chemical interactions involved. During the development of these complex models, it is often necessary to test different graph configurations, and to change parameters and initial conditions. Writing by hand all this information is both time-consuming and error-prone.

In order to perform these tasks automatically, we have developed two Mathematica software packages, Kinetics and GeneticNetworks, that execute the symbolic computations associated with the construction of the model equations for a genetic regulatory network. The result of the analysis is in symbolic form, and can be used in Mathematica, C or any other simulation software for further numerical integration and graphical analysis. The software packages GeneticNetworks and Kinetics are freely distributed, [9].

The Kinetics package implements the mass action law in its polynomial exact form, computing symbolically the associated rate equations and conservation laws. The parameters generated within Kinetics are chemical rate constants.

The package GeneticNetworks implements a particular model for protein-gene regulation. This model for the protein-gene regulation is based on the operon model of Jacob and Monod [10] in prokaryotes, and its basic properties have been previously introduced in [11]. The tools in the GeneticNetworks software package implement a simplified view of the Molecular Biology Dogma, [12], for protein encoding, translation and transcription, and is consistent with the mass action law. For eukaryotes, transcriptional regulation in is a much more complex issue, involving many redundant binding sites dispersed along genomic sequences. Therefore, in this case, the modeling approach proposed here should be understood as a descriptive approximation to the not well understood eukaryotic regulation mechanisms.

In order to obtain a dimensional reduction on the number of variables in the equations obtained by Kinetics and GeneticNetworks, it is possible to construct, by a steady state approximation, Hill's function models, [4], [8], [13].

The advantages of using the Mathematica computing environment are (i) the possibility of obtaining an exact form for the model equations; (ii) to perform, if necessary, further symbolic simplifications on the models; (iii) to modify the initial theoretical assumptions of the model without having to re-introduce or choose new parameter values; (iv) to make the numerical and graphical analysis of models within the same computing environment; (v) to use of a natural language without a deep knowledge of programming; (vi) to use an easy interface for other programming environments.

To build a model of given genetic regulatory network, the only necessary input to GeneticNetworks is the activation and inhibition relationship between genes, mRNAs and proteins. This input is given in the form of the two order pairs of vertices An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e013.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e014.jpg of the network graph. Then, the generation of model equations is done with the GeneticNetworks commands. The model equations can be analysed within the Mathematica environment or introduced in other programming environments as COPASI, [14], and PottersWhell, [15], for simulation and parameter estimation. These programs are powerful general propose tools in order to numerically simulate solutions of ordinary differential equation and to simulate stochastic models for system biology. At the time of writing this paper, in the site of the Systems Biology Markup Language, http://sbml.org, there were more than 180 registered systems biology simulation programs.

This paper is organized as follows. In the Methods subsection, we briefly review the mass action law of chemical kinetics and we introduce the collision graphs associated with the mass action law. We derive the basic mass action rate equations. A special emphasis is done on mass action conservation laws, an important feature that is in the very foundations of threshold effects in biology. In other approaches, threshold do not result as emergent phenomena, but must be imposed through ad hoc regulatory functions (see for example [3] or [4, pp. 237]). We describe the mechanism of genetic regulation based on the Jacob and Monod operon model, [10], and we introduce the modeling assumptions for the construction of the mathematical models of genetic networks described by double graphs. Finally, we give an overview of the GeneticNetworks software package.

In the Results section, we show three different applications of the quantitative approach developed here. In the first application, we show, with a very simple example of auto-regulation, that the conservation law constant is a bifurcation parameter for the regulation model, inducing a concentration dependent threshold effect in the model for the production of proteins. This solves the problem of the introduction of ad hoc threshold effects in biological simulations, [16]. In the second application, we give a genetic regulatory network inducing a localized spiky pattern along a spatial domain. In this case, the spatial spiky patterns appears without the necessity of other transport mechanisms, as diffusion or advection, but is a consequence of the concentration dependent threshold effect. In the third example, we analyze the experimental data associated with the KNI and HB inhibitory cross regulation in Drosophila early development, described by the double graph of Figure 1, and we calibrate this model with the experimental data, without the need of a diffusion hypothesis for the zygotically produced proteins HB and KNI. In the final section, we summarize and discuss the main biological conclusions of the paper.

Methods

The mass action law framework of chemical kinetics

In general, an ensemble of chemical reactions is represented by the following collision diagram,

equation image
(1)

where An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e016.jpg. The An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e017.jpg, for An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e018.jpg, represent chemical substances, as for example, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e019.jpg. The constants An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e020.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e021.jpg are the stoichiometric coefficients, in general, non-negative integers, and the constants An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e022.jpg are the rate constants. If An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e023.jpg, the corresponding substance An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e024.jpg is a catalyst and, if An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e025.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e026.jpg is an autocatalyst. In the diagram (1), there are An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e027.jpg chemical substances and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e028.jpg rate constants or chemical reactions.

Under the hypothesis of homogeneity of the solution where reactions occur, the mass action law asserts that the time evolution of the concentrations of the chemical substances is described by the system of ordinary differential equations,

equation image
(2)

where An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e030.jpg, and we use the same symbol to represent both the chemical substance and its concentration. The rate equations (2) are derived under the following assumptions: (i) chemical reactions, when they occur, are due to elastic collisions between the reactants, (ii) homogeneity of the reacting substances in the solution, and (iii) thermal equilibrium of the solution. All the kinetics aspects related with the dependence of the velocity of the reactions on the temperature or pressure are contained in the rate constants An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e031.jpg. For details see [5].

At the atomic and molecular scale, chemical reactions between molecules can occur only if molecules collide or approach each other to small distances where bounding forces become meaningful. These chemical bounding forces are of electrical or quantum origin, and at distances larger than the mean free path they become less important when compared with the kinetics associated with the molecular motion. As chemical reactions only occur if the chemical substances involved collide, the vector fields associated with the right hand side of (2) are in general quadratic, representing binary collisions. Higher order polynomial vector fields are possible but, at the microscopic level, they are associated to triple or higher order collisions, a situations that occurs with a very low probability. Therefore, we will restrict our examples to models with two-body collisions.

The equations (2) can also be written in the matrix form,

equation image
(3)

where An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e033.jpg is a An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e034.jpg matrix, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e035.jpg, and,

equation image

In general, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e037.jpg, and the equations in system (2) are not all independent. Let us denote by An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e038.jpg the rank of the matrix An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e039.jpg. The dimension of the null space of An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e040.jpg relates with its rank by, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e041.jpg (number of rows of An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e042.jpg). Let An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e043.jpg be a basis of the null space of An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e044.jpg, then, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e045.jpg, for An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e046.jpg. So, by (3), we have,

equation image
(4)

Hence, associated with the differential equations (2), we have the conservation laws,

equation image
(5)

where, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e049.jpg.

The Mathematica software package Kinetics calculates the rate equations (2) describing the time evolution of the concentrations of the substances involved in the reactions described by the collision diagram (1). The package calculates also the corresponding conservations laws (5).

The input of the package is the ensemble of chemical reactions, and the output of the package is the set of differential equations derived by the mass action law. Then, the output can be later analyzed and studied by the analytical and numerical tools in the software package Mathematica. In order to avoid long development times, the names of the rate constants are chosen automatically by the program.

The package Kinetics has the usual help commands, and we provide the Mathematica notebook KineticsTest.nb with several self-explanatory examples and computations, [9].

For example, let us describe now a simple protein production model with Kinetics. The Molecular Biology Dogma asserts that genes are the templates for protein production, and the standard mechanism for protein production can be represented by the collision diagrams,

equation image
(6)

Using the symbols An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e051.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e052.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e053.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e054.jpg to represent gene, polymerase, mRNA and protein concentrations, respectively, the collision equations (6) are the input for Kinetics, with the syntax,

equation image

where An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e056.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e057.jpg are waist products.

For the collision mechanism (6), the rate equations for the protein concentration, and calculated by the package Kinetics, are,

equation image
(7)

and the rate constants have been chosen automatically by the software package. In this model, genes, polymerase and mRNAs are catalysts, and these equations have the exact solutions,

equation image
(8)

To simplify the model equations of protein production and maintaining the catalytic properties of genes, in the following, instead of the collision mechanism (6), we use the simplified or reduced kinetic mechanism,

equation image
(9)

To the reactions (9) correspond the rate equations,

equation image
(10)

The rate equations (10) have the solutions,

equation image
(11)

Comparing the protein solutions in (11) and (8), we conclude that the steady state of the protein in both models is unique and is proportional to the gene concentration. The proportionality constant is different for both models, depending on the rates of the reactions involved. The steady state An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e063.jpg of model (9) has a direct biological meaning: An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e064.jpg is the rate of protein production and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e065.jpg is the rate of protein degradation, and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e066.jpg is the initial gene concentration.

In the following, and in order to simplify the description of the transcriptional regulation of genes, we will adopt the mechanism (9) to describe the associated protein production.

In both rate equations (10) and (7), the concentration of gene is constant along time, and therefore the gene concentration is a conservation law. In the following, we will show that the linear conservation laws of the form (5), will have an important role in the determination of steady states and in bifurcations associated with threshold effects.

A mass action framework to describe genetic regulatory networks based on the protein-gene interaction

In the previous section, we have described a basic model model for the production of proteins, in the framework of the mass action law. Based on this framework, we generalize this view in order to include the case of transcriptional regulation of genes by proteins.

In order to keep general the approach presented here and to maintain the biological reality of model parameters, we make the following basic modeling assumptions:

  1. In order to describe quantitatively the protein production (concentration) within the Molecular Biology Dogma, we only consider genes and proteins. Intermediate substances in the regulatory mechanism like catalysts, polymerases and mRNAs are not considered. A model of protein production has been presented and analyzed at the end of the previous section.
  2. The regulation of protein production by the template gene is based on the Jacob and Monod operon model, [10]. Namely, every gene has associated a certain number of binding sites where transcription factors can bind — activators or repressors, Figure 2. The regulation of activations and repressions occurs only through the binding sites. For a given double graph of interactions, the number of binding sites of a gene is determined by the number of edges that end up in the corresponding graph node.
    Figure 2
    Jacob and Monod operon model for the regulation of protein production.
  3. Transcription factors are the proteins associated with the vertices that activate or inhibit the production of other proteins. The vertex of a graph represent a transcription factor only if it is the initial point of a edge of activation or inhibition. If a vertex has incoming and outgoing edges of any type, this vertex represents symbolically a protein and a gene with several binding states.
  4. Each transcription factor has its own binding site in the gene strand, or each gene has only one binding site for all the regulators. Both cases are treated separately in the model. We assume that when at least one activator is bound to a gene, the transcription is activated with a particular production rate for each combinatorial possibility of all the remaining binding sites.

For example, in the double graph of the biological mechanism of Figure 1, we have the following chemical substances,

equation image
(12)

The description of the time evolution of protein concentrations of the mechanisms of Figure 1 involves one rate equation for each substance in (12), except eventually for bcd and hb. As proteins are produced from a gene template, the symbol associated to each vertex of the graph represents a protein. The operon states are represented by the same symbol with superscripts and lowerscripts. The superscripts positions indicate the binding or unbinding of transcriptional activators. The lowerscripts positions indicate the binding or unbinding of transcriptional repressors. In the GeneticNetworks software package, bcd, hb, BCD, HB, KNI and TLL are the names of the vertices of the regulation graphs, but the operon variables in (12) are generated by the software.

The model associated with a given double graph contains the rate equations for the proteins and the operons in its different states. We also assume by default that proteins always degrade and genes are autocatalytic substances that never degrade. The first assumption implies that protein concentrations remain bounded in time, and the second assumption implies the existence of a conservation law for the concentration of the operon states. Using the symbolic tools of Mathematica, other assumptions can be introduced at this stage of model construction.

The GeneticNetworks software package

GeneticNetworks is a software package that generates the rate equations for the concentrations of genes and proteins in a regulatory network, [9]. The starting point is the double graph of activations and repressions, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e068.jpg. The inputs for GeneticNetworks are two strings, activation and repression, that describe the transcriptional activations and repressions of proteins on genes. In the graph, the same symbol is used to denote both a gene and the corresponding produced protein. As we have seen in (12), the set of variables for the regulation model is constructed with the vertex symbols.

For example, using as input for GeneticNetworks the interaction strings,

equation image
(13)

the double graph of the genetic network (13) is shown in Figure 3. In this case, the double graph An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e070.jpg is characterized by the sets, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e071.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e072.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e073.jpg.

Figure 3
Double graph associated with the input strings (13) for the GeneticNetworks software package.

In the interaction mechanism (13), protein A activates gene B, and protein R represses gene B. Therefore, the variables of the mechanism (13) are,

equation image
(14)

The following functions are defined in the GeneticNetworks package:

  • NetworkGraph, ManipulateGraph
  • Reactions, ReactionsOneSite, ReactionGraph
  • SubstanceNames, SubstanceVariables, SubstanceInitialConditions
  • ParameterNames, ParameterInput
  • Equations
  • ConservationLaws

With these functions, we calculate the model equations associated with the input strings (13), calculate automatically the number of variables of the model, define all the relevant parameters and calculate the rate equations. For example, to the genetic regulatory network (13), the GeneticNetworks package builds the mass action law type collision diagrams,

equation image
(15)

To these collision diagrams, we have the mass action law rate equations,

equation image
(16)

and the conservation law,

equation image
(17)

From the conservation law (17), we can eliminate one of the equations in (16). In this genetic network, we have assumed that the protein concentrations of An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e078.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e079.jpg are constant along time.

The rate equations (16) define a mass action law based model for the genetic regulatory network of Figure 3.

In the implementation of GeneticNetworks, we have two possible modeling choices. In one choice, each different regulator has its own binding site, and the model diagrams (15) have been constructed with this assumption. For the second choice, we consider that there is only one binding site in the operon where all the regulators bind. In this case, the collision diagrams associated with the genetic network (13) and calculated in the GeneticNetworks package are,

equation image
(18)

By the mass action law, the ReactionsOneSite command leads to the rate equations,

equation image
(19)

The equations (19) have the conservation law,

equation image
(20)

The two models (15) and (18) for the genetic network (13) are different and these two choices are implemented in the GeneticNetworks package. For the dynamical analysis of a particular case of the distinction between the two models (15) and (18), see [11]. Below, we will show with a specific example that these two different regulation choices lead to qualitatively and quantitatively similar results.

Results

An emerging concentration threshold in the dynamics of a self-activating protein

As an application of the rules describing a genetic regulatory network just introduced, we discuss now the basic role of the conservation laws in the occurrence of threshold effects in regulation mechanisms. We study the case of a self-activating protein, where the produced protein activates its own production, Figure 4.

Figure 4
Regulation graph describing a self-activating protein.

The simplest self-activating genetic network is described by the input tables,

equation image

The reactions and the parameters involved in this activation can be obtained by the GeneticNetworks command Reactions, followed by the command ReactionGraph,

equation image
(21)

where An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e085.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e086.jpg are the operon states and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e087.jpg is the corresponding protein.

With the command Equations, we get the rate equations,

equation image
(22)

Finally, with the command ConservationLaws, we find,

equation image
(23)

where An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e090.jpg is a constant.

Introducing (23) into (22), the independent set of rate equations describing the process (21) is,

equation image
(24)

We analyze now the steady state and the phase space structure of the solutions of equations (24). Equations (24) have two steady states with coordinates,

equation image

and,

equation image

As the coordinate of the two steady sates are dependent of An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e094.jpg, by (23), the steady state coordinates are dependent of the initial concentrations of the operon.

Let An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e095.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e096.jpg be the Jacobian of equation (24) evaluated at the fixed points. As,

equation image

and,

equation image

then, we have,

equation image
(25)
equation image
(26)

As, for An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e101.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e102.jpg is negative, the protein concentration at the steady state of the rate equations (24) is zero (An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e103.jpg). For An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e104.jpg, the protein concentration at equilibrium is An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e105.jpg. Therefore, the conservation law (23) tune a bifurcation for An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e106.jpg (transcritical bifurcation), implying the existence of a threshold effect tuned by the conservation law parameter An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e107.jpg.

In Figure 5, we show the dependence of the protein steady state on the total concentration of the gene. In this simple regulation model, where both protein and gene concentrations are modeled, the steady state of the protein depends on the initial concentration of the corresponding operon states. On the other hand, the initial concentration of the operon states induce a bifurcation from a quiescent state to a non zero steady state. This is a threshold effect that emerges from the dynamics (24). As the steady state protein concentration depend on An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e108.jpg, the “after threshold” concentration values depends continuously on the operon initial concentration An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e109.jpg. In the following, we will see that in networks with more than one node, the steady state depends also on the concentration of other transcriptional regulators, and these concentration dependent thresholds can be in the origin of spatial patterning.

Figure 5
Dependence of the protein steady state on the total concentration of the gene.

Spatial distribution and steady states

We have focused on genetic regulatory models without specifying a spatial localization. In genetic networks describing some biological process, the initial concentration of proteins can significantly vary across tissues. For example, in some developmental processes, proteins show a non-uniform concentration along embryos, with very sharp slopes, playing a basic role in the establishment of body plans of organisms. A well known case is the Drosophila segmentation, where variations on protein concentrations across the embryo induces protein patterning, [17][19]. One of such genetic regulatory networks is the one represented in Figure 1, [2].

To show that patterning can be explained by the non homogeneity of initial conditions of regulators across tissues, we analyze a genetic regulatory network for the production of a protein An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e117.jpg, regulated by one activator An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e118.jpg and two repressor proteins An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e119.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e120.jpg, Figure 6. To simplify our analysis, we take the competitive case, where the activator and the repressors bind to the same binding site of the operon of protein An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e121.jpg.

Figure 6
Genetic regulatory network for the production of protein An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e122.jpg with one activator An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e123.jpg and two repressors An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e124.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e125.jpg.

To simplify further, we assume that the spatial distribution of the proteins An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e126.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e127.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e128.jpg are constant in time. Under these conditions and with the regulation model developed in the package GeneticNetworks, we obtain the system of linear rate equations,

equation image
(27)

where the derivative is in order to time, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e130.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e131.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e132.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e133.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e134.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e135.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e136.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e137.jpg. These concentration variables are defined in a spatial one-dimensional bounded region of the real line (An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e138.jpg). The following conservation law holds,

equation image

where An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e140.jpg is a constant, depending eventually of the spatial independent coordinate An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e141.jpg. The system of rate equations (27) has one steady state with coordinates,

equation image

where,

equation image

Choosing An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e144.jpg, and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e145.jpg, the steady state concentration of the protein An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e146.jpg, is,

equation image
(28)

In Figure 7, we show the steady state concentration (28) of protein An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e148.jpg, as a function of a spatial coordinate, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e149.jpg. We have considered the initial distributions An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e150.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e151.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e152.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e153.jpg, and the parameter value An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e154.jpg. In this case, due to the inhibitory regulation of the repressor proteins An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e155.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e156.jpg, the steady distribution of protein An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e157.jpg is spiky. We have analyzed the same genetic network of Figure 6 with a model with different binding sites for each regulator. The final result is similar with the one shown in Figure 7. This shows that the two model approaches in GeneticNetworks, with one binding site and with several binding sites in the operon, give similar qualitative results. When, the calibration and validation of models is not a problem, we can use the simplest one binding site regulation model in order to describe a given genetic regulatory network. The one-regulator site framework leads to simpler mathematical models.

Figure 7
Steady states of proteins An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e158.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e159.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e160.jpg for the genetic regulatory network of Figure 6.

The solution (28) shows that steady states can depend on the initial conditions of the regulators (An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e167.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e168.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e169.jpg) and on the initial conditions of the catalytic agents (An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e170.jpg), showing that spatial patterning can be a direct consequence of the non homogeneity of initial conditions.

In this model, we have considered that the concentrations of An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e171.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e172.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e173.jpg are constants, implying that the model equations (27) describe a system where An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e174.jpg, An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e175.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e176.jpg have a fast recovery time. This situation only occurs in thermodynamically open systems, as is the case of biological systems.

Cross-regulation in Drosophila

In Drosophila early development, some proteins as Bicoid (BCD) and Hunchback (HB) are translated from mRNA of maternal origin. Early in the first developmental stages of Drosophila, at cleavage stage 13, BCD and HB proteins form a stable concentration gradient along the antero-posterior axis of the Drosophila embryo. In Figure 8, we show the data for these protein gradients, taken from the FlyEx database, [[20], [21], http://flyex.ams.sunysb.edu/flyex/]. These stable gradients have are established by diffusion processes occurring in the syncytial blastoderm of the embryo, [17][19], [22]. At a latter stage, in the 14th cleavage stage, other proteins as Knirps (KNI) show segments characterized by spiky concentration patterns along the antero-posterior axis of the embryo, [1], [2], [23][26].

Figure 8
Concentration of protein Hunchback (HB) at the end of cleavage cycle 13, and of Bicoid (BCD) and Tailless (TLL) proteins at the cleavage cycle 14, along the antero-posterior axis of the embryo of Drosophila.

We show now that the patterning of HB and KNI proteins as observed at late cleavage stage 14 of the embryo of Drosophila is due to the concentration gradients of proteins at an early developmental stage. This results follows from the concentration dependent threshold effect, just described previously, without assuming diffusion for KNI and for zygotically produced HB. For that, we have taken the genetic regulatory network of Figure 1, describing the genetic regulation of HB and KNI, and we have used the package GeneticNetworks to describe the production of proteins Hunchback and Knirps during the cleavage cycles 14 of the developing embryo of Drosophila.

Hunchback and Knirps proteins are both activated by the maternally produced protein Bicoid, Figure 1, and they mutually repress each other. The protein Knirps is also repressed by the protein Tailless. Therefore, the genetic regulatory network model obtained with the package GeneticNetworks should lead to the experimental profiles of HB and KNI, as observed at cleavage cycle 14. As the model obtained with the GeneticNetworks package is a system of ordinary differential equations that depend on initial data, we have assumed that, at the end of cleavage cycle 13 and beginning of the 14th, the proteins BCD, TLL and HB of maternal origin have a non homogeneous distribution along the embryo, as shown in Figure 8.

In Figure 9, we show the experimental profiles of HB and KNI proteins at cleavage cycle 14, as well as a fit of the experimental data obtained with the model built with the software package GeneticNetworks. The model equations is a system of 14 ordinary differential equations for the proteins and corresponding operon states, and have An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e180.jpg free parameters. In the ordinary differential equations of the model, we have considered that time is also a free parameter. The protein profiles shown in Figure 9 are out of equilibrium patterns, obtained with the integration time An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e181.jpg s. These equations and the fitted parameter values are presented in Text S1.

Figure 9
The dots with error bars are the mean experimental profiles of the proteins HB and KNI at cleavage cycle 14, taken along the antero-posterior axis of the embryo, for several Drosophila embryos.

To fit the experimental data with the mathematical model, we have assumed that the initial protein concentrations of BCD and TLL are constant over time, and we have also assumed that each regulator has its own binding site. To find the numerical values of the model parameters in order to calibrate the model, we have used an optimization technique based on a genetic evolutionary algorithm, minimizing a sum of chi square functions, [23], [27].

The fitted curves in Figure 9 show a very good agreement with the experimental data. HB fits well in the embryo length range An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e185.jpg, and KNI fits well in the embryo length range An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e186.jpg. The quality of the fits has been measured by the penalized chi square test. For the two fits in Figure 9, we have obtained the reduced chi square values An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e187.jpg and An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e188.jpg. This calibration of the genetic regulatory network of Figure 1 suggests that the anterior and posterior regions of the embryo are under the control of additional regulators. This same conclusion has been obtained in [25], but within a different regulation model. On the other hand, this mass action law model without diffusion at the level of gap genes is simpler than other full diffusion models, and is consistent with the reverse engineering methodology described in [26].

Discussion

We have presented a software tool to build mathematical models of genetic regulatory networks. The input of the package is the graph containing the list of transcriptional activators and repressors of the network. The software implements an approach based on the mass action law and on the operon regulation model in prokaryotes. We have also assumed in general that genes are catalytic substances presented in any genetically controlled biological process. For eukaryotic organisms, the modeling approach proposed here should be understood as a descriptive approximation to the not well understood eukaryotic regulation mechanisms.

Within this approach, the usual threshold concept in biology emerges as a bifurcation phenomenon of the model equations. These bifurcations are tuned by the conservation law constants of the equations, resulting from the catalytic role of genes. This corroborates the view that threshold effects should be anchored on bifurcation phenomena, [16].

Another consequence of the modeling approach presented here is that positional information in developmental processes can be described by the non-homogeneity in the spatial distribution of the concentration of regulators, and is not necessarily associated with other physical processes of transport or diffusion. Other models for Drosophila development include a balance between protein diffusion and degradation, [2], [28][30]. Recently, some criticism to the diffusion-degradation hypothesis for proteins, [31], suggest that it is important to search for other mechanisms of pattern formation, [22], [32], [33]. The results presented here show that other mechanisms of spatial patterning are possible.

In conclusion, we have calibrated a genetic regulatory network for the production of Hunchback and KNI during the 14th cleavage stage of the embryo of Drosophila, without assuming the hypothesis of protein diffusion for KNI and zygotically produced HB, and we have presented evidence that gap gene protein segments are out of equilibrium patterns. The genetic regulatory network of Figure 1 describes well the gap gene protein concentration of HB in the embryo length region An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e189.jpg, as well as the gap gene protein concentration of KNI in the embryo length region An external file that holds a picture, illustration, etc.
Object name is pone.0010743.e190.jpg. The out of equilibrium pattern hypothesis has been suggested in [28] in the framework of a model assuming that the HB and KNI proteins diffuse along the antero-posterior axis of the embryo of Drosophila. In [30], patterning at the gap gene stage is associated with the existence of attractors in an high dimensional phase-space, implying that gap gene patterns are obtained as steady state patterns. The necessity of concentration dependent thresholds in the gap-gene Drosophila patterning has been discussed in [24], and modeled through a Hill type response function with diffusion. Here, with mass action law approach, gap-gene patterns emerge from the concentration dependent thresholds that result from the catalytic role of genes in organisms.

Supporting Information

Text S1

Ordinary differential equation model describing the genetic regulatory network of Figure 1.

(0.06 MB PDF)

Acknowledgments

We are grateful to two anonymous referees for constructive comments on the manuscript.

Footnotes

Competing Interests: The authors have declared that no competing interests exist.

Funding: This work has been supported by European project GENNETEC, FP6 STREP IST 034952. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1. Sánchez L, Thieffry D. A logical analysis of the Drosophila gap-gene system. J Theo Bio. 2001;211:115–141. [PubMed]
2. Alves F, Dilão R. Modeling segmental patterning in Drosophila: Maternal and gap genes. J Theo Bio. 2006;241:342–359. [PubMed]
3. Jong HD. Modelling and simulations of genetic regulatory systems: a literature review. J Comput Biol. 2002;9:67–103. [PubMed]
4. Klipp E, Herwig R, Kowald A, Wierling C, Lehrach H. Systems Biology in Practice. Weinheim: Wiley-VCH; 2005. pp. 282–286.
5. van Kampen NG. Stochastic Processes in Physics and Chemistry. Amsterdam: North-Holland; 1992. pp. 166–173.
6. Volpert AI, Volpert VA, Volpert VA. Traveling Wave Solutions of Parabolic Systems. Providence, Rhode Island: American Mathematical Society; 1994. pp. 299–300.
7. Horn F, Jackson R. General Mass Action Kinetics. Arc Rat Mec Anal. 1972;47:187–194.
8. Nicolis G, Progogine I. Self-Organization in nonequilibrium systems. New York: Wiley & Sons; 1977. pp. 395–396.
9. Dilão R, Muraro D. 2009. Software packages can be downloaded from the site https://sd.ist.utl.pt/Download/download.html.
10. Jacob F, Monod J. Genetic regulatory mechanisms in the synthesis of proteins. J Mol Biol. 1961;3:318–356. [PubMed]
11. Alves F, Dilão R. A simple framework to describe the regulation of gene expression in prokaryotes. Com Rend Biologies. 2005;328:429–444. [PubMed]
12. Crick F. Central Dogma of Molecular Biology. Nature. 2006;227:561–563. [PubMed]
13. Ellner SP, Guckenheimer J. Dynamic Models in Biology. Princeton: Princeton Uni. Press; 2006. pp. 8–12.
14. Hoops S, Sahle S, Gauges R, Lee C, Pahle J, et al. COPASI Ñ a COmplex PAthway SImulator. Bioinformatics. 2006;22:3067–74. [PubMed]
15. Maiwald T, Timmer J. Dynamical modeling and multi-experiment fitting with PottersWheel. Bioinformatics. 2008;245:2037–2043. [PMC free article] [PubMed]
16. Tyson JJ. Bringing cartoons to life. Nature. 2007;445:823. [PubMed]
17. Driever W, Nüsslein-Volhard C. A gradient of bicoid protein in Drosophila embryos. Cell. 1988;54:83–93. [PubMed]
18. Gilbert SF. Developmental Biology. Sunderland: Fifth Edition, Sinauer; 1997. pp. 547–548.
19. Nüsslein-Volhard C. Gradients that organize embryo development. Scientific American. 1992;275(2):54–61. [PubMed]
20. Pisarev A, Poustelnikova E, Samsonova M, Reinitz J. FlyEx, the quantitative atlas on segmentation gene expression at cellular resolution. Nucleic Acids Research. 2009;37:D560–D566. [PMC free article] [PubMed]
21. Poustelnikova E, Pisarev A, Blagov M, Samsonova M, Reinitz J. A database for management of gene expression data in situ. Bioinformatics. 2004;20:2212–2221. [PubMed]
22. Dilão R, Muraro D. mRNA diffusion explains protein gradients in Drosophila early development. J Theo Bio. 2010 doi: 10.1016/j.jtbi.2010.03.012. [PubMed]
23. Dilão R, Muraro D. Calibration of a genetic network model describing the production of gap gene proteins in Drosophila. 2010. Pre-print: arXiv:0912.4391v1. [PubMed]
24. Jaeger J, Surkova S, Blagov M, Janssens H, Kosman D, et al. Dynamic control of positional information in the early Drosophila blastoderm. Nature. 2004;430:368–371. [PubMed]
25. Jaeger J, Sharp DH, Reinitz J. Known maternal gradients are not sufficient for the establish-ment of gap domains in Drosophila melanogaster. Mech Dev. 2007;124:108–28. [PMC free article] [PubMed]
26. Perkins TJ, Jaeger J, Reinitz J, Glass L. Reverse Engineering the Gap Gene Network of Drosophila melanogaster. PLoS Comp Biol. 2006;2:e51, 417–428. [PMC free article] [PubMed]
27. Dilão R, Muraro D, Nicolau M, Schoenauer M. Pizzuti C, Ritchie MD, Giacobini M, editors. Validation of a morphogenesis model of Drosophila early development by a multi-objective evolutionary optimization algorithm. 2009. pp. 176–190. EvoBIO 2009, Lecture Notes in Computer Science 5483.
28. Fomekong-Nanfack Y, Kaandorp JA, Blom JG. Efficient parameter estimation for spatio-temporal models of pattern formation: Case study of Drosophila melanogaster. Bioinformatics. 2007;23:3356–3363. [PubMed]
29. Houchmandzadeh B, Wieschaus E, Leibler S. Precise domain specification in the developing Drosophila embryo. Phys Rev E. 2005;72:061920. [PubMed]
30. Manu, Surkova S, Spirov A, Gursky V, Janssens H, et al. Canalization of Gene Expression and Domain Shifts in the Drosophila Blastoderm by Dynamical Attractors. PLoS Comp Biol. 2009;5:e1000303. [PMC free article] [PubMed]
31. Kerszberg M, Wolpert L. Specifying Positional information in the embryo: Looking Beyond Morphogens. Cell. 2007;130:205–209. [PubMed]
32. Reinitz J. A ten per cent solution. Nature. 2007;448:420–421. [PubMed]
33. Coopey M, Berezhokovskii AM, Kim Y, Boettinger AN, Shvartsman SY. Modeling the bicoid gradient: Diffusion and reversible nuclear trapping of a stable protein. Dev Biol. 2007;312:623–630. [PMC free article] [PubMed]

Articles from PLoS ONE are provided here courtesy of Public Library of Science
PubReader format: click here to try

Formats:

Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...

Links

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...