- Journal List
- Biophys J
- v.89(2); 2005 Aug
- PMC1366644

# Signaling through Receptors and Scaffolds: Independent Interactions Reduce Combinatorial Complexity

## Abstract

After activation, many receptors and their adaptor proteins act as scaffolds displaying numerous docking sites and engaging multiple targets. The consequent assemblage of a variety of protein complexes results in a combinatorial increase in the number of feasible molecular species presenting different states of a receptor-scaffold signaling module. Tens of thousands of such microstates emerge even for the initial signal propagation events, greatly impeding a quantitative analysis of networks. Here, we demonstrate that the assumption of independence of molecular events occurring at distinct sites enables us to approximate a mechanistic picture of all possible microstates by a macrodescription of states of separate domains, i.e., macrostates that correspond to experimentally verifiable variables. This analysis dissects a highly branched network into interacting pathways originated by protein complexes assembled on different sites of receptors and scaffolds. We specify when the temporal dynamics of any given microstate can be expressed using the product of the relative concentrations of individual sites. The methods presented here are equally applicable to deterministic and stochastic calculations of the temporal dynamics. Our domain-oriented approach drastically reduces the number of states, processes, and kinetic parameters to be considered for quantification of complex signaling networks that propagate distinct physiological responses.

## INTRODUCTION

Extracellular signals received by plasma membrane receptors are processed and transduced through covalent modifications of amino acid residues on receptors and their cytoplasmic substrates (1). Interaction domains of numerous adaptor proteins and signaling enzymes recognize these modified residues as binding partners. For instance, receptors that belong to the large family of receptor tyrosine kinases (RTK) modify tyrosine residues by attaching a phosphate group. Phosphotyrosine residues efficiently bind proteins with Src homology 2 (SH2) and phosphotyrosine-binding domains (2,3). Subsequent assembly of signaling complexes leads to activation of downstream enzymes and protein kinase cascades, which propagate signals down to the nucleus.

Receptors and adaptor proteins often display multiple docking sites and engage several downstream signaling proteins, thereby serving as scaffolding proteins, or scaffolds. For instance, after activation, insulin receptor (IR), insulinlike growth factor receptor-1 (IGF-1R), epidermal growth factor receptor (EGFR), and immunoreceptor tyrosine kinases, such as B-cell receptors, T-cell receptors, and Fc receptors, all bind various combinations of downstream targets generating a large number of different signaling species in highly branched networks (4–6). IR and IGF-1R, exist as dimers containing two *α*- and two *β*-subunits, whereas other RTKs are monomers that dimerize after ligand binding (7–11). Oligomerization of cell-surface receptors can generate multiple docking sites even if each receptor monomer has only one such site.

The binding partners of IR or IGF-1R include insulin receptor substrates, IRS1–IRS4, which are scaffolding proteins (4,12). Phosphorylation of various tyrosine residues on IRS-1 by IR or IGF-1R creates docking sites for several SH2 domain-containing proteins, such as growth factor receptor binding protein-2 (Grb2), the p85 subunit of phosphatidylinositol 3-kinase (PI3K), and the protein tyrosine phosphatase SHP-2 (13,14). Likewise, the Grb2-associated binders (GAB-1 and GAB-2) are scaffolding adaptors, which are phosphorylated by various RTKs (15–17) and soluble tyrosine kinases of the Src family (18,19). Phosphorylated docking sites on GABs bind numerous targets, including Shc, Grb2, p85, phospholipase C*γ*, SHP-2, and the Crk adaptor protein (20,21). Different docking sites on activated receptors and scaffold proteins initiate separate signaling pathways that propagate distinct cellular responses. For instance, Grb2 binding to tyrosine phosphorylated GAB and the recruitment of the GDP/GTP exchange factor SOS to the plasma membrane enables activation of the small GTPase Ras. This leads to activation of the mitogen-activated protein kinase (MAPK) cascade that promotes mitogenesis and differentiation, whereas p85 binding to another docking site on GAB initiates the PI3K/AKT pathway implicated in glucose and lipid metabolism and cell survival (22,23).

Quantitative analysis and mathematical modeling of signaling through receptors and scaffolds is hampered by a combinatorial increase in complexity with the number of docking domains (sites) and interaction partners. A scaffolding adaptor protein can either be associated with or dissociated from a receptor. Each docking site on a scaffold can be phosphorylated or unphosphorylated, and the phosphorylated site can either be free or occupied by its binding partner. Proteins bound to a scaffold can be phosphorylated and dephosphorylated, and may associate with other signaling proteins, assembling multicomponent complexes. All these different possibilities multiply, generating tens of thousands of molecular species even for a few initial steps in signal transduction involving receptors, scaffolds, and adaptors (24,25).

An entire collection of potential molecular species corresponds to different forms of a receptor and scaffold protein and is referred to as a set of microstates. A standard mechanistic description takes into account all possible microstates and transitions between them. Such a detailed description might be necessary when binding of a protein to one docking site changes the kinetic properties of other docking sites (allosteric interactions, such as cooperative association), and/or promotes (de)phosphorylation of proteins bound to these sites (24–28). An elegant algorithm for deterministic calculations of all potential species and reactions has recently been developed (29). However, because of the enormous number of distinct microstates of regulatory complexes and a lack of knowledge of the kinetics for each possible transition, such a detailed microdescription may rapidly become impractical. Unlike deterministic algorithms, stochastic algorithms use probabilistic rules to simulate the evolution of species populations. MOLECULIZER is a stochastic simulator that employs a form of Gillespie's first reaction algorithm (30) and generates only populated states of a network, thus reducing the number of all possible states to be considered (31). An appealing stochastic approach to modeling multistate biomolecular systems was developed by Bray and colleagues (32,33). In the computer program StochSim, individual multistate molecules and complexes are represented as distinct software objects. Consequently, a combinatorial explosion of the number of microstates is circumvented by merely following stochastic changes in the states of individual, distinguishable molecules, the number of which does not increase in the course of simulations (34). The use of StochSim is especially practical for networks in small (subcellular) volumes with low numbers of molecules, where a stochastic algorithm more closely describes the physical reality than a deterministic algorithm. However, for large networks with hundreds of different proteins, the StochSim calculation time would be too slow, increasing proportionally to the number of molecules squared.

The present article demonstrates that a mechanistic description of a highly combinatorial network generated by various phosphorylation and binding forms of receptors and scaffolds may be drastically reduced using a domain-oriented approach. Provided there is a set of docking sites where molecular events are independent (i.e., allosteric interactions are absent), a signaling system can be modeled in terms of a macrodescription that follows the states of each docking domain separately, including subsequent downstream signal transduction. Compared to the combinatorial explosion of microstates and equations in a mechanistic model, for a macrodescription, the number of macrostates increases linearly, as the sum of distinct domains and binding partners. Starting with simple examples that are subsequently generalized, we demonstrate when and how the temporal behavior of a particular microstate can be expressed explicitly or approximated in terms of macrostates of distinct sites, which are determined using a significantly reduced model. Such a reconstruction of microscopic behavior is required when different microstates within the same macrostate present biologically different activities. The results presented in this article suggest a novel approach to the reduction of combinatorial complexity of multicomponent signal transduction networks.

## METHODS

### Kinetic description

We will analyze two signaling subnetworks, presenting a common theme in signal transduction: 1), a scaffolding adaptor protein activated by a receptor; and 2), a receptor that also acts as a scaffold. A key property that will allow us to reduce combinatorial complexity of these networks is the assumption of the absence of allosteric interactions for a subset of domains/docking sites on a receptor and/or scaffold. A quantitative description of these modules will serve as a template for constructing models of signal transduction.

#### Scaffold with multiple independent docking sites

We will first consider a scaffolding adaptor protein (*S*) that binds to an activated receptor (*R*) through a specific interaction domain (*h*) and is subsequently phosphorylated by the receptor kinase on distinct sites *i* (*i* = 1, …, *n*), located outside of domain *h*. When phosphorylated, sites *i* can engage several downstream signaling proteins (and their complexes) or can be dephosphorylated by phosphatases. To describe the system quantitatively, it is convenient to assign digital flags (numbers) to possible states of domain *h* and docking sites *i* (Fig. 1). For unbound *S*, *h* = 0, and for *S* bound to the receptor, *h* = 1. Each site *i* can be in one of (*m*_{i} + 1) states denoted by *a*_{i} = 0, 1, …, *m*_{i}. The number *a*_{i} = 0 indicates an unphosphorylated free site, *1* denotes a phosphorylated free site, *2* represents a site occupied by a binding partner (*A*_{s}), *3* can stand for phosphorylation of this partner (*A*_{s}*P*) or binding a new partner (*A*_{j}), *4* can denote the binding of the complex *A*_{s}*PA*_{j}, and so on. Consequently, at the microlevel the state of protein *S* is described by a digital flag (*a*_{1}, …, *a*_{n}; *h*), and the meaning of these numbers is specific for a network. The concentration of *S* in each state is denoted by the time-dependent function *s*(*a*_{1}, …, *a*_{n}; *h*; *t*) (to simplify designations, we will omit the variable *t* if it does not result in misunderstanding). For any *h*-state, there are (*m*_{1} + 1) · (*m*_{2} + 1) · … · (*m*_{n} + 1) states of *S*. The transitions between states are described by a graph with 2 · (*m*_{1} + 1) · (*m*_{2} + 1) · … · (*m*_{n} + 1) vertices (states) presenting all feasible species. Fig. 2 illustrates such a transition graph for a scaffold with two docking sites and their partners, *A*_{1} and *A*_{2}. Phosphorylation of docking sites occurs only when the scaffold is bound to the receptor (*h* = 1), whereas dephosphorylation may take place for both bound and unbound (*h* = 0) states, if docking sites are not occupied by their binding proteins (phosphatase activities are assumed to be constant parameters).

*S*binds to the receptor

*R*via a specific domain (

*h*). For the unbound

*S*,

*h*= 0, and when

*S*is bound to

*R*,

*h*= 1. Two docking sites on

**...**

*s*(0,0;0),

*s*(1,0;0), and

*s*(2,0;0) correspond to the scaffold that is not bound to the receptor

*R*(

*h*= 0), unphosphorylated on docking site 2 and has docking

**...**

Elementary transitions between states of the scaffolding protein *S* are assumed to follow mass-action kinetics. Two additional assumptions are crucial. First, we assume that transitions between different states *a*_{i} of each site *i* do not depend on states of the other sites (*a*_{j}, *j* ≠ *i*), whereas these transitions may depend on the state of the *h* domain, which can formally be called a *controlling* recruitment site. For instance, no phosphorylation transitions can occur when the scaffold has dissociated from the receptor (Fig. 2). Second, transitions between different states of the recruitment site *h* are assumed to be independent of states *a*_{i} for any site *i*. These kinetic properties imply a hierarchical relation between two kinds of binding sites on *S*.

In the notation assumed, any chemical transformation is simply a change in one of the numbers *a*_{i} or *h*. We designate by *k*_{i}() the pseudo-first-order rate constant for the transition from the state (*a*_{1}, …, *a*_{i−1}, *a*_{i}, *a*_{i+1} …, *a*_{n}; *h*) to the state (). Upon this transition, the number *a*_{i} characterizing site *i* changes to whereas states *a*_{j} of other sites *j* do not change (as indicated by the subscript *i* in *k*_{i}). If this is a binding reaction, the *k*_{i} includes the concentration of a binding protein (*A*_{i}) as a multiplier. Similarly, the reverse transition is described by the pseudo-first order rate constant *k*_{i}(). Association and dissociation of the scaffold *S* and the receptor *R* change only the *h* number and are characterized by the rate constants *k*_{r} · *R* and *k _{−}*

_{r}, respectively. For example, in Fig. 2, the rate constant for the transitions

*s*(0,

*a*

_{2}; 1) →

*s*(1,

*a*

_{2}; 1) is

*k*

_{1}(0→1; 1) for

*a*

_{2}= 0, 1 or 2, and for the transitions

*s*(

*a*

_{1},

*a*

_{2}; 0)→

*s*(

*a*

_{1},

*a*

_{2}; 1) is

*k*

_{r}·

*R*for any

*a*

_{1},

*a*

_{2}.

The time evolution of the scaffold is determined by the following system of 2 · (*m*_{1}+1) · (*m*_{2}+1) · … · (*m*_{n}+1) ordinary differential equations, which describe the chemical transformations of all feasible species as transitions between microstates of *S*,

This equation has a simple structure: on the right-hand side, the first and second terms describe the consumption and production of species *s*(*a*_{1}, …, *a*_{n}; *h*; *t*) in reactions that occur at sites *i*, whereas the third term presents supply-and-demand processes arising from association (*h* = 0) and dissociation (*h* = 1) from the receptor. We will refer to this mechanistic presentation ( Eq. 1), which involves all possible microstates and transitions between them, as a microdescription of a network.

#### The simplest case of complete independence: sites without hierarchical control

To highlight the general principles of a domain-oriented approach to reduction in combinatorial complexity, we will also analyze an alternative, simpler model of signaling by a scaffold. In this model, chemical transformations of binding sites on the scaffold *S* are completely independent of states of other sites, including phosphorylation and dephosphorylation. The formal difference from the previous model is the lack of controlling recruitment site *h*. In this case, the state of *S* is characterized by the function *s*(*a*_{1}, …, *a*_{n}; *t*). The dynamics of signaling by this scaffolding protein is described by a reduced system of (*m*_{1}+1) · (*m*_{2}+1) · … · (*m*_{n}+1) ordinary differential equations, which is obtained from Eq. 1 by equating *k*_{r} and *k _{−}*

_{r}to zero.

### Receptor possessing scaffolding properties

The second signaling module analyzed here is a receptor, also acting as a scaffold, of the RTK superfamily. After binding of a ligand or/and dimerization, the receptor is autophosphorylated on several docking sites, which makes them capable of binding signaling proteins with specific binding domains. We will consider two possible molecular mechanisms. The first assumes that the binding affinity of a ligand (*L*) for the receptor (*R*) does not depend on the receptor phosphorylation and dimerization states and on the states of its cytoplasmic docking sites (10). In this case, a mathematical description of the system becomes the same as the description of an activated scaffolding protein considered in Scaffold with Multiple Independent Docking Sites. States *a*_{i} of docking sites are described identically for the scaffold and the receptor, and states, *h* = 1 and *h* = 0, correspond to the receptor that has or has not bound *L*. In fact, the transition graph is readily obtained from Fig. 2 by the replacement of *S* and *R* by *R* and *L*, respectively, and the signaling kinetics is determined by Eq. 1 after the same replacement.

In an alternative mechanism, conformational changes after receptor dimerization and/or phosphorylation preclude ligand dissociation, until the receptor is endocytosed and undergoes degradation or recycling, as for the EGF receptor (35–39). For the sake of simplicity, we will assume initially that receptors have a dimeric structure before stimulation (such as IR and IGF-1R) and undergo conformational changes and phosphorylation after the ligand binding. We designate by *r*_{0} and *r*(*a*_{1},…, *a*_{n}) the concentrations of free receptor dimer and the dimer with bound ligand(s) and docking sites (*i*) in states (*a*_{1},…, *a*_{n}), respectively. As in the previous section, *a*_{i} = 0 stands for the unphosphorylated site *i*, *a*_{i} = 1 for the free phosphorylated site, *a*_{i} = 2, and so on for different combinations of adaptor proteins and their phosphorylation states bound to site *i*. For this signaling network, a fragment of a transition graph is given in Fig. 3. In comparison with Fig. 2, this graph is simpler, because the dissociation of ligand *L* is possible only when none of the receptor docking sites are phosphorylated, i.e., the flux of *L* occurs only for the state *r*(0,0).

*L*, ligand;

*r*

_{0}, free receptor; and

*r*(0,0), ligand-receptor complex with two unphosphorylated docking sites. For all species in the network,

*r*(

*i*,

*j*

**...**

A transition from state *a*_{i} to state is characterized by the pseudo-first-order rate constant *k*_{i}(), and the on- and off-constants for ligand and receptor association are designated by *k*_{0} and *k*_{−0}, respectively. The temporal behavior of signaling species in this network is determined by

where *δ*_{ij} is the Kronecker symbol (*δ*_{ij} = 1, if *i* = *j*, otherwise *δ*_{ij} = 0).

## RESULTS

The multiplicity of domains and binding sites on signaling molecules, such as receptors and large scaffolding adaptor proteins, is a hallmark of signal transduction networks. In this section, we apply a domain-oriented approach to combine numerous microstates into macrostates of separate domains/sites.

### Dissecting signal transduction by a scaffold into signaling by separate docking sites

We assumed above that molecular processes involving a particular site of a scaffolding protein are independent of states of other docking sites, and that rate constants of receptor-scaffold association (dissociation) are the same for all states of these sites. Although there are well-characterized examples where this assumption is not valid (e.g., negative cooperativity of binding events because of steric hindrance), in other cases docking sites are located in different, distant domains of a protein, where they interact independently with their binding partners. This independence suggests that the time course of reactions involving a docking site, such as phosphorylation, dephosphorylation, and binding and release of downstream proteins, may be analyzed separately from the analysis of the entire scaffold. In fact, this approach will allow us to follow the fate of a particular docking site on a scaffold independently of reactions occurring at other sites. We will see now how this dissection of scaffold dynamics becomes feasible.

Adding up the concentrations of all forms of the adaptor protein *S* displaying a particular state (*a*_{i}) of docking site *i*, we introduce the macrovariables *S*_{i}(*a*_{i}, *h*),

Each of these macrovariables follows states of only one docking site on the scaffold and can be interpreted as the concentration of this site in a particular state. For instance, considering signaling by the adaptor protein GAB, a macrovariable may be the sum of all microstates, in which the GAB docking site for PI3K is phosphorylated and bound by PI3K, whereas sites for SHP-2, Shc (and so on) can be unphosphorylated, phosphorylated, free, or occupied by the unphosphorylated or phosphorylated binding partners.

Summing up Eq. 1 for all states of docking sites *j* different from site *i*, we obtain

This equation shows that when docking sites are independent, the time evolution of the signaling system can be analyzed in terms of macrovariables *S*_{i}(*a*_{i}, *h*), and does not require the analysis of the dynamics of microstates of the adaptor protein *S*. The description in terms of *S*_{i}(*a*_{i}, *h*) is referred to as a macropresentation of the signaling network that involves *R* and *S*.

A formal mathematical proof of Eq. 4 is given in Supplemental Note 1 in Supplementary Materials. In intuitive terms, recall that each microstate *s*(*a*_{1}, …, *a*_{n}; *h*) can be presented by a vertex in an (*n*+1)-dimensional transition graph. Edges coming in or going out of the vertex *s*(*a*_{1}, …, *a*_{n}; *h*) form *n*+1 directions that correspond to chemical transformations of any of *n* docking sites *i* and binding to/dissociation from *R*, which changes *h* (see Fig. 2 with *n* = 2). The macrostate *S*_{i}(*a*_{i}, *h*) is the sum of microstates over *n*−1 of those directions, which involve transitions that change states of *n*−1 docking sites other than site *i* and state *h*. In the course of summation of the rate equations (Eq. 1), terms corresponding to transitions along these *n*−1 directions cancel each other (each term enters twice as consumption and production with opposite signs). All transitions where *a*_{i} changes have identical rate constants for each of composing microstates, and the same is true for transitions changing *h*. Consequently, the remaining rate terms are the products of the sums of microstates and the corresponding common rate constants. The resulting rate equations contain only macrovariables multiplied by the rate constants common for all microstates composing those macrovariables.

The separation of variables that describe different sites *i*'s considerably reduces the number of states and equations used for a quantitative analysis of the system behavior. In a macrodescription framework, there are only 2 · (*m*_{1} + *m*_{2} + … + *m*_{n} + *n*) states of the scaffold instead of 2 · (*m*_{1} + 1) · (*m*_{2} + 1) · … · (*m*_{n} + 1) microstates (species), and the number of transitions in a transition graph decreases even more drastically. Fig. 4 illustrates this reduction for the transition graph shown in Fig. 2. Even for the simplest case of two docking sites, instead of 18 microstates, there appear to be 12 states, and the number of transitions decreases from 33 to 14. Moreover, the graph of Fig. 4 splits into two disconnected subgraphs corresponding to site 1 and site 2 transitions.

Importantly, an entire signaling network can be analyzed in terms of macrostates only, using a macrodescription of a receptor-scaffold module. In a mechanistic framework, kinetic equations that describe the dynamics of free and bound receptor forms take into account receptor interactions with each of possible forms (microstates) of the scaffold *S* (38,40). By contrast, if the kinetics of the scaffold protein *S* is described in terms of macrostates, these equations can be simplified, and the time evolution of the free receptor concentration can be determined using any of macrovariables *S*_{i} (see Supplemental Note 2 in Supplementary Materials). Likewise, when there is no cross-talk between downstream signaling pathways initiated by the association of binding partners with docking sites on the scaffold, the dynamic behavior of each of these pathways can be calculated separately in terms of single macrovariables and the corresponding interacting partners, without determining the dynamics of a whole network.

The EGFR signaling network provides an instructive illustration of how a macrodescription can reduce the number of states that need to be considered in a realistic model of a signaling network. The EGFR network contains at least two major scaffolding proteins, EGFR itself and the adaptor protein GAB-1. Description of this network up to and including ERK activation requires at least 163,000 microvariables, but only ~350 macrovariables to monitor independent and allosterically interacting domains (unpublished data). Likewise, for the IR and IGF-1R signaling networks, which involve IRS, GAB, and Grb10 adaptor proteins, the assumption of independence of some docking sites on these scaffolds results in a reduction in the number of equations from hundreds of thousands to a few hundreds, yet allows for accurate predictions of the time evolution of signaling patterns.

### Retrieving a mechanistic (micro)description from the dynamics of independent docking sites

Analyzing a model of a scaffolding protein (*S*) with independent docking sites and receptor-binding site (Eq. 1), we found that the time evolution of each docking site (*i*) can be described by the dynamics of a single macrovariable *S*_{i}(*a*_{i},*h*) without requiring monitoring of the remaining sites on *S* (Eq. 4). However, the concentration dynamics of a particular microstate may also be of interest—as in, for instance, when the assembly of two or more interacting proteins on *S* is required for activation of a downstream target. We will now show how such a microdescription can be retrieved from a macrodescription at arbitrary time *t*.

#### A scaffolding protein without controlling site h

We start with a simple model of a scaffold, formulated in The Simplest Case of Complete Independence: Sites without Hierarchical Control, above, in Methods, where there is no hierarchical control over the states of docking sites. Formally, this model does not incorporate a controlling *h*-site on *S* and assumes that phosphorylation, dephosphorylation, binding a partner, and other molecular events involving a particular docking site proceed independently of the states of any other site. In this case, the concentration of *S* in any microstate is characterized by the time-varying function *s*(*a*_{1}, …, *a*_{n}; *t*), which lacks any dependence on *h*. This scaffold can be activated by stimulation of soluble kinases, which independently phosphorylate various docking sites. Phosphorylated motifs deliver a message to cytoplasmic binding partners of the scaffold. Alternatively, inhibition of phosphatases (e.g., as a result of oxidative stress) may shift the balance between phosphorylation and dephosphorylation of docking sites. Such perturbations will cause signaling by the scaffold.

The concentration *S*_{i}(*a*_{i},*t*) of *S* in a particular macrostate (*a*_{i}) is defined in Eq. 3 as the sum of all forms of the scaffold, in which docking site *i* is in state *a*_{i}. Because of the assumed independence of docking sites, one may conjecture that the probability of finding these sites in states *a*_{1},…, *a*_{n} simultaneously, i.e., finding the scaffolding protein in a certain microstate *s*(*a*_{1},…, *a*_{n}), equals the product of the probabilities of the corresponding states for each docking site. The probabilities are equal to the fractional concentrations, which are obtained by dividing the concentrations *s*(*a*_{1},…, *a*_{n}) and *S*_{i}(*a*_{i}) by the total *S* concentration (*S*_{tot}),

Suppose that at some initial moment *t*_{0} any microprobability *s*(*a*_{1},…,*a*_{n}; *t*_{0})/*S*_{tot} can be expressed as the product of the macroprobabilities *S*_{i}(*a*_{i}, *t*_{0})/*S*_{tot},

Using Eqs. 1 and 4 (where both *k*_{r} and *k _{−}*

_{r}equal zero), it can be shown readily that Eq. 6 continues to apply for any

*t*>

*t*

_{0}(Supplemental Note 3 in Supplementary Materials). Consequently, the time evolution of the concentrations

*s*(

*a*

_{1}, …,

*a*

_{n};

*t*) of the scaffold in any of its microstates can be expressed in terms of the product of the fractional concentrations

*S*

_{i}(

*a*

_{i},

*t*)/

*S*

_{tot}of macrostates for any

*t*>

*t*

_{0},

It is instructive to note that in enzyme kinetics and models of ion channels, the equilibrium (or pseudo-equilibrium) concentrations of multimolecular complexes are expressed as the product of the saturation functions for distinct independent subunits or binding sites (41). However, in contrast to this earlier result of the steady-state enzyme kinetics, Eq. 7 tells us that any transient microstate of a scaffolding protein can be expressed as the product of the time-dependent probabilities (fractional concentrations) of macrostates of distinct docking sites.

#### Recruitment of a scaffold into a receptor-scaffold complex

In a more general case (see Eq. 1), distinct docking sites on a scaffolding protein cannot be considered completely independent. In fact, after the formation of a receptor-scaffold complex (Fig. 2), two or more docking sites can be phosphorylated while the scaffold is bound to the receptor (an alternative mechanism of entirely independent sites implies that after each phosphorylation, the scaffold dissociates from the receptor). Clearly, the fact that docking sites can be phosphorylated only if *h* equals 1 and not 0, imposes common constraints on otherwise independent docking sites.

Eq. 4 demonstrates that a macrostate model still applies to a receptor-scaffold system, and the time evolution of distinct domains and the initiated downstream signaling pathways can be resolved into separate dynamics of the corresponding macrovariables *S*_{i}(*a*_{i}, *h*). However, in the general case, the temporal patterns of microstates *s*(*a*_{1}, …, *a*_{n}; *h*) cannot be *exactly* obtained from *S*_{i}(*a*_{i}, *h*), due to the correlation between phosphorylation of docking sites during the stage of association of *S* with *R*. Yet, as we will see, the time evolution of microstates can be accurately approximated in terms of macrostates.

Supplemental Note 4 in Supplementary Materials shows that although the products of the fractional concentrations (probabilities) of macrostates may coincide with the microstate concentrations at the initial time point (e.g., before receptor activation; compare to Eq. 6), they start to deviate from each other, as time progresses. This deviation is brought about by the terms that involve the rate constants of the association and dissociation of the scaffold and receptor, *k*_{r} and *k _{−}*

_{r}(see Eqs. S4.3 and S4.4, Supplemental Note 4 in Supplementary Materials). This finding suggests that it may be helpful to analyze cases, where these rates are much faster or much slower than the rates that change the states of docking sites

*i*. Under the condition where the association/dissociation steps changing

*h*are much faster than reactions changing states

*a*

_{i}of docking sites, the ratio of the concentrations of the receptor-bound forms of the scaffold (

*h =*1) and the free forms (

*h =*0) with the same states of docking sites are fixed by the following rapid-equilibrium relations (42,43):

Using Eq. 8, we express Eqs. 1 and 4 in terms of so-called slow variables, which equal the overall concentrations of bound and unbound scaffold forms with the same states *a*_{i} and, thus, do not change in the fast association/dissociation reactions of the scaffold and receptor (see Supplemental Note 5 in Supplementary Materials). The resulting equations for slow variables are parallel to the equations that describe a scaffold with completely independent sites, and similarly to Eq. 7, the concentrations of microstates for both the receptor-bound (*h =* 1) and unbound (*h =* 0) scaffold forms can be found as follows (Supplemental Note 5 in Supplementary Materials):

Importantly, this equation suggests the use of an alternative scaling of the fractional concentrations of micro- and macrostates exploiting the normalizing factor, *S*_{tot}(*h*), which is the total concentration of all scaffold forms with a certain *h*. With this normalization, the fractional concentrations *s*(*a*_{1}, …, *a*_{n}; *h*)/*S*_{tot}(*h*) and *S*_{i}(*a*_{i}, *h*)/*S*_{tot}(*h*) present the conditional probabilities of finding the scaffold in that particular micro or macrostate, under the condition when it is either associated with (*h* = 1) or dissociated from the receptor (*h* = 0). Equation 9 shows that when the association and dissociation reactions are fast, all microstates (i.e., all network species) can be approximated by the product of the conditional probabilities (alternatively normalized fractional concentrations) of macrostates. However, even for this alternative scaling of concentrations, Eq. 9 cannot be considered an exact relationship for all feasible values of the kinetic constants (see Supplemental Note 6 in Supplementary Materials).

In the other extreme case, when binding/association of the receptor and scaffold is much slower than processes that change states *a*_{i} of docking sites, Eq. 9 continues to apply. In fact, because changes in *h* are very slow, one may consider the pseudo-equilibrium for the microstates of the scaffold bound to the receptor, *h* = 1 (in Fig. 2, these microstates are at the *front facet* of the *graph*), separately from the pseudo-equilibrium for the free scaffold forms, *h* = 0 (the *rear facet* of the *graph* in Fig. 2). Since at constant *h*, transitions between states of any docking site do not depend on other sites, the equilibrium values of *s*(*a*_{1}, …, *a*_{n}; *h*)/*S*_{tot}(*h*) equal the product of *S*_{i}(*a*_{i}, *h*)/*S*_{tot}(*h*) for the same *h*. Because of the rapid-equilibrium condition, the exact values of microstate concentrations will not differ significantly from the equilibrium concentrations at all times.

Numerical experiments demonstrate that over a wide range of parameters the concentrations of network species (microstates of the scaffold) are well approximated by Eq. 9. Fig. 5 illustrates this by covering three cases, where the apparent forward (*k*_{r}*R*) and reverse (*k _{−}*

_{r}) rate constants of binding of the scaffold to the receptor are much greater, in the same range, or much smaller than the rate constants of reactions involving scaffold docking sites. The active receptor concentration at

*t*= 0 was set to be 100 nM (a typical range for EGFR activated with saturating [EGF] in liver cells; see Ref. 37), and the total abundance of the scaffold, and the scaffold's binding partners

*A*

_{1}and

*A*

_{2}(see Fig. 2), was set to be 50 nM. To simulate a transient temporal pattern of receptor activation, the concentration of active free receptor was assumed to decrease with the first-order rate constant of 0.01 s

^{−1}. The precise concentrations of all micro- and macrostates were calculated according to Eqs. 1 and 4. The approximate values of microstate concentrations were obtained as the product of macrostate concentrations according to Eq. 9 (Fig. 5 illustrates the exact and approximate solutions for microstates

*s*(2,2;1) and

*s*(2,2;0) that correspond to the receptor-bound and unbound scaffold with docking sites occupied by their partners

*A*

_{1}and

*A*

_{2}).

*A*

_{1}and

*A*

_{2}.

*A*–

*C*illustrate three cases, where the scaffold-receptor association/dissociation reactions are much faster, comparable, or slower

**...**

The time courses for precise and approximate concentrations of scaffold microstates shows that the best approximation corresponds to the fast receptor-scaffold association/dissociation reactions (Fig. 5, *A1* and *A2*). If *k*_{r}*R* and *k _{−}*

_{r}have the same magnitude as the other rate constants in Eq. 1, the deviation of the exact solution from Eq. 9 increases, but remains reasonably small (Fig. 5,

*B1*and

*B2*). When the association/dissociation of the receptor and scaffold is much slower, microstates that correspond to phosphorylated forms can be approximated well only in the case where the scaffold is bound to the receptor (Fig. 5,

*C1*and

*C2*). Indeed, for the free scaffold the pseudo-equilibrium solution for phosphorylated forms is equal to zero, since only the phosphatase reactions carry on when the scaffold has dissociated from the receptor.

### Macrodescription of receptor possessing scaffolding properties

If ligand binding and dissociation are independent of the conformation and phosphorylation states of the cytoplasmic receptor tail including its independent docking sites, the macrodescription developed here for scaffold signaling is applicable to this receptor. As shown in Methods, all that is needed is the replacement of the word *receptor* with *ligand*, and *scaffold* with *receptor* in that description (for simplicity, it is assumed that the receptor is a dimer before stimulation with the ligand, such as IR or IGF-1R).

In an alternative model, the dissociation of the ligand from the receptor is possible only when all docking sites of the receptor are unphosphorylated (i.e., in microstate *r*(0,…,0), Fig. 3). Adding up the concentrations of all receptor forms that display a particular state (*a*_{i}) of docking site *i*, we introduce the macrovariables *R*_{i}(*a*_{i}),

These macrovariables, also referred to as the receptor macrostates, are merely the concentrations of receptor docking sites in particular states for the receptor that has bound the ligand.

Unfortunately, Eq. 2 cannot be transformed into a system of differential equations that include the macrostates *R*_{i} only. In fact, the rate of dissociation of the ligand from the receptor equals *k _{−}*

_{0}·

*r*(0,…,0), whereas any of the variables

*R*

_{i}(0) is the sum of microstate

*r*(0,…,0) and various other microstates

*r*(

*a*

_{1},…,

*a*

_{i−1},0,

*a*

_{i+1},…,

*a*

_{n}). Summing up Eq. 2 for all states of docking sites

*j*≠

*i*and docking site

*i*in state

*a*

_{i}, we obtain the equation system that explicitly involves the microstate

*r*(0,…,0):

The above analysis of signaling by a scaffold suggests that microstate concentrations can often be well approximated in terms of scaled macrostate concentrations. To scale the concentrations, we introduce the total concentration (*R*_{L}) of the receptor-ligand complexes:

Normalizing by *R*_{L}, we obtain the fractional concentrations (conditional probabilities) of micro- and macrostates of the receptor that have bound the ligand *L*, *r*(*a*_{1},…,*a*_{n})/*R*_{L} and *R*_{i}(*a*_{i})/*R*_{L}. Suppose the microstate concentration *r*(0,…,0) can be approximated by the product of the scaled macrostate concentrations *R*_{i}(0), as

Substituting this estimate in Eq. 11, we arrive at the following approximate description of receptor kinetics in terms of macrostates (the superscripted asterisk indicates the approximate concentrations),

Solving Eq. 14, we can approximate the microstate concentrations by the product of scaled macrostate concentrations (Supplemental Note 7 in Supplementary Materials shows that this relationship is not exact),

The remaining central question is what the difference (*ϕ*) between exact and approximate microstate concentrations amounts to:

Before the stimulation, at the initial moment *t* = 0, both *r*(*a*_{1},…,*a*_{n}) and *R*_{i}^{*} (*a*_{i}) equal zero, and therefore *ϕ* = 0. Remarkably, it appears that at *t*→∞, when all variables are approaching their steady-state values, the difference *ϕ* again tends to zero. The steady-state concentrations of micro- and macrostates of the receptor are obtained from Eqs. 2 and 14, respectively, by equating the time derivatives to zero. At steady state, the terms multiplied by the Kronecker symbols become zero, and equations for *r*(*a*_{1},…,*a*_{n}) and *R*_{i}^{*} (*a*_{i}) become identical to the steady-state equations for micro- and macrostates *s*(*a*_{1},…,*a*_{n}) and *S*_{i}(*a*_{i}) of a scaffold without controlling site *h* (these equations are derived from Eqs. 1 and 4, respectively, by equating *k*_{r}, *k _{−}*

_{r}, and the time derivatives to zero). Since we proved that for a scaffold with totally independent sites, the steady-state microstate concentrations are equal to the products of the scaled macrostate concentrations (see Eq. 7), the same is true for the steady-state concentrations of micro- and macrostates of the receptor, and, therefore,

*ϕ*→0 when

*t*→∞. To understand this result in intuitive terms, note that the graph, which describes transitions between different receptor forms (Fig. 3), is almost identical to the graph for a scaffold lacking controlling site

*h*(see the

*front facet*of Fig. 2). The only difference between the two graphs is the dead-end edge attached to the vertex (0,0) and describing receptor-ligand association/dissociation in Fig. 3. At the steady state, the flux along this edge equals zero, and the relationships between the concentrations of micro- and macrostates become identical for both graphs. Note that results similar to Eqs. 10–16 continue to apply for a receptor model that includes the dimerization of monomers upon ligand binding (Supplemental Note 8 in Supplementary Materials).

Numerical experiments demonstrate that, within a wide parameter range, the difference *ϕ* between the exact microstate concentrations and their approximation in terms of the products of scaled macrostate concentrations remains small for receptor signaling. Fig. 6 illustrates this for a variety of cases where receptor-ligand interactions are fast and have either low or high affinity (Fig. 6, *A* and *B*), slower (Fig. 6 *C*), or comparable affinity (Fig. 6 *D*) with the rates of the chemical reactions at receptor docking sites. In addition, the total ligand concentration is assumed to be either conserved (Fig. 6, *A*–*C*) or free ligand concentration is considered as an external variable that exponentially decreases with time after the initial burst (Fig. 6 *D*). The precise values of microstate concentrations were calculated according to Eq. 2 (marked by *open circles* in Fig. 6), and the approximate values were calculated according to Eqs. 14 and 15 (marked by *plus symbols*). Note that Fig. 6 *D* shows the temporal dynamics on the timescale of hours, because the ligand degradation is a slow process, whereas all other panels depict the time courses on the scale of seconds. In all cases considered, the approximations for the microstate *r*(2,2) that corresponds to the receptor form bound to both adaptor proteins *A*_{1} and *A*_{2} (Fig. 3), were very close to the exact solutions, although the precision of the approximation could depend on the affinity of the ligand for the receptor (compare Fig. 6, *A* and *B*).

*A*

_{1}and

*A*

_{2}.

*A*and

*B*illustrate cases of fast receptor-ligand binding/dissociation with low and high affinity, respectively.

*C*corresponds to slow receptor-ligand binding/dissociation

**...**

Interestingly, for the general case of scaffold signaling, the difference between the exact solution to Eq. 1 and its approximation (obtained from Eqs. 4 and 9) does not tend to zero at *t*→∞. In fact, there are uncompensated fluxes between vertices at the front and rear facets, since phosphorylation reactions occur only at the front facet, when a scaffold is bound to a receptor (Fig. 2). This observation explains why, in the case of signaling by a receptor described by Eq. 2, the quality of the approximation of microstates is generally better than in the case of a scaffold described by Eq. 1 (compare Figs. 5 and and66).

### General principles of reducing combinatorial complexity: which kinetic properties are critical?

Table 1 summarizes the feasibility of applying a macrodescription to different classes of signaling modules considered in this article. These results indicate which conditions need to be satisfied for the reduction of a combinatorial, mechanistic description to a macromodel of a signaling system. A key prerequisite is that a signaling protein contains domains/sites that do not influence other sites, allosterically or through interactions with bound partners. We refer to these sites as *a*-type docking sites; their states are characterized by a set of numbers, ** a** = (

*a*

_{1}, …,

*a*

_{n}). If, in addition,

*a*-type sites do not depend on the state of any other domain on a receptor or scaffold, the dynamics of these sites can be modeled separately without requiring any information on the states of other sites. Obviously, this is the simplest protein system, which we refer to as a scaffolding protein with kinetically independent docking sites and with no hierarchy in the control of these sites (Table 1). In the general case, the chemical transformations of docking sites may depend on the states of a second group of sites, called controlling sites. We refer to these controlling domains (regions) as

*h*-type sites that may influence the transitions between the states of all other sites on a signaling molecule. Two examples of such signaling systems include: 1), a receptor-scaffold module, where the transformation of scaffold docking sites depends on whether the scaffold is bound to, or dissociated from the receptor (characterized by the digital flag of the

*h*-site); and 2), a receptor that acts as a scaffold having independent docking sites, the transformations of which depend on whether the ligand is bound to the receptor (Table 1). Importantly, the controlling hierarchy does not allow for the reverse interaction, in which

*a*-sites would influence the transformations of

*h*-sites (for instance, ligand-receptor interactions are assumed to be independent of the states of receptor docking sites in the example above; see Table 1,

*fifth row*).

Although so far we explicitly considered proteins with a single controlling domain, the same principles apply to systems with multiple controlling sites. General rules for developing macromodels of systems containing proteins with these two types of domains, *a*-type and *h*-type, are given by Eqs. 3 and 4, where the variable *h* is considered the vector-variable, ** h** = (

*h*

_{1},…,

*h*

_{r}). An example of a system with two controlling sites is a scaffold that displays a receptor-binding site

*h*

_{1}, a different phosphatase-binding site

*h*

_{2}, and a number of docking sites

*i*that do not interact with each other. One or more of these docking sites can be phosphorylated or dephosphorylated while the scaffold is associated with the receptor (

*h*

_{1}= 1) and/or with the phosphatase (

*h*

_{2}= 1), respectively. The microstates of a scaffold protein are determined as

*s*(

*a*

_{1},…,

*a*

_{n};

*h*

_{1},

*h*

_{2}). The rate constants for transitions between states (

*a*

_{i}) of docking sites

*i*may depend on both

*h*

_{1}and

*h*

_{2}; e.g., dephosphorylation occurs only if

*h*

_{2}= 1. The differential equations for microstate concentrations are similar to Eq. 1, but include an extra term that describes the association/dissociation of the scaffold and phosphatase. A macromodel of this receptor-scaffold-phosphatase signaling module is constructed as follows. The macrovariables

*S*

_{i}(

*a*

_{i},

*h*

_{1},

*h*

_{2}) depend on the states of controlling sites

*h*

_{1}and

*h*

_{2}and are defined by Eq. 3, where

**= (**

*h**h*

_{1},

*h*

_{2}). The differential equations for macrostate concentrations are given by Eq. 4 with an extra term that accounts for the scaffold-phosphatase interaction. Remarkably, not only the macromodel, but also the approximation of the microstate concentrations by the product of the scaled macrostate concentrations continues to apply. The latter is given by Eq. 9, where the normalizing factor

*S*

_{tot}(

**) should be determined for each of four combinations of the states of controlling sites (**

*h**h*

_{1},

*h*

_{2}). If the association/dissociation of the scaffold with the receptor and the phosphatase is much faster than the changes in the states of docking sites, the normalizing factors and scaffold forms that differ only by states

*h*

_{1},

*h*

_{2}(i.e., bound to or unbound from the receptor and the phosphatase) are related by rapid-equilibrium relationships, and the accuracy of the approximation is very high. For arbitrary rate constants, numerical experiments suggest that the accuracy is generally good (the approximation errors are similar to those illustrated in Fig. 5 for similar values of kinetic parameters; data not shown).

Importantly, model reduction is still possible even if some of docking sites on a scaffold interact allosterically or through their bound proteins. We refer to these sites as *b*-type docking sites. We assume that the states (*b*_{j}) of *b*-type site *j* can influence the transformations between the states (*b*_{l}) of any *b*-type sites, but do not influence the transformations of *a*-type docking sites and controlling *h*-sites. For instance, the tyrosine phosphatase SHP2 that binds to GAB-1 was shown to dephosphorylate specific p85 binding sites, negatively regulating EGF-dependent PI3K activation without significant effects on binding of other partners to GAB-1 (44). Likewise, SHP2 that binds to the EGFR-GAB1 complex specifically dephosphorylates Tyr^{992} on EGFR, whereas the dephosphorylation of other docking sites is not significantly changed (45). Tyr^{992} is a specific binding site for the GTPase activating protein RasGAP; the dephosphorylation of this site increases Ras-GTP levels and thereby positively influences the MAPK activation. A general approach to circumvent the combinatorial explosion of the number of microstates, *s*(*a*_{1},…, *a*_{n}; *b*_{1},…, *b*_{q}; *h*_{1}, …, *h*_{r}), for signaling proteins with these three types of sites (independent *a*-type and allosterically interacting *b*-type docking sites and controlling *h*-domains) is the following. We introduce a mesoscopic description with two sets of macrovariables. One set, *S*(*a*_{i}, ** h**), is analogous to the macrovariables considered above (Eq. 3), and is determined by summing up the microconcentrations of all the states (

**) of allosterically interacting sites and all**

*b**a*-type docking sites except site

*i*. These variables merely follow the states of each independent docking site at certain states of controlling

*h*-sites. The second set of variables

*S*(

**,**

*b***) is determined by the sum of microconcentrations of all the states (**

*h***) of independent docking sites. These variables characterize the states of all allosterically interacting sites simultaneously at certain states of controlling**

*a**h*-sites. The differential equations obtained by the summation of the corresponding equations for a mechanistic description constitute a macromodel of a signaling module. The conditions necessary for the applicability of a macromodel, including the possible influences and dependences between different types of sites and the requirements for the rate constants of the corresponding transformations are shown in Table 2.

### Experimental verification of a macrodescription

The importance of a macrodescription goes beyond the reduction of a mechanistic model of signal transduction, providing a direct connection to experimentally observable variables. In fact, macrovariables are quantified in experimental studies by Western blot analysis using site-specific antibodies, whereas microstates of a scaffold protein cannot be assessed readily at the present state-of-the-art. When the structural and kinetic information suggests that allosteric interactions are absent (at least for some protein domains), a macrostate model can be developed. Such a macro (or mesoscopic) description of a signaling system has the advantage of providing a direct test of the model against the experiment using current techniques to quantify posttranslational protein modifications (46,47).

It is instructive to ask how we can proceed from standard cell biological experiments determining the functional states of signaling molecules using Western blots to assess if a macropresentation of a system is applicable. Answering this question requires a modification to the commonly used experimental designs. An analysis is outlined here for a simple example of a scaffold (*S*) that has two docking sites. Suppose proteins *A* and *B* were detected by Western blot in immunoprecipitates of total cell lysates obtained with an antibody against *S*. These data do not show 1), whether proteins *A* and *B* can bind simultaneously to the same molecule of *S*; and 2), if they do, whether docking sites for *A* and *B* can be considered independent, or whether allosteric interactions occur that would impede a macrodescription of this system. These issues can be addressed as follows. First, separate immunoprecipitates of *S*, *B*, and *A* are prepared at several time-points after cell stimulation. Using an antibody against *A*, we quantify the intensity of the *A*-band in the immunoprecipitates of *S* and in the total cell lysates on the same membrane, thereby determining the relative amount, *AS*/*A*_{tot} = *r*_{1}. Assuming that *A* binds *B* only through the ternary complex *ASB* and quantifying the *A*-band in the immunoprecipitates of *B* and in the total cell lysates on the same membrane, we determine the relative amount, *ASB*/*A*_{tot} = *r*_{2}. Likewise, using an antibody against *B* for a detection and the immunoprecipitates of *S* and *A*, we determine *SB*/*B*_{tot} = *r*_{3} and *ASB*/*B*_{tot} = *r*_{4}. Finally, using an antibody against *S* for detection and the immunoprecipitates of *A* and *B*, we find *AS*/*S*_{tot} = *r*_{5} and *SB*/*S*_{tot} = *r*_{6}. Using these data, we can determine the relative abundance of proteins *A*_{tot}/*B*_{tot}/*S*_{tot}, and relative amount of *ASB*/*S*_{tot} = *r*_{2} · *r*_{5}/*r*_{1} = *r*_{4} · *r*_{6}/*r*_{3}. This equation shows that there is the relationship between these data (*r*_{2} · *r*_{3} · *r*_{5} = *r*_{1} · *r*_{4} · *r*_{6}), and only five measurements are needed to determine any unknown fraction at any given time. However, the additional measurement is useful for the statistical estimates and tests for the consistency of data (including the possible presence of other complexes containing *A*, *B*, and *S*).

Now recall that the relative amounts quantified in these experiments are the macrovariables, *S*(*A*)/*S*_{tot} = *r*_{5} and *S*(*B*)/*S*_{tot} = *r*_{6}, and the microvariable *S*(*A*,*B*)/*S*_{tot} = *r*_{2} · *r*_{5}/*r*_{1} (for the states of the scaffold *S*, mnemonic notations are used, which are readily comparable to the notation utilized in Eq. 7). If the data show that *S*(*A*,*B*)/*S*_{tot} is not significantly different from the product of *S*(*A*)/*S*_{tot} and *S*(*B*)/*S*_{tot} (i.e., *r*_{5} = *r*_{4}/*r*_{3}, or equivalently, *r*_{6} = *r*_{2}/*r*_{1}) there is no allosteric interaction between docking sites for proteins *A* and *B* (see Eq. 7). Hence, a macromodel of this system can be constructed where the temporal dynamics of each docking site is determined separately. A substantial deviation of the microvariable from the product of the macrovariables tells us that there is a positive or negative cooperativity between two docking sites. If *r*_{5} *r*_{4}/*r*_{3}, there is positive cooperativity (binding a partner to one docking site facilitates binding to the other site), and if the opposite inequality holds, negative cooperativity is predicted. In such cases, it is necessary to monitor the states of two docking sites simultaneously in the framework of a mechanistic model.

## DISCUSSION

Many cellular proteins participate in the transfer and processing of information, rather than catalyzing the chemical transformation of metabolic intermediates or building cellular structures. Information is received in the form of growth factors, hormones, and other environmental signals that stimulate plasma membrane receptors. After stimulation, activated receptors and large adaptor proteins often act as scaffolds that assemble multiprotein complexes on their docking sites (Fig. 1). For instance, activated RTKs, such as growth factor receptors and IR, and scaffolding/adaptor proteins, such as GAB 1/2, IRS 1–4, and Grb10, gather a variety of complexes with downstream partners, initiating diverse signaling branches and pathways (2,3,7,48,49). The multiplicity of docking sites, phosphorylation, dephosphorylation, subsequent binding and dissociation, and the resulting random assemblage of a battery of protein complexes leads to a combinatorial increase in the number of feasible molecular species that present different states of a receptor/scaffold signaling system.

In this article, we show a way to circumvent this combinatorial complexity so as to facilitate modeling of signaling networks. Using a domain-oriented analysis, we dissect the dynamics of a highly multidimensional system into separate trajectories of individual docking sites on receptors and scaffolds. Our analysis effectively separates a highly branched signaling network into interacting pathways originated by protein complexes assembled on different docking sites, dramatically reducing the number of states and differential equations to be considered. Closely related approaches were successfully applied to model the kinetics of bivalent ligand-receptor interactions that lead to the formation of multireceptor aggregates (50,51).

We analyzed molecular scenarios that are commonly present in RTK signaling networks: a scaffold activated by a receptor or soluble kinases, and a receptor that can bind several target proteins simultaneously, thus displaying scaffolding properties. In the simplest model of a scaffold, the chemical transformations of any docking site, including its phosphorylation and dephosphorylation, are assumed to be completely independent of the states of other sites. In a more complex case, the scaffold-receptor association keeps a tight rein on otherwise independent docking sites, since their phosphorylation occurs only when the scaffold is bound to the receptor. To account for this effect, we set the notation of a controlling site (*h*) on the scaffold. When two or more different docking sites are phosphorylated while the scaffold is bound to the receptor (*h* = 1), the reaction mechanism is called processive. A model of completely independent sites assumes that docking sites are phosphorylated and dephosphorylated in a distributive mechanism, where phosphorylation (dephosphorylation) of more than one site requires a new binding interaction between soluble kinases (phosphatases) and their target proteins (52,53). Fig. 2 shows that in the model of a receptor-scaffold signaling module analyzed here, scaffold phosphorylation includes elements of both processive and distributive mechanisms. We demonstrated that regardless of the mechanism of phosphorylation, signaling by a scaffold and its binding partners can be analyzed in terms of macrostates of the system (Eqs. 3 and 4). The macrostates *S*_{i}(*a*_{i}, *h*) represent separate states (*a*_{i}) of distinct docking sites *i* on a scaffold protein (*S*) and may depend on *h* for a receptor-scaffold signaling module. Even for two docking sites, a macrodescription brings about a considerable reduction in the number of states and transitions, as illustrated by Figs. 4 and and22.

Using a macrodescription, the dynamics of microstates can be retrieved for a scaffold protein with totally independent sites. In this case, the microstate concentrations are exactly expressed as the product of fractional concentrations of macrostates (Eq. 7). For a receptor-scaffold signaling module, i.e., for a scaffold with a controlling site, a similar relationship (Eq. 9) that expresses the microstate dynamics is merely an approximation. Furthermore, the macrostate concentrations are no longer normalized by the total concentration of the scaffold protein. Depending on whether a particular macrostate corresponds to the form associated with or dissociated from the receptor, the normalizing factor is the total concentration of receptor-bound or receptor-unbound scaffold forms (Eq. 9). Both these total concentrations can be measured in co-immunoprecipitation experiments (37,46). We showed that the approximation of the dynamics of microstates in terms of the product of the scaled macrostate concentrations holds well within a wide range of kinetic parameters, as illustrated in Fig. 5. These results imply that by using antibodies that specifically recognize phosphorylation of individual residues/sites on a scaffold protein, we can predict the temporal dynamics of any given microstate, e.g., the state where all docking sites are phosphorylated and occupied by their binding partners, in terms of the product of scaled concentrations of macrostates.

We analyzed two models of a receptor that possesses scaffolding proteins: 1), a model where the ligand affinity does not depend on receptor dimerization and phosphorylation state; and 2), a model where the affinity of the ligand drastically increases after the receptor undergoes dimerization, activating conformational change, or phosphorylation. In the extreme case, the dissociation of ligand is possible only from the unphosphorylated receptor or receptor monomer (see kinetic schemes in Fig. 3 and see Supplemental Note 8, Fig. S8.1, in Supplementary Materials). Whereas the first model is mathematically equivalent to a scaffold-receptor signaling module, where both mechanistic and macrodescription can be applied, the second model does not allow a precise quantitation in terms of only macrovariables. In fact, in contrast to signaling by a scaffold protein, the differential equations for the macrovariables (Eq. 11) include a separate microstate that corresponds to the active, but unphosphorylated receptor. This unique microstate serves as the input into a transition graph (Fig. 3) for receptor docking sites, and cannot be eliminated from Eq. 11 by any linear transformation of variables. We derived a nonlinear approximation for the concentrations of macrostates (Eq. 14) and showed how the microstate concentrations are estimated (Eq. 15). Interestingly, the quality of the approximation of a mechanistic description by macrovariables appears to be even better for a receptor with independent docking sites than for a scaffold-receptor signaling system (Figs. 5 and and6).6). In fact, only for signaling by a receptor that possesses scaffolding properties does the difference *ϕ* between the exact microstate concentrations and their approximations in terms of the concentrations of macrostates tend to zero when the system approaches the steady state, whereas *ϕ* does not tend to zero for a scaffold-receptor signaling module.

The notion of multiple controlling sites is also useful for the analysis of more complex models of receptor activation. For instance, it is known that following insulin binding, IR undergoes autophosphorylation at three neighboring tyrosine residues in the activation loop of the intracellular kinase domain (54). Phosphorylation of tyrosine residues in the activation loop causes a 36-fold increase in the kinase activity of IR (54). Assuming that there are several independent docking sites on the receptor with the phosphorylation state of the activation loop determining the kinase activity, we can describe all microstates of this receptor by a set of numbers (*a*_{1}, …, *a*_{n}; *h*_{1}, *h*_{2}), i.e., by a model with two controlling sites, where *h*_{1} = 1 or 0 indicates that the ligand is associated with or dissociated from the receptor, and *h*_{2} = 1 or 0 stands for the phosphorylated or unphosphorylated activation loop. Using our approach, the kinetic behavior of this system can be analyzed at the macrostate level and, in addition, the microstate concentrations can be approximated by the product of the scaled concentrations of macrostates, with generally good quality.

In this article, we applied our analysis to signaling networks described by deterministic differential equations. Signaling events that occur in small subcellular volumes involve small numbers of molecules. These signaling systems are intrinsically noisy and should be analyzed by stochastic methods (30,55). Importantly, our results appear to be equally applicable to exact stochastic calculations of the temporal dynamics of noisy systems. Indeed, the stochastic kinetic equation, referred to as the master equation, uniquely corresponds to ordinary differential equations, which are derived for the probabilities of numbers of molecules in certain molecular states. For instance, the reaction parameters of the master equation are equal to the first-order rate constants of ordinary differential equations for unimolecular reactions and, for bimolecular reactions, to the second-order rate constants divided by the reaction volume (30). The absence of allosteric interactions implies that the kinetic parameters of reactions that occur at one docking site are independent of the states of the other docking sites on a signaling protein. Under this condition, the probabilities of transformations of macrostates are the sums of the probabilities of transformations of microstates and are calculated in a Gillespie-type exact stochastic algorithm that is related to the master equation (30). Therefore, the implications of the independence of the reaction parameters for the summation of the microstates appear to be similar for deterministic and stochastic systems. Employing a macroscopic, domain-oriented approach significantly reduces the number of distinct states and reactions and the required calculation time even when the pathway kinetics is simulated according to exact stochastic methods. In conclusion, the methods proposed here make it possible to exploit reduced, tractable models for a quantitative description of complex multiprotein signaling networks.

## SUPPLEMENTARY MATERIAL

An online supplement to this article can be found by visiting BJ Online at http://www.biophysj.org.

## Acknowledgments

This work was supported by grants No. GM59570 and No. AA08714 from the National Institutes of Health.

## References

^{771}in the PDGF

*β*-receptor. Oncogene. 21:1870–1875. [PubMed]

*γ*1 signaling pathway. J. Biol. Chem. 274:8958–8965. [PubMed]

**The Biophysical Society**

## Formats:

- Article |
- PubReader |
- ePub (beta) |
- PDF (281K) |
- Citation

- A domain-oriented approach to the reduction of combinatorial complexity in signal transduction networks.[BMC Bioinformatics. 2006]
*Conzelmann H, Saez-Rodriguez J, Sauter T, Kholodenko BN, Gilles ED.**BMC Bioinformatics. 2006 Jan 23; 7:34. Epub 2006 Jan 23.* - Trading the micro-world of combinatorial complexity for the macro-world of protein interaction domains.[Biosystems. 2006]
*Borisov NM, Markevich NI, Hoek JB, Kholodenko BN.**Biosystems. 2006 Feb-Mar; 83(2-3):152-66. Epub 2005 Oct 19.* - Exact model reduction of combinatorial reaction networks.[BMC Syst Biol. 2008]
*Conzelmann H, Fey D, Gilles ED.**BMC Syst Biol. 2008 Aug 28; 2:78. Epub 2008 Aug 28.* - The complexity of complexes in signal transduction.[Biotechnol Bioeng. 2003]
*Hlavacek WS, Faeder JR, Blinov ML, Perelson AS, Goldstein B.**Biotechnol Bioeng. 2003 Dec 30; 84(7):783-94.* - Depicting combinatorial complexity with the molecular interaction map notation.[Mol Syst Biol. 2006]
*Kohn KW, Aladjem MI, Kim S, Weinstein JN, Pommier Y.**Mol Syst Biol. 2006; 2:51. Epub 2006 Oct 3.*

- Exact Hybrid Particle/Population Simulation of Rule-Based Models of Biochemical Systems[PLoS Computational Biology. ]
*Hogg JS, Harris LA, Stover LJ, Nair NS, Faeder JR.**PLoS Computational Biology. 10(4)e1003544* - Navigating the Multilayered Organization of Eukaryotic Signaling: A New Trend in Data Integration[PLoS Computational Biology. ]
*Santra T, Kolch W, Kholodenko BN.**PLoS Computational Biology. 10(2)e1003385* - Reconstruction and analysis of nutrient-induced phosphorylation networks in Arabidopsis thaliana[Frontiers in Plant Science. ]
*Duan G, Walther D, Schulze WX.**Frontiers in Plant Science. 4540* - Hierarchic Stochastic Modelling Applied to Intracellular Ca2+ Signals[PLoS ONE. ]
*Moenke G, Falcke M, Thurley K.**PLoS ONE. 7(12)e51178* - Crosstalk and Signaling Switches in Mitogen-Activated Protein Kinase Cascades[Frontiers in Physiology. ]
*Fey D, Croucher DR, Kolch W, Kholodenko BN.**Frontiers in Physiology. 3355*

- Signaling through Receptors and Scaffolds: Independent Interactions Reduce Combi...Signaling through Receptors and Scaffolds: Independent Interactions Reduce Combinatorial ComplexityBiophysical Journal. Aug 2005; 89(2)951

Your browsing activity is empty.

Activity recording is turned off.

See more...