![]() | ![]() |
Formats:
|
||||||||||||||||||||||||||||||||||||||
Copyright © 2009 Lenaerts et al; licensee BioMed Central Ltd. Information theoretical quantification of cooperativity in signalling complexes 1SWITCH, VIB, Brussels, Belgium 2Vrije Universiteit Brussel, Brussels, Belgium 3Ørsted.DTU, Technical Uiversity of Denmark, Kgs. Lyngby, Denmark Corresponding author.Tom Lenaerts: tlenaert/at/vub.ac.be; Jesper Ferkinghoff-Borg: jfb/at/oersted.dtu.dk; Joost Schymkowitz: jschymko/at/vub.ac.be; Frederic Rousseau: froussea/at/vub.ac.be Received May 21, 2008; Accepted January 16, 2009. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract Background Intra-cellular information exchange, propelled by cascades of interacting signalling proteins, is essential for the proper functioning and survival of cells. Now that the interactome of several organisms is being mapped and several structural mechanisms of cooperativity at the molecular level in proteins have been elucidated, the formalization of this fundamental quantity, i.e. information, in these very diverse biological contexts becomes feasible. Results We show here that Shannon's mutual information quantifies information in biological system and more specifically the cooperativity inherent to the assembly of macromolecular complexes. We show how protein complexes can be considered as particular instances of noisy communication channels. Further we show, using a portion of the p27 regulatory pathway, how classical equilibrium thermodynamic quantities such as binding affinities and chemical potentials can be used to quantify information exchange but also to determine engineering properties such as channel noise and channel capacity. As such, this information measure identifies and quantifies those protein concentrations that render the biochemical system most effective in switching between the active and inactive state of the intracellular process. Conclusion The proposed framework provides a new and original approach to analyse the effects of cooperativity in the assembly of macromolecular complexes. It shows the conditions, provided by the protein concentrations, for which a particular system acts most effectively, i.e. exchanges the most information. As such this framework opens the possibility of grasping biological qualities such as system sensitivity, robustness or plasticity directly in terms of their effect on information exchange. Although these parameters might also be derived using classical thermodynamic parameters, a recasting of biological signalling in terms of information exchange offers an alternative framework for visualising network cooperativity that might in some cases be more intuitive. Background A cellular pathway, whether enzymatic or signal transducing, can in a simplistic manner be described as a causal relationship between an environmental signal (such as nutrients, osmolytes or hormones) and a cellular response (generally through gene regulation). Cellular signals are mediated through a series of successive protein-protein interactions that bridge spatial and topological boundaries (between the plasma membrane and cell nucleus for example) and that allow for crosstalk between different pathways [1,2]. This protein-based modular strategy achieves integrated cellular responses that are both specific and at the same time tuned to global environmental and cellular requirements. This specificity is organized through the cooperativity between the members of the complex and the introduction of temporal and spatial constraints on the expression levels of the different members of the signalling pathways. Over- or under-expression, for instance, of the signalling components may have disastrous effects on the cellular phenotype, e.g. the development of cancer. Cooperativity is a thermodynamic concept that is used in different biochemical contexts [3-6]. Here this notion refers to the formation of multi-protein complexes with non-additive free-energies of assembly, i.e. complexes for which the stability of the final assembly is higher than the sum of all individual binary association [6]. A classic way to study cooperativity is by the analysis of a thermodynamic cycle [7]. Consider an assembly process that involves three proteins A, B and C that together form a ternary complex ABC, where B acts as an adaptor protein providing a separate binding surface for each of the two other molecules (see left panel Figure Figure1).1
Even though this scheme identifies the presence and type of cooperation in the assembly process, it does not shed light on the molecular concentrations, possibly reflecting the intracellular conditions, required for efficient regulation or coordination between a pathway's active (ABC) and inactive (B) state. Here we provide an information theoretical method that, in the same spirit as the Hill and Scatchard plots [8], identifies and quantifies cooperativity in macromolecular assemblies and visualizes for a spectrum of concentrations when optimal coordination is obtained for the given experimental data. Different from those established methods, our approach goes beyond multiple bindings of the same ligand to a homogeneous oligomer (as in the binding of oxygen to haemoglobin [9]): We consider here the construction of heterogeneous protein assemblies mediated by multiple binding surfaces on adaptor proteins. As such, and as far as we are aware, this method provides an original and novel approach for the analysis of the cooperativity in macromolecular complexes that are part of some signalling cascade. Results and discussion General description of the approach In analogy with cellular pathways, each protein in a cellular network can be considered as an element receiving an input signal (from upstream ligands) and generating an output signal (towards downstream effectors). Hence, we can reinterpret the ternary protein complex ABC as an instance of communication over a noisy channel [10,11], where protein B provides the communication channel through which information is exchanged between upstream ligand A and downstream effector C (See left panel in Figure Figure1).1 We quantify the degree of cooperativity of the system by the amount of information that is exchanged between the elements of the complex. In terms of the protein concentrations, this mutual information expresses how well the ratio of the steady-state concentrations of the ternary complex ABC and the free adaptor protein B are balanced while at the same time requiring the concentration of both binary complexes (AB and BC) to be as low as possible. So, on the one hand, low information exchange corresponds to an equilibrium situation where the protein (B) and complex concentrations (ABC) are out of balance or where to many intermediate complexes are present making it hard for the biological system to perform its function. On the other hand, high amounts of information exchange correspond to an optimized system where all members achieve the required coordination to switch efficiently between active and inactive states of the cellular process. Note, that the approach described here for a ternary protein complex can be further generalised to describe communication channels having multiple inputs or outputs (i.e. to study signal integrators or differentiators). In that case the mutual information between the different components needs to be deduced by a multivariate approach (see Methods) [11,12]. It is also important to note that the mutual information does not change from swapping the input with the output components, i.e. I(A;C) = I(C;A). Biophysical model system To clarify the biophysical meaning and illustrate our method we here describe the information exchange over a part of the p27 regulatory pathway. The p27 pathway controls the degradation of the cyclin-dependent kinase 2 (Cdk2) inhibitor p27 [13-16] thereby playing an important role in cell cycle progression [17,18]. In particular, phosphorylation of p27 triggers Cks1-mediated binding of p27 to Skp2. As Skp2 is part of the SCFSkp2 ubiquitin ligase this results in p27 degradation and cell cycle progression. In a recent study [19], the assembling mechanism for part of the SCFSkp2 multiprotein complex has been analyzed in order to understand 1) how and in which order the different units assemble and 2) how the specific order of this process influences the mutual affinities between the components and intermediately formed complexes. Seeliger et al. [19] showed that the Skp2-Cks1 complex increases the affinity of Cks1 for the Cdk2 inhibitor p27 a 100-fold. Additional inclusion of Cdk2 increases the affinity for p27 even more. Through mutational analysis the authors also showed long-range coupling between distant functional sites in Cks1, making it a principal example how adaptor proteins can play a central role in tightly controlling the assembly of a critical complex. As a consequence, it forms a biophysically meaningful case to investigate the communication between the different binding sites of the Cks1 structure in terms of Shannon's information theory (see Methods). Note here that Shannon's information theory can also be used to derive the communication pathway in Cks1. We recently demonstrated this lower-level analysis for the SH2 domain of Fyn [20]. Given the appropriate structural data, the same analysis could occur which should reveal the communication between the three binding sites [21]. The biophysical data obtained in [19], i.e. the dissociation constants, is used to perform the current analysis, (see also Table 1 for the data). The thermodynamic cycle including the adaptor protein Cks1 (acting as component B) [21], the proteins Skp2 (acting as component C) and p27 (acting as component A) produced from this data shows that both paths around the cycle are cooperative: Having Skp2 bound to Cks1 makes it easier for p27 to bind and vice versa. In a first step, we focus on the thermodynamic cycle for the formation of this ternary complex p27-Cks1-Skp2 (see Methods). Since in vivo p27 is bound to Cdk2, we will in a second step consider the quaternary complex Cdk2-p27-Cks1-Skp2. In that case two signals (Cdk2 and p27) are integrated and conveyed over the communication channel Cks1. As the mutual binding affinities of this system, i.e. KdSkp2-Cks1, KdCks1-p27, KdSkp2-Cks1p27 and KdSkp2Cks1-p27, have been determined experimentally (see [19] and Table 1), we can quantify the information exchange between the input and output components of the system and study the transmission efficiency, meaning under which conditions we observe the highest degree of cooperativity, of the adaptor protein Cks1 under a wide range of chemical potentials (see Methods). Note that only one of the dissociation constants, KdSkp2-Cks1p27 or KdSkp2Cks1-p27, is required for the derivation of the different steady-state concentrations (see Methods).
How much information is exchanged in the p27-Cks1-Skp2 complex? Figure Figure22
As can be seen in Figure Figure2,2 How does channel concentration affect robustness of the system? Interestingly, although the area of maximum cooperativity of p27-Cks1-Skp2 represents only a minor part of the phase space, it displays a relatively slow decline for increasing p27 concentration (see also Figure Figure3.3
The robustness towards the Skp2 concentration increases as the concentration of Cks1 increases, as is shown in Figure Figure4.4
How much information is exchanged in the Cdk2-p27-Cks1-Skp2 complex? The right panel of Figure Figure22
Channel capacity and noise of the p27-Cks1-Skp2 complex The maxima of these contour diagrams represent the capacity of the system, i.e. the maximum amount of information that can be transmitted over the channel with an arbitrary small probability of error [22]. As can be seen in Figure Figure22 Figure Figure33
We added in each plot a blue line that marks the concentration of p27 where the maximum information transmission is found ([p27]* = 5.79 μM and [Skp2]* = 0.0512 μM) for all combinations of [Skp2] and [p27] (see also Figure Figure22 By adding Cdk2 in the system, the capacity becomes ~0.75 bits (see Figure Figure22 Conclusion All these results show that, given the binding affinities at equilibrium and the overall concentrations of the different components, mutual information quantifies for which protein concentrations the systems' cooperativity, or more specific its coordination, is optimal. Our analysis clearly shows (see Figure Figure2)2 Methods Defining Cks1 as an asymmetric noisy channel In information theory communication occurs through noisy channels[10], where the noise is the result of an error in the transmission. Different kinds of channels exist, but here the focus is on discrete and memoryless channels. Concretely, a noisy channel is defined by an input alphabet AX, output alphabet AY, a set of conditional probability distributions P(y|x). The conditional probability distributions provide for every input signal (x AX) the probability that a particular output signal (y AY) is produced. When the alphabets contain only two symbols and the probability of having a miscommunication is the same for both input symbols, the channel is also referred to as a binary symmetric noisy channel. Given this description, the cooperative pathway within Cks1 can now be defined as a noisy channel where the input and output alphabets both consist of the symbols 0 and 1, referring respectively to the unbound and bound states of both binding sites of Cks1. To keep things simple, the channel description of Cks1 uses only two of the three proteins that bind to Cks1 in the SCFSkp2 model system: Skp2 and phosphorylated p27. Since the symbols refer to bound and unbound state of either Skp2 or p27 to Cks1, there are four probabilities relevant here: The probability that both Skp2 and p27 are bound, that both are unbound and that Skp2 is bound (unbound) to Cks1 and p27 is unbound (bound). These probabilities are visualized in Figure Figure11
where f and g are defined as This relation between concentrations implies that the errors, and later also the information exchange, depends on the concentration of the proteins that may be produced by the system. Determining the equilibrium concentrations We determine the concentrations of the different proteins and protein complexes using the dissociation constants determined by Seeliger et al. [19]. Using these binding affinities a system of equations is derived, which is numerically solved by determining the roots of these equations. For the simplified model system, which only incorporates Cks1, Skp2 and phosphorylated p27, this system of six equations is the following:
The system contains six parameters, namely [Cks1]0, [Skp2]0, [p27]0, Kd [Skp2-Cks1], Kd [Cks1-p27] and Kd [Skp2Cks1-p27]. The first three parameters correspond to the total concentrations of the proteins in the model system both in isolation and in complexes. The latter three parameters are the three dissociation constants specific to the SCFSkp2 model system. Note that the dissociation constant Kd [Skp2Cks1-p27] refers here to the dissociation of p27 from the complex Skp2-Cks1-p27. The results remain the same if the alternative dissociation constant, dissociating Skp2 from Skp2-Cks1-p27, is used. When the values for these parameters are inserted from ref. 19, a root finding algorithm is applied to determine the equilibrium concentrations of all the members of this system: [Cks1], [Skp2], [p27], [Skp2-Cks1], [Cks1-p27] and [Skp2-Cks1-p27]. Once these concentrations are obtained, the probabilities in the matrix Q can be determined. Calculating mutual information Mutual information expresses the amount of information that the output conveys about the input (and vice versa). It is formally expressed in terms of entropy:
where the entropies are calculated as: The base of the logarithm determines the units in which mutual information is expressed. Usually it is either a natural (ln x) or a binary logarithm (log2 x), making the units either nats (natural digits) or bits (binary digits). Here, a binary logarithm is used. So mutual information (see Equation 3) expresses how much we learn about the output (or input) of a channel when we receive information about the input (or output). This is calculated by subtracting the entropy (uncertainty) on the state of the output (or input) from the entropy (uncertainty) of the output (or input) when we know the input (or output). So all entropy scores are related to the state of the channel (here Cks1) and not the state of the input and output proteins, respectively Skp2 and p27. Concretely, all entropy values can be easily derived from the probabilities related to the input and output state of Cks1. For instance, if X corresponds to Skp2, then P(Skp2 = 0) and P(Skp2 = 1) correspond to the probabilities that Skp2 is bound or not bound to Cks1. This leads to the following formulation of the entropy for Skp2:
where The entropy H(p27) is derived in the same way. The joint entropy is
where Multivariate mutual information To derive the information exchange between three or more proteins a multivariate approach needs to be followed[11,12]. This approach allows the analysis of the signal between two input proteins and an output protein. As in the previous formulation, the mutual information is determined using entropy:
where X represents the output signal and Y and Z represent two input signals or visa versa. In addition, the effect of either one of the components on the two other ones can be analysed by eliminating this component. For instance if one wants to determine the effect of Cdk2 on the communication between Skp2 and phosphorylated p27, the mutual information I(Skp2;p27) and the averaged transmitted information ICdk2(Skp2;p27) need to be determined (see ref. 9 for the details). If I(Skp2;p27) is not equal to ICdk2(Skp2;p27) then Cdk2 has an effect on the transmission between the two other proteins. This difference, called the interaction information A(Skp2, Cdk2, p27), is the gain (or loss) in the sample information transmitted between any two of the proteins, caused by the additional knowledge of the third one. Combining the interaction information with I(Skp2;p27) and I(Skp2;Cdk2) produces again the multivariate mutual information I(Skp2; Cdk2, p27). As a consequence, A(Skp2, Cdk2, p27) expresses how the two signals are modulated, which can be either in a negative of positive way. Authors' contributions TL carried out the mathematical modelling and contributed in the conception and development of the principles of this work. JFB, JS and FR conceived the principles of the work and assisted in the development of the mathematical model. TL, JS and FR drafted the manuscript. All authors read and approved the manuscript. Acknowledgements Discussions with Dr. Sebastian Maurer-Stroh are gratefully acknowledged. We also acknowledge valuable input from Prof. James R. Williamson and Dr. Mark Isalan to improve the clarity of the manuscript. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||||||||||||||||
Proc Natl Acad Sci U S A. 2003 Oct 14; 100(21):12123-8.
[Proc Natl Acad Sci U S A. 2003]Curr Opin Struct Biol. 2005 Feb; 15(1):23-30.
[Curr Opin Struct Biol. 2005]Methods. 1998 Sep; 16(1):3-20.
[Methods. 1998]Nat Chem Biol. 2008 Aug; 4(8):458-65.
[Nat Chem Biol. 2008]J Mol Biol. 1990 Aug 5; 214(3):613-7.
[J Mol Biol. 1990]Methods Enzymol. 1978; 48():270-99.
[Methods Enzymol. 1978]J Biol Chem. 1974 Dec 10; 249(23):7607-12.
[J Biol Chem. 1974]Nat Cell Biol. 2001 Apr; 3(4):E95-8.
[Nat Cell Biol. 2001]Mol Cell. 2001 Mar; 7(3):639-50.
[Mol Cell. 2001]J Cell Physiol. 2000 Apr; 183(1):10-7.
[J Cell Physiol. 2000]Cancer Cell. 2003 Oct; 4(4):251-6.
[Cancer Cell. 2003]Nat Struct Biol. 2003 Sep; 10(9):718-24.
[Nat Struct Biol. 2003]Nat Struct Biol. 2003 Sep; 10(9):718-24.
[Nat Struct Biol. 2003]J Biol Chem. 2002 Nov 1; 277(44):42233-40.
[J Biol Chem. 2002]Nat Struct Biol. 2003 Sep; 10(9):718-24.
[Nat Struct Biol. 2003]J Biol Chem. 2002 Nov 1; 277(44):42233-40.
[J Biol Chem. 2002]Proc Natl Acad Sci U S A. 2001 Apr 24; 98(9):5043-8.
[Proc Natl Acad Sci U S A. 2001]Clin Cancer Res. 2006 Jan 15; 12(2):487-98.
[Clin Cancer Res. 2006]Nat Struct Biol. 2003 Sep; 10(9):718-24.
[Nat Struct Biol. 2003]