![]() | ![]() |
Formats:
|
||||||||||||||||||||||||||||||||||||||
Copyright © 2002, The National Academy of Sciences Cell Biology Untangling the wires: A strategy to trace functional interactions in signaling and gene networks *Department of Pathology, Anatomy and Cell Biology, Thomas Jefferson University, 1020 Locust Street, Philadelphia, PA 19107; ‡Department of Microbial Physiology, Free University, Biocentrum, 1081 HV, Amsterdam, The Netherlands; and §Department of Mathematics, Rutgers University, Piscataway, NJ 08854 †To whom reprint requests should be addressed. E-mail: Boris.Kholodenko/at/mail.tju.edu. Communicated by Rudolf E. Kalman, Swiss Federal Institute of Technology, Gainesville, FL Received March 30, 2002; Accepted July 25, 2002. This article has been corrected. See Proc Natl Acad Sci U S A. 2002 November 12; 99(23): 15244. This article has been cited by other articles in PMC.Abstract Emerging technologies have enabled the acquisition of large genomics and proteomics data sets. However, current methodologies for analysis do not permit interpretation of the data in ways that unravel cellular networking. We propose a quantitative method for determining functional interactions in cellular signaling and gene networks. It can be used to explore cell systems at a mechanistic level or applied within a “modular” framework, which dramatically decreases the number of variables to be assayed. This method is based on a mathematical derivation that demonstrates how the topology and strength of network connections can be retrieved from experimentally measured network responses to successive perturbations of all modules. Importantly, our analysis can reveal functional interactions even when the components of the system are not all known. Under these circumstances, some connections retrieved by the analysis will not be direct but correspond to the interaction routes through unidentified elements. The method is tested and illustrated by using computer-generated responses of a modeled mitogen-activated protein kinase cascade and gene network. Advances in high-throughput genomics and proteomics analysis facilitate the monitoring of the expression levels of large gene sets and the activity states of signaling proteins in living cells. The explosive growth in the amount of data calls for the development of novel quantitative approaches for analysis. Thus far, our understanding of cellular signaling and gene networks has been almost exclusively qualitative. Recently, both qualitative and mechanistic mathematical modeling have been applied to better understand network molecular organization and kinetics in quantitative terms (1–8). The mechanistic “bottom-up” approach has the advantage of being readily testable against experiments as a computer replica of cellular networks. However, a major disadvantage of a mechanistic modeling is the large number of molecular processes to be considered, complicated by the fact that values for multiple kinetic parameters are unknown. Moreover, a bottom-up approach inevitably misses the interactions and regulatory feedbacks still awaiting discovery. As an approach to studying cellular networks, we developed a form of “top-down” sensitivity analysis to quantify the input–output relations and molecular interactions in regulatory networks (9). The control of the input signal over the output target was quantified as the ratio of the input-to-output changes at steady state (called the response coefficient in the limit of infinitesimal changes). The signal may be a hormone, growth factor, neurotransmitter, or experimental intervention (e.g., an inhibitor), and the target process may be the phosphorylation state or activity of a protein, mRNA level, or transcription rate. If a full set of molecular interactions is known, the (global) network response to a signal or experimental perturbation can be predicted and expressed in terms of the individual (local) responses by using a “map” of network connections (9). However, when detailed information about molecular mechanisms is lacking, a top-down analysis has advantages, because it can be applied experimentally to any cellular network regardless of its degree of complexity (10, 11). The daunting challenge of understanding the coordinated behavior of numerous molecular interactions can be facilitated by analyzing them within a “modular” framework (12, 13). A complex cellular network can be divided conceptually into reaction groups referred to as functional units or modules. Each module consists of several signaling or gene interactions and performs one or more identifiable tasks. For instance, each of the three tiers of the mitogen-activated protein kinase (MAPK) cascade can be considered as a functional module that involves unphosphorylated, monophosphorylated, and bisphosphorylated forms of a protein kinase and the reactions converting these forms. Modules need not be rigid, and entire MAPK cascades can serve as functional modules in a signaling network that involves growth factor and stress-activated pathways. For gene networks, modules can involve mRNAs of a particular gene or gene cluster with regulatory interaction loops running through metabolic and signaling pathways (14). Modules can be interconnected in multiple ways, many of which may be unknown, even when the network components are identified in genetic and biochemical studies. Fig. Fig.11
One of the fundamental problems in cell biology is to infer and quantify interconnections in complex regulatory networks. The present paper proposes a powerful method for attacking such questions, assuming that knowledge of at least some network components is on hand. Specifically, we develop a methodology capable of unraveling and quantifying unknown molecular or modular connections in signaling and gene networks. We demonstrate how, by making systematic perturbations (using inhibitors, activators, changes in external signals, etc.) and measuring global responses only, one can discover a network “interaction map” that can be expressed in terms of module-to-module connection strengths. Importantly, we select experimental interventions that directly perturb single modules, and we apply as many perturbations as there are modules. We illustrate this approach by applying perturbations to model networks and comparing quantitative reconstructions to known interaction maps. Methods Fundamentals of Top-Down Regulatory Analysis of Modular Cellular Networks. Quantitation of a network interaction map. We conceptually divide a signaling or gene network into modules (m). The degree of complexity of each module is not restricted, and generally a module involves many cellular components (intermediates) connected by chemical reactions (intramodular interactions). We assume that only a single intermediate, referred to as “communicating,” serves as the module output (this simplifying assumption is relaxed in Appendix 1, which is published as supporting information on the PNAS web site, www.pnas.org). A communicating molecule may be the active form of a kinase, a second messenger, mRNA, or transcription factor influencing other modules. Thus, communicating intermediates form molecular connections between modules, referred to as intermodular interactions. A top-down regulatory analysis “black-boxes” the molecular organization of network modules, considering only communicating intermediates (the module outputs). We designate by xi, i = 1, … , m, the activities (concentrations) of communicating intermediates. Following our previous work (9), we quantify intermodular interactions in terms of the fractional changes (Δxi/xi) in the activity of communicating intermediate (xi) of a particular module (i) brought about by a change in the (output) activity (xj) of another module (j). Output activities of all other modules (xk, k ≠ i,j) are assumed to remain fixed, whereas the affected module (i) is allowed to relax to its steady state. A mathematical definition requires the changes (Δx/x) to be infinitesimally small, resulting in log-to-log derivatives,
Because each module is assumed to have a single communicating intermediate, all interactions between network modules are quantified by m (m − 1) intermodular response coefficients, rij. These “connection” coefficients indicate how the network is “wired” and compose the m × m matrix, r, hereafter referred to as the network interaction map. The ith row of the matrix r quantifies how module i is affected by each network module through immediate interaction, whereas the jth column of r measures how module j directly influences each network module. We assign values of −1 to the diagonal elements (rii) of the matrix r, rii = −1, i = 1, … , m.Local and global network responses to perturbations. Conceptually considering module i “in isolation” from the network, we determine the local response coefficient (ripi) of xi to a perturbation of parameter pi, intrinsic to module i as follows: ripi = ( lnxi/ pi)module i steady state. When module i is isolated from the network, changes in parameters pj, influencing other modules (j), have no effect on module i, and therefore the local response of xi to a perturbation in pj equals zero. Local responses to perturbations, affecting single modules only, form the diagonal m × m matrix, dgrp, with diagonal elements ripi and all off-diagonal elements equal to zero.If following a parameter (pi) perturbation intrinsic to module i an entire network is allowed to relax, this perturbation not only causes changes in those modules directly affected by module i but also propagates further into the network through interactions between other modules. The resulting time-dependent or stationary responses are called “global” responses of the network. We designate by Rjpi the global response coefficient of module j to a perturbation in pi and by Rp the m × m matrix composed of these coefficients:
Models of Signaling and Gene Networks Used to Test and Illustrate the Proposed Approach. Computer simulation of MAPK cascade responses to specific perturbations. MAPK cascades consist of several levels, where the activated kinase at each level phosphorylates the kinase at the next level down the cascade. The three-tiered MAPK cascade comprises MAPK (the terminal level), MAPK kinase (MKK) and MKK kinase (MKKK) (Fig. (Fig.2).2
Computer simulation of responses of a gene network to specific perturbations. A kinetic scheme of a four-gene network is depicted in Fig. Fig.3.3
Results Relation Between the Local and Global Network Responses. Global responses to perturbations can be measured in experiments with intact cellular systems. However, local responses governed by the network interaction map cannot be captured by using intact cells. To measure the kinetics of local interactions between two modules (proteins) directly, they should be isolated from the network. Sometimes the interaction of interest can be reconstituted “in vitro,” but often only an entire system is accessible experimentally. We are left with the question of how to determine quantitatively the network interaction map if only global responses can be assessed. We demonstrate that by making parameter perturbations to all modules and measuring the global network responses, we can retrieve the unknown interaction map (see Appendix 2, which is published as supporting information on the PNAS web site, for the abstract mathematical derivation). An experimental intervention to perturb a parameter (pi) intrinsic to module i can employ a specific inhibitor or activator of a reaction within module i, an antisense mRNA affecting the expression level of a protein, or a plasmid changing the rate of transcription. A parameter change, Δpi, first causes a local perturbation in xi, which subsequently propagates through intermodular interactions described by the local response coefficients (rij). After the network has relaxed to a new steady state, the resulting global changes in communicating intermediates (xk) brought about by a perturbation (Δpi) are related through
Rp−1. Because all the diagonal elements of the matrix r are equal to −1, the elements of the diagonal matrix, dgrp, are expressed readily in terms of the diagonal elements of the matrix, Rp−1. Designating by dg(Rp−1) the diagonal matrix with diagonal elements (Rp−1)ii and all off-diagonal elements equal to zero, we have I = dgrp dg(Rp−1), where I is the identity matrix. By expressing dgrp from this equation, we obtain
Importantly, our method does not require the parameter changes (Δpi) to be measured or estimated. Instead of response coefficients, one can simply consider the global (Δilnxj) fractional changes in communicating intermediates (xj) caused by a parameter change Δpi. Accordingly, we redefine the global response matrix, Rp, with coefficients Rjpi to be determined by the global fractional changes brought about by a perturbation Δ pi,
Practical Application of the Proposed Methodology. We now outline three steps of experimental applications for the proposed method. ![]() i. Conceptually divide the network under consideration into interacting modules and identify communicating intermediates. ii. Use an inhibitor or other perturbation that affects a single network module only, e.g., module 1, and measure the difference in the steady-state levels of communicating intermediates before [x
… , m) by using a perturbation directly affecting that module only, and calculate the remaining columns of the matrix Rp (Δilnx1, … , Δilnxm)T. The presentation in terms of the relative values given in Eq. 7 may help where quantitation of the absolute activities is difficult, e.g., when Western blotting is used to quantify the relative amount of a protein or determining the ratio of the fluorescence intensities from gene arrays (14). iii. Apply Eq. 5 to reveal and quantify the network interaction map in terms of the matrix r of intermodular (local) response coefficients.Unraveling the MAPK Cascade Interaction Map: An Illustration. MAPK cascades are widely involved in eukaryotic signal transduction, and these pathways are conserved from yeast to mammals (20). Mammalian cells express at least four different MAPK families including the ERK cascade (which is our primary example) and the c-Jun N-terminal kinase (JNK) and p38 MAPK cascades. In many cell types, MAPK cascades are regulated by multiple feedbacks. For instance, in mammalian cells inhibitory phosphorylation of the GDP/GTP exchange factor, SOS, by ERK provides a mechanism for switching off Ras and, thereby, Raf-1 signaling, creating a negative feedback as shown schematically in Fig. Fig.22 Our method may provide a universal tool to analyze the interaction map of MAPK pathways in various cells. To test and illustrate the method, we retrieve the interaction map from computer-generated responses of a kinetic model of the MAPK cascade to perturbations, which simulate experimental interventions. The first step is to identify modules and communicating intermediates based on biological information. We define three MAPK cascade modules that involve different phosphorylation forms of MKKK, MKK, and MAPK, respectively, and the reactions converting these forms (e.g., module 2 includes MKK, MKK-P, and MKK-PP and reactions 5–8, Fig. Fig.2).2 In the second step, we apply three different perturbations, each affecting a single module. As a perturbation to the first module, we inhibited the input signal by decreasing the Ras-GTP concentration. As relevant interventions to module 2, either the maximal activities of the phosphatase, which dephosphorylates MKK-PP and MKK-P, or the kinase that acts on MKK were inhibited. The different perturbations were applied to illustrate that network interactions to be detected with the method would not depend on what particular molecular processes within a module are affected. Module 3 was perturbed by inhibiting either the maximal activity of the MAPK phosphatase or the kinase. After each perturbation, the MAPK cascade was allowed to reach a new steady state, and the global responses of communicating intermediates were calculated according to Eq. 7. Fig. Fig.44
Four different matrices (Rp), displayed in Fig. Fig.4,4
Unraveling the Wiring of a Gene Network. Our approach can be applied to untangle gene network interactions (wiring) by carrying out specially designed gene microarray experiments. Gene networks are high-level conceptual representations of interactions between genes (14). These interactions proceed through multiple protein products (e.g., transcription factors) and metabolic intermediates, which are not considered explicitly in the analysis, such that the mRNAs themselves act as communicating intermediates. Fig. Fig.33
The network interaction map was determined by taking the inverse of Rp and substituting it into Eq. 5. Both simulated perturbations, i.e., inhibition or activation, resulted in nearly identical experimental interaction matrices (rounded to the nearest tenth; Fig. Fig.88
Discussion Recently, high-throughput technologies have enabled the acquisition of data on the expression of thousands of genes and the functional state of hundreds of proteins. However, there are no methods capable of providing quantitative interpretations of genomics and proteomics data sets in a manner that unravels the wiring of cellular machinery. This paper proposed a powerful quantitative method to unravel interactions in signaling and gene networks. A dynamic connection between two network components is quantified by the extent to which a small change in one component affects the level or activity of the other, provided all remaining interactions are kept unchanged. The resulting quantifier is known as a response coefficient, which is a convenient and unambiguous measure of the sensitivity of a particular component to a local, direct effect by another component. A network component may be a protein, a gene, or a module involving a number of interacting proteins and genes when considered within a modular framework (12, 13). The present paper demonstrates that monitoring of signaling and gene-expression responses to systematic perturbations is sufficient to infer and quantify signal transduction maps and gene connections. A series of studies was concerned with the determination of complex reaction mechanisms by experimental evaluation of the Jacobian matrix elements from time-series analysis (23, 24). Methods for the deduction of chemical reaction pathways from measurements of species concentrations were pioneered by Ross and coworkers (25, 26). These studies used ranked time-lagged correlation functions among pairs of chemical species coupled with a multidimensional scaling analysis and heuristic algorithms to deduce a diagram describing the interactions between chemical species. The method that we propose here exploits the modular organization of signaling and gene networks and the absence of mass flow between modules (assuming proteins are not significantly sequestered in intermodular interactions). We demonstrated that steady-state measurements of only communicating intermediates (the number of which is much smaller than the number of all independent protein forms) are sufficient to quantify interactions within a modular framework. Our technique involves a matrix inversion (Eq. 5). This inversion may give rise to numerical errors if the experimentally measured matrix Rp is ill-conditioned. Various preconditioning methods might be used to rescale data, but a singular-value decomposition of Rp can avoid these potential errors by dropping the least meaningful modes. In general, a matrix of lower rank will result, which will constrain the estimates of the local response matrix r (which is the normalized Jacobian matrix, see Appendix 2, which is published as supporting information on the PNAS web site) to a lower dimensional subspace. A similar approach is based on the observation that a vector that quantifies dynamic connections leading to a particular module (i.e., a row of the matrix r) is orthogonal to the linear subspace (H) spanned by vectors composed of measured network responses to perturbations influencing other modules (columns of Rp) (27). If the rank of H decreases because of ill conditioning, additional experiments should be performed by applying different perturbations. Any perturbation directly affecting a single module is appropriate. For instance, in signaling networks one can inhibit or mutate enzymatic activities or change the abundance of a protein operating within a single module. For gene networks, suitable experimental interventions involve the inhibition of a transcription rate or transfection with a plasmid expressing a gene that results in an increase in the mRNA synthesis rate. Applying such perturbations to a model gene network, all gene interactions were unraveled and quantified (Fig. (Fig.3).3 Other biological applications of the method include systems in which some components are unknown or uncharacterized. To illustrate these applications, we revisit the example of a four-gene network, assuming that only three genes (1, 2, and 3) are known (cf. Figs. Figs.11 r43 and r24 r43, respectively, determined for a four-gene network (cf. Figs. Figs.88
Supporting Information
Acknowledgments We thank J. Pastorino, B. Ingalls, and H. Sauro for stimulating discussions. This work was supported by National Institutes of Health Grants GM59570, AA08714, and P20-GM6437. E.S. also acknowledges support from the BioMaPS Institute at Rutgers University. Abbreviations References 1. Smolen P, Baxter D A, Byrne J H. Am J Physiol. 1998;274:C531–C542. [PubMed] 2. Kholodenko B N, Demin O V, Moehren G, Hoek J B. J Biol Chem. 1999;274:30169–30181. [PubMed] 3. von Dassow G, Meir E, Munro E M, Odell G M. Nature (London). 2000;406:188–192. [PubMed] 4. Haugh J M, Wells A, Lauffenburger D A. Biotechnol Bioeng. 2000;70:225–238. [PubMed] 5. Hasty J, McMillen D, Isaacs F, Collins J J. Nat Rev Genet. 2001;2:268–279. [PubMed] 6. Tyson J J, Chen K, Novak B. Nat Rev Mol Cell Biol. 2001;2:908–916. [PubMed] 7. Moehren G, Markevich N, Demin O, Kiyatkin A, Goryanin I, Hoek J B, Kholodenko B N. Biochemistry. 2002;41:306–320. [PubMed] 8. Shvartsman S Y, Hagan M P, Yacoub A, Dent P, Wiley H S, Lauffenburger D A. Am J Physiol. 2002;282:C545–C559. 9. Kholodenko B N, Hoek J B, Westerhoff H V, Brown G C. FEBS Lett. 1997;414:430–434. [PubMed] 10. Brand M D. J Theor Biol. 1996;182:351–360. [PubMed] 11. Krauss S, Brand M D. FASEB J. 2000;14:2581–2588. [PubMed] 12. Hartwell L H, Hopfield J J, Leibler S, Murray A W. Nature (London). 1999;402:C47–C52. [PubMed] 13. Lauffenburger D A. Proc Natl Acad Sci USA. 2000;97:5031–5033. [PubMed] 14. De la Fuente A, Brazhnik P, Mendes P. Trends Genet. 2002;18:395–398. [PubMed] 15. Marais R, Marshall C J. Cancer Surv. 1996;27:101–125. [PubMed] 16. Huang C Y, Ferrell J E., Jr Proc Natl Acad Sci USA. 1996;93:10078–10083. [PubMed] 17. Kholodenko B N. Eur J Biochem. 2000;267:1583–1588. [PubMed] 18. Hu Y, Bowtell D D. Oncogene. 1996;12:1865–1872. [PubMed] 19. Kolch W. Biochem J. 2000;351:289–305. [PubMed] 20. Chang L, Karin M. Nature (London). 2001;410:37–40. [PubMed] 21. Ferrell J E, Jr, Machleder E M. Science. 1998;280:895–898. [PubMed] 22. Bagowski C P, Ferrell J E., Jr Curr Biol. 2001;11:1176–1182. [PubMed] 23. Tyson J J. J Chem Phys. 1975;62:1010–1015. 24. Chevalier T, Schreiber T, Ross J. J Phys Chem. 1993;97:6776–6787. 25. Arkin A, Ross J. J Phys Chem. 1995;99:970–979. 26. Samoilov M, Arkin A, Ross J. Chaos. 2001;11:108–114. [PubMed] 27. Kholodenko, B. N. & Sontag, E. D. (2002) arXiv: physics/0205003. |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||||||||||||||||
Am J Physiol. 1998 Feb; 274(2 Pt 1):C531-42.
[Am J Physiol. 1998]FEBS Lett. 1997 Sep 8; 414(2):430-4.
[FEBS Lett. 1997]J Theor Biol. 1996 Oct 7; 182(3):351-60.
[J Theor Biol. 1996]FASEB J. 2000 Dec; 14(15):2581-8.
[FASEB J. 2000]Nature. 1999 Dec 2; 402(6761 Suppl):C47-52.
[Nature. 1999]Proc Natl Acad Sci U S A. 2000 May 9; 97(10):5031-3.
[Proc Natl Acad Sci U S A. 2000]Trends Genet. 2002 Aug; 18(8):395-8.
[Trends Genet. 2002]FEBS Lett. 1997 Sep 8; 414(2):430-4.
[FEBS Lett. 1997]Cancer Surv. 1996; 27():101-25.
[Cancer Surv. 1996]Proc Natl Acad Sci U S A. 1996 Sep 17; 93(19):10078-83.
[Proc Natl Acad Sci U S A. 1996]Eur J Biochem. 2000 Mar; 267(6):1583-8.
[Eur J Biochem. 2000]Oncogene. 1996 May 2; 12(9):1865-72.
[Oncogene. 1996]Biochem J. 2000 Oct 15; 351 Pt 2():289-305.
[Biochem J. 2000]Am J Physiol. 1998 Feb; 274(2 Pt 1):C531-42.
[Am J Physiol. 1998]Nature. 2000 Jul 13; 406(6792):188-92.
[Nature. 2000]Trends Genet. 2002 Aug; 18(8):395-8.
[Trends Genet. 2002]Nature. 2001 Mar 1; 410(6824):37-40.
[Nature. 2001]Oncogene. 1996 May 2; 12(9):1865-72.
[Oncogene. 1996]Science. 1998 May 8; 280(5365):895-8.
[Science. 1998]Curr Biol. 2001 Aug 7; 11(15):1176-82.
[Curr Biol. 2001]Eur J Biochem. 2000 Mar; 267(6):1583-8.
[Eur J Biochem. 2000]Trends Genet. 2002 Aug; 18(8):395-8.
[Trends Genet. 2002]Nature. 1999 Dec 2; 402(6761 Suppl):C47-52.
[Nature. 1999]Proc Natl Acad Sci U S A. 2000 May 9; 97(10):5031-3.
[Proc Natl Acad Sci U S A. 2000]Chaos. 2001 Mar; 11(1):108-114.
[Chaos. 2001]FEBS Lett. 1997 Sep 8; 414(2):430-4.
[FEBS Lett. 1997]