![]() | ![]() |
Formats: |
||||||
Copyright © 2008 HFSP Publishing Toward integration of in vivo molecular computing devices: successes and challenges 1Lehrstuhl für Computational Biology, Universität des Saarlandes, Gebäude C7.1, Saarbreucken, Saarland 66123, Germany 2Bio Systems Analysis Groups, Jena Centre for Bioinformatics and Friedrich-Schiller-Universität Jena, Ernst-Abbe-Platz 1–4, D-07743 Jena, Germany CORRESPONDENCE: Sikander Hayat: s.hayat@bioinformatik.uni-saarland.de Received July 17, 2008. Abstract The computing power unleashed by biomolecule based massively parallel computational units has been the focus of many interdisciplinary studies that couple state of the art ideas from mathematical logic, theoretical computer science, bioengineering, and nanotechnology to fulfill some computational task. The output can influence, for instance, release of a drug at a specific target, gene expression, cell population, or be a purely mathematical entity. Analysis of the results of several studies has led to the emergence of a general set of rules concerning the implementation and optimization of in vivo computational units. Taking two recent studies on in vivo computing as examples, we discuss the impact of mathematical modeling and simulation in the field of synthetic biology and on in vivo computing. The impact of the emergence of gene regulatory networks and the potential of proteins acting as “circuit wires” on the problem of interconnecting molecular computing device subunits is also highlighted. Should in vivo computing devices be envisioned as a replacement for the current state of the art silicon based computers? Since the inception of the first DNA based computing device by Leonard Adleman (Adleman, 1994) in 1994, many scientific investigations have been carried out, and it seems that when it comes to in vivo computing, “problem-specific” molecular computing devices (MCDs) take precedence over all purpose computing devices. Based on the environment in which the computation takes place, MCDs can be broadly classified into in vitro computers (mainly based on DNA, RNA, proteins, hybrid structures, or artificial chemistries) and in vivo computing devices. As described by Adleman, the MCDs that belong to the first category make use of in vitro replication of the DNA subunits while computational units that aim at harnessing the whole protein translational machinery of a living cell and employ gene regulation by proteins comprise the second category (Bogunia-Kubik and Sugisaka, 2002). Studies in the realm of MCDs have successfully demonstrated individual subunits that can compute both basic and moderately complex mathematical problems; however, the realization of the truly massively parallel MCD can only be possible when these individual subunits can be efficiently circuited together (Sprinzak and Elowitz, 2005; Simpson, 2004). Making proteins act as the information carrying “wire” in a circuit, recent studies (Benenson et al., 2004; Yaakov et al., 2001; Hinze et al., 2008) have brought forth the notion of implementing MCDs as a massively parallel and fully autonomous problem-specific automaton. For example, the autonomous system as described by Yaakov and co-workers (Yaakov et al., 2001) uses ATP, restriction nuclease, and ligase as the “hardware.” Double stranded DNA molecules act as the input, and the automaton processes the input molecule via a cascade of restriction, hybridization, and ligation cycles, producing a detectable output molecule that encodes the automaton’s final state and thus the computational result. The computing performance for an output resulting from five transitions was reported to be on the order of 109 transitions per second. Related work (Benenson et al., 2004) described a modular, robust, and flexible MCD capable of logical analysis of mRNA disease indicators in vitro and controlled administration of biologically active ssDNA molecules. The MCD was reported to operate at concentrations close to 1012 molecules per microliter. These and other studies in literature exemplify the emerging use of intelligent diagnostic computing devices in drug delivery, genetic engineering, and biochemical sensing (Rinaudo et al., 2007; Bogunia-Kubik and Sugisaka, 2002; McDaniel and Weiss, 2005). COMPUTING BOOLEAN AND ARITHMETIC FUNCTIONS In contrast to the in vitro computing devices discussed above, in vivo MCDs make use of the naturally occurring translational regulation mechanism of the host organism. Sincein vivo systems have to go through an additional step of protein translation, they are suggested to be implicitly slower than DNA based in vitro MCDs. However, the use of error correction mechanisms naturally implemented in the evolutionarily optimized transcription regulation machinery of living cells makes the overall computation more robust and hence justifies the trade-off with speed (Baker et al., 2006). A recent theoretical study conducted by Cory and Perkins (Cory and Perkins, 2008) has laid the focus on the use of a transcriptional regulatory mechanism to solve basic arithmetic operations. The study shows how different parametrizations of a simple chemical kinetic model of transcription regulation can give rise to these different operations. The accuracy of such theoretical arithmetic calculations based on the transcription regulatory mechanism is dependent on the kinetic parameters as well as the transcription factor concentrations. From today’s perspective, two implementation approaches for MCDs using gene regulation mechanisms have emerged: discrete and continuous devices. They reflect the encoding of processed information. In discrete devices, the domain of possible species concentration is divided into several layers separated by intermediate forbidden ranges. Each layer represents a digit, see Fig. Fig.1,1
Continuous MCDs as discussed in Cory and Perkins (Cory and Perkins, 2008) might be a clue to overcome insufficiencies of discrete devices. In electronic engineering, analogous computers exploit the continuous principle of operation for computing purposes. Historically, they became established for some niche applications such as functional differentiation or integration. Magnasco (Magnasco, 1997) and Deckart and Sauro (Deckart and Sauro, 2004) showed that the dynamical behavior of chemical reaction networks based on a kinetic model can perform computations in a continuous manner. Here, initial species concentrations represent the input data, and dedicated species concentrations reached within the steady state form the output. Particularly, the computation of arithmetic functions seems to be effectively feasible this way. The non-mass-conserving reaction [A] [B]−k2 [C] while catalyst concentrations [A] and [B] remain constant. In the steady state, the concentration [C] does not change anymore, such that d[C]/dt=0. Thus, [C]=[A] [B] holds for parametrization of kinetic reaction rates with k1=k2=1. Assuming a more complex reaction network incorporating several motifs, predetermined parts of the network can be deactivated by setting corresponding reaction rates ki=0 and activated (ki>0), respectively. This way, a reaction network becomes capable of performing multiple arithmetic operations. Cory and Perkins seized this idea to propose analogous computation of arithmetic functions by a continuous MCD. The selection of activated reactions controls the executed operation whose result (output species concentration) evolves asymptotically up to the steady state (see Fig. Fig.1,1TACKLING COMBINATORIAL SEARCH In a recently conducted experimental study, Haynes and co-workers (Haynes et al., 2008) used site-specific DNA recombination to solve a burnt-pancake problem (BPP) in vivo. The advantage that an in vivo system offers in such a case, where an exponential expansion in search space is known to occur, is that DNA replication and bacterial cell growth are inexpensive, feasible, and scalable. Haynes et al. draw parallels between the actual BPP and their in vivo computational units by suggesting the pancakes be represented by flippable DNA segments. A flip move is mediated by Hin DNA recombinase that regulates gene expression by switching the orientation of the promoter and the coding region while the two palindromic 26bp hix sequences flanking the invertible DNA segment act as recognition sites for cleavage and strand exchange. Cells with sorted DNA segments represent the final state. As a proof of concept, Haynes et al. designed a two-pancake BPP comprised of a lac promoter (pLac) and a tetracycline resistance coding region, each flanked by hixC sites. The reaction proceeds by cotransforming cells with a BPP plasmid containing a hixC-flanked RBS-tetA(C)rev coding region, a hixC-flanked pLac promoter [permutation (−2,1)], and a HinLVA expression plasmid. As described, the system reaches equilibrium state after more than 11 h, with most cells in the starting pancake arrangement while about one-third of the cells were detected to have undergone simultaneous inversion of both DNA segments. However, it was also found that due to general leakiness of the pLac promoter, HinLVA-mediated inversion did not seem to require induction of the pLac promoter on the HinLVA plasmid. Furthermore, cells with a strong pLac repressor (lacIQ) were found to be tetracycline resistant even without pLac activation. Cells were still found to be tetracycline resistant even when the pLac promoter was removed from the plasmid construct, leading to the conclusion that pLac is not necessary for expression in the pBR322-derived cloning vector. Tests with a pLac sensitive vector showed that the pLac promoter demonstrates both forward and backward transcription activity, hence rendering phenotypic characterization of the final state ambiguous. In addition, the experimental setup does not allow for quick reversal of states. However, by revealing the unexpected behavior of the pLac promoter, these results illustrate the interdependence of synthetic biology and in vivo computing. These results were confirmed by the accompanying mathematical simulations in which flipping was modeled as a Markov chain such that each of the possible permutations was considered as a state. The probability of a plasmid to be in the sorted order after k flips was determined by dividing the number of paths from the initial to the desired final state by the total number of paths of length k from the start to any state. The results from the mathematical simulations showed that 25% of the total plasmids attained the equilibrium between an intermediate state and the final state after five flips. The overall slow speed (time scale of hours to days rather than minutes) and the lack of robustness could be the bottleneck of such an approach that involves a combinatorial search space of the possible states and involves extensive site-specific DNA recombination. For these reasons, it seems that like most other state of the art MCDs, a more complicated network comprised of the basic subunits represented in the study would be difficult to achieve. However, the study provides experimental proof of employing DNA for computing in vivo. Furthermore, this approach gives a new perspective to MCDs, where each DNA segment itself can be considered as a state comprising a finite state automaton (FSA), and transition in states could be conceived as a flip in a DNA segment. The final state, defined by a phenotypic expression, is achieved when the automaton has flipped and oriented all the DNA segments in the desired order. So far biomolecular computational units have been demonstrated in vivo and in vitro (Kobayashi et al., 2004; Gardner et al., 2000; Hayat et al., 2006; Kari, 2001). Some of the key issues that still need to be handled are: noise propagation, interfacing output and input, and integrating basic subunits. Sustenance of a large population of cells, removal of intermediate and final products aiding a fast reversal of states are also problems that need to be looked into. Furthermore, the use of MCDs in drug targeting, where they can be programmed to sense the disease indicators and output the drug based on the computation module, makes it imperative to implement these biomolecular computational units in mammalian cells. Recently, Rinaudo et al. (Rinaudo et al., 2007) successfully implemented an RNAi based approach to Boolean logic evaluation based on molecular computing, wherein they transfected human embryonic kidney cells with evaluator circuits. An evaluator circuit as described by them consisted of mRNA with fused siRNA targets at the noncoding sites. Based on the presence or absence of complimentary siRNA and the linkage of the mRNAs, various Boolean operations can be performed. Such and other attempts at in vivo computing in mammalian cells (Kramer et al., 2004; An et al., 2006) could benefit from developments in the understanding of genetic circuits (Sprinzak and Elowitz, 2005; Simpson, 2004; McDaniel and Weiss, 2005; Kramer et al., 2004) and vice versa. CONCLUDING REMARKS In the last decade, independent in vivo biocomputational modules that can carry out basic arithmetic and logic operations have been successfully implemented, but the lack of standard protocols at fabricating these subunits is forcing their reimplementation and hence hampering research that needs to be focused on integrating the available MCD subunits. Generating libraries of compatible modules could be helpful in the fabrication of more complex MCDs (Baker et al., 2006). With more than 1000 modules (in 2006) in the BioBrick library, work is already underway toward curating a universal library of biocomputational modules. Furthermore, with emerging rules (Baker et al., 2006; Andrianantoandro et al., 2006), a bottom up approach of implementing a problem-specific rather than an “all purpose” in vivo MCD at the forefront of the recent studies in the field of in vivo computing and with the recent developments in the use of mammalian cells as precise and robust computational units, the next crucial task of interconnecting (Simpson, 2004) and optimizing these subunits will be instrumental in applying MCDs as intelligent drug delivery systems and biochemical sensors. ACKNOWLEDGMENTS We thank Volkhard Helms from Saarland University and Peter Dittrich from Jena University for administrative support. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||
Science. 1994 Nov 11; 266(5187):1021-4.
[Science. 1994]Biosystems. 2002 Mar-May; 65(2-3):123-38.
[Biosystems. 2002]Nature. 2005 Nov 24; 438(7067):443-8.
[Nature. 2005]Trends Biotechnol. 2004 Nov; 22(11):555-7.
[Trends Biotechnol. 2004]Nature. 2004 May 27; 429(6990):423-9.
[Nature. 2004]Nat Biotechnol. 2007 Jul; 25(7):795-801.
[Nat Biotechnol. 2007]Biosystems. 2002 Mar-May; 65(2-3):123-38.
[Biosystems. 2002]Sci Am. 2006 Jun; 294(6):44-51.
[Sci Am. 2006]Nature. 2000 Jan 20; 403(6767):339-42.
[Nature. 2000]Science. 2002 May 24; 296(5572):1466-70.
[Science. 2002]Nature. 2000 Jan 20; 403(6767):335-8.
[Nature. 2000]Chembiochem. 2004 Oct 4; 5(10):1423-31.
[Chembiochem. 2004]J Biol Eng. 2008 May 20; 2():8.
[J Biol Eng. 2008]Proc Natl Acad Sci U S A. 2004 Jun 1; 101(22):8414-9.
[Proc Natl Acad Sci U S A. 2004]Nature. 2000 Jan 20; 403(6767):339-42.
[Nature. 2000]Nat Biotechnol. 2007 Jul; 25(7):795-801.
[Nat Biotechnol. 2007]Nat Biotechnol. 2004 Jul; 22(7):867-70.
[Nat Biotechnol. 2004]RNA. 2006 May; 12(5):710-6.
[RNA. 2006]Science. 2002 May 24; 296(5572):1466-70.
[Science. 2002]Sci Am. 2006 Jun; 294(6):44-51.
[Sci Am. 2006]Trends Biotechnol. 2004 Nov; 22(11):555-7.
[Trends Biotechnol. 2004]