![]() | ![]() |
Formats:
|
||||||||||||||||
Copyright © 2006, Cold Spring Harbor Laboratory Press Integrated analysis of regulatory and metabolic networks reveals novel regulatory mechanisms in Saccharomyces cerevisiae Department of Bioengineering, University of California, San Diego, La Jolla, California 92093-0412, USA 1Present address: Institute for Advanced Biosciences, Keio University, Tsuruoka, Yamagata 997-0017, Japan. 2Corresponding author. E-mail palsson/at/ucsd.edu; fax (858) 822-3120. Received April 30, 2005; Accepted February 23, 2006. This article has been cited by other articles in PMC.Abstract We describe the use of model-driven analysis of multiple data types relevant to transcriptional regulation of metabolism to discover novel regulatory mechanisms in Saccharomyces cerevisiae. We have reconstructed the nutrient-controlled transcriptional regulatory network controlling metabolism in S. cerevisiae consisting of 55 transcription factors regulating 750 metabolic genes, based on information in the primary literature. This reconstructed regulatory network coupled with an existing genome-scale metabolic network model allows in silico prediction of growth phenotypes of regulatory gene deletions as well as gene expression profiles. We compared model predictions of gene expression changes in response to genetic and environmental perturbations to experimental data to identify potential novel targets for transcription factors. We then identified regulatory cascades connecting transcription factors to the potential targets through a systematic model expansion strategy using published genome-wide chromatin immunoprecipitation and binding-site-motif data sets. Finally, we show the ability of an integrated metabolic and regulatory network model to predict growth phenotypes of transcription factor knockout strains. These studies illustrate the potential of model-driven data integration to systematically discover novel components and interactions in regulatory and metabolic networks in eukaryotic cells. Current biological research produces increasing volumes of data describing cellular components, their interactions, and states of biological networks for model organisms including the budding yeast, Saccharomyces cerevisiae. Component and interaction data including genome sequences (Bussey et al. 1997), protein complexes (Gavin et al. 2006), and protein–DNA interactions (Harbison et al. 2004) can be used to establish the connectivity of the biochemical networks inside the cell. System-state data types including gene expression (DeRisi et al. 1997), metabolite level (Villas-Boas et al. 2005), metabolic flux (Sauer 2004), and high-throughput deletion strain phenotyping (Giaever et al. 2002) data represent the states and outputs of these networks. Connecting large-scale component and interaction information to data on system states in order to facilitate the interpretation of both data types is a major challenge in systems biology. The data integration and interpretation task is made challenging by the incompleteness and noisiness of large-scale data sets (Grunenfelder and Winzeler 2002). Given these issues with large-scale data sets, systematic inclusion of literature-derived information on network structures into the analysis represents an appealing alternative to purely data-driven approaches. The widespread availability of component and biochemical interaction information in the primary literature has enabled the reconstruction of chemically and biologically consistent mathematical descriptions of biochemical networks in well-studied model organisms (Herrgård et al. 2004; Price et al. 2004). These network models can then be used to predict changes in system states in response to genetic and environmental perturbations. Furthermore, model predictions can be directly compared with experimental data obtained, for example, by metabolic flux or gene expression profiling (Covert et al. 2004; Price et al. 2004). As a result of these comparisons, modifications to the biochemical network model that would improve its ability to predict system states can be identified to iteratively improve the model. In the case of metabolic networks, the network reconstruction step can now be routinely done and has been accomplished for a number of key model organisms including Escherichia coli (Reed et al. 2003) and Saccharomyces cerevisiae (Famili et al. 2003; Duarte et al. 2004). Using a reconstructed genome-scale stoichiometric matrix as a starting point, the constraint-based modeling framework (Price et al. 2004) can then be used to make phenotypic predictions that can be compared to experimental data. Frequently used constraint-based approaches include flux-balance analysis (FBA) that allows the prediction of flux distributions and growth rates based on known metabolic network stoichiometry, growth media composition, and biomass composition (Kauffman et al. 2003). The constraint-based framework has been extended to account for transcriptional regulation of metabolism in the form of the regulated flux-balance analysis (rFBA) approach (Covert et al. 2001). This approach requires first reconstructing the transcriptional regulatory network structure in the form of Boolean rules based on the various available data sources and then integrating this network with the metabolic network model. So far the rFBA approach has been applied to make predictions of growth phenotypes and qualitative gene expression changes in E. coli (Covert and Palsson 2002; Covert et al. 2004) Here we present the first large-scale integrated model of metabolism and its transcriptional regulation in a eukaryotic organism, S. cerevisiae iMH805/775.3 The current model includes the majority of known nutrient-controlled transcriptional regulation (55 transcription factors) of metabolic pathways (750 genes in total) represented in a Boolean rule-based formalism (Covert et al. 2001). The regulatory network model is combined with an existing genomescale metabolic model, iND750 (Duarte et al. 2004). We demonstrate three major applications of the iMH805/775 model using the rFBA approach to predict gene expression changes and growth phenotypes of knockout (KO) strains. First, we analyze previously published genome-scale gene expression and protein–DNA interaction data sets in a model-driven fashion. Second, we demonstrate the ability to iteratively expand an existing model based on comparisons of model predictions with experimental data. This iterative model building strategy allows identification of new interactions supported by multiple experimental data types and inclusion of these interactions into the regulatory network model. Third, we show how an integrated metabolic/regulatory network model can be used to analyze diverse data types such as phenotyping and gene expression data simultaneously within the same modeling framework. The overall approach used for model-based integration of diverse data sets is schematically represented in Figure Figure11
Results Reconstructed transcriptional regulatory network Using the previously described genome-scale compartmentalized metabolic model of S. cerevisiae, iND750 (Duarte et al. 2004), as a starting point, we reconstructed the transcriptional regulatory network regulating all of the metabolic subsystems included in iND750. The resulting integrated metabolic/regulatory network, iMH805/775, accounts for 805 genes and 775 regulatory interactions. The network consists of the 750 metabolic genes in iND750 and 55 specific nutrient-regulated transcription factors (TFs). The model allows 82 distinct intra- and extracellular metabolites to act as input signals to the regulatory network. The direct and indirect regulatory interactions for each gene were collected from primary literature, and they are based on detailed studies of transcriptional regulation of individual genes. In addition to representing these regulatory interactions, iMH805/775 also includes rules describing the mode of combinatorial control by different TFs at each promoter. This logic-based representation allows in silico prediction of gene expression changes in response to environmental and genetic perturbations and integration of the regulatory network to the metabolic network model as described previously (Covert and Palsson 2002; Covert et al. 2004). The Supplemental material contains all the interactions included in the model, the Boolean rules describing the expression of each target gene, and literature references. Prediction of gene expression changes In order to assess the completeness of the iMH805/775 network described above, we compared in silico gene expression change predictions to experimentally measured expression profiles as well as experimentally determined protein–DNA interactions (chromatin immunoprecipitation followed by microarray analysis, or ChIP-chip) and predicted TF-binding motifs (Harbison et al. 2004). We used gene expression data for eight transcription factor knockout strains (rgt1, rox1, gat1, hap1, adr1, gal4, gln3, cat8) and two overexpression (HAP4, GCN4) strains from previously published reports (Ideker et al. 2001; Natarajan et al. 2001; Ter Linde and Steensma 2002; Lascaris et al. 2003; Young et al. 2003; Kaniak et al. 2004). Each of the 750 metabolic genes in iMH805/775 was classified as significantly up-regulated, significantly down-regulated, or unchanged in each of the 10 experimental data sets. The in silico simulations for the gene expression predictions were done using in silico environments that matched the experimental conditions for each individual expression data set. The ChIP-chip and motif data were merged to obtain a set of potential direct target genes for each of the 10 transcription factors listed above. The results for the comparison between iMH805/775 predictions, experimentally observed gene expression changes, and experimentally determined protein–DNA interactions are summarized in Figure Figure2.2
As Figure Figure2A2A Systematic expansion of the regulatory network Based on the results presented above, it is clear that while the model makes relatively few gene expression change predictions that were not supported by either gene expression or ChIP-chip data (28 cases overall), for many metabolic genes, iMH805/775 predicts no expression change even in cases in which such changes are observed in vivo (a total of 498 cases across all 10 strains). We next sought to improve the predictive ability of the in silico model by using ChIP-chip and TF-binding motif data (Harbison et al. 2004) to systematically expand the regulatory network part of the model. First, the gene expression comparison presented above was used to identify potential novel candidate target genes for each of the 10 transcription factors (498 TF-target pairs total). Next, we traced paths through an expanded regulatory network that consisted of the iMH805/775 network and a provisional regulatory network that can be established based on combining ChIP-chip data and TF-binding motif data (Harbison et al. 2004). We used a systematic search algorithm (see Methods for details) to find the shortest possible path from a particular TF to a particular target gene through this combined network. We investigated three different network expansion scenarios using the search algorithm. In the first scenario, we only used ChIP-chip and motif data for the 55 TFs already included in the model. We also required that the direction of the expression change for each target gene would be correctly explained as a combination of the regulatory interactions along the path when each TF was considered to be either a repressor or activator depending on its known activity. In the second scenario, we used the same set of ChIP-chip and motif data sets, but we relaxed the assumption that specific transcription factors can only act as activators or repressors. In the third scenario, we included ChIP-chip and motif data for all 203 TFs studied in Harbison et al. (2004), again allowing each TF to act either as a repressor or activator. The three panels in Figure Figure33
The regulatory interactions that were identified by the first expansion scenario described above were used as a basis for building an improved version of the regulatory network. We analyzed manually each of the suggested regulatory interactions involving the 55 TFs in the original iMH805/775 model and included the interactions that did not cause conflicts with existing regulatory rules to form an expanded model, iMH805/837. In addition, minor changes to existing rules in the model were made to remove mispredictions of gene expression changes due to incorrectly represented combinatorial effects, especially for the rgt1 strain. In order to provide an independent test set to assess the relative performance of iMH805/775 and iMH805/837, we turned to prediction of gene expression changes in wild-type yeast strains in response to five different environmental perturbations (Gasch et al. 2000; Kwast et al. 2002). As Table 2 shows, overall the overlap between the observed and predicted expression changes for genes that change in expression either in silico or in vivo was significant in each of the five conditions (P < 0.01; Fisher’s exact test). The iMH805/837 model showed improved predictive ability over the iMH805/775 model in four out of the five conditions studied.
Prediction of growth phenotypes In order to provide data for an evaluation of the predictive power of the integrated regulatory and metabolic network model iMH805/837, we performed quantitative growth phenotyping experiments on 12 different carbon sources with 10 TF deletion strains. All these TFs were included in the model and were known to be involved in carbon-source-dependent transcriptional regulation or otherwise play a role as key regulators in carbon source metabolism. The 12 carbon sources were chosen to represent both fermentable and nonfermentable carbon sources. We used an optimization-based approach (see Methods) to compare growth phenotype predictions to experimental data. This approach allowed us to systematically identify which of the growth phenotypes are unexpected considering the current understanding of the regulatory and metabolic networks in S. cerevisiae as represented by iMH805/837. The predicted growth rates are shown in Figure Figure4A4A
The 13 growth rate overpredictions were assumed to correspond to missing regulatory effects in the model, and hence we sought to identify potential novel regulatory mechanisms that would improve model predictions. We identified metabolic genes whose deletion in iMH805/837 would result in a reduced in silico growth rate prediction compared to the wild-type strain specifically on each carbon source. These genes were then considered to be direct or indirect candidate target genes for the transcription factors whose knockout growth phenotype on the carbon source was incorrectly predicted. Next we used gene expression data sets for TF deletion strains for which such data were available (cat8 and rgt1 strains) to further narrow down the set of potential targets by only including genes that were down-regulated in the TF KO strain in vivo. Specific new direct or indirect target genes that resulted in improved growth rate predictions were identified in five of the 13 cases using this approach. After including the suggested new regulatory effects in iMH805/837, only eight significant mispredictions remained as is shown in Figure Figure4B,4B Discussion We presented a literature-based reconstruction of the known nutrient-controlled transcriptional regulatory network regulating metabolism in S. cerevisiae and the integration of this regulatory network with the genome-scale metabolic network model. We applied previously developed in silico methods for combined simulation of regulatory and metabolic network models to predict expression profiles and growth phenotypes. Comparison of these predictions with large-scale experimental data allowed identifying regulatory mechanisms missing from the model and expansion of the model by using existing ChIP-chip and promoter motif data. A similar approach for integrated modeling was first applied to genome-scale regulatory and metabolic networks in E. coli (Covert and Palsson 2002; Covert et al. 2004), but in yeast, the availability of ChIP-chip data allows integrating this key data type into the network expansion approach. Similarly to the case of the initial E. coli regulatory network reconstruction (Covert et al. 2004), the overall agreement between in silico gene expression predictions by iMH805/775 and experimental data was found to be relatively low. In particular, a large number of the experimentally observed expression changes were not predicted by iMH805/775. However, the agreement between model predictions and the experimental data sets was significantly better in cases in which the gene expression and ChIP-chip data sets were in good agreement with each other (Fig. (Fig.2).2 Using ChIP-chip and motif data sets (Harbison et al. 2004) to systematically expand the iMH805/775 model allowed explaining one-third of the gene expression prediction discrepancies through hypothetical regulatory cascades involving the 55 TFs in the model (Fig. (Fig.3A).3A The set of new regulatory interactions suggested by the network expansion approach was used to develop an improved metabolic/regulatory network model, iMH805/837. The predictions of growth rates of TF deletion strains made by the iMH805/837 model were in good agreement with experimentally measured growth rates for most TF/carbon source combinations. The discrepancies in growth phenotype predictions were analyzed using a model-based approach that allowed the identification of potential new regulatory mechanisms mediating carbon source utilization that are discussed in detail in the Supplemental material. The changes to the model identified through this process were modifications of regulatory mechanisms for specific genes involved in utilization of particular carbon sources such as PGM2 (galactose) and HXT1 (mannose). Conclusions We have described the reconstruction of the nutrient-controlled transcriptional regulatory network regulating metabolism in S. cerevisiae and evaluation of the predictive power of an integrated metabolic/regulatory network. We compared model predictions to experimental gene expression data and identified a significant number of expression changes that were not predicted by the model. A combination of ChIP-chip and binding-site motif data was used to expand the regulatory network model so that its ability to predict expression changes was improved in independent test cases. This network expansion also suggested several novel connections between nutrient regulation of metabolism and other regulatory subsystems that require further validation through detailed experiments. It was found that the integrated metabolic/regulatory network model could be used to predict growth phenotypes, and the discrepancies in these predictions could be used to direct the search for novel regulatory mechanisms. The present work shows how a systematic approach can be used to fill in missing regulatory mechanisms through the combined use of an integrated model of regulation and metabolism and existing large-scale experimental data sets. In the future, the combination of more targeted expression profiling, ChIP-chip, growth phenotyping, and metabolic flux profiling (Blank et al. 2005) under many different environmental conditions will allow systematic iterative model building. With increasing knowledge of specific regulatory mechanisms in higher eukaryotes (Picard et al. 2004; Wolfrum et al. 2004), the approaches used in this work can be readily extended to studying the network-level interplay between regulation and metabolism in these organisms. Methods Model reconstruction The relevant literature for each metabolic (Duarte et al. 2004) and transcription factor gene was collected through information in the SGD, YPD, and MIPS databases (Costanzo et al. 2001; Christie et al. 2004; Mewes et al. 2004) and direct PubMed searches. The regulatory network model part of iMH805/775 consists of three layers describing the effect of extracellular and intracellular signals on the expression of metabolic genes and the maximum allowed flux through particular reactions in the network. The first layer describes the activities of 55 TFs in response to 67 extracellular and 15 intracellular metabolite concentrations. The second layer consists of the rules describing the expression of 348 metabolic genes as a function of the transcription factor states and metabolite concentrations in cases in which the direct regulatory mechanisms were unknown. For the remaining metabolic genes, no information on regulation could be found in the literature, and they were assumed to be constitutively expressed in all environmental conditions. The third layer contains the gene–protein–reaction associations that encode the relationship between gene expression and presence/absence of a particular reaction in the network (Duarte et al. 2004). All the three layers were implemented as Boolean rules derived from primary literature, and these rules are listed in the Supplemental material. Regulated flux balance analysis The constraint-based modeling approach for genome-scale metabolic networks has been extensively reviewed elsewhere (Price et al. 2004) and will not be described in full detail here. The metabolic network model consists of the stoichiometric matrix describing all the interconversions between metabolites in the network, maximum flux capacity constraints on all reactions, and a cellular objective function to be maximized (based on the estimated biomass composition). The integration of the metabolic network analysis and the transcriptional regulatory network model described above is accomplished using the regulated flux balance analysis approach (Covert et al. 2001). In this approach, the three layers of rules detailed above are evaluated given an initial condition (extracellular concentrations and intracellular fluxes), and the state of each reaction in the metabolic model is calculated. The maximum fluxes through the reactions that are determined to be down-regulated are then set to a predetermined fraction of the unregulated maximum flux. Given the stoichiometric matrix, an objective function, and the new maximum flux capacities, a particular flux distribution can be calculated using linear optimization. The resulting flux distribution is then, in turn, used to calculate the changes in external concentrations, and these concentrations as well as the internal fluxes are used to evaluate the rules at the next time step to obtain the gene expression state (see Supplemental material for details). Model parameter estimation There are several parameters such as maximum substrate and oxygen uptake rates that affect growth rate predictions by the metabolic/regulatory network mode. For the growth phenotyping study, these parameters were not experimentally measured so that they had to be estimated computationally. The particular estimation strategy used in this work is based on genetic algorithms that allow optimization of a nonlinear objective function with nonlinear constraints (Goldberg 1989). In our application, the objective was to minimize the summed absolute discrepancies between predicted and experimentally measured growth rates weighted by the experimental standard deviations of the growth rates under the 12 growth conditions for the 11 strains. The Supplemental material contains details of the parameter estimation approach and sensitivity analysis of the growth rate predictions. Gene expression data comparisons The individual gene expression data sets for the transcription factor knockout or overexpression strains and wild-type condition shifts were obtained directly from the Supplemental Web sites for individual papers (Gasch et al. 2000; Ideker et al. 2001; Natarajan et al. 2001; Kwast et al. 2002; Lin et al. 2002; Ter Linde and Steensma 2002; Young et al. 2003; Kaniak et al. 2004). All the data sets were first converted into log ratios between gene expression levels in the knockout or overexpression strain and in the wild-type strain or between two conditions. The criteria that were used to find significantly up/down-regulated genes in each gene expression data set, and the conditions used for the in silico simulations of the TF knockout effects on gene expression are listed in the Supplemental material. Network expansion The network expansion approach used in this work is purely graph-based, that is, the regulatory network was represented as a directed graph with labeled edges indicating either repression or activation of a target gene by a TF or a regulatory signal. The provisional network derived from ChIP-chip and motif data (Harbison et al. 2004) as described in the Supplemental material was combined with the regulatory network in iMH805/775, and each of the edges was labeled as either repressing or activating depending on the known primary function of the TF involved in the interaction. Next, a breadth first search was used to find all directed noncyclic paths up to a certain number of steps in the combined network that connect a transcription factor to a particular potential target gene. In the first expansion scenario discussed in the Results section, only paths that would result in the correct sign for the expression change of the target gene in the TF KO or overexpression strain were considered further. In the other two expansion scenarios, this path coherence criterion was not used. In order to establish the final expanded model, the interactions in the shortest coherent path for each TF–target pair were included in a provisional regulatory network model. The suggested new interactions were then integrated with existing rules by manual inspection. The predictions of expression changes in all modified strains were then re-run with the new model, and interactions that reduced the overall predictive power of the model were removed from the model. All the computations presented in this paper were done using Matlab 6.5 (Mathworks Inc.), and the FBA problems were solved using the Lindo linear programming solver (Lindo Systems Inc.). Experimental protocols The wild-type S. cerevisiae haploid strain used in this study, BY4741 (MATa his3Δ1 leu2Δ0 met15Δ0 ura3Δ0), was obtained from ATCC. All the single transcription factor knockout strains were from the S. cerevisiae gene deletion collection (Winzeler et al. 1999) and were generously provided by T. Ideker. Growth on different carbon sources was evaluated by using the Bioscreen C system (Thermo Labsystems). Minimal medium (Van Hoek et al. 1998) with the appropriate auxotrophic supplements added was used to test growth on 12 different carbon sources (glucose, galactose, fructose, sucrose, mannose, ribose, lactate, pyruvate, ethanol, raffinose, glycerol) for each of the knockout strains as well as the wild-type strain. Maximum specific growth rates were determined for each strain–carbon source combination based on five independent measurements as described in the Supplemental material. Acknowledgments We thank Trey Ideker, Bing Ren, Shankar Subramaniam, George Santangelo, and Timothy Hughes for valuable discussions. We also thank Iman Famili, Natalie Duarte, and Monica Mo for help with the yeast metabolic model and Thuy Vo for critical comments on the manuscript. Support for this work was provided by the National Institutes of Health (RO1 GM071808) and National Science Foundation (BES-0331342). Footnotes [Supplemental material is available online at www.genome.org.] Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.4083206 3See Reed et al. (2003) for conventions used for naming in silico strains. MH refers to the principal person responsible for reconstructing the model, 805 is the number of genes accounted for by the model, and 775 is the number of regulatory interactions in the model. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||
Nature. 1997 May 29; 387(6632 Suppl):103-5.
[Nature. 1997]Nature. 2004 Sep 2; 431(7004):99-104.
[Nature. 2004]Science. 1997 Oct 24; 278(5338):680-6.
[Science. 1997]Biochem J. 2005 Jun 1; 388(Pt 2):669-77.
[Biochem J. 2005]Curr Opin Biotechnol. 2004 Feb; 15(1):58-63.
[Curr Opin Biotechnol. 2004]Curr Opin Biotechnol. 2004 Feb; 15(1):70-7.
[Curr Opin Biotechnol. 2004]Nat Rev Microbiol. 2004 Nov; 2(11):886-97.
[Nat Rev Microbiol. 2004]Nature. 2004 May 6; 429(6987):92-6.
[Nature. 2004]Genome Biol. 2003; 4(9):R54.
[Genome Biol. 2003]Proc Natl Acad Sci U S A. 2003 Nov 11; 100(23):13134-9.
[Proc Natl Acad Sci U S A. 2003]Genome Res. 2004 Jul; 14(7):1298-309.
[Genome Res. 2004]Nat Rev Microbiol. 2004 Nov; 2(11):886-97.
[Nat Rev Microbiol. 2004]Curr Opin Biotechnol. 2003 Oct; 14(5):491-6.
[Curr Opin Biotechnol. 2003]J Theor Biol. 2001 Nov 7; 213(1):73-88.
[J Theor Biol. 2001]Genome Res. 2004 Jul; 14(7):1298-309.
[Genome Res. 2004]Genome Res. 2004 Jul; 14(7):1298-309.
[Genome Res. 2004]J Biol Chem. 2002 Aug 2; 277(31):28058-64.
[J Biol Chem. 2002]Nature. 2004 May 6; 429(6987):92-6.
[Nature. 2004]Nature. 2004 Sep 2; 431(7004):99-104.
[Nature. 2004]Science. 2001 May 4; 292(5518):929-34.
[Science. 2001]Mol Cell Biol. 2001 Jul; 21(13):4347-68.
[Mol Cell Biol. 2001]Genome Biol. 2003; 4(1):R3.
[Genome Biol. 2003]J Biol Chem. 2003 Jul 11; 278(28):26146-58.
[J Biol Chem. 2003]Nature. 2004 Sep 2; 431(7004):99-104.
[Nature. 2004]Nature. 2004 Sep 2; 431(7004):99-104.
[Nature. 2004]Mol Biol Cell. 2000 Dec; 11(12):4241-57.
[Mol Biol Cell. 2000]J Bacteriol. 2002 Jan; 184(1):250-65.
[J Bacteriol. 2002]J Biol Chem. 2002 Aug 2; 277(31):28058-64.
[J Biol Chem. 2002]Nature. 2004 May 6; 429(6987):92-6.
[Nature. 2004]Nature. 2004 May 6; 429(6987):92-6.
[Nature. 2004]FEMS Microbiol Rev. 2002 Aug; 26(3):223-38.
[FEMS Microbiol Rev. 2002]Gene. 2002 May 15; 290(1-2):1-18.
[Gene. 2002]Nature. 2004 Sep 2; 431(7004):99-104.
[Nature. 2004]Nat Genet. 2001 Aug; 28(4):327-34.
[Nat Genet. 2001]Cell. 2002 Feb 22; 108(4):475-87.
[Cell. 2002]Genome Biol. 2005; 6(6):R49.
[Genome Biol. 2005]Nature. 2004 Jun 17; 429(6993):771-6.
[Nature. 2004]Nature. 2004 Dec 23; 432(7020):1027-32.
[Nature. 2004]Genome Res. 2004 Jul; 14(7):1298-309.
[Genome Res. 2004]Nucleic Acids Res. 2001 Jan 1; 29(1):75-9.
[Nucleic Acids Res. 2001]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D311-4.
[Nucleic Acids Res. 2004]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D41-4.
[Nucleic Acids Res. 2004]Nat Rev Microbiol. 2004 Nov; 2(11):886-97.
[Nat Rev Microbiol. 2004]J Theor Biol. 2001 Nov 7; 213(1):73-88.
[J Theor Biol. 2001]Mol Biol Cell. 2000 Dec; 11(12):4241-57.
[Mol Biol Cell. 2000]Science. 2001 May 4; 292(5518):929-34.
[Science. 2001]Mol Cell Biol. 2001 Jul; 21(13):4347-68.
[Mol Cell Biol. 2001]J Bacteriol. 2002 Jan; 184(1):250-65.
[J Bacteriol. 2002]Nature. 2002 Jul 18; 418(6895):344-8.
[Nature. 2002]Nature. 2004 Sep 2; 431(7004):99-104.
[Nature. 2004]Science. 1999 Aug 6; 285(5429):901-6.
[Science. 1999]Appl Environ Microbiol. 1998 Nov; 64(11):4226-33.
[Appl Environ Microbiol. 1998]Genome Biol. 2003; 4(9):R54.
[Genome Biol. 2003]Nature. 2004 Sep 2; 431(7004):99-104.
[Nature. 2004]Nature. 2004 Sep 2; 431(7004):99-104.
[Nature. 2004]Nature. 2004 Sep 2; 431(7004):99-104.
[Nature. 2004]