![]() | ![]() |
Formats:
|
||||||||||||||||||||||||
Copyright © 2007, Cold Spring Harbor Laboratory Press The anatomy of microbial cell state transitions in response to oxygen 1 Institute for Systems Biology, Seattle, Washington 98103, USA; 2 Divisions of Human Biology and Clinical Research, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109-1024, USA 3Corresponding author.E-mail nbaliga/at/systemsbiology.org; fax (206) 732-1299. Received May 23, 2007; Accepted July 30, 2007. This article has been cited by other articles in PMC.Abstract Adjustment of physiology in response to changes in oxygen availability is critical for the survival of all organisms. However, the chronology of events and the regulatory processes that determine how and when changes in environmental oxygen tension result in an appropriate cellular response is not well understood at a systems level. Therefore, transcriptome, proteome, ATP, and growth changes were analyzed in a halophilic archaeon to generate a temporal model that describes the cellular events that drive the transition between the organism’s two opposing cell states of anoxic quiescence and aerobic growth. According to this model, upon oxygen influx, an initial burst of protein synthesis precedes ATP and transcription induction, rapidly driving the cell out of anoxic quiescence, culminating in the resumption of growth. This model also suggests that quiescent cells appear to remain actively poised for energy production from a variety of different sources. Dynamic temporal analysis of relationships between transcription and translation of key genes suggests several important mechanisms for cellular sustenance under anoxia as well as specific instances of post-transcriptional regulation. Adaptation to varying levels of oxygen is critical for the survival of all organisms since this element is required for energy production in aerobic organisms, but is a dangerous poison for obligate anaerobes. Thus, diverse strategies have evolved for optimizing fitness under conditions of fluctuating oxygen availability. For example, anaerobic microbes have evolved specialized anoxic physiologies, including mechanisms to exclude and scavenge traces of oxygen (Imlay 2002). In contrast, facultative anaerobes such as Escherichia coli flexibly transition between oxidative metabolism and anaerobic growth, using alternate respiratory enzymes when oxygen becomes limiting (Nakano and Zuber 1998). Anoxia-tolerant eukaryotes such as Caenorhabditis elegans enter a state of suspended animation in which energy supply and demand are drastically reduced in a regulated manner during oxygen starvation (Hochachka et al. 1996). Understanding cellular responses to oxygen at the molecular systems level requires comprehensive and quantitative measurements of changes in parameters such as transcription, translation, and metabolism. Transcriptome measurements are quite comprehensive (Lander 1999), whereas current technology limits the detection of the complete microbial proteome and metabolome; e.g., the highest reported coverage for microbial shotgun proteomics is 60% (Lipton et al. 2002; Brauer et al. 2006). Furthermore, in addition to this disparity in technical tractability, the dynamic nature of information processing at all of these levels further complicates the collective comparative analysis of global changes in transcriptome, proteome, and metabolome (Gygi et al. 1999; Ideker et al. 2001; Beyer et al. 2004). Consequently, the global dynamic relationships across these distinct but interconnected processes remain to be characterized to build a physiological model of systems behavior. We chose the haloarchaeon Halobacterium salinarum NRC-1 as a model organism to investigate the systems-level oxygen response. This organism, found in the Great Salt Lake, the Dead Sea, and other waters with high salt concentration, requires an environment with a high concentration of salt for survival (~4.0 M) (Robb et al. 1995). Our choice of this organism was guided by (1) the relative simplicity afforded by the small genome size (2.6 Mb) and lack of compartmentalization of prokaryotes, and (2) H. salinarum’s capability to effect metabolic changes within a remarkably narrow range of oxygen availability. Rapid shifts to low environmental oxygen tension is a frequent challenge to H. salinarum, existing in an environment nearly saturated with salt, where consumption resulting from high cell density and evaporation can rapidly lower oxygen tension below 5 μM (Robb et al. 1995) (Supplemental Methods). By comparison, most microbes (e.g., E. coli) exist in environments with oxygen saturation of ~250 μM, thus the niche of the halophile would lead to rapid and severe cellular hypoxia for these organisms (Supplemental Methods). H. salinarum NRC-1 utilizes metabolic strategies similar to other facultative anaerobic microbes such as E. coli to alternate between four modes within a narrow range (0–5 μM) of oxygen concentration: (1) aerobic respiration via the tricarboxylic acid (TCA) cycle (Ng et al. 2000); (2) anaerobic fermentation via the arginine deiminase (ADI) pathway (Hartmann et al. 1980; Ruepp and Soppa 1996; Baliga et al. 2002); (3) anaerobic dimethyl sulfoxide (DMSO) and trimethylamine N-oxide (TMAO) reduction (Muller and DasSarma 2005); and (4) anaerobic energy production via bacteriorhodopsin-mediated phototrophy (Oesterhelt and Krippahl 1983; Gropp and Betlach 1994; Baliga et al. 2002). Despite good descriptive knowledge of these four modes, the response and regulation of other aspects of halobacterial physiology during fluctuations in oxygen tension remains largely uncharacterized. Here we report a temporal model of key cellular events from environmental perturbation (input) to cellular phenotype (output) by simultaneously measuring changes in the transcriptome (61 time points), proteome (28 time points), physiological growth (61 time points), and metabolic outputs (18 time points) during controlled cellular transitions between oxic and anoxic environments. Our findings suggest that H. salinarum NRC-1 shifts from a state of anoxic quiescence to active growth when the oxygen supply is replenished. During quiescence, the organism appears to remain poised for a rapid transition to alternative metabolic states. We were able to significantly improve the concordance between changes in transcription and translation when a time lag was considered during data analysis. In addition, this analysis suggested several possible post-transcriptional strategies enabling adaptation to changes in oxygen. From this standpoint, the dynamic temporal model of H. salinarum NRC-1 has shed new insights into general principles of the oxygen response. Results and Discussion Experimental design and rationale Cellular responses to changes in the environment require coordinated signal processing and other physiological adjustments at the transcriptional, translational, and metabolic levels. Therefore, to capture a systems perspective of cellular responses to oxygen, global changes in relative abundance of transcripts, proteins, ATP, and growth were measured in continuous chemostat cultures. In the chemostat, pH, cell density, light, and temperature were kept constant, whereas oxygen was perturbed in a controlled manner (Fig. 1
Physiological characterization of the oxygen response Characterization of transitions in cell states: Anoxic quiescence to aerobic growth Previous studies have shown that ATP production and growth of H. salinarum NRC-1 under anoxic conditions requires the addition of an anaerobic growth substrate; e.g., orange light at 459 μmol/m2sec, 1% arginine, or DMSO (Hartmann et al. 1980; Oesterhelt and Krippahl 1983; Muller and DasSarma 2005). However, in the natural environment, the organism will likely also encounter conditions with such defined substrates absent during frequent fluctuations in oxygen tension. We therefore sought to characterize cell physiology under these conditions. During a rapid transition from an oxic to anoxic environment (5–20 min), intracellular ATP levels dropped 1.8-fold in 6 h and fivefold within 24 h to plateau at a minimum of ~0.6 ± 0.1 μM (Table 1, third experiment), approximately half that observed for flask-grown stationary phase H. salinarum NRC-1 cells (0.96 ± 0.08 μM) (Hartmann et al. 1980). Growth ceased 30–40 min after anoxia was induced, as evidenced by the fact that flow rate was no longer required to maintain a constant cell density in the chemostat (Table 1). Also, once the anoxic ATP concentration dropped to ~0.6 μM, it remained relatively constant for as long as 6 d (Table 1, bottom). Despite the lack of proliferation, cells withdrawn for plating during the 24 h anoxic incubation showed normal viability (data not shown). Together, these latter two pieces of evidence suggest under anoxic conditions in the absence of secondary growth substrate, H. salinarum NRC-1 cells enter a state of quiescence. In contrast to the oxic to anoxic shift, during the transition from anoxic to oxic conditions, ATP concentrations increased within 5 min, reaching a plateau at 90 min (Fig. 2
The transcriptional response of H. salinarum NRC-1 to changes in oxygen tension To characterize the transcriptional response to transitions between the oxic and anoxic cell states of H. salinarum NRC-1, we analyzed global changes in mRNA levels (Baliga et al. 2002). K-means clustering followed by principal component analysis (PCA) were used to group genes and assign an “oxic score” to each cluster (Fig. 3A
During anoxic quiescence, the H. salinarum NRC-1 transcriptome is poised for a transition to three metabolic modes of energy transduction Both expected and unexpected results were found within the oxygen-correlated and oxygen-anticorrelated gene group transcriptome data. For example, we were surprised to find that two paralogous copies of the gas vesicle biogenesis proteins were anticorrelated with each other, a result that is discussed in detail in the Supplemental material (Supplemental Fig. 3). However, as expected, our analysis identified two clusters (10 and 19) with high “oxic scores” (8.9 and 9.2, respectively) (Fig. 3B In the oxygen-anticorrelated gene group, transcripts encoding DMSO reductase (Muller and DasSarma 2005), phototrophy, and their respective transcription regulators were significantly induced when oxygen was removed, and likewise, were repressed as oxygen increased; oxic score: −4.4 (Figs. 3C In contrast, the arcRABC gene cluster, which encodes the enzymes and regulators for the arginine deiminase (ADI) fermentation pathway, did not transcriptionally respond to oxygen (oxic score = 1.7) (Fig. 4D Large-scale changes in two oppositely expressed, temporally coordinated transcriptomes are associated with the transition from the anoxic to oxic cellular state Transcriptional responses to stress can often be classified into early, middle, or late events based on their temporal separation. Although oxygen-correlated and oxygen-anticorrelated transcripts were both regulated to new steady-state levels within ~90 min of an oxygen perturbation (Fig. 3B,C The translational response of H. salinarum NRC-1 to changes in oxygen tension To determine the extent to which the transcript level changes described above are dynamically reflected at the protein level, and in turn, at the phenotype level, whole-cell quantitative proteomics analysis was conducted using the mass spectrometry-based iTRAQ method (Ross et al. 2004; Whitehead et al. 2006; Stensjo et al. 2007). Over all 28 time points in the proteomics data set (Table 2) 1294 of the 2400 predicted proteins in the H. salinarum NRC-1 proteome (54%) were detected under conditions of fluctuating oxygen (Supplemental Table 3). For these detected proteins, a stringent probability cutoff of ≥0.9 was chosen, giving a low average false discovery rate of 0.6% ± 0.002% at the peptide and protein levels (Nesvizhskii et al. 2003; Keller et al. 2005; Nesvizhskii and Aebersold 2005) (Methods). Using these criteria, up to 40% of the total predicted proteome was detected in each of the 10 multiplexed samples (Table 3). The overall coverage for this study (54%) is the best recorded to date for quantitative microbial shotgun proteomics experiments (Supplemental material; Baliga et al. 2002; Whitehead et al. 2006). In addition, analysis of replicates showed that the coefficient of variation for iTRAQ was comparable to other established quantitative methods such as ICAT (CV ~18% vs. 20%, respectively; Supplemental Fig. 5; Molloy et al. 2005).
Despite evenly distributed sampling from the oxic and anoxic growth regimes, peptides from proteins encoded by oxygencorrelated transcripts were significantly over-represented (79% detection, P < 10−20), and conversely, the proteins encoded by oxygen-anticorrelated transcripts were significantly under-represented (27% detection, P < 10−25) in the proteomics data set (Supplemental Fig. 6). To determine the potential reasons for this bias, we calculated the codon adaptation indices (CAI) for all of the proteins that were detected (Methods) (Wu et al. 2005). CAI measures the bias at the wobble position in each codon, and a bias toward preferred codons (high CAI) has been shown to be directly proportional to the absolute levels of gene expression (Sharp and Li 1987). For the 97 oxygen-responsive proteins that were detected in our proteomics data set (Table 3), the average CAI was higher for oxygen-correlated than that for oxygen-anticorrelated proteins (0.78 vs. 0.65). In addition, the average number of peptides detected for oxygen-correlated proteins was 6.5, whereas that for anticorrelated proteins was two (Supplemental Table 5). Combined, these data suggest that the proteins associated with oxic physiology (oxygen-correlated) of H. salinarum NRC-1 are in higher abundance relative to anoxic proteins (oxygen-anticorrelated) irrespective of the oxygen tension in the environment. This bias toward oxic proteins was taken into consideration in all subsequent analyses (see below; Fig. 5
Temporal dynamics of the oxygen response Comparison of transcriptome and proteome data suggests instances of post-transcriptional regulation in response to oxygen Transcription and translation are rapid, dynamic events that occur sequentially on the order of minutes. Due to the technological limitations of proteomics technology, previous studies comparing transcriptome and proteome measurements have either measured only one time point (Gygi et al. 1999; Baliga et al. 2002; Griffin et al. 2002; Kolkman et al. 2006; Newman et al. 2006) or measured a few time points on the order of hours (Whitehead et al. 2006) or days (Kislinger et al. 2005; Cox et al. 2007). Therefore, to calculate a more realistic relationship between transcript and protein-expression dynamics during a cell-state transition, we computed the correlation between the interpolated transcript profile for each gene in our data set with its corresponding time-shifted cognate protein profile (28 time points) in 1-min intervals from −5 to +40 min (Whitehead et al. 2006; Fig. 5A,B For genes with less coverage (i.e., np < 9), we were able to repeat the PTLC analysis for combined mRNA and protein profiles of genes in the same operon (Methods) if we assumed that genes in an operon are cotranslated as well as cotranscribed. This operon analysis enabled detailed time-lag calculation for some genes with relatively poor individual signal in the protein data (Supplemental Fig. 8). To our knowledge, this is the first system-wide study of its kind to enable calculation of time lags between dynamic changes in transcription and translation at the level of individual genes (Yildirim and Mackey 2003). Previously, low correlation from single time-point studies has often been interpreted as evidence of post-transcriptional regulation (Gygi et al. 1999; Ideker et al. 2001; Beyer et al. 2004; Brockmann et al. 2007). However, we find that as more time points are included in the analysis, it becomes possible to identify more genes with significant time-lagged correlations (Supplemental Figs. 7, 8). For example, ~95% of the genes for which we were able to calculate a significant TLC (P < 0.05) had np ≥ 18 (i.e., covered >50% of the time course). Thus, prior studies that invoked post-transcriptional regulation to explain a lack of mRNA/protein correlation in data for one or a few time-point measurements may be doing so prematurely, particularly using current high-throughput proteomics technologies, since those studies may not have sufficient data to investigate dynamics as we have done here. Detailed descriptions of mRNA/protein dynamics therefore requires high-resolution time-course sampling, and points to a critical need for improved proteomics technology, which will increase the quality and comprehensiveness of proteomics data while decreasing cost. Among all genes and operons that were analyzed, we estimated an average peak time lag of Δt ~ 16 min. At this PTL, we find that the percentage of genes with correlations greater than random increases from ~60% at Δt = 0 min, to ~75% at Δt = 16 min (Fig. 5D Stability of mRNA is an additional factor that has significant impact on the time gap between transcription and translation. The median mRNA half-lives measured globally E. coli, Bacillus subtilis, and Lactococcus lactis are on the order of 3–8 min for aerobic cells growing at optimum rates (Bernstein et al. 2002; Hambraeus et al. 2003; Redon et al. 2005). This is somewhat paradoxical because a 3–8 min mRNA half-life in anoxic cells hampered by reduced translational capacity would imply that transcripts are degraded before the 16-min lag when translational precursors become available (Fig. 5 A transient burst of protein synthesis is an early event during the switch to aerobic metabolism Notably, we observed a transient spike in protein levels occurring in the first ~20 min after oxygen upshift. This phenomenon was observed only for certain aerobic operons associated with ribosome biogenesis (Fig. 6
A dynamic temporal map of the physiological response to oxygen Integration of all transcript, protein, physiological, and phenotypic level data generated in this study enabled the temporal mapping of cellular events following a cell-state shift from anoxic quiescence to aerobic growth (Fig. 7
This descriptive model therefore surprisingly suggests that, immediately following the anoxic to oxic transition, an increase in ATP production and the transient induction of certain key proteins precedes transcriptional induction of all genes associated with the aerobic physiological state. We hypothesize that the higher abundance of proteins associated with active growth (Supplemental Fig. 6) and perhaps the preferential stabilization of certain transcripts (e.g., those genes encoding ribosomes and cobalamin biosynthesis enzymes; Fig. 5 Methods Culturing conditions and experimental design H. salinarum NRC-1 (ATCC700922) was routinely grown in complex medium (CM; 250 NaCl, 20 g/L MgSO4·7H2O, 3 g/L sodium citrate, 2 g/L KCl, 10 g/L peptone) at 37°C. For chemostat experiments, starter cultures of NRC-1 were inoculated into 2 L of CM in a 3.0 L vessel (5%–10% inoculum) and grown to mid-logarithmic phase (OD600 ~ 0.5) in batch mode in a BioFlo100 modular bench top fermentor (New Brunswick Scientific) at 300 rpm (pH 7.0). Prior to each experiment, an oxygen sensor (model InPro 6000, Mettler Toledo) was calibrated to 100% oxygen at 1100 rpm and sparging with 3.2 VVM of air. These conditions were approximately equivalent to oxygen saturation in CM medium, which is 1.6 mg/L (~5 uM) according to our salinity compensation calculations (Supplemental Methods). Three replicate chemostat experiments were conducted. Complete data regarding experiment design, sample preparation schedule, growth rates, pH, turbidity, and oxygen concentrations for all time points for the three experiments are listed in Table 1 and depicted in Figure 1. ATP concentration was measured using the ATP Bioluminescent Assay Kit (Sigma) according to the manufacturer’s instructions. To prepare culture samples for the ATP assay, 1 OD unit of culture was pelleted, washed once in basal salts buffer (CM medium without peptone), and lysed in 1 mL of sterile milliQ water (H. salinarum NRC-1 cells lyse readily at osmolarities lower than 2 M NaCl due to their obligate halophilicity). Tenfold serial dilutions of lysates were measured against serially diluted ATP standards using 1:625 diluted bioluminescence assay mix on an EG&G Berthold LB96V microplate luminometer set to inject every 10 sec. RNA preparation and microarray protocol H. salinarum NRC-1 sample cultures (5 mL) were harvested by room temperature centrifugation at 16,000g for 30 sec and snap-frozen on a dry-ice ethanol bath. Sample pellets were stored overnight at −80°C, followed by RNA preparation using the Absolutely-RNA kit (Stratagene) according to the manufacturer’s instructions. A total of 5 μg of each experimental RNA sample was hybridized against the H. salinarum NRC-1 reference RNA prepared under standard conditions (mid-logarithmic phase batch cultures grown at 37°C under full-spectrum light in CM). This common H. salinarum NRC-1 reference RNA has been used as the reference across all 950 microarray experiments in the H. salinarum NRC-1 microarray data repository (Baliga et al. 2004; Bonneau et al. 2006; Kaur et al. 2006; Reiss et al. 2006; Whitehead et al. 2006). Samples were hybridized to a 70-mer oligonucleotide array containing the 2400 nonredundant ORFs of the H. salinarum NRC-1 genome as described in Baliga et al. (2004). Each ORF was spotted on each chip in quadruplicate and dye-flipping was conducted (to rule out bias in dye incorporation) for all time course samples, yielding eight technical replicates per gene per time point. Two biological replicates exist for all time points for a total of 16 replicates per gene (Table 1). Direct RNA labeling, slide hybridization, and washing protocols were performed as described previously (Baliga et al. 2002, 2004), except that Dy547 and Dy647 dyes (Kreatech) were used to directly label RNA. Raw intensity signals from each slide were processed by the SBEAMS-microarray pipeline (Marzolf et al. 2006) (www.SBEAMS.org/microarray), where resultant data was median normalized and subjected to significance of microarray (SAM) and variability and error estimates (VERA) analysis. Each data point was assigned a significance statistic, λ, using maximum likelihood (Ideker et al. 2000). Microarray data analysis Using the Gaggle integrated data analysis software package (Shannon et al. 2006), all mean and variance normalized transcriptome data from the three replicate oxygen time-series experiments (61 total conditions for 2400 genes, analyzed as a concatenated data matrix) were filtered according to the following two protocols to compile the final oxygen-responsive gene list. Protocol 1 Data from each of the three experiments was separately filtered for a VERA-SAM (Marzolf et al. 2006) λ likelihood significance cutoff of 7 in at least five consecutive time points (previously determined to be a significant cutoff above the signal/noise ratio by self-self hybridizations; N. Baliga, unpubl.; Ideker et al. 2000) and subsequently subjected to principal component analysis (PCA). This analysis resulted in ~600 genes for each experiment that were either correlated (~300) or anticorrelated (~300) with oxygen. These gene groups were expanded to include genes putatively associated in operons. These operon associations were calculated using a two-step nonhomology model in which (1) operon boundaries were set by intergenic distance constraints (Moreno-Hagelsieb and Collado-Vides 2002), and (2) operon membership required a significantly correlated mRNA coexpression over ~950 microarray conditions (Baliga et al. 2004; Kaur et al. 2006; Reiss et al. 2006; Whitehead et al. 2006). The intersection of the three separately computed gene groups and the list derived from protocol 2 (described below) comprises the final list of 215 genes. Protocol 2 K-means clustering was used to group the profiles of the entire transcriptome data set (61 time points). To choose an appropriate value for k in our data set, we investigated the mean residuals of the genes from the cluster means for a wide range of k’s (10–200) and k of 30 enabled the best solution for maintaining simplicity (small number of clusters) and optimizing cluster coherence. Principal component analysis (PCA) was then used to assign a numerical measure for each cluster that correlated with its response to oxygen (Fig. 3A Gene annotations were assigned (Fig. 4 Protein preparation and mass spectrometry analysis A total of 5 mL of mid-logarithmic phase H. salinarum NRC-1 (~5 × 109 cells) culture was pelleted at room temperature by centrifugation for 2 min at 9000g. Culture supernatant was discarded and pellets were immediately snap-frozen on a dry-ice ethanol bath and stored overnight at −80°C. Lysis of cell pellets was achieved by resuspending in 1 mM phenylmethylsulfonyl fluoride in water (PMSF, a protease inhibitor; H. salinarum NRC-1 cells lyse readily at osmolarities lower than 2 M NaCl due to their obligate halophilicity); soluble and insoluble protein fractions were separated by centrifugation at 25°C, 16,000g for 5 min. Insoluble protein pellets were dissolved in 3 μL of 10% SDS, mixed with the soluble fraction, and stored at −80°C. Nucleic acid was subsequently removed from protein extracts by incubating at 37°C for 45 min with 37.5 U of Benzonase nuclease (Novagen). Complete digestion of nucleic acid was verified on a 4% agarose gel (data not shown). Proteins were then precipitated with six volumes of cold acetone (as instructed by the iTRAQ reagents kit, Applied Biosystems) to remove interfering SDS and PMSF, followed by resuspension in water. Total protein concentration was determined using the bicinchoninic acid method (Pierce), and 100 μg of protein from untreated reference and oxygen-treated cells were digested with trypsin at 37°C for 12–16 h. Resultant peptides were labeled at primary amines using the iTRAQ reagents multiplex kit (Applied Biosystems) according to the manufacturer’s instructions. Reference samples were labeled with a 114 Da reagent, whereas oxygen-treated time point samples were labeled with each of 115-, 116-, or 117-Da reagents. Detailed multiplex labeling parameters for each of the 10 four-plex time-course sample sets are listed in Table 2. iTRAQ-labeled peptide samples were analyzed via LC-MS/MS using an Applied Biosystems API QSTAR Pulsar i, equipped with an in-house nanospray device. Samples were eluted onto a 10 cm × 75 μm-fused silica microcapillary reversed phase column (packed with 5 μm, 100 Å pore Magic C18AQTM beads; Michrom Bioresources) over a 60-min gradient, ranging from 10:90 (acetonitrile: 0.1% Formic acidaq) to 35:65 with a flow rate of 200 nL/min. Eluting peptides were analyzed using the IDA (Information Dependent Acquisition) function of the Analyst QS software with the two most abundant ions selected for MS/MS. The MS mass range scanned was from 350 to 1300 m/z, and the MS/MS mass range scanned was from 60 to 1800 m/z. Proteomics data analysis Proteomics data analysis was performed essentially as described in Whitehead et al. (2006). Briefly, MS/MS spectra peptide and protein assignment was achieved using SEQUEST and software within the Trans Proteomic Pipeline (TPP) package (Keller et al. 2005) to match spectra against the H. salinarum NRC-1 protein database digested in silico with trypsin (Ng et al. 2000). The static modifications given to SEQUEST for the iTRAQ reagents were an addition of 144.23 to the N terminus, an addition of 45.86 to cysteine, and an addition of 144.102 to lysine. In the assignment process, SEQUEST was allowed to use one missed cleavage. The script running SEQUEST constrained the MS/MS identifications to those spectra with at least five peaks and having a peptide mass between 600 and 4200 Da. Additionally, the region of the spectrum containing the iTRAQ peaks was not passed to SEQUEST for assignment. Subsequent peptide and protein-relative quantitation and error estimation was conducted using the Libra algorithm within the TPP as previously described (Nesvizhskii et al. 2003; Keller et al. 2005; Nesvizhskii and Aebersold 2005; Whitehead et al. 2006) (Supplemental Methods). The data for the 10 iTRAQ sets were then merged and loaded into the Gaggle software package to facilitate data integration and visualization (Shannon et al. 2006; Whitehead et al. 2006). Codon adaptation indices were calculated with a web-based CAI calculator (http://www.evolvingcode.net/codon/cai/cais.php) using the halobacterial codon usage table based on ribosomal protein codons. Similar results were obtained using alternative codon usage tables for H. salinarum NRC-1 (Puigbo et al. 2007; Supplemental material). Integrated statistical systems analysis of proteomic and microarray data The time-lagged correlation profiles (TLCPs) between mRNA and protein time-series responses for each gene or operon are a series of Pearson correlations computed between time-shifted protein measurements and the corresponding interpolated time points from the mRNA measurements. The mRNA profiles, rather than protein profiles, were chosen for interpolation because they were, in general, less noisy and had no missing values. We used cross-validated cubic spline interpolation (Berloff et al. 2002) for the final analysis, although we found that the significant results described in this report were independent of the interpolation method chosen. TLCPs were computed only for genes that had nine or more protein-level observations. Resultant TLCPs (correlation as a function of protein time lag) were computed for time lags between −5 and +40 min. Peak time-lagged correlations (PTLCs) for each gene were assessed, and the corresponding time lag, Δt, was identified. To validate and assess the significance of the observed PTLC for each gene, we performed two permutation tests, in which the TLC analysis was performed on data that were randomized by: (1) permuting the rows of the mRNA data matrix such that each gene's protein profile is compared with the mRNA profile for a different, randomly selected gene, but where the mRNA profile is correctly time-ordered (hereafter, the “shuffle” test); and (2) randomizing all values in the mRNA data matrix so that the protein profile is compared against randomly ordered mRNA data that contains the same variance structure as the original mRNA data matrix (hereafter, the “scramble” test). Each permutation test was repeated 100 times for each gene (Fig. 5C We computed an aggregate, global time-lag profile by counting the total fraction of occurrences, whereby each gene obtained a TLC better than each of its 100 permutation-test-based TLCs (Fig. 5D Acknowledgments We thank Marc T. Facciotti and Kenia Whitehead for their ideas, support, and critical reading of the manuscript at all stages. We thank Vesteinn Thorsson for enlightening discussions on the time-lagging model and Paul Shannon and Christopher Bare for their expert Gaggle and Cytoscape software support throughout this project. This work was funded by grants from DOE (DE-FG02-04ER63807 and DE-AC02-05CH11231), NSF (EF-0313754), NASA (NNG05GN58G), and NIH (P50 GM076547) to N.S.B., and a postdoctoral fellowship to A.K.S. from NIH (5F32GM078980-02). Footnotes [Supplemental material is available online at www.genome.org. The microarray data from this study have been submitted to GEO under accession nos. GSE7559 and GSE5929.] Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.6728007 References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||
Adv Microb Physiol. 2002; 46():111-53.
[Adv Microb Physiol. 2002]Annu Rev Microbiol. 1998; 52():165-90.
[Annu Rev Microbiol. 1998]Proc Natl Acad Sci U S A. 1996 Sep 3; 93(18):9493-8.
[Proc Natl Acad Sci U S A. 1996]Nat Genet. 1999 Jan; 21(1 Suppl):3-4.
[Nat Genet. 1999]Proc Natl Acad Sci U S A. 2002 Aug 20; 99(17):11049-54.
[Proc Natl Acad Sci U S A. 2002]Proc Natl Acad Sci U S A. 2006 Dec 19; 103(51):19302-7.
[Proc Natl Acad Sci U S A. 2006]Mol Cell Biol. 1999 Mar; 19(3):1720-30.
[Mol Cell Biol. 1999]Science. 2001 May 4; 292(5518):929-34.
[Science. 2001]Proc Natl Acad Sci U S A. 2000 Oct 24; 97(22):12176-81.
[Proc Natl Acad Sci U S A. 2000]Proc Natl Acad Sci U S A. 1980 Jul; 77(7):3821-5.
[Proc Natl Acad Sci U S A. 1980]J Bacteriol. 1996 Aug; 178(16):4942-7.
[J Bacteriol. 1996]Proc Natl Acad Sci U S A. 2002 Nov 12; 99(23):14913-8.
[Proc Natl Acad Sci U S A. 2002]J Bacteriol. 2005 Mar; 187(5):1659-67.
[J Bacteriol. 2005]Proc Natl Acad Sci U S A. 1980 Jul; 77(7):3821-5.
[Proc Natl Acad Sci U S A. 1980]Ann Microbiol (Paris). 1983 Jul-Aug; 134B(1):137-50.
[Ann Microbiol (Paris). 1983]J Bacteriol. 2005 Mar; 187(5):1659-67.
[J Bacteriol. 2005]Proc Natl Acad Sci U S A. 2002 Nov 12; 99(23):14913-8.
[Proc Natl Acad Sci U S A. 2002]Genome Biol. 2006; 7(5):R36.
[Genome Biol. 2006]Nat Genet. 2000 May; 25(1):25-9.
[Nat Genet. 2000]Trends Genet. 1997 Sep; 13(9):375-6.
[Trends Genet. 1997]Science. 1997 Oct 24; 278(5338):631-7.
[Science. 1997]J Bacteriol. 2005 Mar; 187(5):1659-67.
[J Bacteriol. 2005]J Bacteriol. 1996 Aug; 178(16):4942-7.
[J Bacteriol. 1996]Proc Natl Acad Sci U S A. 2002 Nov 12; 99(23):14913-8.
[Proc Natl Acad Sci U S A. 2002]Curr Opin Microbiol. 1999 Apr; 2(2):153-8.
[Curr Opin Microbiol. 1999]Genome Res. 2006 Jul; 16(7):841-54.
[Genome Res. 2006]J Biol Chem. 2006 Sep 22; 281(38):27806-15.
[J Biol Chem. 2006]J Biol Chem. 2007 Apr 13; 282(15):11230-7.
[J Biol Chem. 2007]Mol Cell Proteomics. 2004 Dec; 3(12):1154-69.
[Mol Cell Proteomics. 2004]Mol Syst Biol. 2006; 2():47.
[Mol Syst Biol. 2006]J Proteome Res. 2007 Feb; 6(2):621-35.
[J Proteome Res. 2007]Anal Chem. 2003 Sep 1; 75(17):4646-58.
[Anal Chem. 2003]Mol Cell Proteomics. 2005 Oct; 4(10):1419-40.
[Mol Cell Proteomics. 2005]Proc Natl Acad Sci U S A. 2002 Nov 12; 99(23):14913-8.
[Proc Natl Acad Sci U S A. 2002]Mol Syst Biol. 2006; 2():47.
[Mol Syst Biol. 2006]Proteomics. 2005 Apr; 5(5):1204-8.
[Proteomics. 2005]Microbiology. 2005 Jul; 151(Pt 7):2175-87.
[Microbiology. 2005]Nucleic Acids Res. 1987 Feb 11; 15(3):1281-95.
[Nucleic Acids Res. 1987]Mol Cell Biol. 1999 Mar; 19(3):1720-30.
[Mol Cell Biol. 1999]Science. 2001 May 4; 292(5518):929-34.
[Science. 2001]Mol Cell Biol. 1999 Mar; 19(3):1720-30.
[Mol Cell Biol. 1999]Proc Natl Acad Sci U S A. 2002 Nov 12; 99(23):14913-8.
[Proc Natl Acad Sci U S A. 2002]Mol Cell Proteomics. 2002 Apr; 1(4):323-33.
[Mol Cell Proteomics. 2002]Nature. 2006 Jun 15; 441(7095):840-6.
[Nature. 2006]Mol Syst Biol. 2006; 2():47.
[Mol Syst Biol. 2006]Biophys J. 2003 May; 84(5):2841-51.
[Biophys J. 2003]Mol Cell Biol. 1999 Mar; 19(3):1720-30.
[Mol Cell Biol. 1999]Science. 2001 May 4; 292(5518):929-34.
[Science. 2001]Mol Cell Proteomics. 2004 Nov; 3(11):1083-92.
[Mol Cell Proteomics. 2004]PLoS Comput Biol. 2007 Mar 23; 3(3):e57.
[PLoS Comput Biol. 2007]Biophys J. 2003 May; 84(5):2841-51.
[Biophys J. 2003]EMBO J. 1984 Jan; 3(1):91-4.
[EMBO J. 1984]Microbiol Rev. 1989 Jun; 53(2):256-71.
[Microbiol Rev. 1989]Antimicrob Agents Chemother. 2004 Jul; 48(7):2659-64.
[Antimicrob Agents Chemother. 2004]Cancer Biol Ther. 2004 Jun; 3(6):492-7.
[Cancer Biol Ther. 2004]Proc Natl Acad Sci U S A. 2002 Jul 23; 99(15):9697-702.
[Proc Natl Acad Sci U S A. 2002]Mol Genet Genomics. 2003 Aug; 269(5):706-14.
[Mol Genet Genomics. 2003]J Bacteriol. 2005 May; 187(10):3589-92.
[J Bacteriol. 2005]Mol Microbiol. 1993 Jul; 9(2):375-81.
[Mol Microbiol. 1993]Microbiology. 1998 Mar; 144 ( Pt 3)():739-50.
[Microbiology. 1998]Annu Rev Biochem. 2005; 74():199-217.
[Annu Rev Biochem. 2005]Microbiology. 1998 Mar; 144 ( Pt 3)():739-50.
[Microbiology. 1998]Environ Microbiol. 2000 Aug; 2(4):355-65.
[Environ Microbiol. 2000]Proc Natl Acad Sci U S A. 2002 Jul 23; 99(15):9697-702.
[Proc Natl Acad Sci U S A. 2002]Mol Genet Genomics. 2003 Aug; 269(5):706-14.
[Mol Genet Genomics. 2003]Ann Microbiol (Paris). 1983 Jul-Aug; 134B(1):137-50.
[Ann Microbiol (Paris). 1983]J Bacteriol. 1996 Aug; 178(16):4942-7.
[J Bacteriol. 1996]J Bacteriol. 2005 Mar; 187(5):1659-67.
[J Bacteriol. 2005]Genome Res. 2004 Jun; 14(6):1025-35.
[Genome Res. 2004]Genome Biol. 2006; 7(5):R36.
[Genome Biol. 2006]Genome Res. 2006 Jul; 16(7):841-54.
[Genome Res. 2006]BMC Bioinformatics. 2006 Jun 2; 7():280.
[BMC Bioinformatics. 2006]Mol Syst Biol. 2006; 2():47.
[Mol Syst Biol. 2006]BMC Bioinformatics. 2006 Mar 28; 7():176.
[BMC Bioinformatics. 2006]BMC Bioinformatics. 2006 Jun 6; 7():286.
[BMC Bioinformatics. 2006]J Comput Biol. 2000; 7(6):805-17.
[J Comput Biol. 2000]Bioinformatics. 2002; 18 Suppl 1():S329-36.
[Bioinformatics. 2002]Genome Res. 2004 Jun; 14(6):1025-35.
[Genome Res. 2004]Genome Res. 2006 Jul; 16(7):841-54.
[Genome Res. 2006]Genome Biol. 2006; 7(5):R36.
[Genome Biol. 2006]Trends Genet. 1997 Sep; 13(9):375-6.
[Trends Genet. 1997]Science. 1997 Oct 24; 278(5338):631-7.
[Science. 1997]BMC Bioinformatics. 2003 Sep 11; 4():41.
[BMC Bioinformatics. 2003]Nucleic Acids Res. 2000 Jan 1; 28(1):27-30.
[Nucleic Acids Res. 2000]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D247-51.
[Nucleic Acids Res. 2006]Mol Syst Biol. 2006; 2():47.
[Mol Syst Biol. 2006]Proc Natl Acad Sci U S A. 2000 Oct 24; 97(22):12176-81.
[Proc Natl Acad Sci U S A. 2000]Anal Chem. 2003 Sep 1; 75(17):4646-58.
[Anal Chem. 2003]Mol Cell Proteomics. 2005 Oct; 4(10):1419-40.
[Mol Cell Proteomics. 2005]BMC Bioinformatics. 2006 Mar 28; 7():176.
[BMC Bioinformatics. 2006]J Comput Biol. 2002; 9(3):465-75.
[J Comput Biol. 2002]