![]() | ![]() |
Formats:
|
||||||||||||||||||||||||
Halobacterium salinarum NRC-1 PeptideAtlas: strategies for targeted proteomics † Institute for Systems Biology, 1441 N 34th Street, Seattle, WA 98103, USA ‡ Department of Biology, University of Washington, Seattle, WA 98195, USA ‡‡ Department of Microbiology, University of Washington, Seattle, WA 98195, USA ↕ Department of Medicinal Chemistry, University of Washington, Seattle, WA 98195, USA § Spielberg Family Center for Applied Proteomics, Cedars-Sinai Medical Center, 8750 W Beverly Blvd, Los Angeles, CA 90048, USA. *Author to whom correspondence should be addressed: Nitin S. Baliga Telephone: 206−732−1266 Fax: 206−732−1299 Email: nbaliga/at/systemsbiology.org The publisher's final edited version of this article is available at J Proteome Res. See other articles in PMC that cite the published article.Abstract The relatively small numbers of proteins and fewer possible posttranslational modifications in microbes provides a unique opportunity to comprehensively characterize their dynamic proteomes. We have constructed a Peptide Atlas (PA) for 62.7% of the predicted proteome of the extremely halophilic archaeon Halobacterium salinarum NRC-1 by compiling approximately 636,000 tandem mass spectra from 497 mass spectrometry runs in 88 experiments. Analysis of the PA with respect to biophysical properties of constituent peptides, functional properties of parent proteins of detected peptides, and performance of different mass spectrometry approaches has helped highlight plausible strategies for improving proteome coverage and selecting signature peptides for targeted proteomics. Notably, discovery of a significant correlation between absolute abundances of mRNAs and proteins has helped identify low abundance of proteins as the major limitation in peptide detection. Furthermore we have discovered that iTRAQ labeling for quantitative proteomic analysis introduces a significant bias in peptide detection by mass spectrometry. Therefore, despite identifying at least one proteotypic peptide for almost all proteins in the PA, a context-dependent selection of proteotypic peptides appears to be the most effective approach for targeted proteomics. Keywords: Peptide Atlas, Halobacterium, iTRAQ, bioinformatics, archaea, proteomics INTRODUCTION A complete genome sequence presents a one-dimensional perspective of the physiological potential of an organism. It is the temporally and spatially coordinated expression of genes into functional protein networks that yields emergent behavior that is unique to each species. Therefore, to fully understand how cells function at a systems level it is imperative to measure, assimilate and simultaneously analyze changes that occur at all levels of genetic information processing1. The transcriptome is dynamic and relatively easy to monitor comprehensively using whole genome microarrays, providing insight into which genes respond and assist in adaptation of the organism to a particular environment2. However, much more information on regulatory processes remains locked within the proteome. There exist important differences between the transcriptome and the proteome that stem from a variety of post-transcriptional processes, such as regulated degradation and posttranslational modifications, thus elevating the importance of comprehensive analysis of dynamic changes in the proteome in response to various environmental perturbations3, 4. However, comprehensive detection of the proteome is fraught with technical challenges, especially with regard to proteins that are present in low abundance, integral to the membrane, or uniquely expressed in an environment-specific manner. Even within the same protein some peptides are more tractable than others using mass spectrometry-based approaches. While there are several existing hypotheses regarding the underlying reasons that make some peptides more tractable than others (i.e. biophysical properties such as isoelectric point (pI), hydrophobicity, and length)5, certain properties such as protease accessibility, protein structure, and protein modifications complicate any attempt to make accurate predictions of peptide tractability by mass spectrometry (MS) using purely theoretical approaches. The PeptideAtlas (PA) project was initiated to map the proteome of a given organism, cell type or tissue as experimentally detected by the mass spectrometer6. The PA is technology agnostic and can make use of data from a variety of MS proteomic approaches such as qualitative proteomics surveys, tandem MS of immunoprecipitated complexes, and quantitative proteomics (e.g. ICAT and iTRAQ). Once constructed, the PA can be used as a reference for designing targeted proteomic strategies such as multiple reaction monitoring (MRM) as well as absolute protein quantification7. PeptideAtlas databases have been constructed for the human8, human plasma9, fly10, and yeast11 proteomes. Here we report a PA for Halobacterium salinarum NRC-1, an obligate halophilic archaeon that evolved unique adaptations, such as increased surface negative charges on folded proteins for survival in its extreme environment of 4.5 M salt12. H. salinarum NRC-1 has a completely sequenced and easily manipulable genome and as such has been used as a model system for constructing a predictive model of cellular responses13 to a diverse array of routine and stressful environmental changes2, 4, 14-17. The PA represents the product of integration and re-processing of data from a wide array of proteomics experiments (surveys of fractionated proteomes, enrichment of complexes by immunoprecipitation, and ICAT- and iTRAQ-based quantitative analysis of proteomic changes) in these environmental response studies. This exercise has verified the expression of 63% of the predicted proteome of H. salinarum NRC-1 including previously undetected and potentially new members of a diverse array of physiological processes. Through extensive analysis of peptides in the PA in context of function, biophysical properties, and abundance, we have identified several factors that might have contributed to our inability to detect 37% of the proteome. Notably by demonstrating a significant correlation between absolute abundance of proteins and transcripts we have identified low abundance of proteins as the main limiting factor in peptide detection by mass spectrometry. We have also conducted a comparative analysis of all the various proteomics approaches that contributed data to the construction of the PA to craft strategies for improving coverage and using proteotypic reference peptides for targeted proteomics. MATERIALS AND METHODS Cell culture conditions, protein preparation and mass spectrometric conditions All details regarding cell culturing, protein preparation, and mass spectrometry conditions are discussed in the corresponding publications on H. salinarum NRC-1 for each of the mass spectrometry proteomics methods included in the PA4, 17-20. These methods include iTRAQ, ICAT, cell fractionation, enrichment by immunoprecipitation, and gel band extracted proteins. However, to aid clarity in the present study, we have delineated pertinent details regarding these procedures in Table 1.
PeptideAtlas build summary A PA is created by identifying the peptides in MS/MS spectra, calculating the genomic coordinates of the peptides, and storing the datasets and derived information in a database for subsequent data mining10. The H. salinarum NRC-1 PA was constructed from 88 experiments [immunoprecipitation (IP), quantitative proteomic analysis using isotopic reagents (ICAT and iTRAQ), and proteome surveys via fractionation into soluble and membrane fractions] comprised of a total of 636,000 MS/MS from multiple spectrometer vendors [Sciex QStar (Applied Biosystems, Foster City, CA), Micromass QTOF (Waters, Milford, MA) and LCQ (ThermoFinnigan, Waltham, MA)] (Table 2). For each experiment, the vendor format MS/MS spectra were converted to mzXML format21 and assigned to peptides using SEQUEST22 and the complete set of H. salinarum NRC-1 protein sequences derived from the original genome annotation12 and the National Center for Biotechnology Information (NCBI) and SwissProt sequence databases. The peptide identifications were scored using PeptideProphet23 and filtered to retain only those with P ≥ 0.9, which corresponds to a spectrum identification false discovery rate of 1.1%. After all experiments were processed, the peptides were aligned to the reference proteome. The chromosomal coordinates of peptides from this analysis were verified against NCBI's Generic Features File (GFF) files and manually-curated data maintained at ISB (http://baliga.systemsbiology.net/halobacterium) in Systems Biology Experimental Analysis Management System (SBEAMS), a Relational Database Management System (RDBMS) (http://www.sbeams.org), which was also used to archive all of the PA results. We generated a complete library of tryptic peptides by performing an in silico digest of the entire H. salinarum NRC-1 predicted proteome, allowing for one missed cleavage. A measure of peptide observability, the Empirical Observability Score (EOS) (E. Deutsch, personal communication) was calculated for each peptide using the following equation: Nsamples (peptide) / Nsamples (protein). For example, if a protein was seen in 10 different samples, and one of its constituent peptides was seen in 5 of those samples, EOS of that peptide would be 0.5.
Calculation of mRNA/protein abundance correlation To calculate transcript abundance for each of the 1,646 genes whose cognate proteins were detected in the PA (Table 2), we computed the arithmetic mean intensity for that gene across 215 microarray conditions (Supplementary Table ST-1). These intensities were then log10 transformed. Cultures prepared for these microarray experiments were treated identically to those used for the proteomics experiments included in the PA (Table 1, Supplementary Table ST-1; conditions included gamma radiation stress4, UV radiation24, oxygen transitions20, and genetic knockouts17, 18, 24). To calculate sequence coverage per protein (Fig. 3A
RESULTS AND DISCUSSION Construction of the H. salinarum NRC-1 PeptideAtlas A total of 636,000 tandem mass (MS/MS) spectra from 88 proteomic experiments in 497 individual runs representing at least three types of approaches and three types of mass spectrometers (Materials and Methods) were converted to a common file format (mzXML) (Table 2). Using SEQUEST and PeptideProphet, 76,212 MS/MS spectra or ~12% of all MS/MS spectra had significant matches (P ≥ 0.9) to peptides from 1,646 predicted proteins in H. salinarum NRC-1 (Table 2), resulting in a false discovery rate of 1.1%. This represents 1,461 non-redundant proteins or 63% of the predicted proteome, thus improving coverage by 1.7-fold over a previous report that made use of a two-dimensional separation approach for protein cataloguing25. To facilitate further analysis, the PA module has been integrated with the H. salinarum NRC-1 protein annotation module in SBEAMS –a relational database system for managing systems biology data (http://baliga.systemsbiology.net/halobacterium/). Physiological functions represented in the H. salinarum NRC-1 PA Although gene finding algorithms such as GLIMMER can identify protein-coding genes with relatively low error rates26, until verified experimentally these genes are considered putative. This is an especially important concept considering that over a third of all genes predicted from almost all completely sequenced genomes do not match experimentally characterized orthologs. The identification of a peptide verifies the expression of the parent protein predicted from the genome sequence. As such we have verified the expression of 1,461 non-redundant proteins predicted in the H. salinarum genome. Of these 1,029 proteins (9,330 peptides) had significant matches to PFAM signatures (e–value < 0.001)27; 1,157 proteins (10,490 peptides) had significant matches to Clusters of Orthologous Groups (COGs) (e-value < 0.001)28; 902 proteins (9,012 peptides) matched manually-curated functional annotations12, 29, and 838 proteins (12,410 peptides) mapped to distinct enzymatic steps within 77 metabolic pathways in Kyoto Encyclopedia of Genes and Genomes (KEGG)30. In summary, we have experimentally verified the expression of at least 989 proteins (37.6% of the H. salinarum NRC-1 proteome) with some putative functional annotation, which represents a 2.3-fold improvement over the 16.2% verification in a previous proteomic survey25. More importantly, we have verified the expression of at least 300 proteins with no significant matches to experimentally characterized proteins. Below we provide some highlights from this analysis. Detection of essential cellular functions Consistent with previous H. salinarum NRC-1 high-throughput proteomics studies, we observed a high degree of coverage for proteins involved in essential cellular functions (Table 3, Supplemental Table ST-2; Supplemental Figure SF-1). For example, with regard to genetic information processing, unique peptides were detected from five of the six predicted DNA polymerase proteins or subunits (PolA1 was not detected). We have detected unique peptides from 10 of the 12 predicted putative RNA polymerase (RNAP) subunits. In addition, we detected a putative 7 KDa RNAP subunit, Rpc10 (COG1996), which is not co-transcribed with any of the other known RNA polymerase subunits and was not detected in any previous H. salinarum NRC-1 proteomic surveys. With regard to protein synthesis, secretion and degradation, unique peptides from all 55 ribosomal proteins, all 20 amino acid-tRNA synthetases, elongation factors EF-1α and β, and EF-2, 11 translation initiation factors, 6 putative sec-dependent secretion proteins, 5 putative twin-arginine translocation proteins, and five proteases were detected. Also as expected, proteins involved with cellular motility and relocation including 9 chemotaxis proteins, 8 flagellar proteins, and 11 gas vesicle biogenesis proteins were detected. At least 53 out of the 75 predicted membrane ABC transport system subunits have been detected.
Identification of specialized components of H. salinarum physiology We have detected 20 of the 26 components of the four unique modes of energy production in H. salinarum NRC-1, including oxidative phosphorylation, arginine fermentation, phototrophy and dimethyl sulfoxide (DMSO) respiration (Table 3, Supplemental Table ST-2). Although all proteins from the arginine deiminase pathway, including ArcRABC, were previously detected25, here we have newly detected previously elusive components of the DMSO and phototrophic respiratory pathways DmsC and the regulator Bat (Table 3). Specifically, we were able to detect 183 unique peptides from 13 proteins involved in energy production via bacteriorhodopsin-mediated phototrophy by enriching for the purple membrane (Table 1: fractionation, ICAT, and bacteriorhodopsin IP gel band extraction experiments)31. Interestingly, in cells which overexpress the purple membrane (Table 1: ICAT experiments), we also detected VNG1459H, a protein of unknown function. VNG1459H co-localizes in the genome and is significantly co-expressed under relevant environmental conditions with other known phototrophy genes13, 32, 33. While the exact function remains to be tested, these data support the prediction that this protein may be involved in phototrophy, an extension of a process that was considered well understood. Fractionation and subsequent solubilization with detergents also improved the detection of other membrane-associated proteins, allowing detection of 188 out of 550 proteins with predicted transmembrane domains19, 34 (Fig. 2D
Strategies for improving proteome coverage Despite the unprecedented proteome coverage of the H. salinarum NRC-1 PA, it is significant that nearly 37% of all predicted proteins were not detected. In fact we observe from a cumulative plot of numbers of distinct peptides detected as a function of individual experiment type that we have reached an apparent threshold that was previously predicted6, 8, 11 but not observed until now (Fig. 1 Influence of protein molecular weight on detection As expected, we observed better detection of peptides with an increase in molecular weight up to 1,500 Da (Fig. 2A Isoelectric point The isoelectric point of a peptide influences its enrichment depending on the type of fractionation columns used for enriching peptides (or proteins) during sample preparation. Most proteins in H. salinarum NRC-1 have a relatively higher number of acidic residues and the resulting surface negative charge is believed to help circumvent protein aggregation and precipitation in a hypersaline cytoplasm37. Consequently most peptides in H. salinarum NRC-1 PA are also acidic with a median isoelectric point of 4.4 (Fig. 2B Hydrophobicity Peptides of very low hydrophobicity were poorly detected. This is expected because of the property of the LC column used in most of our experiments34. Low hydrophobicity peptides are washed off from these columns before the mass spectrometer has a chance to analyze them. Also, as expected, peptides with hydrophobicity greater than ~30 were detected at a relatively lower frequency, perhaps due to their low solubility (Fig. 2C Solubility Despite enrichment of membrane proteins in some experiments, this fraction of the proteome is poorly represented in the PA (Fig. 1 Influence of protein abundance on peptide detection Abundance of proteins in a population can significantly influence the time a mass spectrometer spends analyzing each unique protein species38. Since there is no independent approach to measure absolute abundance of proteins on a systems scale we evaluated the use of mRNA signal intensity from microarray-based transcription profiling experiments as a proxy for the same. First we investigated whether mRNA and protein abundance were indeed proportional. The dynamic quantitative relationships between transcription and translation can be assessed at the level of absolute abundance or relative changes in the outputs of mRNA and protein. Although comparisons of relative changes across protein and mRNA concentrations have yielded variable relationships in some studies3, 39-43, we have previously demonstrated that given sufficient numbers of temporal measurements for both RNA and protein level changes over time scales of minutes, for most genes there exists significant time-lagged correlation between relative changes in transcript and protein abundance2, 16. However, a significant correlation value between absolute mRNA and protein across the entire genome has not yet been reported40, 44. To assess this relationship we compared mRNA signal intensities from 215 microarray experiments4, 20, 24 (Supplementary Table ST-1) to average spectral counts (over all peptides) per protein from 497 mass spectrometric runs. This comparison yielded significant correlation across the two datasets (Spearman correlation ~0.5; P < 10−6) indicating that the abundance of most proteins is proportional to the abundance of their corresponding transcripts (Fig. 3C Strategies for targeted proteomics As the PA becomes increasingly comprehensive we can use it for designing strategies for high throughput approaches to rapidly characterize the proteome –both qualitatively and quantitatively. A tangible approach to accomplish this is via the use of “proteotypic peptides”5, 45, which are peptides that map uniquely to one protein and are likely to be observed with LCMS/MS if the protein is present. We selected proteotypic peptides as those that (i) receive a PeptideProphet score of P > 0.9, (ii) were detected in more than one experiment, and (iii) have an Empirical Observability Score (EOS) > 0.3 (Materials and Methods). These peptides can be used as beacons for tracking specific proteins in high-throughput experiments for the targeted analysis of the proteome, greatly reducing mass-spectrometer time and improving proteome coverage at the same time. Proteotypic peptides can also aid in the QCAT approach7, in which known quantities of labeled synthetic peptides are spiked in and used as reference for absolute quantification of proteins. Using the criteria listed above, we have identified proteotypic peptides for 1,505 proteins or 57.3% of the proteome (Table 4). In other words, we can now, in principle, tune the mass spectrometer to specifically search for 1,505 mass spectra instead of a possible ~76,212 (Table 4), which represents a 50-fold reduction in mass spectrometer time to get information on the same number of proteins. However, the selection of proteotypic peptides for practical applications is more complicated since the PA represents a diverse array of experimental techniques (ICAT, iTRAQ, immunoprecipitation, etc.) designed to address different scientific problems. Each of these approaches could have an inherent bias in peptide detection as suggested by the observation that a majority of proteins were observed in a small number of runs; for example, 633 (43%) of the 1,461 observed non-redundant proteins were observed in 5 or fewer of the 497 runs (Fig. 4
Despite the potential for these inherent biases, we were able to make a fair and statistically significant comparison of performance across two of the most information rich datasets that shared significant numbers of detected proteins: iTRAQ vs. all other shotgun proteomics methods (Fig. 5
Supp. data Click here to view.(840K, pdf) ACKNOWLEDGEMENTS We thank Christopher Bare and David Campbell for help with programming, database construction and false discovery rate calculation, and Ning Zhang for help with generating proteotypic peptide scores. This work was supported by grants from NIH (P50GM076547 and 1R01GM077398-01A2), DoE (MAGGIE: DE-FG02-07ER64327), NSF (EF-0313754, EIA-0220153, MCB-0425825, DBI-0640950) and NASA (NNG05GN58G) and Institute for Systems Biology institutional support to NSB, postdoctoral fellowships are acknowledged from NSF to MTF (DBI 0400598) and KW (0443746), and from NIH (5F32GM078980-02) to AKS. Footnotes SUPPORTING INFORMATION AVAILABLE A comparative analysis revealed possible similarities between spectral counts and genome organization, a finding discussed in the Supporting Information. Supporting figures (Figure SF-1, Figure SF-2) and Supplementary Tables (ST1 and ST2) are freely available online at http://pubs.acs.org. REFERENCES 1. Facciotti MT, Bonneau R, Hood L, Baliga NS. Systems biology experimental design--considerations for building predictive gene regulatory network models for prokaryotic systems. Current Genomics. 2004;5:527–544. 2. Kaur A, Pan M, Meislin M, Facciotti MT, El-Geweley R, Baliga NS. A systems view of haloarchaeal strategies to withstand stress from transition metals. Genome Res. 2006;16:841–854. [PubMed] 3. Gygi SP, Rochon Y, Franza BR, Aebersold R. Correlation between protein and mRNA abundance in yeast. Mol Cell Biol. 1999;19:1720–1730. [PubMed] 4. Whitehead K, Kish A, Pan M, Kaur A, Reiss DJ, King N, Hohmann L, Diruggiero J, Baliga NS. An integrated systems approach for understanding cellular responses to gamma radiation. Mol Syst Biol. 2006;2:47. [PubMed] 5. Mallick P, Schirle M, Chen SS, Flory MR, Lee H, Martin D, Ranish J, Raught B, Schmitt R, Werner T, Kuster B, Aebersold R. Computational prediction of proteotypic peptides for quantitative proteomics. Nat Biotechnol. 2007;25:125–131. [PubMed] 6. Desiere F, Deutsch EW, King NL, Nesvizhskii AI, Mallick P, Eng J, Chen S, Eddes J, Loevenich SN, Aebersold R. The PeptideAtlas project. Nucleic Acids Res. 2006;34:D655–658. [PubMed] 7. Beynon RJ, Doherty MK, Pratt JM, Gaskell SJ. Multiplexed absolute quantification in proteomics using artificial QCAT proteins of concatenated signature peptides. Nat Methods. 2005;2:587–589. [PubMed] 8. Desiere F, Deutsch EW, Nesvizhskii AI, Mallick P, King NL, Eng JK, Aderem A, Boyle R, Brunner E, Donohoe S, Fausto N, Hafen E, Hood L, Katze MG, Kennedy KA, Kregenow F, Lee H, Lin B, Martin D, Ranish JA, Rawlings DJ, Samelson LE, Shiio Y, Watts JD, Wollscheid B, Wright ME, Yan W, Yang L, Yi EC, Zhang H, Aebersold R. Integration with the human genome of peptide sequences obtained by high-throughput mass spectrometry. Genome Biol. 2005;6:R9. [PubMed] 9. Deutsch EW, Eng JK, Zhang H, King NL, Nesvizhskii AI, Lin B, Lee H, Yi EC, Ossola R, Aebersold R. Human Plasma PeptideAtlas. Proteomics. 2005;5:3497–3500. [PubMed] 10. Desiere F, Deutsch EW, Nesvizhskii AI, Mallick P, Eng JK, Aderem A, Boyle R, Brunner E, Donohoe S, Fausto N, Hafen E, Hood L, Katze MG, Kennedy K, Kregenow F, Lee H, Lin B, Martin D, Ranish J, Rawlings DJ, Samelson LE, Shiio Y, Watts J, Wollscheid B, Wright ME, Yan W, Yang L, Yi E, Zhang H, Aebersold R. Integration of Peptide Sequences Obtained by High-Throughput Mass Spectrometry with the Human Genome. Genome Biology. 2004;5:R9. [PubMed] 11. King NL, Deutsch EW, Ranish JA, Nesvizhskii AI, Eddes JS, Mallick P, Eng J, Desiere F, Flory M, Martin DB, Kim B, Lee H, Raught B, Aebersold R. Analysis of the Saccharomyces cerevisiae proteome with PeptideAtlas. Genome Biol. 2006;7:R106. [PubMed] 12. Ng WV, Kennedy SP, Mahairas GG, Berquist B, Pan M, Shukla HD, Lasky SR, Baliga NS, Thorsson V, Sbrogna J, Swartzell S, Weir D, Hall J, Dahl TA, Welti R, Goo YA, Leithauser B, Keller K, Cruz R, Danson MJ, Hough DW, Maddocks DG, Jablonski PE, Krebs MP, Angevine CM, Dale H, Isenbarger TA, Peck RF, Pohlschroder M, Spudich JL, Jung KW, Alam M, Freitas T, Hou S, Daniels CJ, Dennis PP, Omer AD, Ebhardt H, Lowe TM, Liang P, Riley M, Hood L, DasSarma S. Genome sequence of Halobacterium species NRC-1. Proc Natl Acad Sci U S A. 2000;97:12176–12181. [PubMed] 13. Bonneau R, Facciotti MT, Reiss DJ, Schmid AK, Pan M, Kaur A, Thorsson V, Shannon P, Johnson MH, Bare JC, Longabaugh W, Vuthoori M, Whitehead K, Madar A, Suzuki L, Mori T, Chang DE, Diruggiero J, Johnson CH, Hood L, Baliga NS. A predictive model for transcriptional control of physiology in a free living cell. Cell. 2007;131:1354–1365. [PubMed] 14. Baliga NS, Pan M, Goo YA, Yi EC, Goodlett DR, Dimitrov K, Shannon P, Aebersold R, Ng WV, Hood L. Coordinate regulation of energy transduction modules in Halobacterium sp. analyzed by a global systems approach. Proc Natl Acad Sci U S A. 2002;99:14913–14918. [PubMed] 15. Baliga NS, Bjork SJ, Bonneau R, Pan M, Iloanusi C, Kottemann MCH, Hood L, DiRuggiero J. Systems Level Insights Into the Stress Response to UV Radiation in the Halophilic Archaeon Halobacterium NRC-1. Genome Res. 2004;14:1025–1035. [PubMed] 16. Schmid AK, Reiss DJ, Kaur A, Pan M, King N, Van PT, Hohmann L, Martin DB, Baliga NS. The anatomy of microbial cell state transitions in response to oxygen. Genome Res. 2007;17:1399–1413. [PubMed] 17. Facciotti MT, Reiss DJ, Pan M, Kaur A, Vuthoori M, Bonneau R, Shannon P, Srivastava A, Donohoe SM, Hood LE, Baliga NS. General transcription factor specified global gene regulation in archaea. Proc Natl Acad Sci U S A. 2007;104:4630–4635. [PubMed] 18. Baliga NS, Pan M, Goo YA, Yi EC, Goodlett DR, Dimitrov K, Shannon P, Aebersold R, Ng WV, Hood L. Coordinate regulation of energy transduction modules in Halobacterium sp. analyzed by a global systems approach. Proc Natl Acad Sci U S A. 2002;99:14913–14918. [PubMed] 19. Goo YA, Yi EC, Baliga NS, Tao WA, Pan M, Aebersold R, Goodlett DR, Hood L, Ng WV. Proteomic Analysis of an Extreme Halophilic Archaeon, Halobacterium sp. NRC-1. Mol Cell Proteomics. 2003;2:506–524. [PubMed] 20. Schmid AK, Reiss DJ, Kaur A, Pan M, King N, Van PT, Hohmann L, Martin DB, Baliga NS. The anatomy of microbial cell state transitions in response to oxygen. Genome Res. 2007;17:1399–1413. [PubMed] 21. Pedrioli PG, Eng JK, Hubley R, Vogelzang M, Deutsch EW, Raught B, Pratt B, Nilsson E, Angeletti RH, Apweiler R, Cheung K, Costello CE, Hermjakob H, Huang S, Julian RK, Kapp E, McComb ME, Oliver SG, Omenn G, Paton NW, Simpson R, Smith R, Taylor CF, Zhu W, Aebersold R. A common open representation of mass spectrometry data and its application to proteomics research. Nat Biotechnol. 2004;22:1459–1466. [PubMed] 22. Eng JK, McCormack AL, Yates JRI. An approach to correlate tandem mass spectral data of pepties with amino acid sequences in a protein database. J Am Soc Mass Spectrom. 1994;5:976–989. 23. Keller A, Eng J, Zhang N, Li XJ, Aebersold R. A uniform proteomics MS/MS analysis platform utilizing open XML file formats. Mol Syst Biol. 2005;1:2005, 0017. [PubMed] 24. Baliga NS, Bjork SJ, Bonneau R, Pan M, Iloanusi C, Kottemann MC, Hood L, DiRuggiero J. Systems level insights into the stress response to UV radiation in the halophilic archaeon Halobacterium NRC-1. Genome Res. 2004;14:1025–1035. [PubMed] 25. Gan RR, Yi EC, Chiu Y, Lee H, Kao YC, Wu TH, Aebersold R, Goodlett DR, Ng WV. Proteome analysis of Halobacterium sp. NRC-1 facilitated by the biomodule analysis tool BMSorter. Mol Cell Proteomics. 2006;5:987–997. [PubMed] 26. Delcher AL, Bratke KA, Powers EC, Salzberg SL. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics. 2007;23:673–679. [PubMed] 27. Finn RD, Mistry J, Schuster-Bockler B, Griffiths-Jones S, Hollich V, Lassmann T, Moxon S, Marshall M, Khanna A, Durbin R, Eddy SR, Sonnhammer EL, Bateman A. Pfam: clans, web tools and services. Nucleic Acids Res. 2006;34:D247–251. [PubMed] 28. Tatusov RL, Galperin MY, Natale DA, Koonin EV. The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res. 2000;28:33–36. [PubMed] 29. Bonneau R, Baliga NS, Deutsch EW, Shannon P, Hood L. Comprehensive de novo structure prediction in a systems-biology context for the archaea Halobacterium sp. NRC-1. Genome Biol. 2004;5:R52. [PubMed] 30. Kanehisa M. The KEGG database. Novartis Found Symp. 2002;247:91–101. discussion 101−103, 119−128, 244−152. [PubMed] 31. Hartmann R, Sickinger HD, Oesterhelt D. Anaerobic growth of halobacteria. Proc Natl Acad Sci U S A. 1980;77:3821–3825. [PubMed] 32. Bonneau R, Reiss DJ, Shannon P, Facciotti M, Hood L, Baliga NS, Thorsson V. The Inferelator: an algorithm for learning parsimonious regulatory networks from systems-biology data sets de novo. Genome Biol. 2006;7:R36. [PubMed] 33. Reiss DJ, Baliga NS, Bonneau R. Integrated biclustering of heterogeneous genome-wide datasets for the inference of global regulatory networks. BMC Bioinformatics. 2006;7:280. [PubMed] 34. Schindler PA, Van Dorsselaer A, Falick AM. Analysis of hydrophobic proteins and peptides by electrospray ionization mass spectrometry. Anal Biochem. 1993;213:256–263. [PubMed] 35. Lipton MS, Pasa-Tolic L, Anderson GA, Anderson DJ, Auberry DL, Battista JR, Daly MJ, Fredrickson J, Hixson KK, Kostandarithes H, Masselon C, Markillie LM, Moore RJ, Romine MF, Shen Y, Stritmatter E, Tolic N, Udseth HR, Venkateswaran A, Wong KK, Zhao R, Smith RD. Global analysis of the Deinococcus radiodurans proteome by using accurate mass tags. Proc Natl Acad Sci U S A. 2002;99:11049–11054. [PubMed] 36. Klein C, Aivaliotis M, Olsen JV, Falb M, Besir H, Scheffer B, Bisle B, Tebbe A, Konstantinidis K, Siedler F, Pfeiffer F, Mann M, Oesterhelt D. The Low Molecular Weight Proteome of Halobacterium salinarum. J Proteome Res. 2007;6:1510–1518. [PubMed] 37. Kennedy SP, Ng WV, Salzberg SL, Hood L, DasSarma S. Understanding the Adaptation of Halobacterium Species NRC-1 to Its Extreme Environment through Computational Analysis of Its Genome Sequence. Genome Res. 2001;11:1641–1650. [PubMed] 38. Zhang H, Li XJ, Martin DB, Aebersold R. Identification and quantification of N-linked glycoproteins using hydrazide chemistry, stable isotope labeling and mass spectrometry. Nat Biotechnol. 2003;21:660–666. [PubMed] 39. Bitton DA, Okoniewski MJ, Connolly Y, Miller CJ. Exon level integration of proteomics and microarray data. BMC Bioinformatics. 2008;9:118. [PubMed] 40. Cox B, Kislinger T, Wigle DA, Kannan A, Brown K, Okubo T, Hogan B, Jurisica I, Frey B, Rossant J, Emili A. Integrated proteomic and transcriptomic profiling of mouse lung development and Nmyc target genes. Mol Syst Biol. 2007;3:109. [PubMed] 41. Flory MR, Lee H, Bonneau R, Mallick P, Serikawa K, Morris DR, Aebersold R. Quantitative proteomic analysis of the budding yeast cell cycle using acid-cleavable isotope-coded affinity tag reagents. Proteomics. 2006;6:6146–6157. [PubMed] 42. Washburn MP, Koller A, Oshiro G, Ulaszek RR, Plouffe D, Deciu C, Winzeler E, Yates JR., 3rd Protein pathway and complex clustering of correlated mRNA and protein expression analyses in Saccharomyces cerevisiae. Proc Natl Acad Sci U S A. 2003;100:3107–3112. [PubMed] 43. Schmidt MW, Houseman A, Ivanov AR, Wolf DA. Comparative proteomic and transcriptomic profiling of the fission yeast Schizosaccharomyces pombe. Mol Syst Biol. 2007;3:79. [PubMed] 44. Corbin RW, Paliy O, Yang F, Shabanowitz J, Platt M, Lyons CE, Jr., Root K, McAuliffe J, Jordan MI, Kustu S, Soupene E, Hunt DF. Toward a protein profile of Escherichia coli: comparison to its transcription profile. Proc Natl Acad Sci U S A. 2003;100:9232–9237. [PubMed] 45. Craig R, Cortens JP, Beavis RC. The use of proteotypic peptide libraries for protein identification. Rapid Commun Mass Spectrom. 2005;19:1844–1850. [PubMed] 46. Krokhin OV. Sequence-specific retention calculator. Algorithm for peptide retention prediction in ion-pair RP-HPLC: application to 300- and 100-A pore size C18 sorbents. Anal Chem. 2006;78:7785–7795. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||
Genome Res. 2006 Jul; 16(7):841-54.
[Genome Res. 2006]Mol Cell Biol. 1999 Mar; 19(3):1720-30.
[Mol Cell Biol. 1999]Mol Syst Biol. 2006; 2():47.
[Mol Syst Biol. 2006]Nat Biotechnol. 2007 Jan; 25(1):125-31.
[Nat Biotechnol. 2007]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D655-8.
[Nucleic Acids Res. 2006]Nat Methods. 2005 Aug; 2(8):587-9.
[Nat Methods. 2005]Genome Biol. 2005; 6(1):R9.
[Genome Biol. 2005]Proteomics. 2005 Aug; 5(13):3497-500.
[Proteomics. 2005]Genome Biol. 2004; 5(2):R9.
[Genome Biol. 2004]Proc Natl Acad Sci U S A. 2000 Oct 24; 97(22):12176-81.
[Proc Natl Acad Sci U S A. 2000]Cell. 2007 Dec 28; 131(7):1354-65.
[Cell. 2007]Genome Res. 2006 Jul; 16(7):841-54.
[Genome Res. 2006]Mol Syst Biol. 2006; 2():47.
[Mol Syst Biol. 2006]Proc Natl Acad Sci U S A. 2002 Nov 12; 99(23):14913-8.
[Proc Natl Acad Sci U S A. 2002]Mol Syst Biol. 2006; 2():47.
[Mol Syst Biol. 2006]Proc Natl Acad Sci U S A. 2007 Mar 13; 104(11):4630-5.
[Proc Natl Acad Sci U S A. 2007]Genome Res. 2007 Oct; 17(10):1399-413.
[Genome Res. 2007]Genome Biol. 2004; 5(2):R9.
[Genome Biol. 2004]Nat Biotechnol. 2004 Nov; 22(11):1459-66.
[Nat Biotechnol. 2004]Proc Natl Acad Sci U S A. 2000 Oct 24; 97(22):12176-81.
[Proc Natl Acad Sci U S A. 2000]Mol Syst Biol. 2005; 1():2005.0017.
[Mol Syst Biol. 2005]Mol Syst Biol. 2006; 2():47.
[Mol Syst Biol. 2006]Genome Res. 2004 Jun; 14(6):1025-35.
[Genome Res. 2004]Genome Res. 2007 Oct; 17(10):1399-413.
[Genome Res. 2007]Proc Natl Acad Sci U S A. 2007 Mar 13; 104(11):4630-5.
[Proc Natl Acad Sci U S A. 2007]Proc Natl Acad Sci U S A. 2002 Nov 12; 99(23):14913-8.
[Proc Natl Acad Sci U S A. 2002]Mol Cell Proteomics. 2006 Jun; 5(6):987-97.
[Mol Cell Proteomics. 2006]Bioinformatics. 2007 Mar 15; 23(6):673-9.
[Bioinformatics. 2007]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D247-51.
[Nucleic Acids Res. 2006]Nucleic Acids Res. 2000 Jan 1; 28(1):33-6.
[Nucleic Acids Res. 2000]Proc Natl Acad Sci U S A. 2000 Oct 24; 97(22):12176-81.
[Proc Natl Acad Sci U S A. 2000]Genome Biol. 2004; 5(8):R52.
[Genome Biol. 2004]Mol Cell Proteomics. 2006 Jun; 5(6):987-97.
[Mol Cell Proteomics. 2006]Proc Natl Acad Sci U S A. 1980 Jul; 77(7):3821-5.
[Proc Natl Acad Sci U S A. 1980]Cell. 2007 Dec 28; 131(7):1354-65.
[Cell. 2007]Genome Biol. 2006; 7(5):R36.
[Genome Biol. 2006]BMC Bioinformatics. 2006 Jun 2; 7():280.
[BMC Bioinformatics. 2006]Mol Cell Proteomics. 2003 Aug; 2(8):506-24.
[Mol Cell Proteomics. 2003]Anal Biochem. 1993 Sep; 213(2):256-63.
[Anal Biochem. 1993]Proc Natl Acad Sci U S A. 2002 Aug 20; 99(17):11049-54.
[Proc Natl Acad Sci U S A. 2002]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D655-8.
[Nucleic Acids Res. 2006]Genome Biol. 2005; 6(1):R9.
[Genome Biol. 2005]Genome Biol. 2006; 7(11):R106.
[Genome Biol. 2006]J Proteome Res. 2007 Apr; 6(4):1510-8.
[J Proteome Res. 2007]Genome Res. 2001 Oct; 11(10):1641-50.
[Genome Res. 2001]Anal Biochem. 1993 Sep; 213(2):256-63.
[Anal Biochem. 1993]Mol Syst Biol. 2006; 2():47.
[Mol Syst Biol. 2006]Nat Biotechnol. 2003 Jun; 21(6):660-6.
[Nat Biotechnol. 2003]Mol Cell Biol. 1999 Mar; 19(3):1720-30.
[Mol Cell Biol. 1999]BMC Bioinformatics. 2008 Feb 25; 9():118.
[BMC Bioinformatics. 2008]Mol Syst Biol. 2007; 3():79.
[Mol Syst Biol. 2007]Genome Res. 2006 Jul; 16(7):841-54.
[Genome Res. 2006]Mol Syst Biol. 2007; 3():109.
[Mol Syst Biol. 2007]Proc Natl Acad Sci U S A. 2003 Aug 5; 100(16):9232-7.
[Proc Natl Acad Sci U S A. 2003]Mol Syst Biol. 2006; 2():47.
[Mol Syst Biol. 2006]Genome Res. 2007 Oct; 17(10):1399-413.
[Genome Res. 2007]Genome Res. 2004 Jun; 14(6):1025-35.
[Genome Res. 2004]Nat Biotechnol. 2007 Jan; 25(1):125-31.
[Nat Biotechnol. 2007]Rapid Commun Mass Spectrom. 2005; 19(13):1844-50.
[Rapid Commun Mass Spectrom. 2005]Nat Methods. 2005 Aug; 2(8):587-9.
[Nat Methods. 2005]Proc Natl Acad Sci U S A. 2002 Nov 12; 99(23):14913-8.
[Proc Natl Acad Sci U S A. 2002]Anal Chem. 2006 Nov 15; 78(22):7785-95.
[Anal Chem. 2006]