![]() | ![]() |
Formats:
|
||||||||||||||||||||
Copyright Titz et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. The Binary Protein Interactome of Treponema pallidum – The Syphilis Spirochete 1Institute of Genetics, Forschungszentrum Karlsruhe, Karlsruhe, Germany 2The Institute of Genomic Research (TIGR) and J Craig Venter Institute (JCVI), Rockville, Maryland, United States of America 3Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, Texas, Houston, United States of America Neil Hall, Editor University of Liverpool, United Kingdom #Contributed equally. * E-mail: uetz/at/jcvi.org Conceived and designed the experiments: PU BT. Performed the experiments: SR BT RH. Analyzed the data: PU SR JG BT. Contributed reagents/materials/analysis tools: TP MM. Wrote the paper: PU BT. ¤Current address: Crump Institute for Molecular Imaging, University of California, Los Angeles, California, United States of America Received December 10, 2007; Accepted April 14, 2008. This article has been cited by other articles in PMC.Abstract Protein interaction networks shed light on the global organization of proteomes but can also place individual proteins into a functional context. If we know the function of bacterial proteins we will be able to understand how these species have adapted to diverse environments including many extreme habitats. Here we present the protein interaction network for the syphilis spirochete Treponema pallidum which encodes 1,039 proteins, 726 (or 70%) of which interact via 3,649 interactions as revealed by systematic yeast two-hybrid screens. A high-confidence subset of 991 interactions links 576 proteins. To derive further biological insights from our data, we constructed an integrated network of proteins involved in DNA metabolism. Combining our data with additional evidences, we provide improved annotations for at least 18 proteins (including TP0004, TP0050, and TP0183 which are suggested to be involved in DNA metabolism). We estimate that this “minimal” bacterium contains on the order of 3,000 protein interactions. Profiles of functional interconnections indicate that bacterial proteins interact more promiscuously than eukaryotic proteins, reflecting the non-compartmentalized structure of the bacterial cell. Using our high-confidence interactions, we also predict 417,329 homologous interactions (“interologs”) for 372 completely sequenced genomes and provide evidence that at least one third of them can be experimentally confirmed. Introduction Most bacterial genomes encode hundreds or even thousands of proteins of unknown function [1]. If we want to understand the biology of these organisms, we need to understand the role of their proteins. One way to unravel the molecular function of a protein is to identify interacting proteins [2]. Up to now, the protein networks of only three organisms have been comprehensively investigated. Systematic purification of protein complexes and their identification by mass spectrometry has recently been completed in both budding yeast and Escherichia coli [3]–[5]. However, it became clear that these studies recovered only a fraction of all complexes and interactions [6] and it is still unclear how many interactions take place in a cell since no organism has been sampled exhaustively. More important, for the majority of interactions it remains unclear what their biological significance is. Only recently, the first comprehensive bacterial yeast-two-hybrid (Y2H) interaction map was presented for C. jejuni [7]. Partial Y2H interaction maps have been published for human, fly, and worm [8] and for several bacteria including Helicobacter pylori [9], Synechocystis sp. [10] and Mesorhizobium loti [11]. Similar to purified complexes though, yeast two-hybrid data reveal only a fraction of all interactions with false negative rates estimated to be in the range of 50–90% [12]. Low coverage can only be overcome by applying multiple methods to the same organism [13] or studying homologous proteins in multiple organisms [14]. We have tested nearly all binary combinations among the proteins of Treponema pallidum, the causative agent of syphilis, using the yeast two-hybrid system. With 1.14 Mbp and 1,039 ORFs [15], T. pallidum has one of the smallest genomes of any bacterium with an extracellular life-style. Although syphilis is usually not a life-threatening disease, it still caused 12 million new infections as recently as 1999, mostly in developing countries [16]. Progress in understanding the Syphilis disease and the biology of T. pallidum is severely hampered because T. pallidum cannot be cultured continuously in vitro and is not susceptible to genetic manipulation. However, our functional genomics studies demonstrate that insights into the function of individual proteins and larger functional complexes can be gained even for a bacterium which is not approachable by direct experiments. T. pallidum is only remotely related to other bacteria but still shares a significant fraction of conserved genes with other species [15]. Hence, we expect a substantial number of interactions to predict homologous counterparts in more tractable experimental systems as well as in other pathogens. Given the significant false-positive rate in many Y2H screens it is necessary to verify these interactions by independent methods. In this study we have confirmed only 8 Y2H interactions for one simple reason: Treponema pallidum is not an experimentally tractable organism and thus it will remain difficult to investigate the biological relevance of these interactions. We suggest that interactions found in species such as T. pallidum be verified in more mainstream model organisms such as E. coli. We have previously shown the efficiency of such an approach for interactions among T. pallidum motility proteins by analyzing their homologous proteins and interactions in E. coli and Bacillus subtilis [14]. The aim of this study was to unravel the protein network of a single cell by means of the yeast two-hybrid system, evaluate its utility when compared to other experimental approaches and compare the resulting data to other genome-wide datasets. We conclude that the Y2H as used here may recover one quarter of all interactions and may require other methodologies or multi-species approaches to achieve a more complete coverage. Our dataset indicates for the first time that some operons can interact via their contained proteins and that bacterial cells exhibit more promiscuous interaction patterns than eukaryotic proteomes. We support the latter observation with data from yeast and speculate that this property is a consequence of the much less compartmentalized organization of prokaryotic cells when compared to eukaryotes. Results and Discussion Generation of a comprehensive binary protein-interaction map and quality control Yeast-two-hybrid screening for the T. pallidum proteome was conducted in a systematic array-based format as described previously [14], [17]. In particular, the array format ensures reproducibility and control for unspecific background activation. Of nearly 1,000,000 examined protein pairs, 3,684 tested positive in our yeast two-hybrid assays resulting in 3,649 distinct interactions (Figure 1A
We used two independent approaches to derive more reliable, “high-confidence” datasets from our raw two-hybrid data: first, a simple approach based on the number of times a certain protein is found as prey: preys which are found more than 50 times (which is an arbitrary threshold) are likely to be unspecific interactors and thus have been excluded from the “TPA 50” dataset. Second, we applied a more comprehensive logistic regression approach, similar to that used in the STRING database [18](high-confidence dataset, see methods for details). In the latter high-confidence network (“TPA HCI”), 576 proteins of T. pallidum are connected by 991 distinct interactions with an average of 3.4 interactions per protein. Based on our training dataset, the false positive rate of this set can be estimated to be 28% (see Supporting online information [SI] file [Discussion S1] for details). However, since there are no objective computational ways to unambiguously identify false positives or negatives in any interaction dataset further experimental verifications are required for better assessments. Table 1 and Figure 1B Comparison of datasets Up to now, only two comprehensive studies of protein interactions in bacteria have been published [5], [7]. In addition, a number of partial prokaryotic interaction studies have been presented, including Y2H maps [9]–[11] and another coAP/MS study for E. coli [19]. Surprisingly, only 26 T. pallidum interactions were shared with C. jejuni, only 23 interactions with E. coli, and only 5 with H. pylori (Table S1). While the small overlap seems to be surprisingly low, small overlaps between interaction datasets are commonly observed, and may be explained by the large phylogenetic distance between these species, the different methodologies applied, the considerable false negative rate, and the incomplete sampling of each interactome. Total number of interactions of a minimal bacterial proteome To estimate the overall false-negative rate of our Y2H screen, we made use of a comprehensive set of flagellar protein interactions, which we collected for a comprehensive study on bacterial motility [14]. In this study, a “gold standard” dataset of 59 motility interactions was used, of which 39 had homologous pairs in T. pallidum. Of these 39 pairs, only 9 (or 23%) were found in our dataset which would imply a false-negative rate of 77% (but see below). To estimate the false positive rate, we looked for ‘high-confidence’ interactions which were maximally separated in a network of protein families (StringDB experimental COG network - exp. score>0.15). Based on the overlap, we estimate the false positive rate of our high-confidence set to be 28%. Based on our high-confidence set with 991 interactions, we can predict a total number of approx. 3,100 interactions (total interactions = found interactions−false positives+false negatives) for T. pallidum with an average of ~6 interactions per protein.Large-scale interaction studies cover functional complexes only to a limited extend. Integration of several datasets is the first choice to increase the coverage as has been recently demonstrated by our group for bacterial motility where a combination of two-hybrid data from T. pallidum and C. jejuni reduced the false-negative rate from 77% and 87%, respectively, to a combined 67% [14]. We expect that further technical improvements and the addition of even more genomes may be able to reduce the false negative rate to below 50%. Mapping of the interactome onto the genome On the genome level, bacterial genes have long been known to be organized in functional groups such as operons or as co-conserved genomic islands [20]. Many structural features of interactomes have been revealed including the tight connection of functional protein complexes (e.g., [21]). We wondered, whether an interdependence of the genome and the interactome structure could be identified. To this end, we overlaid Y2H interactions and predicted gene associations [18] onto the circular T. pallidum chromosome (Figure 2
Functional class organisation The main difference between pro- and eukaryotes is their subcellular organization. We wondered whether this functional specialization is reflected in protein interaction networks. To investigate this, we grouped all proteins belonging to the same functional category (as defined by the STRING database [18]) and counted the links within these groups and between groups. Figure 3
The number of self-links, i.e. functional links on the diagonal of the matrix, can be assumed to give an indication of the functional organization in a dataset or a species. We noticed that the number of self-links is larger in eukaryotes than in prokaryotes: T. pallidum (4 links, 1,039 genes), C. jejuni (6 links, 1,654 genes), E. coli (7 links [the average between [5] and [19]], 4,289 genes), yeast (16 links [Y2H], 20 links [Gavin], 6,200 genes). One explanation for these differences could be the source of the data: coAP/MS approaches tend to favor stable complexes and proteins within the same complex are usually assigned to the same functional class. On the other hand, Y2H favors transient interactions [13] among proteins which may be more promiscuous and thus less-well defined functionally. An alternative explanation for the differences in functional linkage is an increase in functional complexity from bacteria to eukaryotes, with a fundamental difference in the functional organization between these cell types. A slightly higher number of self-links in the yeast coAP/MS dataset compared to the combined yeast Y2H dataset argues for the former explanation. Comparing the three datasets of bacterial origin with the yeast datasets (coAP/MS and Y2H), however, supports an alternative explanation: a higher level of functional organization is observed in the eukaryotic datasets. Thus, the well-known difference in structural organization of pro- and eukaryotes is also reflected on the protein-interaction level. Functional processes are well separated in eukaryotes, e.g., through differential compartments such as organelles, whereas most functional processes in prokaryotes co-exist in space and partly in time as exemplified by the synchronous execution of transcription and DNA replication. It remains to be seen whether these results can be generalized for more species when additional datasets for other prokaryotes and eukaryotes become available. An integrated view of DNA-metabolism related processes In addition to the protein network of bacterial motility [14], we here present an additional network of DNA metabolism for T. pallidum, which is solely based on high-throughput data and bioinformatical predictions (Figure 4A
Functions of unknown proteins In total, 433 proteins of T. pallidum (42% of the proteome) are still uncharacterized [33]. Thus, we expanded our interaction-based annotation from DNA metabolism to the whole dataset. Indeed, 649 out of the 991 interactions in the high-confidence set involved at least one uncharacterized protein. 493 of these interactions link an uncharacterized protein to a protein of known function. These protein-pairs can be used to derive improved annotations, e.g. by integrating datasets for specific functional groups such as DNA metabolism (Figure 4 Patterns of conserved interactions Out of 1,039 T. pallidum genes, 302 are Spirochete-specific and an additional set of 147 genes shows a “narrow” distribution and is conserved in less than 50% of the sequenced bacterial species. Interestingly, a majority of 758 (76%) T. pallidum interactions (HCI) involve at least one of the 449 “narrowly” distributed proteins. Based on this observation, we asked how the overall distribution for interacting proteins looks like. For this, we constructed a phylogenetic profile for interacting protein families (“iCOGs”, Figure 5 = 1.1×10−20), which explains the observed pattern by the distribution among motile bacteria. Cluster #6 shows the highest conservation and is enriched in translation-related functions in archaea and eukaryotes (cluster #6, 5 fold enrichment, p = 3.9×10−7). On the contrary, the large cluster #2 contains mainly Treponema or Spirochete specific proteins, which interact with broadly conserved proteins, and is enriched for proteins of unknown or general function (3 fold, p = 0.003). For Spirochete-specific proteins, we also find a general tendency to interact with well-conserved proteins, which are conserved in 60%–80% (z-score vs. random of 2.0) or in 80%–100% (z-score of 1.1) of the sequenced species. Despite the large number of Spirochete-specific proteins, their overall tendency to interact with well-conserved proteins supports the notion that specific properties of spirochetes (e.g., their endoflagella) have not been invented independently in evolution but rather derived by modification of existing structures or by recruiting spirochete-specific proteins.
Prediction of protein interactions in other species Interactions in Treponema are likely to be conserved in other species. In fact, we have tested 174 motility-related interactions among Campylobacter jejuni proteins predicted from our dataset [14]. Using the criteria of Parrish et al. [7], 49 of those were tested positive with high confidence. Interestingly, most of them were not found in the study by Parrish et al. because their screens used pooled clones while our retests used individual clones. Pooling often results in lost interactions for poorly understood reasons. In any case, the comparison of Treponema and Campylobacter data confirms other studies where interactions predicted from yeast were also found in worm [35] or where metazoan interactions successfully predicted homologous interactions in yeast [36]. As a basis for further functional analysis and comparative interactomics, we predict 417,329 interactions in 372 other genomes (Table S4, Figure 6
Conclusions Here we presented a genome-wide protein interaction map for Treponema pallidum, the causative agent of Syphilis. The genome of T. pallidum is one of the smallest of all bacteria not living within host cells, and most importantly, T. pallidum is not approachable by many experimental methods, since it cannot be cultured continuously in vitro. From its interaction map, we obtain insights into the connection between genomes and interactomes, we see that the different structural organization of pro- and eukaryotes is already reflected on the interaction level, and demonstrate the usefulness of our interaction data to reveal biological insights into biological processes (DNA metabolism) as well as into the function of individual proteins (e.g., HrpA). We learned that Spirochete and Treponema-specific proteins interact with ubiquitously conserved proteins and potentially modulate their functions to achieve Spirochete-specific properties. Finally, based on our high-confidence interaction data 417,329 interactions for 372 species can be predicted. The biological relevance of the interactions found in this study remains to be shown in model organisms that are more tractable experimentally. Nevertheless, we believe in the utility of data obtained in organisms such as T. pallidum as they can show us which proteins and interactions are conserved in other species and thus help us to define minimal or essential sets of protein activities. Outlook Protein interaction mapping is where genome sequencing was about 10 years ago. Many more interaction datasets are required to distinguish between conserved and non-conserved (but biologically relevant) interactions and separate them from false positives and false negatives. Such a classification will make it much easier to evaluate the biological significance of individual interactions, either by suggesting additional experiments or by facilitating computational analysis such as protein docking. Materials and Methods Description of datasets and a more extensive description of the applied methods can be found as supporting information (Discussion S1). The interactions of this study have been submitted to the IntAct database (http://www.ebi.ac.uk/intact/, accession number EBI-1581350) and to the IMEx consortium (http://imex.sourceforge.net) through the MPIDB database (http://www.jcvi.org/mpidb, identifier IM-9152). Cloning of baits and preys, Y2H screening Selection of high-confidence datasets and logistic regression model for quality scoring For the “TPA 50” dataset, preys that were found in more than 50 screens were removed as large numbers indicate unspecific interactions [14]. Based on a binary logistic regression model [18], we assigned probability scores to all interactions using a training set of positive (interologs in DIP and IntAct) and negative Treponema interactions (see Discussion S1 for more details on the training data and scoring procedure). Next, we generated a set of ‘highly reliable interactions’ (TPA HCI) retaining only those with a probability > = 0.5. At this probability cutoff, 80% of interactions in the positive training set are classified correctly (true positives), while 28% of negative interactions were misclassified (false positives).Links between genomic locations (Figure 2 The number of interactions or bioinformatical associations between clusters of five neighboring genes was counted for the real network and for randomized versions of this network. Overrepresentation of a link compared to 1000 randomized networks was assessed by calculating a Z-score, , with the number of linking interactions n, its average in 1,000 randomized networks nrand , and its standard deviation σrand. Links between gene clusters with at least three connecting interactions/associations and a z-score (compared to random networks) of at least 2 are shown in Figure 2Associations of functional classes (Figure 3 Association values were calculated for the functional classification scheme of the String database [18]. First, the functional class association index (fCAI) was computed for each dataset and each functional class pair. The fCAI represents a log-odds-ratio, which compares the odds to find the number of linking interactions in the experimental set to the odds in a random model (see discussion S1). Based on a z-statistic, a raw p-value was derived for each functional class link and used for the visualization of functional links in the association matrix. Extended view of T. pallidum's DNA metabolism (Figure 4 A set of T. pallidum proteins involved in DNA metabolism was extracted from several databases (Table S3). Several interaction sets were integrated: high-confidence T. pallidum Y2H set (TPA HCI), high-confidence C. jejuni Y2H set [7], two socio-affinity-index (SAI) filtered E. coli coAP/MS sets ([19] and [5]), a B. subtilis Y2H set [38], a H. pylori Y2H set [9], and bioinformatically predicted interactions [18]. E. coli proteins localized to the bacterial nucleoid were taken from the GenoBase database (http://ecoli.naist.jp/GB6/search.jsp). The transfer of interactions between species (interologs) was based on orthology relationships from the MBGD database [39]. All T. pallidum interactions and interologs linking two DNA metabolism related proteins were selected. In addition, interactions or interologs of DNA metabolism related proteins, which were supported by bioinformatical predictions [18] (combined score>400) or by at least two experimental datasets, were chosen. Finally, associated proteins, which were predicted to be involved in DNA metabolism [18] (combined score>800 for DNA metabolism related proteins), localized to the nucleoid in E. coli, or had an additional evidence associated with it (Table 2) were included. Network visualization was done with the Cytoscape software [40]. A number of these selected interactions were re-tested by co-immunoprecipitation as described in [14] (Figure 4B Conservation Classes and iCOGs (Figure 5 A matrix showing the conservation of iCOGs (interacting clusters of orthologous groups) in the “TPA HCI” data set was created. For each interaction in the interaction data set, an iCOG was defined, if both interacting proteins were part of a COG (cluster of orthologous group–meaning that they could be grouped with proteins from other species into an orthologous protein family). Each element of the matrix, contains a conservation value for a specific iCOG in a specific genome (species). The conservation value (cv) indicates whether both COGs of the iCOG are conserved (cv = 1) or absent (cv = 0) in the given species or whether only one or the other COG is conserved (cv = 0.5). Average linkage clustering of the matrix in iCOG direction was done with the R-package using Euclidean distances.Significant enrichment of functional classes (taken from the STRING database) in the conservation clusters were identified employing Fisher's exact test in conjunction with a Bonferroni correction for multiple testing (p<0.01) using the R-package. Table S1 All protein-protein interactions of Treponema pallidum found in this study. (1.82 MB XLS) Click here for additional data file.(1.7M, xls) Table S2 Additional genomic links as shown in Figure 2 (0.11 MB XLS) Click here for additional data file.(103K, xls) Table S3 All proteins involved in DNA metabolism as well as their interactions as shown in Figure 4 (0.07 MB XLS) Click here for additional data file.(73K, xls) Table S4 Summary table for the predicted interactions showing all species, their phylogenetic relationships, and the number of predicted interactions for each species. (2.00 MB XLS) Click here for additional data file.(1.9M, xls) Discussion S1 More detailed discussion of results and additional details on the methodology used in this study. (0.42 MB DOC) Click here for additional data file.(410K, doc) Data S1 Predicted protein-protein interactions based on our Treponema pallidum data; zip archive containing 372 files with one file per species. (6.09 MB ZIP) Click here for additional data file.(5.8M, zip) Figure S1 Interacting clusters of orthologous groups (“iCOG”) show phylogenetically conserved interaction patterns. Each row of the shown profile corresponds to a species and each column corresponds to a pair of interacting protein families (i.e. iCOG), for which an interaction was found in the high-confidence T. pallidum data set. The protein families were defined based on the “cluster of orthologous genes” approach (COG) (see methods). With this, the profile shows for each interaction of the T. pallidum data set whether both interacting proteins, only one interacting protein or none of the interacting proteins are conserved in a given species (given row). For each species from the shown taxonomy (y-axis) and each iCOG, a conservation value is shown in the matrix. This conservation value indicates whether both COGs are conserved/absent in a given species or whether only one or the other COG is conserved (see left upper corner for color key). Overall, three distinct conservation regions are visible in the clustered matrix: #1, #2, and region #3-#6, which we subdivided somewhat arbitrarily into individual clusters #3-#6 with increasing conservation from left to right (note branches on tree above). (0.14 MB PDF) Click here for additional data file.(136K, pdf) Acknowledgments We thank multiple anonymous reviewers for comments on the manuscript. Tanja Kuhn, Cathrin Klumpp, Katrin Wohlbold, and Sindhu Thomas are acknowledged for technical assistance with the interaction screens. Footnotes Competing Interests: The authors have declared that no competing interests exist. Funding: PU was funded by the DFG (grant Ue 50/4-1) and by the J Craig Venter Institute. Tim Palzkill was funded by NIH grant NIH AI45842. Although the project has been proposed to the funders, they have taken no subsequent role in the design, conduct, or publication of the study. References 1. Galperin MY, Koonin EV. ‘Conserved hypothetical’ proteins: prioritization of targets for experimental study. Nucleic Acids Res. 2004;32:5452–5463. [PubMed] 2. Schwikowski B, Uetz P, Fields S. A network of protein-protein interactions in yeast. Nat Biotechnol. 2000;18:1257–1261. [PubMed] 3. Gavin AC, Aloy P, Grandi P, Krause R, Boesche M, et al. Proteome survey reveals modularity of the yeast cell machinery. Nature. 2006;440:631–636. [PubMed] 4. Krogan NJ, Cagney G, Yu H, Zhong G, Guo X, et al. Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature. 2006;440:637–643. [PubMed] 5. Arifuzzaman M, Maeda M, Itoh A, Nishikata K, Takita C, et al. Large-scale identification of protein-protein interaction of Escherichia coli K-12. Genome Res. 2006;16:686–691. [PubMed] 6. Goll J, Uetz P. The elusive yeast interactome. Genome Biol. 2006;7:223. [PubMed] 7. Parrish JR, Yu J, Liu G, Hines JA, Chan JE, et al. A proteome-wide protein interaction map for Campylobacter jejuni. Genome Biol. 2007;8:R130. [PubMed] 8. Gandhi TKB. Analysis of the human protein interactome and comparison with yeast, worm and fly interaction datasets. Nature Genetics. 2006;38:285–293. [PubMed] 9. Rain JC, Selig L, De Reuse H, Battaglia V, Reverdy C, et al. The protein-protein interaction map of Helicobacter pylori. Nature. 2001;409:211–215. [PubMed] 10. Sato S, Shimoda Y, Muraki A, Kohara M, Nakamura Y, et al. A Large-scale Protein–protein Interaction Analysis in Synechocystis sp. PCC6803. DNA Research. 2007;14:207–216. [PubMed] 11. Shimoda Y, Shinpo S, Kohara M, Nakamura Y, Tabata S, et al. A Large Scale Analysis of Protein–Protein Interactions in the Nitrogen-fixing Bacterium Mesorhizobium loti. DNA Research. 2008;15:13–23. [PubMed] 12. Edwards AM, Kus B, Jansen R, Greenbaum D, Greenblatt J, et al. Bridging structural biology and genomics: assessing protein interaction data with known complexes. Trends Genet. 2002;18:529–536. [PubMed] 13. von Mering C, Krause R, Snel B, Cornell M, Oliver SG, et al. Comparative assessment of large-scale data sets of protein-protein interactions. Nature. 2002;417:399–403. [PubMed] 14. Rajagopala SV, Titz B, Goll J, Parrish JR, Wohlbold K, et al. The protein network of bacterial motility. Molecular Systems Biology. 2007;3:128. [PubMed] 15. Fraser CM, Norris SJ, Weinstock GM, White O, Sutton GG, et al. Complete genome sequence of Treponema pallidum, the syphilis spirochete. Science. 1998;281:375–388. [PubMed] 16. Peeling RW, Mabey DC. Syphilis. Nat Rev Microbiol. 2004;2:448–449. [PubMed] 17. Uetz P, Dong YA, Zeretzke C, Atzler C, Baiker A, et al. Herpesviral protein networks and their interaction with the human proteome. Science. 2006;311:239–242. [PubMed] 18. von Mering C, Jensen LJ, Kuhn M, Chaffron S, Doerks T, et al. STRING 7–recent developments in the integration and prediction of protein interactions. Nucleic Acids Res. 2007;35:D358–362. [PubMed] 19. Butland G, Peregrin-Alvarez JM, Li J, Yang W, Yang X, et al. Interaction network containing conserved and essential protein complexes in Escherichia coli. Nature. 2005;433:531–537. [PubMed] 20. Gaillard M, Vallaeys T, Vorholter FJ, Minoia M, Werlen C, et al. The clc element of Pseudomonas sp. strain B13, a genomic island with various catabolic properties. J Bacteriol. 2006;188:1999–2013. [PubMed] 21. Spirin V, Mirny LA. Protein complexes and functional modules in molecular networks. Proc Natl Acad Sci U S A. 2003;100:12123–12128. [PubMed] 22. Deaconescu AM, Savery N, Darst SA. The bacterial transcription repair coupling factor. Curr Opin Struct Biol. 2007;17:96–102. [PubMed] 23. Trautinger BW, Jaktaji RP, Rusakova E, Lloyd RG. RNA polymerase modulators and DNA repair activities resolve conflicts between DNA replication and transcription. Mol Cell. 2005;19:247–258. [PubMed] 24. Boubrik F, Rouviere-Yaniv J. Increased sensitivity to gamma irradiation in bacteria lacking protein HU. Proc Natl Acad Sci U S A. 1995;92:3958–3962. [PubMed] 25. Harmon FG, Kowalczykowski SC. RecQ helicase, in concert with RecA and SSB proteins, initiates and disrupts DNA recombination. Genes Dev. 1998;12:1134–1144. [PubMed] 26. Harmon FG, Kowalczykowski SC. Biochemical characterization of the DNA helicase activity of the escherichia coli RecQ helicase. J Biol Chem. 2001;276:232–243. [PubMed] 27. Huisman O, Faelen M, Girard D, Jaffe A, Toussaint A, et al. Multiple defects in Escherichia coli mutants lacking HU protein. J Bacteriol. 1989;171:3704–3712. [PubMed] 28. Sikder D, Unniraman S, Bhaduri T, Nagaraja V. Functional cooperation between topoisomerase I and single strand DNA-binding protein. J Mol Biol. 2001;306:669–679. [PubMed] 29. Umezu K, Nakayama H. RecQ DNA helicase of Escherichia coli. Characterization of the helix-unwinding activity with emphasis on the effect of single-stranded DNA-binding protein. J Mol Biol. 1993;230:1145–1150. [PubMed] 30. Takahashi S, Hours C, Chu A, Denhardt DT. The rep mutation. VI. Purification and properties of the Escherichia coli rep protein, DNA helicase III. Can J Biochem. 1979;57:855–866. [PubMed] 31. Heller RC, Marians KJ. The disposition of nascent strands at stalled replication forks dictates the pathway of replisome loading during restart. Mol Cell. 2005;17:733–743. [PubMed] 32. Connelly JC, Kirkham LA, Leach DR. The SbcCD nuclease of Escherichia coli is a structural maintenance of chromosomes (SMC) family protein that cleaves hairpin DNA. Proc Natl Acad Sci U S A. 1998;95:7969–7974. [PubMed] 33. Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita KF, Itoh M, et al. From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. 2006;34:D354–357. [PubMed] 34. Titz B, Rajagopala SV, Ester C, Hauser R, Uetz P. A novel conserved assembly factor of the bacterial flagellum. J Bacteriol. 2006;188:7700–7706. [PubMed] 35. Matthews LR, Vaglio P, Reboul J, Ge H, Davis BP, et al. Identification of potential interaction networks using sequence-based searches for conserved protein-protein interactions or “interologs”. Genome Res. 2001;11:2120–2126. [PubMed] 36. Sharan R, Suthram S, Kelley RM, Kuhn T, McCuine S, et al. Conserved patterns of protein interaction in multiple species. Proc Natl Acad Sci U S A. 2005;102:1974–1979. [PubMed] 37. McKevitt M, Patel K, Smajs D, Marsh M, McLoughlin M, et al. Systematic cloning of Treponema pallidum open reading frames for protein expression and antigen discovery. Genome Res. 2003;13:1665–1674. [PubMed] 38. Noirot-Gros MF, Dervyn E, Wu LJ, Mervelet P, Errington J, et al. An expanded view of bacterial DNA replication. Proc Natl Acad Sci U S A. 2002;99:8342–8347. [PubMed] 39. Uchiyama I. MBGD: a platform for microbial comparative genomics based on the automated construction of orthologous groups. Nucleic Acids Res. 2007;35:D343–346. [PubMed] 40. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–2504. [PubMed] 41. Deka RK, Brautigam CA, Yang XF, Blevins JS, Machius M, et al. The PnrA (Tp0319; TmpC) lipoprotein represents a new family of bacterial purine nucleoside receptor encoded within an ATP-binding cassette (ABC)-like operon in Treponema pallidum. J Biol Chem. 2006;281:8072–8081. [PubMed] 42. Noens EE, Mersinias V, Willemse J, Traag BA, Laing E, et al. Loss of the controlled localization of growth stage-specific cell-wall synthesis pleiotropically affects developmental gene expression in an ssgA mutant of Streptomyces coelicolor. Mol Microbiol. 2007;64:1244–1259. [PubMed] 43. Yoshida Y, Nakano Y, Nezu T, Yamashita Y, Koga T. A novel NDP-6-deoxyhexosyl-4-ulose reductase in the pathway for the synthesis of thymidine diphosphate-D-fucose. J Biol Chem. 1999;274:16933–16939. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||
Nucleic Acids Res. 2004; 32(18):5452-63.
[Nucleic Acids Res. 2004]Nat Biotechnol. 2000 Dec; 18(12):1257-61.
[Nat Biotechnol. 2000]Nature. 2006 Mar 30; 440(7084):631-6.
[Nature. 2006]Genome Res. 2006 May; 16(5):686-91.
[Genome Res. 2006]Genome Biol. 2006; 7(6):223.
[Genome Biol. 2006]Genome Biol. 2007; 8(7):R130.
[Genome Biol. 2007]Nat Genet. 2006 Mar; 38(3):285-93.
[Nat Genet. 2006]Nature. 2001 Jan 11; 409(6817):211-5.
[Nature. 2001]DNA Res. 2007 Oct 31; 14(5):207-16.
[DNA Res. 2007]DNA Res. 2008 Feb 29; 15(1):13-23.
[DNA Res. 2008]Science. 1998 Jul 17; 281(5375):375-88.
[Science. 1998]Nat Rev Microbiol. 2004 Jun; 2(6):448-9.
[Nat Rev Microbiol. 2004]Mol Syst Biol. 2007; 3():128.
[Mol Syst Biol. 2007]Mol Syst Biol. 2007; 3():128.
[Mol Syst Biol. 2007]Science. 2006 Jan 13; 311(5758):239-42.
[Science. 2006]Nucleic Acids Res. 2007 Jan; 35(Database issue):D358-62.
[Nucleic Acids Res. 2007]Genome Res. 2006 May; 16(5):686-91.
[Genome Res. 2006]Genome Biol. 2007; 8(7):R130.
[Genome Biol. 2007]Nature. 2001 Jan 11; 409(6817):211-5.
[Nature. 2001]DNA Res. 2008 Feb 29; 15(1):13-23.
[DNA Res. 2008]Nature. 2005 Feb 3; 433(7025):531-7.
[Nature. 2005]Mol Syst Biol. 2007; 3():128.
[Mol Syst Biol. 2007]Mol Syst Biol. 2007; 3():128.
[Mol Syst Biol. 2007]J Bacteriol. 2006 Mar; 188(5):1999-2013.
[J Bacteriol. 2006]Proc Natl Acad Sci U S A. 2003 Oct 14; 100(21):12123-8.
[Proc Natl Acad Sci U S A. 2003]Nucleic Acids Res. 2007 Jan; 35(Database issue):D358-62.
[Nucleic Acids Res. 2007]Mol Syst Biol. 2007; 3():128.
[Mol Syst Biol. 2007]Nucleic Acids Res. 2007 Jan; 35(Database issue):D358-62.
[Nucleic Acids Res. 2007]Mol Syst Biol. 2007; 3():128.
[Mol Syst Biol. 2007]Nucleic Acids Res. 2007 Jan; 35(Database issue):D358-62.
[Nucleic Acids Res. 2007]Genome Res. 2006 May; 16(5):686-91.
[Genome Res. 2006]Nature. 2006 Mar 30; 440(7084):631-6.
[Nature. 2006]Nature. 2006 Mar 30; 440(7084):637-43.
[Nature. 2006]Genome Res. 2006 May; 16(5):686-91.
[Genome Res. 2006]Nature. 2006 Mar 30; 440(7084):631-6.
[Nature. 2006]Genome Res. 2006 May; 16(5):686-91.
[Genome Res. 2006]Nature. 2005 Feb 3; 433(7025):531-7.
[Nature. 2005]Nature. 2002 May 23; 417(6887):399-403.
[Nature. 2002]Mol Syst Biol. 2007; 3():128.
[Mol Syst Biol. 2007]Curr Opin Struct Biol. 2007 Feb; 17(1):96-102.
[Curr Opin Struct Biol. 2007]Mol Cell. 2005 Jul 22; 19(2):247-58.
[Mol Cell. 2005]Proc Natl Acad Sci U S A. 1995 Apr 25; 92(9):3958-62.
[Proc Natl Acad Sci U S A. 1995]J Mol Biol. 1993 Apr 20; 230(4):1145-50.
[J Mol Biol. 1993]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D354-7.
[Nucleic Acids Res. 2006]J Biol Chem. 2006 Mar 24; 281(12):8072-81.
[J Biol Chem. 2006]Mol Microbiol. 2007 Jun; 64(5):1244-59.
[Mol Microbiol. 2007]J Biol Chem. 1999 Jun 11; 274(24):16933-9.
[J Biol Chem. 1999]J Bacteriol. 2006 Nov; 188(21):7700-6.
[J Bacteriol. 2006]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D354-7.
[Nucleic Acids Res. 2006]Mol Syst Biol. 2007; 3():128.
[Mol Syst Biol. 2007]J Bacteriol. 2006 Nov; 188(21):7700-6.
[J Bacteriol. 2006]Mol Syst Biol. 2007; 3():128.
[Mol Syst Biol. 2007]Genome Biol. 2007; 8(7):R130.
[Genome Biol. 2007]Genome Res. 2001 Dec; 11(12):2120-6.
[Genome Res. 2001]Proc Natl Acad Sci U S A. 2005 Feb 8; 102(6):1974-9.
[Proc Natl Acad Sci U S A. 2005]Genome Res. 2003 Jul; 13(7):1665-74.
[Genome Res. 2003]Mol Syst Biol. 2007; 3():128.
[Mol Syst Biol. 2007]Mol Syst Biol. 2007; 3():128.
[Mol Syst Biol. 2007]Nucleic Acids Res. 2007 Jan; 35(Database issue):D358-62.
[Nucleic Acids Res. 2007]Nucleic Acids Res. 2007 Jan; 35(Database issue):D358-62.
[Nucleic Acids Res. 2007]Genome Biol. 2007; 8(7):R130.
[Genome Biol. 2007]Nature. 2005 Feb 3; 433(7025):531-7.
[Nature. 2005]Genome Res. 2006 May; 16(5):686-91.
[Genome Res. 2006]Proc Natl Acad Sci U S A. 2002 Jun 11; 99(12):8342-7.
[Proc Natl Acad Sci U S A. 2002]Nature. 2001 Jan 11; 409(6817):211-5.
[Nature. 2001]