![]() | ![]() |
Formats:
|
||||||||||||||
Copyright © 2006 by The National Academy of Sciences of the USA Biochemistry Colloquium Papers Characterization and prediction of protein–protein interactions within and between complexes Department of Molecular Genetics and Biotechnology, Faculty of Medicine, Hebrew University, Jerusalem 91120, Israel †To whom correspondence should be addressed. E-mail: hanah/at/md.huji.ac.il Edited by Samuel Karlin, Stanford University, Stanford, CA, and accepted August 3, 2006 *Present address: UCLA–DOE Institute for Genomics and Proteomics, University of California, Los Angeles, CA 90095. Author contributions: E.S. and H.M. designed research; E.S. performed research; Y.A. contributed analytic tools; E.S., Y.A., and H.M. analyzed data; and E.S. and H.M. wrote the paper. Received August 4, 2005. This article has been cited by other articles in PMC.Abstract Databases of experimentally determined protein interactions provide information on binary interactions and on involvement in multiprotein complexes. These data are valuable for understanding the general properties of the interaction between proteins as well as for the development of prediction schemes for unknown interactions. Here we analyze experimentally determined protein interactions by measuring various sequence, genomic, transcriptomic, and proteomic attributes of each interacting pair in the yeast Saccharomyces cerevisiae. We find that dividing the data into two groups, one that includes binary interactions within protein complexes (stable) and another that includes binary interactions that are not within complexes (transient), enables better characterization of the interactions by the different attributes and improves the prediction of new interactions. This analysis revealed that most attributes were more indicative in the set of intracomplex interactions. Using this data set for training, we integrated the different attributes by logistic regression and developed a predictive scheme that distinguishes between interacting and noninteracting protein pairs. Analysis of the logistic-regression model showed that one of the strongest contributors to the discrimination between interacting and noninteracting pairs is the presence of distinct pairs of domain signatures that were suggested previously to characterize interacting proteins. The predictive algorithm succeeds in identifying both intracomplex and other interactions (possibly the more stable ones), and its correct identification rate is 2-fold higher than that of large-scale yeast two-hybrid experiments. Keywords: domain signature, genomewide analysis, stable interaction, transient interaction, logistic regression Protein interactions are central to almost all biological processes. Large-scale screens of protein–protein interactions (PPIs) in several organisms (1–4), together with PPI data from small-scale studies, have generated a large volume of experimental data that provides a partial picture of the cellular PPI networks. Previous studies that analyzed PPIs characterized their sequence domains and cellular properties (5–12) and provided insight into their evolution and regulation (13–23). At present, the richest information on PPIs is available for the yeast Saccharomyces cerevisiae, including documentation on experimentally determined binary interactions (1, 2, 24–26) as well as participation of proteins in the same complex (24, 27, 28). Intersection of these two data sources divides the binary interactions into those that occur within larger protein complexes [intracomplex interactions (ICIs)] and those that were not documented as belonging to complexes [non-intracomplex interactions (NICIs)]. The latter include interactions between proteins in different complexes, interactions between a noncomplexed protein and a protein in a complex, and interactions between two noncomplexed proteins (Fig. 1
Results Attribute Assignment. We selected nine attributes that may characterize pairs of physically interacting proteins (Table 1). Values for those nine attributes were assigned for each of all possible (≈6,0002/2) ≈1.8 × 107 pairs in the yeast proteome. Note that the attributes were defined at the pair level and not at the single-protein level. As Table 1 shows, the information about most attributes is incomplete, including both the computationally derived attributes (attributes 1–5) and the experimentally based attributes (attributes 6–9). Also, assignment of the computationally derived attributes depends on the stringency of the criteria used in the analysis. For example, following our criteria for fusion events (Tables 2 and 3, which are published as supporting information on the PNAS web site), this attribute was assigned to a very small fraction of all possible protein pairs (consistent with ref. 19). It is possible that with less stringent criteria, more pairs could be assigned. Using the stringent criteria, however, guarantees a higher quality of the data.
Comparison of Attributes in the Different Data Sets. As described in Materials and Methods and in Fig. 2
Prediction of Interacting Pairs. Although each attribute distinguishes to some extent between interacting and noninteracting pairs, the integration of all attributes, appropriately weighted, is expected to provide better discrimination, and thus it could potentially be used for the development of a predictive algorithm. To incorporate the nine attributes into a predictive scheme, we used logistic regression (LR). Similar to linear regression, LR provides the best fitting function between a dependent variable and a set of independent variables. LR provides a function that incorporates the relative contributions of the independent variables (here, attributes) to compute the probability of an event (here, interaction between two proteins) (ref. 29; see also Supporting Methods, which is published as supporting information on the PNAS web site). It is possible then to choose a probability threshold above which a pair of proteins is determined as putatively interacting. Our study shows that there are differences in the attribute distributions between ICIs and NICIs, suggesting that it would be more appropriate to treat them separately rather than unifying them as one data set. Therefore, we turned to developing two separate LR models for each type of interaction. However, the low specificity values of the NICI attributes (fractions of interacting pairs having an attribute among all pairs with that attribute), caused by the huge size of the data set of noninteracting pairs following our 1:600 rate estimation (Fig. 2 The LR was carried out on the ICI and RICP data sets with 5-fold cross-validation (choosing 80% of the pairs for training and 20% for testing), showing consistent results and consistent attribute coefficients in all five tests (see Table 4, which is published as supporting information on the PNAS web site). Based on these results, the LR analysis was applied to the whole data sets of ICIs and RICPs, resulting in one set of attribute coefficients to be used in further predictions (see Table 5, which is published as supporting information on the PNAS web site). The overall model fit as estimated by the likelihood ratio was highly statistically significant (P < 0.0001). The coefficients of the domain–domain signatures, colocalization, and shared cellular process deviated significantly from zero (P < 0.0001). Interestingly, recently Lu et al. (30) also found that four strong features are sufficient for satisfactory predictions of co-complexed proteins, two of which regarded participation in a common cellular process. Fig. 4
Our ability to identify interactions between free proteins and complexes by using the LR-based scheme that was trained on ICIs suggests that these interactions may have been missed from the complexes. Indeed, for some of these NICI interactions we managed to find support in the literature for their involvement in complexes. For space limitations, only one such example is described here. This example regards the heme activator protein HapI and the proteins that regulate its activity. It is known that the proteins Hsp82/Hsc82, Ydj1, and Sro9 are associated with HapI in the absence of heme (31, 32), and it was also shown that the proteins Ssa1 and Ssa2 play a major role in HapI repression (32). In our data, Ssa1 and Ssa2 are not documented as included in the complex with the other proteins, but our predictive scheme still succeeds in identifying the (known) interactions Ssa1–Hsp82 and Ssa2–Hsp82, suggesting that Ssa1 and Ssa2 participate in the complex. Likewise, we identify interactions between complexes, such as the interactions between the coat protein complex COPII with the soluble N-ethylmaleimide-sensitive fusion attachment receptor complexes t-SNARE and v-SNARE, which play a role in cellular vesicle transport (33). When the algorithm was applied to all possible ≈1.8 × 107 pairs in the yeast proteome (excluding the known interacting pairs) we predicted 29,181 putative interactions (with a cutoff of 0.49, which yields ≈50% specificity in prediction as in Y2H). Of these putative interactions, 1,946 are between subunits of the same complex, i.e., predicted ICIs, and the rest are predicted NICIs. Among these predicted NICIs, 87% involved at least one complexed protein, and 13% involved two free proteins, probably regarding relatively stable interactions. Among the predicted interactions, there are interactions that were revealed by large-scale screening methods that are considered as less reliable, and our study supports them (see Table 6, which is published as supporting information on the PNAS web site). One such example regards the proteins Lst8 and Sec13, which belong to different complexes and play a role in protein transport. It was suggested that these two proteins may function as components of a post-Golgi secretory vesicle coat (34). The relationship between Sec13 and Lst8, which was determined previously only as a genetic interaction (34), was strongly supported by our analysis, and it was predicted as a PPI with a probability of 0.85. The pair Sec13–Lst8 showed six of the nine attributes: domain–domain signature, fold combination, colocalization, shared cellular process, fusion event, and consistent phylogenetic profiles. Remarkably, both Sec13 and Lst8 contain the WD-40 repeat that consists of five to eight tandem repeats, each containing a central Trp–Asp motif. Because the WD-40 repeat is known to be involved in PPI (35), it is possible that it also mediates the interaction between Sec13 and Lst8. Discussion The intersection of the information on pairwise interactions in the yeast S. cerevisiae with that on the involvement in a protein complex revealed an intriguing picture. In a data set of reliable interactions, 42% of the known pairwise interactions resided within multiprotein complexes. Among the rest of the interactions, 96% involved at least one pair-mate that participates in a protein complex (Fig. 1 Characterization of Interactions. To characterize the various types of interactions, we examined their association with nine attributes of protein pairs (Table 1 and Supporting Methods). These attributes can be divided into two major classes: one class includes two attributes that relate directly to the physical interactions, the domain–domain signatures and fold combinations. Both attributes were learned from the database of experimentally determined PPIs, and they seem relevant to the actual physical interactions between proteins (it is widely acknowledged that there are special domains that participate in the physical interaction, and it is conceivable that there are preferred folds that are more suitable for interaction with each other). The other class includes attributes that may be indicative of functional relationships between proteins but not necessarily of their physical interactions. These attributes were derived from sources of information that are independent of the data sets of interacting proteins. These attributes can be also divided into different types. Colocalization and participation in a shared cellular process are implicit for PPIs (both ICIs and NICIs), and their presence or absence mostly testifies to the current state of annotation. The attributes of coexpression and coregulation are relevant to the coordinated transcription of the interacting proteins, and as such they are expected to provide insights into ICIs and NICIs. The attributes of fusion events, consistent phylogenetic profiles, and conserved gene neighborhoods have evolutionary implications, and they are expected to provide valuable insight into possible differences between ICIs and NICIs in this regard. Here we discuss the main findings regarding the various attributes. Domain–domain signatures. This attribute was more prominent in ICIs than in NICIs, and in general it was more abundant in PPIs than in noninteracting pairs. It was also assigned a high coefficient by the LR, suggesting that it is a good discriminator between interacting and noninteracting pairs. It should be emphasized that the domain–domain signatures were not derived directly from all interacting pairs, but rather they were learned from one part of the data and assigned to the other part (see Supporting Methods). Several other studies demonstrated the value of this attribute for PPI prediction by using different data sets of PPIs (6–9). These findings substantiate the biological meaning of domain–domain signatures and support the suggestion that they are characteristic of physically interacting proteins (5–9). Indeed, a literature and database inspection of the overrepresented pairs of domain signatures in interacting proteins (5) showed that 56% of the domain signature pairs for which information is available are involved in physical interactions (E.S. and H.M., unpublished results). Interestingly, pairs of identical domain signatures were found in 15% of the ICIs and in 7.5% of the NICIs. Among the interactions predicted on the full proteome, 13.5% had the same domain signatures in the two pair-mates, in general agreement with the suggestion that identical interfaces are often used for interaction (for review, see ref. 36). Colocalization and shared cellular process. Intuitively, involvement of two proteins in a complex implies that they should be found at the same cellular compartment in localization experiments (37). We found, however, that within complexes, interacting pairs are significantly more often documented as colocalized than other pairs, which suggests that noninteracting subunits within the complex may be either spurious or present transiently (depending on the cellular condition). The latter might complicate their detection as colocalized by independent studies. This possibility implies that the protein composition of multiprotein complexes changes dynamically, as indeed was found in the large-scale studies in which these complexes were discovered (27, 28). Thus, there are complexes that have identical core proteins, and different proteins may join them for specific tasks. Similar arguments hold for the shared cellular process attribute. Because colocalization and shared cellular process are implicit for interacting proteins, their higher association with ICIs compared with NICIs indicates only that the annotation is incomplete. With complete annotation, we will be able to learn whether there are different cellular compartments that are preferred for the different interactions. mRNA coexpression. mRNA coexpression of pair-mates was more prominent among the interacting pairs in complexes than in other pairs in complexes. Dezso et al. (38) showed that there are protein pairs in complexes that exhibit these three properties (colocalization, coexpression, and shared process), and they suggested that these pairs constitute the cores of the complexes. Our results support this conjecture and emphasize that the protein pairs making up these cores were identified experimentally as physically interacting. mRNA coexpression was also relatively more pronounced in ICIs compared with NICIs, perhaps to ensure the right stoichiometry of subunits in complexes. Similar findings regarding the association of coexpression and coregulation with ICIs were also reported in other studies (39, 40). Phylogenetic profile. This attribute, which reflects the consistent presence/absence of the two pair-mates in different organisms, was more frequent among the ICIs than among the NICIs or RICPs. This observation may suggest that the stable interactions within complexes make up the cores that are responsible for basic, evolutionary conserved mechanisms, whereas the other protein pairs in the complexes or the transient pairs comprise interactions that are organism-specific and play a role in more specialized processes. Prediction of Interacting Pairs. Attempts to predict physical interactions have been reported with varying degrees of success (6–9, 41–43). Some of the attributes, such as coexpression or fusion events, were used previously for predictions of co-complexed proteins (44–46) or of functional relations (10–13). These predictions identify associations between protein pairs which are not necessarily physical interactions, although for the identified fusion events, many of the predicted relations correspond to actual interactions (as seen also in Fig. 4 An attempt to develop a predictive scheme based on the whole data set (ICIs and NICIs together) was not successful, nor did an attempt to develop a predictive scheme for NICIs based on their attributes succeed. In both cases, one major reason for the failure probably involves the very large differences in size between the interacting and noninteracting pairs. We believe that the sizes of these data sets reflect the actual situation in the cell, where, among millions of possible PPIs, only a very small fraction of pairs actually interact [≈30,000 based on several estimations (47)]. The well characterized ICIs and the smaller size of the RICP set allow the development of a reasonable predictive scheme based on the current attributes, which shows relatively good performance for this set. It is likely that for identifying NICIs among so many possible pairs, the studied attributes are not sufficient, and additional attributes of different types are required. Some additional attributes may be measures of essentiality (30) and coordinated protein levels (48, 49), which were shown recently to correlate with PPI and with functionally related proteins. Still, the predictive scheme that was trained on the ICIs succeeded in identifying 11% of the NICIs. This success rate is ≈60-fold higher than the chance probability for detecting these interactions (0.17%), and it is 1.4-fold higher than their identification rate by the Y2H method (using the same specificity rate as in the Y2H in the predictions). Most of the NICIs that were correctly identified by the LR model involve at least one protein in a complex. These interactions may have been missed from the complexes because they occur only under certain conditions, and now they are identified by us as putative components of the complexes. The LR model provides the probability of interaction for each protein pair. In our analysis, we determined 0.49 as the threshold probability above which a pair is determined to be interacting (resulting in a specificity of ≈50%). We examined the attribute combinations that lead to probabilities above this threshold, and we found in the data set of interacting proteins as well as when the analysis was applied to the whole yeast proteome that only pairs that showed the three attributes of domain–domain signatures, colocalization, and shared cellular process passed the probability threshold. As discussed above, the first two characteristics are indeed prerequisite for an interaction to occur, which implies that with the current state of annotation, only proteins that are documented as colocalized, participating in a common cellular process, and containing appropriate domain signatures can be predicted to interact. However, what if such information is unavailable or if we are interested in more reliable prediction (requiring higher specificity)? Theoretically, there are additional attribute combinations that pass the threshold (for details, see Table 7, which is published as supporting information on the PNAS web site), and when the data annotation will be more complete, protein pairs with such combinations may be revealed as well. Our predictive model provides a sensitivity that is >2-fold higher than the widely used Y2H method for the same specificity level. It is clear, however, that the sensitivity and specificity provided by the LR-based predictive scheme should be further improved. One possible improvement involves the attribute assignment. Our predictive scheme uses attributes that are all based on documentation available in the various biological databases. For many proteins in our data sets there is no information for some of these attributes (Table 1). For example, many of the yeast proteins are not yet characterized by domain signatures, and therefore they cannot be classified by this attribute. Thus, it is conceivable that when the documentation becomes complete, the associations learned by the LR model will be more precise and will lead to improved predictions. Also, additional attributes that are characteristic of physical interactions in general and of less stable interactions in particular should lead to improvement. Furthermore, using this scheme for initial screening and then applying a structure-based approach as proposed by Aloy et al. (43) may also be a promising direction, leading to a more accurate picture of both intra- and intercomplex interactions. Materials and Methods Data Sets of Interacting Protein Pairs. The data of S. cerevisiae interacting protein pairs were collected from three public databases, MIPS (24), DIP (25), and BIND (26), and from compilations of genomewide Y2H assays (1, 2). After exclusion of redundancies and homodimers, the database included a total of 8,695 binary interactions, involving 4,136 proteins. From this database, two data sets were derived (Fig. 2 Data Sets of Noninteracting Protein Pairs. The proteome size of S. cerevisiae, which includes ≈6,000 proteins, implies that there are ≈1.8 × 107 possible protein pairs. Current estimations of the yeast interactome (the number of PPIs) range from 12,000 (50) to >30,000 (47). Taking the more frequently used estimation of 30,000 interactions implies that the ratio between interacting and noninteracting pairs in yeast is ≈1:600 (30,000:≈1.8 × 107). In our analyses, we follow this ratio of 1:600 between the known interacting pairs and the random pairs (representing the noninteracting pairs). For the 8,695 known binary interactions in our database, we generated 5,217,000 random pairs from all ≈6,000 yeast proteins, and from those pairs, we extracted the RICPs and RNICPs (see Fig. 2 Data Sources of Attributes and Representation. Table 1 provides a brief summary of the attributes and their data sources (for more detail, see Supporting Methods). Each yeast protein pair is represented as a vector of nine entries, where each entry represents an attribute, and the value indicates whether the attribute was found to characterize this particular interaction. An entry is assigned a value of 1 if the protein pair shows the attribute and a value of 0 if it does not show the attribute or if there is no information about the attribute. Supporting Information
Acknowledgments We thank Joel Sussman for hosting E.S. in Rehovot and Ophry Pines, Mario Baras, Norman Grover, Shmuel Sattath, Ariel Jaimovich, and David Sprinzak for helpful discussions. We thank Amit Fliess for help in data mining and our group members for useful comments on the manuscript. This study was supported by Israeli Science Foundation (founded by the Israel Academy of Sciences and Humanities) Grant 558/01 (to H.M.) and by European Union Grant QLRI-CT-2001-00015. E.S. was supported by the Horowitz Foundation. Abbreviations Footnotes This paper results from the Arthur M. Sackler Colloquium of the National Academy of Sciences, “Frontiers in Bioinformatics: Unsolved Problems and Challenges,” held October 15–17, 2004, at the Arnold and Mabel Beckman Center of the National Academies of Science and Engineering in Irvine, CA. Papers from this Colloquium will be available as a collection on the PNAS web site. See the introduction to this Colloquium on page 13355 in issue 38 of volume 102. The complete program is available on the NAS web site at www.nasonline.org/bioinformatics. The authors declare no conflict of interest. This paper was submitted directly (Track II) to the PNAS office. References 1. Uetz P, Giot L, Cagney G, Mansfield TA, Judson RS, Knight JR, Lockshon D, Narayan V, Srinivasan M, Pochart P, et al. Nature. 2000;403:623–627. [PubMed] 2. Ito T, Chiba T, Ozawa R, Yoshida M, Hattori M, Sakaki Y. Proc Natl Acad Sci USA. 2001;98:4569–4574. [PubMed] 3. Giot L, Bader JS, Brouwer C, Chaudhuri A, Kuang B, Li Y, Hao YL, Ooi CE, Godwin B, Vitols E, et al. Science. 2003;302:1727–1736. [PubMed] 4. Li S, Armstrong CM, Bertin N, Ge H, Milstein S, Boxem M, Vidalain PO, Han JD, Chesneau A, Hao T, et al. Science. 2004;303:540–543. [PubMed] 5. Sprinzak E, Margalit H. J Mol Biol. 2001;311:681–692. [PubMed] 6. Gomez SM, Rzhetsky A. Pac Symp Biocomput. 2002;7:413–424. [PubMed] 7. Deng M, Mehta S, Sun F, Chen T. Genome Res. 2002;12:1540–1548. [PubMed] 8. Ng SK, Zhang Z, Tan SH. Bioinformatics. 2003;19:923–929. [PubMed] 9. Gomez SM, Noble WS, Rzhetsky A. Bioinformatics. 2003;19:1875–1881. [PubMed] 10. von Mering C, Huynen M, Jaeggi D, Schmidt S, Bork P, Snel B. Nucleic Acids Res. 2003;31:258–261. [PubMed] 11. Marcotte EM, Pellegrini M, Ng HL, Rice DW, Yeates TO, Eisenberg D. Science. 1999;285:751–753. [PubMed] 12. Enright AJ, Iliopoulos I, Kyrpides NC, Ouzounis CA. Nature. 1999;402:86–90. [PubMed] 13. Pellegrini M, Marcotte EM, Thompson MJ, Eisenberg D, Yeates TO. Proc Natl Acad Sci USA. 1999;96:4285–4288. [PubMed] 14. Dandekar T, Snel B, Huynen M, Bork P. Trends Biochem Sci. 1998;23:324–328. [PubMed] 15. Ge H, Liu Z, Church GM, Vidal M. Nat Genet. 2001;29:482–486. [PubMed] 16. Grigoriev A. Nucleic Acids Res. 2001;29:3513–3519. [PubMed] 17. Deane CM, Salwinski L, Xenarios I, Eisenberg D. Mol Cell Proteomics. 2002;1:349–356. [PubMed] 18. Kemmeren P, van Berkum NL, Vilo J, Bijma T, Donders R, Brazma A, Holstege FC. Mol Cell. 2002;9:1133–1143. [PubMed] 19. von Mering C, Krause R, Snel B, Cornell M, Oliver SG, Fields S, Bork P. Nature. 2002;417:399–403. [PubMed] 20. Yeger-Lotem E, Margalit H. Nucleic Acids Res. 2003;31:6053–6061. [PubMed] 21. Yeger-Lotem E, Sattath S, Kashtan N, Itzkovitz S, Milo R, Pinter RY, Alon U, Margalit H. Proc Natl Acad Sci USA. 2004;101:5934–5939. [PubMed] 22. Bader JS, Chaudhuri A, Rothberg JM, Chant J. Nat Biotechnol. 2004;22:78–85. [PubMed] 23. Sharan R, Suthram S, Kelley RM, Kuhn T, McCuine S, Uetz P, Sittler T, Karp RM, Ideker T. Proc Natl Acad Sci USA. 2005;102:1974–1979. [PubMed] 24. Mewes HW, Frishman D, Guldener U, Mannhaupt G, Mayer K, Mokrejs M, Morgenstern B, Munsterkotter M, Rudd S, Weil B. Nucleic Acids Res. 2002;30:31–34. [PubMed] 25. Xenarios I, Salwinski L, Duan XJ, Higney P, Kim SM, Eisenberg D. Nucleic Acids Res. 2002;30:303–305. [PubMed] 26. Bader GD, Betel D, Hogue CW. Nucleic Acids Res. 2003;31:248–250. [PubMed] 27. Gavin AC, Bosche M, Krause R, Grandi P, Marzioch M, Bauer A, Schultz J, Rick JM, Michon AM, Cruciat CM, et al. Nature. 2002;415:141–147. [PubMed] 28. Ho Y, Gruhler A, Heilbut A, Bader GD, Moore L, Adams SL, Millar A, Taylor P, Bennett K, Boutilier K, et al. Nature. 2002;415:180–183. [PubMed] 29. Hosmer D, Lemeshow S. Applied Logistic Regression. New York: Wiley; 2000. 30. Lu LJ, Xia Y, Paccanaro A, Yu H, Gerstein M. Genome Res. 2005;15:945–953. [PubMed] 31. Zhang L, Hach A, Wang C. Mol Cell Biol. 1998;18:3819–3828. [PubMed] 32. Hon T, Lee HC, Hach A, Johnson JL, Craig EA, Erdjument-Bromage H, Tempst P, Zhang L. Mol Cell Biol. 2001;21:7923–7932. [PubMed] 33. Sato K, Nakano A. Nat Struct Mol Biol. 2005;12:167–174. [PubMed] 34. Roberg KJ, Bickel S, Rowley N, Kaiser CA. Genetics. 1997;147:1569–1584. [PubMed] 35. Enninga J, Levay A, Fontoura BM. Mol Cell Biol. 2003;23:7271–7284. [PubMed] 36. Nooren IM, Thornton JM. EMBO J. 2003;22:3486–3492. [PubMed] 37. Huh WK, Falvo JV, Gerke LC, Carroll AS, Howson RW, Weissman JS, O'Shea EK. Nature. 2003;425:686–691. [PubMed] 38. Dezso Z, Oltvai ZN, Barabasi AL. Genome Res. 2003;13:2450–2454. [PubMed] 39. Jansen R, Greenbaum D, Gerstein M. Genome Res. 2002;12:37–46. [PubMed] 40. Simonis N, Van Helden J, Cohen GN, Wodak SJ. Genome Biol. 2004;5:R33. [PubMed] 41. Lu L, Arakaki AK, Lu H, Skolnick J. Genome Res. 2003;13:1146–1154. [PubMed] 42. Valencia A, Pazos F. Methods Biochem Anal. 2003;44:411–426. [PubMed] 43. Aloy P, Bottcher B, Ceulemans H, Leutwein C, Mellwig C, Fischer S, Gavin AC, Bork P, Superti-Furga G, Serrano L, et al. Science. 2004;303:2026–2029. [PubMed] 44. Jansen R, Lan N, Qian J, Gerstein M. J Struct Funct Genomics. 2002;2:71–81. [PubMed] 45. Jansen R, Yu H, Greenbaum D, Kluger Y, Krogan NJ, Chung S, Emili A, Snyder M, Greenblatt JF, Gerstein M. Science. 2003;302:449–453. [PubMed] 46. Zhang LV, Wong SL, King OD, Roth FP. BMC Bioinformatics. 2004;5:38. [PubMed] 47. Grigoriev A. Nucleic Acids Res. 2003;31:4157–4161. [PubMed] 48. Fraser HB, Hirsh AE, Wall DP, Eisen MB. Proc Natl Acad Sci USA. 2004;101:9033–9038. [PubMed] 49. Lithwick G, Margalit H. Nucleic Acids Res. 2005;33:1051–1057. [PubMed] 50. Sprinzak E, Sattath S, Margalit H. J Mol Biol. 2003;327:919–923. [PubMed] 51. Mulder NJ, Apweiler R, Attwood TK, Bairoch A, Barrell D, Bateman A, Binns D, Biswas M, Bradley P, Bork P, et al. Nucleic Acids Res. 2003;31:315–318. [PubMed] 52. Hegyi H, Lin J, Greenbaum D, Gerstein M. Proteins. 2002;47:126–141. [PubMed] 53. Csank C, Costanzo MC, Hirschman J, Hodges P, Kranz JE, Mangan M, O'Neill K, Robertson LS, Skrzypek MS, Brooks J, et al. Methods Enzymol. 2002;350:347–373. [PubMed] 54. Ihmels J, Friedlander G, Bergmann S, Sarig O, Ziv Y, Barkai N. Nat Genet. 2002;31:370–377. [PubMed] 55. Lee TI, Rinaldi NJ, Robert F, Odom DT, Bar-Joseph Z, Gerber GK, Hannett NM, Harbison CT, Thompson CM, Simon I, et al. Science. 2002;298:799–804. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||
Nature. 2000 Feb 10; 403(6770):623-7.
[Nature. 2000]Proc Natl Acad Sci U S A. 2001 Apr 10; 98(8):4569-74.
[Proc Natl Acad Sci U S A. 2001]Science. 2003 Dec 5; 302(5651):1727-36.
[Science. 2003]Science. 2004 Jan 23; 303(5657):540-3.
[Science. 2004]J Mol Biol. 2001 Aug 24; 311(4):681-92.
[J Mol Biol. 2001]Nature. 2002 May 23; 417(6887):399-403.
[Nature. 2002]Nature. 2000 Feb 10; 403(6770):623-7.
[Nature. 2000]Proc Natl Acad Sci U S A. 2001 Apr 10; 98(8):4569-74.
[Proc Natl Acad Sci U S A. 2001]Genome Res. 2005 Jul; 15(7):945-53.
[Genome Res. 2005]Mol Cell Biol. 1998 Jul; 18(7):3819-28.
[Mol Cell Biol. 1998]Mol Cell Biol. 2001 Dec; 21(23):7923-32.
[Mol Cell Biol. 2001]Nat Struct Mol Biol. 2005 Feb; 12(2):167-74.
[Nat Struct Mol Biol. 2005]Genetics. 1997 Dec; 147(4):1569-84.
[Genetics. 1997]Mol Cell Biol. 2003 Oct; 23(20):7271-84.
[Mol Cell Biol. 2003]Pac Symp Biocomput. 2002; ():413-24.
[Pac Symp Biocomput. 2002]Genome Res. 2002 Oct; 12(10):1540-8.
[Genome Res. 2002]Bioinformatics. 2003 May 22; 19(8):923-9.
[Bioinformatics. 2003]Bioinformatics. 2003 Oct 12; 19(15):1875-81.
[Bioinformatics. 2003]J Mol Biol. 2001 Aug 24; 311(4):681-92.
[J Mol Biol. 2001]Nature. 2003 Oct 16; 425(6959):686-91.
[Nature. 2003]Nature. 2002 Jan 10; 415(6868):141-7.
[Nature. 2002]Nature. 2002 Jan 10; 415(6868):180-3.
[Nature. 2002]Genome Res. 2003 Nov; 13(11):2450-4.
[Genome Res. 2003]Genome Res. 2002 Jan; 12(1):37-46.
[Genome Res. 2002]Genome Biol. 2004; 5(5):R33.
[Genome Biol. 2004]Pac Symp Biocomput. 2002; ():413-24.
[Pac Symp Biocomput. 2002]Genome Res. 2002 Oct; 12(10):1540-8.
[Genome Res. 2002]Bioinformatics. 2003 May 22; 19(8):923-9.
[Bioinformatics. 2003]Bioinformatics. 2003 Oct 12; 19(15):1875-81.
[Bioinformatics. 2003]Genome Res. 2003 Jun; 13(6A):1146-54.
[Genome Res. 2003]Nucleic Acids Res. 2003 Jul 15; 31(14):4157-61.
[Nucleic Acids Res. 2003]Genome Res. 2005 Jul; 15(7):945-53.
[Genome Res. 2005]Proc Natl Acad Sci U S A. 2004 Jun 15; 101(24):9033-8.
[Proc Natl Acad Sci U S A. 2004]Nucleic Acids Res. 2005; 33(3):1051-7.
[Nucleic Acids Res. 2005]Science. 2004 Mar 26; 303(5666):2026-9.
[Science. 2004]Nucleic Acids Res. 2002 Jan 1; 30(1):31-4.
[Nucleic Acids Res. 2002]Nucleic Acids Res. 2002 Jan 1; 30(1):303-5.
[Nucleic Acids Res. 2002]Nucleic Acids Res. 2003 Jan 1; 31(1):248-50.
[Nucleic Acids Res. 2003]Nature. 2000 Feb 10; 403(6770):623-7.
[Nature. 2000]Proc Natl Acad Sci U S A. 2001 Apr 10; 98(8):4569-74.
[Proc Natl Acad Sci U S A. 2001]J Mol Biol. 2003 Apr 11; 327(5):919-23.
[J Mol Biol. 2003]Nucleic Acids Res. 2003 Jul 15; 31(14):4157-61.
[Nucleic Acids Res. 2003]