![]() | ![]() |
Formats:
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright : © 2006 Mika and Rost. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Protein–Protein Interactions More Conserved within Species than across Species 1 Department of Biochemistry and Molecular Biophysics, Columbia University, New York, New York, United States of America 2 Columbia University Center for Computational Biology and Bioinformatics, Irvine Cancer Center, New York, New York, United States of America 3 Institute of Physical Biochemistry, University Witten/Herdecke, Witten, Germany 4 NorthEast Structural Genomics Consortium, New York, New York, United States of America Andrey Rzhetsky, Editor Columbia University, United States of America * To whom correspondence should be addressed. E-mail: mika/at/rostlab.org Received November 18, 2005; Revised May 18, 2006. This article has been cited by other articles in PMC.Abstract Experimental high-throughput studies of protein–protein interactions are beginning to provide enough data for comprehensive computational studies. Today, about ten large data sets, each with thousands of interacting pairs, coarsely sample the interactions in fly, human, worm, and yeast. Another about 55,000 pairs of interacting proteins have been identified by more careful, detailed biochemical experiments. Most interactions are experimentally observed in prokaryotes and simple eukaryotes; very few interactions are observed in higher eukaryotes such as mammals. It is commonly assumed that pathways in mammals can be inferred through homology to model organisms, e.g. the experimental observation that two yeast proteins interact is transferred to infer that the two corresponding proteins in human also interact. Two pairs for which the interaction is conserved are often described as interologs. The goal of this investigation was a large-scale comprehensive analysis of such inferences, i.e. of the evolutionary conservation of interologs. Here, we introduced a novel score for measuring the overlap between protein–protein interaction data sets. This measure appeared to reflect the overall quality of the data and was the basis for our two surprising results from our large-scale analysis. Firstly, homology-based inferences of physical protein–protein interactions appeared far less successful than expected. In fact, such inferences were accurate only for extremely high levels of sequence similarity. Secondly, and most surprisingly, the identification of interacting partners through sequence similarity was significantly more reliable for protein pairs within the same organism than for pairs between species. Our analysis underlined that the discrepancies between different datasets are large, even when using the same type of experiment on the same organism. This reality considerably constrains the power of homology-based transfer of interactions. In particular, the experimental probing of interactions in distant model organisms has to be undertaken with some caution. More comprehensive images of protein–protein networks will require the combination of many high-throughput methods, including in silico inferences and predictions. http://www.rostlab.org/results/2006/ppi_homology/ Synopsis The IntAct database contains about ten large-scale data sets of protein–protein interactions. Each set contains thousands of experimentally observed pair interactions. Most pairs were observed in yeast (Saccharomyces cerevisiae), fly (Drosophila melanogaster), and worm (Caenorhabditis elegans). These interactions are often perceived as model organisms in the sense that one can infer that two mouse proteins interact if one experimentally observes the two corresponding proteins in worm to interact. Here, the authors analyzed in detail how the sequence signals of physical protein–protein interactions are conserved. It is a common assumption that protein–protein interactions can easily be inferred through homology transfer from one model organism to another organism of interest. Here, the authors demonstrated that such homology transfers are only accurate at unexpectedly high levels of sequence identity. Even more surprisingly, homology transfers of protein–protein interactions are significantly more reliable for protein pairs from the same species than for two protein pairs from different organisms. The observation that interactions were much more conserved within than across species was valid for all levels of sequence similarity, i.e. for very similar as well as for more diverged interologs. Introduction Experiments Peek at Complete Protein–Protein Networks The faster large-scale sequencing projects determine the alphabet of life, the higher the pressure to determine some of the actual processes that make life what it is. The understanding of functional relations among all proteins is essential to understanding how cells work. Recent breakthroughs in experimental high-throughput techniques have begun to peek at complete protein–protein interaction networks of entire organisms (Table S1). One central method is to use yeast two-hybrid (Y2H) assays [1] that are based on a genially simple idea: first, separate two domains (activation and DNA-binding) of a transcription factor that activates a reporter gene, then merge each of the two domains to a different protein (A and B) [2,3]. If A and B interact, the two transcription domains will merge, and thereby activate the reporter gene that will be detected. The difficulty of using Y2H is in mastering the details of the experimental setup. Other high-throughput methods to detect protein–protein interactions, such as phage-display assays [4], tandem affinity purifications (TAP) [5,6], co-immunoprecipitation, and affinity chromatography [2,7–9], are also commonly used. An important advantage of using Y2H over these other high-throughput techniques is the ability to measure physical interactions between proteins as opposed to pure functional associations. Also, Y2H experiments work with physiological conditions, i.e., conditions that resemble those in eukaryotic cells [2,3,10,11]. Ito et al. [12] and Uetz et al. [13] first scanned large fractions of the yeast proteome for protein–protein interactions. Others added further interactions: Ho et al. [14] used mass spectrometry and Gavin et al. [15] used TAP. Protein networks in the fly (Drosophelia melanogaster) have been targeted through three different Y2H studies [11,16,17], in the worm (Caenorhabditis elegans) through one [18], and a large subset of about 1,500 human protein network relations were detected through TAP [19]. These data bear deeper insights into cellular processes. Today's Data Are Incomplete and Not Fully Reliable Y2H systems are not 100% accurate; they, for instance, identify many putative interactions that cannot be confirmed by other studies. One reason for false positives (interactions incorrectly postulated) is that the two proteins A and B may activate the reporter gene directly without having to interact [3]. The Margalit group has estimated the false positive rate in high-throughput Y2H assays to be about 50% [20]; the Eisenberg group has arrived at the same estimate through measuring the reliability of interactions in the Database of Interacting Proteins [21]. Y2H experiments also do not achieve complete coverage, i.e., they miss many interactions. Conversely, false negatives (missed interactions) might result from the particular experimental setup (which may prevent the interaction between A and B) or from problems in the assembly of the two transcriptional domains (activation and DNA-binding) needed for Y2H. These problems do not prevent Y2H from evolving as one of the major experimental probes for interactions; they do, however, imply that today's data sets are neither complete nor fully accurate [20,22]. One of the strong arguments in favor of large-scale Y2H experiments is that they are more systematic and much less driven by happenstance than hypothesis-driven, detailed experiments. Known Interactions Are Expanded through Homology-Based Inference Evolutionary connections help explain the rapid success of molecular biology: we can study a particular protein in a simple bacterium and learn about the function of the same protein in multicellular eukaryotes. This idea enables us to use model organisms to predict protein structure [23–25], subcellular localization [26], enzymatic activity [27–29], and other aspects of protein function [30–34]. The same principle is frequently applied to the extension of interactions (Figure 1
Focus on Transient Physical Interactions (PPIs) One important difference between Y2H and TAP is that while Y2H aims at the detection of physically interacting proteins, TAP identifies large groups of proteins that are associated, for instance, through a common pathway [43]. Most high-throughput techniques resemble TAP in the sense that they reveal association rather than physical interaction. To illustrate this difference, assume we hypothesized that co-expressed proteins interact physically, and we wanted to use this hypothesis to predict physical interactions directly from co-expression data. Assume further that six proteins are strung together in a linear pathway (1 binds 2, 2 binds 3, etc.), and that all six are co-expressed. Of the 15 [N*(N − 1)/2] possible interactions, only 5 (N − 1) are physical, i.e., only 33% of the co-expressed proteins interact. Since most pathways involve many more than six interactions this example is likely to significantly underestimate the actual problem. In other words, even if all physically interacting proteins were co-expressed, predictions of interactions based on such association alone would still be more often wrong than right. This significantly constrains the way in which we can use association-type data to analyze physical interactions. In order to emphasize our focus on physical interactions, we used the abbreviation PPI for transient physical protein–protein interactions (as opposed to functional associations as measured by TAP-like data, and as opposed to permanent physical interactions between, e.g., two different domains or two different chains of the same protein [44]). Coping with the Dilemma of Incomplete Data Sets How can we evaluate accuracy and coverage of homology transfer (Figure 1
Here, we presented the analysis of PPI in, to our knowledge, the largest data set investigated thus far. We defined and measured the overlap between different data sets, and analyzed the expected levels of accuracy and coverage for homology-based inference of PPIs depending on the level of sequence similarity. The most surprising finding originated from differentiating between intraspecies and interspecies inferences (o ≠ p in Figure 1 Results/Discussion Different Experiments Overlap Very Little If we want to homology infer PPIs between organisms, we first have to measure the overlap within organisms and then between organisms. We introduced such a measure (Equation 2 and Equation 3, see Materials and Methods) and applied it to assessing the overlap between datasets in IntAct [45]. A large overlap value implies high agreement between two experimental sets of interactions. Our definition of overlap takes into account that two data sets may not have used the same proteins thereby rendering a score that is, in principle, independent of the size of common subsets (see Materials and Methods section). The scores are straightforward when comparing different datasets within the same organism (Equation 2) because we only have to identify identical pairs of proteins. As noted before [22,46–49], the data sets overlap maximally for about 30% of all PPIs in yeast (Saccharomyces Cerevisiae) and much less for PPIs in fly (Drosophila Melanogaster, Table 1). Interspecies comparisons are trickier because we now have to identify the corresponding homologous pairs in the other organism. Equation 3 solves this problem by counting homologous instead of identical pairs of proteins; it is applicable to intraspecies and interspecies comparisons. A consequence of counting homologous rather than identical protein pairs is that the same data set no longer overlaps 100% with itself (Table 2), because the interaction between A and B may be detected while that between the homologs A′ and B′ may not be. The application of Equation 3 to the intraspecies comparison for yeast and fly datasets yielded similar results as the application of Equation 2 to the same datasets (Table 1). The overlap between different yeast datasets seems to be generally higher than that between different fly datasets. Finally, we merged datasets of different large-scale experiments for each organism and compared these pseudo-complete PPIs between organisms by using Equation 3 (Table 3). As expected the overlap between organisms was increased with increasing thresholds in what was considered homologous (Table 3; HSSP-value (HVAL)>40 highest, HVAL>0 lowest, Equation 1; note that the HSSP value (homology derived secondary structure of proteins) is an empirical measure for sequence similarity that empirically embeds the simple fact that high levels of sequence similarity are less meaningful for short than they are for long alignments). This increase in overlap was achieved by finding fewer matches (Table 3, empty cells). Conversely, the overlap was very low at levels of sequence similarity that mark the twilight zone of sequence-structure inference [25], i.e., the line above which most pairs of proteins have largely similar structure (HVAL>0, Table 3). In other words, overall fold similarity does not suffice to infer similarity in interactions.
Automatic Homology Transfer of PPIs Is Very Limited We generated a homology performance plot (see Materials and Methods section) by comparing an unbiased, nonredundant data set (no two pairs of proteins in the set had significant sequence similarity (see Materials and Methods section) against the redundant set with all PPIs (note that we removed identical pairs even in this set, Table 4, Experiment 1). When using the observed PPI between two proteins (A-B), we applied the same sequence similarity threshold to identify both homologs (A/A′, B/B′) to infer the PPI between A′-B′. Pairs such as A-B′ or A′-B were not counted because those pairs could only be detected within the same organism and not across two species. Not surprisingly, the accuracy of homology transfer was proportional to sequence similarity (Figure 2
Homology Transfer Is Better within than between Organisms Arguably [40–42], homology transfer is expected to be slightly better between organisms than within organisms. Instead, we observed the extreme opposite (Figure 3
Table 4 and Figures 2 Results Were Stable with Respect to Details in Filtering Data (1) Different sampling of intraspecies vs. interspecies: We allowed transfers of the type A-B to A′-B or A-B to A-B′ (see Materials and Methods section). The performance became significantly better for intraspecies PPI transfers, thus further widening the gap between intraspecies and interspecies transfers (Figure S2A). (2) Inclusion of transfers within the same data set: we included homology transfers within the same experimental dataset (see Materials and Methods section). The effect was very similar to those observed for different sampling (see #1), i.e., the gap was widened between intraspecies and interspecies inferences (Figure S2B). (3) We used TAP-like data (Table S1) as a constraint for the negatives. To illustrate this, assume that TAP pulled down a complex of six proteins. While we cannot infer that all 15 possible interactions are physical, all could be. Therefore, we ignored a false positive prediction (i.e., we did not count it) if we could find the interaction in those 15 TAP protein–protein pairs. The accuracy slightly increased for both yeast versus yeast (intraspecies) comparisons as well as for nonyeast versus yeast (interspecies) comparisons (Figure S2C). Note that yeast is the only organism with available TAP-like data. (4) We used a redundant dataset (instead of a nonredundant, bias-reduced set) from organism o (Figure 7
Examples In the following, we presented a few representative examples that illustrate these points with more details than it is possible through averages over large data sets. Both show how homology transfer fails across species while it succeeds within an organism (Ao-Bo observed, A′o-B′o observed, A″m-B″m not observed). Example 1: same family, different ancestors, different PPI. The two peroxins PEX1 and PEX6 are known to functionally and physically interact in both human [50] and yeast [51–53] (Figure 4
This particular example illustrated how yeast may generally be a rather poor model organism for more complex species such as fly, worm or vertebrates. Proteins from these higher eukaryotes have to perform many different tasks in often highly specialized cell types (e.g., nerve cells). This might have lead to an evolutionary pressure to build new protein-interaction networks from the available protein building blocks (e.g., ATPase function). Thus, by only slightly altering the existing sequences, new binding properties were added to these proteins, while others were lost. A similar argument could be used to explain a likely poor homology transfer between fly and human or worm and human. Example 2: same pathway, different functions, different binding properties. The drosophila Ser/Thr protein phosphatase 4 (Pp4) and the cyclin dependent kinase 4 (Cdk4) were found in our small-scale dataset for drosophila PPIs. At HVAL>20, we found two sequence-similar proteins in fly, namely Ser/Thr protein phosphatase alpha 2 (Pp1) similar to Pp4, and chk2 similar to Cdk4; both these fly proteins (Pp1 and chk2) have been shown to interact [16]. Fly chk2 as well as its sequence relatives in yeast (Mek1p and Rad53p) and human are involved in cell-cycle checkpoints, which are signal transduction pathways that control the cell cycle and prevent the cell from further replication if the DNA double strand breaks, the DNA is incompletely replicated, or in case of other DNA damages [58–60]. A checkpoint can halt an ongoing mitosis or meiosis or even terminate it and induce apoptosis. A phylogenetic analysis of the chk2 family members found that fly chk2 and its yeast and human homologs stem from the same ancestor (Figure 4 Sequence-Based Homology Transfer Is Limited Although Binding Sites Are Partially Conserved in Three-Dimensional (3-D) Structure Recently, the Sali group analyzed the conservation of protein–protein binding sites on homologous and structurally aligned protein surfaces. They found that the differences in the localization of binding sites between homologous proteins are significantly smaller than the differences expected at random [62]. On the one hand, this result is similar to what we found for higher levels of similarity (Figure 3 Conclusions As demonstrated again by our overlap measure, today's datasets of PPIs are still rather inconsistent (Tables 1–3). The discrepancies were significantly smaller between yeast than between fly datasets (Tables 1 and 2). This finding also explains the much higher accuracy for intrayeast as opposed to intrafly or intraworm transfer. Why datasets of yeast appear more consistent than those of fly datasets remains speculation. One reason might be that measurements of protein–protein interactions are performed within yeast (Y2H) and are thus more precise for yeast proteins than for other species′ proteins, since those might behave differently in the unfamiliar yeast cell. Although incomplete and not fully consistent, PPI datasets are finally large enough to validate quantitative analyses. In particular, this enables a large-scale assessment of the performance of automated homology transfer for PPIs. Assuming that today's errors are largely nonsystematic, estimates for the performance of homology transfer will provide correct qualitative pictures, albeit the actual numbers will be overpessimistic. In the extreme regimen of comparing very similar pairs of proteins, we could establish that data sets appeared very consistent (Figure 2 Materials and Methods Data sets. Several publicly available databases such as GRID [68], BIND [69], MINT [70], and DIP [71,72] gather information about interacting proteins in different organisms. For our analysis, we used the IntAct database [45], a protein–protein interaction resource maintained at the European Bioinformaics Institute (EBI) in Cambridge (http://www.ebi.ac.uk/intact/). IntAct uses the PSI format (extended markup language (XML)-tagged) to store data [73], fly [12–15], fly [11,16,17], worm [18] and human [19] as well as about 30 so called small-scale datasets, which are collections of results from many detailed experiments for different organisms. The largest small-scale dataset is that of human with about 38,000 interactions. Concerning the high-throughput datasets, IntAct carries detailed information about which proteins were used as baits and which proteins were used as preys, so that a complete interaction matrix can easily be reconstructed from these sets. Table S1 contains all protein–protein interaction datasets deposited in IntAct at the moment along with links to these datasets (small-scale and large-scale). The Giot [17], Ito [35], and Li [18] datasets contain some information about the level of confidence that was assigned to each interaction. For these three sets, we excluded everything from our analysis that either had a confidence-value of less than 0.4 (Giot: values range from 0 to 1) or those that were not in a so called “core” dataset of trusted interactions (Ito and Li divide their sets into core and full or core and noncore subsets, where core means a higher confidence in the measured interaction). Note that for the initial submission of this manuscript we had compiled all results for unfiltered data sets, i.e., we had included all experimental interactions; the results were qualitatively identical to those given here (data not shown). True positives and false negatives: focus on Physical Interactions = PPIs. Technically, we realized our goal of exclusively focusing on PPIs through the particular way of labeling positives and negatives. We labeled as positives (true PPIs) only those pairs that were identified by experiments that target the detection of physical interactions (only Y2H experiments). We then also assumed that these data for each organism was complete, i.e., we labeled all pairs as negatives that were not detected by Y2H. Measuring sequence similarity/homology. The term homology usually implies an evolutionary relation in the sense of having a common ancestor. Strictly speaking, we cannot measure homology. Instead, alignment methods measure sequence similarity in some way or other. In our work the ranges of similarity were so high that the pairs of proteins were most likely homologous. We used BLAST and PSI-BLAST [74] to align all protein sequences in IntAct against each other (standard procedure [75]: 3 iterations at E<10-10 against filtered database of all proteins to build clean profiles, then one run with frozen profile against unfiltered database at E < 10−3, freeze profile again and run against all IntAct proteins). Then we extracted the PSI-BLAST E-values for each alignment, as well as the percentage of sequence identity (PIDE) and the distance to the HSSP curve, i.e. the HSSP-value [25,76,77] (HVAL). The HVAL is defined as:
Nonredundant data sets. We removed bias from PPI datasets by the following procedure (Figure 5
Identity- and homology-based overlap between datasets. We defined two procedures resembling the Jaccard correlation to measure the overlap between two different datasets of PPIs in IntAct. Equation 2 defines the first measure; for clarity we refer to this measure as the identity-based overlap. This measure can only be applied to two PPI sets from the same organism.
The second measure capturing an overlap between two interaction datasets was applicable to any two datasets, even if they were from different organisms. We referred to this measure as the homology-based overlap. It was defined as follows (Figure 6
Homology performance curves. For given levels of sequence similarity, we monitored and plotted the accuracy of inferring PPIs through homology from one dataset to another. The procedure is described in Figure 7 The resulting curves can be interpreted as the degree to which PPIs are evolutionarily conserved. In a more technical sense, the curves reflect the performance of homology transfer of PPIs (Figure 1 Accuracy and coverage. We measured the accuracy (Acc) and coverage (Cov) for the inference (prediction) of interacting protein pairs by the standard formulas:
Error estimate. The error in the estimates of accuracy and coverage were determined by bootstrapping [78] over the protein–protein interactions in the source datasets. In particular, we picked n interactions at random from the non-redundant source dataset and compiled the averages over a larger set with possibly many replicas of the same incidence. The levels of accuracy/coverage for different thresholds in sequence similarity were then calculated according to the procedure described above (Figure 7 Table S1: Large-Scale Protein–Protein Interaction Datasets from IntAct (74 KB DOC) Click here for additional data file.(74K, doc) Figure S1: Number of true positive counts versus HVAL Each curve shows the accuracy (red) as shown in Figure 3 (72 KB DOC) Click here for additional data file.(72K, doc) Figure S2: Results Are Stable with Respect to Variations in the Experimental Setup (A) Different sampling of intra- versus inter-species: we allowed transfers of the type A-B to A'-B or A-B to A-B' (see Materials and Methods section). The performance became significantly better for intra-species PPI-transfers, thus further widening the gap between intra- and inter-species transfers. (B) Inclusion of transfers within the same data set: we included homology transfers within the same experimental dataset (see Materials and Methods section). The effect was very similar to those observed for different sampling (#1), i.e. widening the gap between intra- and inter-species inferences. (C) Using TAP-like data (Table S1) as a constraint for the negatives. To illustrate this, assume that TAP pulled down a complex of six proteins. While we cannot infer that all 15 possible interactions are physical, all could be. Therefore, we ignored a false positive prediction (did not count it) if we could find the interaction in those 15 TAP protein-protein pairs. The accuracy slightly increased for both yeast versus yeast (intra-species) comparisons as well as for non-yeast versus yeast (inter-species) comparisons. Note that yeast is the only organism with available TAP-like data. (D) We used a redundant dataset (instead of a non-redundant, bias-reduced set) from organism o (Figure 7 (153 KB DOC) Click here for additional data file.(153K, doc) Acknowledgments Thanks to Jinfeng Liu, Hans-Erik Aronson, Kristen McFadden, and Paul Glick (all from Columbia University) for computer assistance. Thanks to the anonymous reviewers for their helpful criticism. Furthermore, thanks in particular to Amos Bairoch (Swiss Institute of Bioinformatics, Geneva, Switzerland), Rolf Apweiler (European Bioinformatics Institute, Hinxton, United Kingdom), Phil Bourne (San Diego University, San Diego, California, United States), David Eisenberg (University of California—Los Angeles, Los Angeles, California, United States), and their crews for maintaining excellent databases and to all experimentalists who enabled this work by publishing their PPI results in PubMed/MedLine.
Author contributions. SM conceived and designed the experiments. SM performed the experiments. SM analyzed the data. BR contributed reagents/materials/analysis tools. SM and BR wrote the paper. Abbreviations
Footnotes Author contributions. SM conceived and designed the experiments. SM performed the experiments. SM analyzed the data. BR contributed reagents/materials/analysis tools. SM and BR wrote the paper. Competing interests. The authors have declared that no competing interests exist. A previous version of this article appeared as an Early Online Release on May 18, 2006 (DOI: 10.1371/journal.pcbi.0020079.eor). Funding. This work was supported by the grants R01-GM63029-01 from the National Institute of Health. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
||||||||||||||||||||||||||||||||||||||||||||||||||||||
Nature. 1989 Jul 20; 340(6230):245-6.
[Nature. 1989]Mass Spectrom Rev. 2004 Sep-Oct; 23(5):350-67.
[Mass Spectrom Rev. 2004]Trends Genet. 2001 Jun; 17(6):346-52.
[Trends Genet. 2001]Plant Mol Biol. 2002 Dec; 50(6):837-54.
[Plant Mol Biol. 2002]Methods. 2001 Jul; 24(3):218-29.
[Methods. 2001]Trends Genet. 2001 Jun; 17(6):346-52.
[Trends Genet. 2001]J Mol Biol. 2003 Apr 11; 327(5):919-23.
[J Mol Biol. 2003]Mol Cell Proteomics. 2002 May; 1(5):349-56.
[Mol Cell Proteomics. 2002]J Mol Biol. 2003 Apr 11; 327(5):919-23.
[J Mol Biol. 2003]J Mol Biol. 1997 Oct 17; 273(1):355-68.
[J Mol Biol. 1997]Protein Eng. 1999 Feb; 12(2):85-94.
[Protein Eng. 1999]Protein Sci. 2002 Dec; 11(12):2836-47.
[Protein Sci. 2002]J Mol Biol. 2001 Apr 6; 307(4):1113-43.
[J Mol Biol. 2001]J Mol Biol. 2002 Apr 26; 318(2):595-608.
[J Mol Biol. 2002]Science. 2004 Dec 24; 306(5705):2246-9.
[Science. 2004]J Mol Biol. 2003 Jan 10; 325(2):377-87.
[J Mol Biol. 2003]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D452-5.
[Nucleic Acids Res. 2004]J Mol Biol. 2003 Apr 11; 327(5):919-23.
[J Mol Biol. 2003]Protein Sci. 2001 Oct; 10(10):1970-9.
[Protein Sci. 2001]Curr Opin Struct Biol. 2003 Jun; 13(3):377-82.
[Curr Opin Struct Biol. 2003]Protein Eng. 1999 Feb; 12(2):85-94.
[Protein Eng. 1999]BMC Bioinformatics. 2005 Mar 2; 6():40.
[BMC Bioinformatics. 2005]Bioinformatics. 2005 Jun 1; 21(11):2730-8.
[Bioinformatics. 2005]Biochem J. 2001 Jul 15; 357(Pt 2):417-26.
[Biochem J. 2001]Yeast. 1999 Aug; 15(11):1059-78.
[Yeast. 1999]Nat Cell Biol. 2003 May; 5(5):454-60.
[Nat Cell Biol. 2003]Nature. 2000 Feb 10; 403(6770):623-7.
[Nature. 2000]Nature. 2002 Jan 10; 415(6868):141-7.
[Nature. 2002]Genome Biol. 2004; 5(12):R96.
[Genome Biol. 2004]FEBS Lett. 2003 Jun 19; 545(2-3):209-12.
[FEBS Lett. 2003]Genetics. 2003 Mar; 163(3):973-82.
[Genetics. 2003]Curr Biol. 2001 Oct 30; 11(21):R865-8.
[Curr Biol. 2001]Protein Sci. 2005 Sep; 14(9):2350-60.
[Protein Sci. 2005]FEBS Lett. 2003 Jun 5; 544(1-3):236-9.
[FEBS Lett. 2003]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D115-9.
[Nucleic Acids Res. 2004]Genome Biol. 2003; 4(3):R23.
[Genome Biol. 2003]Nucleic Acids Res. 2005 Jan 1; 33(Database issue):D418-24.
[Nucleic Acids Res. 2005]FEBS Lett. 2002 Feb 20; 513(1):135-40.
[FEBS Lett. 2002]Nucleic Acids Res. 2004 Jan 1; 32(Database issue):D449-51.
[Nucleic Acids Res. 2004]Nucleic Acids Res. 2002 Jan 1; 30(1):303-5.
[Nucleic Acids Res. 2002]Nucleic Acids Res. 1997 Sep 1; 25(17):3389-402.
[Nucleic Acids Res. 1997]Proteins. 2002 Feb 1; 46(2):197-205.
[Proteins. 2002]Protein Eng. 1999 Feb; 12(2):85-94.
[Protein Eng. 1999]Nucleic Acids Res. 2003 Jul 1; 31(13):3789-91.
[Nucleic Acids Res. 2003]Proteins. 1991; 9(1):56-68.
[Proteins. 1991]Proc Natl Acad Sci U S A. 1996 Nov 12; 93(23):13429-34.
[Proc Natl Acad Sci U S A. 1996]Genome Res. 2004 Jun; 14(6):1107-18.
[Genome Res. 2004]Genome Biol. 2005; 6(12):R106.
[Genome Biol. 2005]