![]() | ![]() |
Formats:
|
||||||||||||||||||||||||||
Copyright Järvinen et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Predicting Quantitative Genetic Interactions by Means of Sequential Matrix Approximation 1Biomathematics Research Group, Department of Mathematics, University of Turku, Turku, Finland 2Data Mining Research Group, Turku Centre for Biotechnology, Turku, Finland Shin-Han Shiu, Editor Michigan State University, United States of America * E-mail: tero.aittokallio/at/utu.fi Conceived and designed the experiments: APJ LLE TA. Performed the experiments: APJ TA. Analyzed the data: APJ JH. Wrote the paper: APJ JH LLE TA. Received June 20, 2008; Accepted September 5, 2008. Abstract Despite the emerging experimental techniques for perturbing multiple genes and measuring their quantitative phenotypic effects, genetic interactions have remained extremely difficult to predict on a large scale. Using a recent high-resolution screen of genetic interactions in yeast as a case study, we investigated whether the extraction of pertinent information encoded in the quantitative phenotypic measurements could be improved by computational means. By taking advantage of the observation that most gene pairs in the genetic interaction screens have no significant interactions with each other, we developed a sequential approximation procedure which ranks the mutation pairs in order of evidence for a genetic interaction. The sequential approximations can efficiently remove background variation in the double-mutation screens and give increasingly accurate estimates of the single-mutant fitness measurements. Interestingly, these estimates not only provide predictions for genetic interactions which are consistent with those obtained using the measured fitness, but they can even significantly improve the accuracy with which one can distinguish functionally-related gene pairs from the non-interacting pairs. The computational approach, in general, enables an efficient exploration and classification of genetic interactions in other studies and systems as well. Introduction The systematic mapping of genetic interactions in biological systems has the potential to provide a better understanding of how genes function as networks to carry out and regulate cellular processes. In particular, recent advances in the experimental technologies which allow for the large-scale screening of the effects of combinatorial gene deletions are providing an exciting glimpse into the organization of complex genetic networks in terms of revealing novel interacting cellular components and compensatory pathways involved in many cell functions. Comprehensive maps of genetic interactions in model organisms, such as yeast, may also provide a valuable template for understanding the basic principles underlying the relationships between genotype and phenotype in other populations [1]. In humans, genetic interactions are involved in many complex phenotypes and they contribute to most genetic disorders, but the organization of the underlying networks is largely unknown [2], [3]. Due to their combinatorial nature, the mapping of genetic interactions is highly labor-intensive even in genetically amenable organisms. Efficient computational frameworks are therefore required to underpin the full potential of these experiments. Several large-scale studies, especially in yeast Saccharomyces cerevisiae, have already identified a number of synthetic lethal interactions, in which a combination of two individually non-lethal mutations results in lethality [1], [4]. Genome-wide screening strategies for synthetic sick or lethal interactions, such as those based on synthetic genetic arrays (SGA) or the diploid synthetic lethality analysis by microarray (dSLAM), have successfully been used for providing insights into the nature of genetic robustness [5], and for identifying functional relationships among the genes and pathways [6]. In addition to this rather limited spectrum of observed phenotypes (synthetic sick/lethal vs. non-interacting pairs), quantitative phenotypes, such as the relative growth rate of yeast colonies, have recently been explored systematically using high-throughput screening approaches like epistatic miniarray profiling (E-MAP) and genetic interaction mapping (GIM) [7], [8]. The importance of measuring a broader spectrum of genetic interactions when identifying functionally-related genes and pathway organizations has been demonstrated in theoretical and experimental studies [9]–[11]. To provide reliable information on genetic interactions, customized data handling and pre-processing pipelines have been developed for the different screening approaches [12]–[13]. Regardless of the experimental technology used, the screening strategies aim to quantify the extent to which a mutation in one gene modulates the phenotype (or fitness) associated with altering the second gene, either by explicitly measuring or analytically comparing the observed fitness of double-mutants to those of single-mutants. More formally, a genetic interaction between mutants i and j can be defined by the deviation (εij) of an observed double-mutant phenotype (Pij) from the expected neutral phenotype of an organism's fitness (Eij) under the hypothesis that it carries two non-interacting mutations (the null hypothesis). If the fitness Pij is evaluated in terms of the growth rate of double-mutant wij, relative to the wild-type growth rate, and Eij is a function g(wi, wj) of the relative single-mutant fitness values wi and wj, this definition can be formulated as:
= 0). Alleviating interactions, in which the double-mutant phenotype is less severe than expected, can occur, for example, when the first mutation already impairs the function of a whole pathway and thereby masks the effect of the second mutation in the same pathway.Recently, Mani et al. [14] demonstrated that the product function g(wi, wj) = wiwj provides a convenient null model in the sense that it yields a distribution with location close to zero and low dispersion over all of the measured deviations. The comparison was based on the principle that, as the vast majority of gene pairs should be non-interacting, the rare gene pairs sharing a specific function should be distinguishable from this background distribution as outlying cases with extreme absolute deviations. Accordingly, it was shown that the observed deviations based on the multiplicative null model were indeed most accurate at identifying functional relationships between the genes [14]. In the present study, we asked two follow-up questions: (1) whether the observed deviations could be estimated directly from the double-mutant phenotypes under the multiplicative model; and (2) whether the prediction of specific functional links could be made at a similar accuracy without utilizing the measurements of single-mutant phenotypes. Under the assumptions that significant genetic interactions are rare and that the multiplicative model is a reasonable approximation in the case of no interaction, we developed a sequential approach that enables a multi-resolution approximation of the double-mutant fitness matrix to address these particular questions, and more generally, to provide a computational framework for exploring genetic interaction datasets.Results Estimating single-mutant fitness values As an initial study objective, we sought to assess the accuracy to which the single-mutant fitness vector could be estimated directly from the double-mutant fitness matrix . In the quantitative genetic interaction dataset of St Onge et al. [10], which was used in the following results, w is a 26-dimensional column vector and W is a 26×26-dimensional symmetric matrix. Under the multiplicative null model, Eq. 1 takes the form:
is the tensor product (or outer-product) of the vector w with itself, and E is the n×n-matrix comprising the εij values as its elements for each gene pair i, j = 1,2,…,n. In the ideal case, when there are no measurement inaccuracies, the approximation problem of Eq. 2 could easily be solved using the well-established machinery of linear algebra. More precisely, using the spectral decomposition theorem, one can represent any symmetric real matrix as , where λ is the largest eigenvalue of W and e is the corresponding eigenvector [15]. Under the unrealistic assumption that there are no genetic interactions among any of the gene pairs, the approximation would in fact be exact, that is, the residual error E equals zero. However, as the genetic interaction screens are bound to present with experimental variation, missing data, and hopefully also with significant genetic interactions, the estimation problem must in practice be solved by numerical means.In the present work, we developed a sequential matrix approximation procedure, which uses increasingly larger subsets of mutation pairs in W to provide a series of estimates for w as solutions of the weighted least squares problem [16]. During the first rounds, the procedure solves the approximation problem of Eq. 2 using only those mutation pairs that best fit the multiplicative model, and then gradually extends the subset to also include pairs with larger residual errors (see Methods for details). Already when using all but the diagonal and missing entries of the double-mutant fitness matrix in the dataset of St Onge et al. [10], we obtained an estimate relatively close to the actual measured single-mutant fitness vector, as compared to the conventional median estimate (Figure 1A
Predicting specific functional relationships Beyond the dynamic variability in the estimates of the single-mutant fitness values, the behavior of the approximation procedure with different subsets of mutation pairs revealed another interesting observation: the order in which the mutation pairs were added into the approximation process reflects on average the relative order of their actual measured deviations, even if the measurements of single-mutant fitness were not employed (Figures 1C and D obtained from the approximation procedure were highly correlated (Pearson correlation equals 0.964, see Figure S2). This led us to investigate whether such procedure-ranked deviations could be used instead of the measured deviations when predicting functional links between the mutations. To this end, we took the same set of gene pairs which were found to have a highly specific shared function in the previous study by Mani et al. [14] (see Methods for their definition). Interestingly, the majority of the mutation pairs selected towards the end of the approximation procedure shared a specific function (Figure 2 values was significantly higher than expected (p<10−11), whereas the functional enrichment was exceptionally low among the 50 pairs with the lowest values (p = 0.998). These results show that the sequential matrix approximation procedure gives as its by-product a ranking of the mutation pairs that is in good agreement with their likelihood of sharing a specific function.
To test more systematically whether the prediction of the functional relationships could be made to an accuracy similar to that obtained when using the measured single-mutant fitness values, we assessed how well the ranking based on values can discriminate between gene pairs with or without a specific functional link. Similarly as in the original study of St Onge et al. [10], the prediction capability was evaluated using the receiver operating characteristic (ROC) curves that show the relative trade-off between the sensitivity and specificity of the predictions at multiple decision thresholds (Figure 3 = 0.780 vs. AUC = 0.662, p = 0.003). The prediction capability was improved systematically at each specificity-level, demonstrating that the procedure could distinguish with high accuracy the functionally related gene pairs over the whole spectrum of the exceptional deviations (Figure 3 = 0.789, p = 0.002). The prediction accuracy of the double-mutant fitness values alone was similar to that of a random classifier (AUC = 0.506), demonstrating that the normalization by the measured or estimated single-mutant fitness values can, in any case, improve the prediction of the functional links. The estimates achieved using different subsets of mutation pairs can also lead to different degrees of prediction accuracy (Table 1).
Distribution of the estimated deviations Finally, we investigated how the estimated deviations obtained from the approximation procedure at different subset sizes, k, are distributed relative to the true deviations εij obtained when using the measured single-mutant fitness values. When comparing the different distributions, we used the same interpretation rule as in the earlier comparison by Mani et al. [14]: an ideal definition of genetic interaction should result in a tight distribution (indicating low variability) that is centered at zero (indicating low bias) for the bulk of the measured interactions (reflecting the background distribution of non-interacting genes). The subset of mutation pairs being used in the approximation process had a marked effect on the distribution of the estimated deviations (Figure 4 = 317, which gave the most accurate estimates for the single-mutant fitness values resulted in a relatively weak prediction accuracy for the functional links (AUC = 0.679). Although the different measures of location and dispersion gave relatively similar results, the trimmed mean and interquartile range identified the distribution with least background bias and variability at k = 136 (Figure 4 = 0.810). These results indicate that the distribution characteristics of the estimated deviations could serve as a guide to choosing the optimal subset of gene pairs for defining genetic interactions in a given dataset. Both the optimal subset size and the overall performance of the method is likely to depend on the properties of the dataset being analysed, including the number of gene pairs and and the degree of their functional homogeneity.
To investigate how the differences in the distributions are visible in the conventional heat map visualizations, we displayed the color-coded deviations on a two-dimensional grid spanned by the individual mutations. Here, a special emphasis was placed on analyzing the estimated deviations , which provided the most ideal definition of genetic interactions in terms of both distribution characteristics and predictive power, relative to the measured deviations εij (Figure 5
Discussion The growing availability of large-scale genetic interaction datasets is enabling computational methods to systematically explore how genes interact to produce phenotypes on a global network-level. While these datasets yield an unprecedented insight into the organization and function of complex genetic networks, their analysis also poses many challenging computational problems. Using a high-resolution screen of genetic interactions in yeast as an example dataset of St Onge et al. [10], we have demonstrated that the computational approach based on sequential matrix approximations facilitates extraction of pertinent information from the background variation. A key finding of the present work is that the double-mutant fitness matrix alone carries enough information for accurate estimation of the single-mutant fitness values and for prediction of functional relationships among the genes. This makes it possible to avoid performing the single-mutant growth experiments, without compromising - if not even improving – the functional prediction power encoded in the double-mutant measurements. Surprisingly, the subset of mutation pairs which gave the most accurate estimates for the single-mutant fitness values did not lead to the most accurate predictions of the functional links. This may be due to experimental variability, such as differences in growth or screening conditions when measuring the strains carrying either single or double mutations, which may be beyond the capacity of the standard data pre-processing but can be normalized by the sequential approximation procedure. Other possible explanations for this surprising observation include biases in the definition of the gene pairs sharing a specific function or in the targeted set of genes pairs chosen for the particular interaction screen. Further study is therefore needed to confirm whether similar results can be obtained also in larger genetic interaction datasets, in which genes with much wider variety of functional categories are studied. Limitations and extensions of the procedure Perhaps the biggest technical limitation of the present work concerns the heuristic way in which the subsets of mutation pairs were selected for the approximation of the double-mutant fitness matrix. The greedy subset selection scheme was motivated by a similar approach successfully being used in many feature selection problems [17]. An adaptive version of such a forward floating selection method was applied here because of its low computational complexity and because it was capable of excluding the most prominent outliers during the sequential approximation process (Fig. 1 Future applications and research directions In spite of the above mentioned technical limitations, the present results support the feasibility of the approximation framework for systematic exploration of genetic interaction data, and warrant its applications to larger-scale datasets, such as those generated with the E-MAP or GIM screens, to confirm whether similar findings can be extrapolated to the genetic interactions data derived from high-throughput technologies. Other quantitative phenotypes or experimental techniques for defining or measuring genetic interactions could, in principle, also be used, although certain modifications will be needed to adapt the procedure to the specific characteristics of each genetic interaction screen. In the larger-scale screens, the gene pairs under analysis can be selected more randomly among a wider range of functions, thus increasing the expected proportion of non-interacting pairs. Accordingly, the more interactions there are being measured, the better the assumptions behind the multiplicative model will be justified, provided that significant genetic interactions remain relatively rare. The many missing values typically occurring in the large-scale screens should not pose a major problem for our approach either, due to its sequential nature being able to adapt to those subsets of double-mutant measurements with the best approximation power. Hence, the approximation approach is likely to yield even better results with larger and unbiased datasets. Similarly, even if the assumption of low frequency of significant interactions may become compromised in more targeted studies, such as the one of St Onge et al, this should not have a major effect on the results as the strongest genetic interaction pairs are effectively filtered out in the sequential estimation process. For such smaller-scale and more targeted genetic interaction studies, a further increase in the performance could be obtained by modifying the null model for non-interacting pairs to take into account the multitude of single-mutants affecting the particular double-mutant fitness value. This is one of the modeling challenges which we aim to tackle in the future. Integrative analysis together with other data sources More generally, the computational approach based on the sequential matrix approximation can provide a principled framework for exploring and classifying genetic networks and interactions using a wide spectrum of global data sources, including the localization, mRNA or protein expression, physical interaction and functional annotation of the proteins encoded by the genes [19]. It has previously been demonstrated that physical protein-protein interactions, in particular, provide useful information that is, by and large, complementary to that obtained from the functional genetic interactions [6], [20]. To reveal the modular structure of the underlying networks and functional organization the multitude of pathways reflected in such large-scale data types, various network partitioning methods have been used to detect either densely- or similarly-connected clusters as well as significantly-repeated motifs in the individual or integrated interaction networks [21]–[24]. However, many open questions still remain about the integrative analysis strategy of these datasets and the most meaningful interpretation of their results. For instance, the extent to which the genetic interaction could be explained by the other information sources, such as protein-protein, protein-DNA, metabolic network and protein structure data [25]–[28], and how these should be efficiently employed when scoring genetic interactions using measures such as the pairwise deviations, the S- and COP-scores, or the correlation and congruence between the interaction patterns [9]–[12], [27]–[29]. Finally, the success of any computational approach for constructing genetic interaction networks is likely to be driven by parallel improvements in the experimental technologies, such as enabling measurement of phenotypic effects in response to the mutation of more than two genes in combination. Materials and Methods Approximating the double-mutant fitness matrix For a given subset of mutation pairs, we calculated the matrix approximation of Eq. 2 using the decomposition algorithm by De Leeuw [16]. Formally, we solved the following weighted least-squares optimization problem at each step l of the procedure
which were involved in the approximation process at the lth step of the procedure. Although the general formulation in Eq. 3 allows for an element of C(l) to be any non-negative number, we used binary weights only; in particular, we set the diagonal entries for all i, and for the missing data points, at each step l. Throughout the operation of the approximation procedure, we ensured that the weight matrix remained symmetric.In the dataset of St Onge et al. [10], there were two missing values because of genetic linkage between the gene pairs involved in the double-deletion strains rad57Δrad61Δ and rad55Δshu2Δ. Selecting subsets of mutation pairs for estimation The subset of mutation pairs used at a particular step of the approximation process was encoded in the binary weight matrix C(l), that is, if and only if the mutation pair (i, j) is used at step l. To select increasingly large subsets of non-missing and non-diagonal mutation pairs from the double-mutant fitness matrix W, we adapted the floating search method of Pudil et al. [17]. The sequential subset search method is characterized by a dynamically changing number of features included or eliminated at each step. In our implementation, the residual errors were used as the criterion function for adding or deleting mutation pairs. The operation of the forward-type subset selection scheme was organized through the following steps:
We modified the general subset search method by making the deletion of pairs an adaptive process over the evolution of the subset search. More specifically, the threshold t(l) used in the conditional exclusion step 3c is multiplied by 1.5 each time the pair selected for deletion is the same as that added in step 3a. This modification enabled the forward-type algorithm to recover from poor starting tolerance values (we used t(0) = 2×10−5 in the present work) or from a poor initialization of the weight matrix (see the next subsection for details), and it also made it possible to increase the size k of the subsets up to the maximal size K. Figure 1 = 323 mutation pairs in the dataset of St Onge et al. [10].Initialization of the weight matrix from the data The approximation algorithm requires the weight matrix C(l) to be non-singular at each stage of the procedure. Therefore, we cannot start the sequential approximation procedure from the empty subset of mutation pairs, but instead must adjust the initialization C(0) = 0 for the dataset under analysis. After calculating the initial residuals from the whole double-mutant matrix, three types of adjustments were made in the present study: (i) for each i, we set for the pair (i, j) with the smallest residual to make all of the rows in C(0) non-zero; (ii) we made an additional setting of for each identical row pair to reduce the linear dependence among the rows of C(0); and finally (iii) we set pairs (i, j) with to 1 in the increasing order of their residuals until the determinant of C(0) became non-zero. Every time a pair (i, j) was added in the initialization steps (i)–(iii), we also set its transpose entry to one, that is, , to keep the weight matrix symmetric. In the dataset of St Onge et al. [10], this resulted in an initial weight matrix C(0) with 28 entries of ones.Sequential estimation of the pairwise deviations The measured deviations εij = wij−wiwj were estimated in two different ways. The sequential approximation procedure gives as its by-product a surrogate for the deviations in the form of the residual errors of Eq. 4. More specifically, we defined the ranked deviation of a mutation pair (i, j) as its residual error at the last step l during which the pair was included into the approximation subset. In this way, even when there are multiple inclusions and deletions of a particular pair during the procedure, we obtained an unambiguous ranking of the pairs according to their values. This ranking and the corresponding ranked deviations are shown in Figure 1The second set of estimates was obtained by stopping the sequential approximation procedure at a given subset size, k, and by using the estimate of Eq. 3 in place of the measured single-mutant fitness vector w in the definition of the deviation in Eq. 1. For a mutation pair (i, j), this resulted in a sequence of estimated deviations for step l at which the size of the approximation subset equals k. Note that since the residual errors in Eq. 4 are updated each time that a new pair is added, the estimated deviations can vary considerably as a function of k. These estimates are not generally congruent with the ranked deviations. The distributions of the estimated deviations with different subset sizes k are illustrated in Figure 4Quantitative genetic interaction measurements To evaluate the performance of the sequential matrix approximation procedure in practice, we applied it to a recent high-resolution genetic interaction study of St Onge et al. [10]. This particular study was chosen because it contains quantitative growth-rate measurements of both single- and double-mutant cell populations for a targeted set of 26 genes related to DNA repair in yeast S. cerevisiae. The detailed time course fitness measurements were performed in the presence and absence of the DNA-damaging agent methyl methaneusulfonate (MMS). The results of the present study were based on the growth measurements in the absence of MMS. The prediction of functionally related gene pairs was in fact more challenging in this case than in the data measured in the presence of MMS. The measured and estimated single-mutant fitness values and the double-deletion deviations in the dataset are shown in Figure S1 and Figure 5 Defining gene pairs sharing a specific function Functional links among the 26 genes were defined using the same approach as in many previous genetic interaction studies [6], [10], [14]. Briefly, a term in the Biological Process branch of the Gene Ontology was considered specific if it was associated with fewer than 30 yeast genes, and two genes were considered to have a specific functional relationship if they shared any of those specific terms [14]. This resulted in the set of 35 specific functional links in the dataset of St Onge et al. [10]. Statistical evaluation of the predictive power Statistical enrichment of the specific functional links among a set of mutation pairs selected by the sequential approximation procedure was assessed using the standard hypergeometric test. Briefly, if t is the number of top mutation pairs selected according to their residual errors, and M is the total number of the functionally-related links, then the probability of obtaining at least m functionally-related pairs when selecting pairs at random from the set of K mutation pairs can be calculated using the cumulative distribution function:
The enrichment for the M = 35 functionally-linked pairs among the t = 50 mutation pairs selected on the basis of either small (m = 1) or large (m = 22) residual errors is shown in Figure 2 = 35/323 = 0.108.The predictive power of the measured and estimated deviations was assessed using the receiver operating characteristic (ROC) curves that characterize the relative trade-off between the true positive rate (sensitivity) and the false positive rate (1 - specificity). The overall predictive performance was summarized using the area under the ROC curve (AUC). The statistical significance of the difference in the AUC values between two genetic interaction measures was assessed using a custom written algorithm based on the method of DeLong et al. [30]. This nonparametric method uses the theory of generalized U-statistics to calculate an estimated covariance matrix and hence it can also take into account the correlated nature of the data. The ROC curves and the corresponding AUC values for the prediction of the 35 functional links in the dataset of St Onge et al. [10] are shown in Figure 3 Figure S1 Estimated vs. measured single-mutant fitness values. The comparison is shown both as histogram and scatter-plot. The two fitness values were highly correlated (Pearson correlation equals 0.952 and the offset and slope of the best fit line are 0.0429 and 0.960, respectively). The estimated values were calculated at the cut-off point k = 317, in which the approximation procedure used all but the diagonal and missing entries of the double-mutant fitness matrix and also omitted those six pairs with the most extreme residual errors (the five synthetic lethal mutations and one plausible synergistic mutation pairs). This point can approximately be identified from the sharp increase in the trace of approximation error (Figure 1B(0.01 MB PDF) Click here for additional data file.(14K, pdf) Figure S2 Scatter-plot between the measured and ranked deviations. The ranked deviations were highly correlated with the true measured deviations over all of the mutation pairs (Pearson correlation equals 0.964). The inset shows the five synthetic lethal mutation pairs. The dotted diagonal line corresponds to the one-to-one correspondence between the two deviations. (0.01 MB PDF) Click here for additional data file.(13K, pdf) Figure S3 Distributions of the measured and estimated deviations. The non-scaled version of the Figure 4 (0.02 MB PDF) Click here for additional data file.(21K, pdf) Acknowledgments The authors thank Dr. Arho Virkki for modifying the R-code of the decomposition method for the purposes of the present study and Dr. Milla Kibble for the language review of the manuscript. Footnotes Competing Interests: The authors have declared that no competing interests exist. Funding: The Academy of Finland (grant 120569) and the Graduate School in Computational Biology, Bioinformatics and Biometry (ComBi). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. References 1. Boone C, Bussey H, Andrews BJ. Exploring genetic interactions and networks with yeast. Nat Rev Genet. 2007;8:437–449. [PubMed] 2. Badano JL, Katsanis N. Beyond Mendel: an evolving view of human genetic disease transmission. Nat Rev Genet. 2002;3:779–789. [PubMed] 3. Lehner B. Modelling genotype-phenotype relationships and human disease with genetic interaction networks. J Exp Biol. 2007;210:1559–1566. [PubMed] 4. Ooi SL, Pan X, Peyser BD, Ye P, Meluh PB, Yuan DS, Irizarry RA, Bader JS, Spencer FA, Boeke JD. Global synthetic-lethality analysis and yeast functional profiling. Trends Genet. 2006;22:56–63. [PubMed] 5. Hartman JL, Garvik B, Hartwell L. Principles for the buffering of genetic variation. Science. 2001;291:1001–1004. [PubMed] 6. Tong AH, Lesage G, Bader GD, Ding H, Xu H, et al. Global mapping of the yeast genetic interaction network. Science. 2004;303:808–813. [PubMed] 7. Schuldiner M, Collins SR, Thompson NJ, Denic V, Bhamidipati A, et al. Exploration of the function and organization of the yeast early secretory pathway through an epistatic miniarray profile. Cell. 2005;123:507–519. [PubMed] 8. Decourty L, Saveanu C, Zemam K, Hantraye F, Frachon E, et al. Linking functionally related genes by sensitive and quantitative characterization of genetic interaction profiles. Proc Natl Acad Sci U S A. 2008;105:5821–5826. [PubMed] 9. Segrè D, Deluna A, Church GM, Kishony R. Modular epistasis in yeast metabolism. Nat Genet. 2005;37:77–83. [PubMed] 10. St Onge RP, Mani R, Oh J, Proctor M, Fung E, et al. Systematic pathway analysis using high-resolution fitness profiling of combinatorial gene deletions. Nat Genet. 2007;39:199–206. [PubMed] 11. Collins SR, Miller KM, Maas NL, Roguev A, Fillingham J, et al. Functional dissection of protein complexes involved in yeast chromosome biology using a genetic interaction map. Nature. 2007;446:806–810. [PubMed] 12. Collins SR, Schuldiner M, Krogan NJ, Weissman JS. A strategy for extracting and analyzing large-scale quantitative epistatic interaction data. Genome Biol. 2006;7:R63. [PubMed] 13. Pan X, Yuan DS, Ooi SL, Wang X, Sookhai-Mahadeo S, et al. dSLAM analysis of genome-wide genetic interactions in Saccharomyces cerevisiae. Methods. 2006;41:206–221. [PubMed] 14. Mani R, St Onge RP, Hartman JL, Giaever G, Roth FP. Defining genetic interaction. Proc Natl Acad Sci U S A. 2008;105:3461–3466. [PubMed] 15. Axler S. Linear Algebra Done Right. Second Edition. New York: Springer-Verlag; 1997. 16. De Leeuw J. A Decomposition Method for Weighted Least Squares Low-rank Approximation of Symmetric Matrices. Department of Statistics Papers, Paper 2006041602. 2006. Available: http://repositories.cdlib.org/uclastat/papers/2006041602. Accessed: December 2007. 17. Pudil P, Novovičová J, Kittler J. Floating search methods in feature selection. Pattern Recognition Lett. 1994;15:1119–1125. 18. Drees BL, Thorsson V, Carter GW, Rives AW, Raymond MZ, et al. Derivation of genetic interaction networks from quantitative phenotype data. Genome Biol. 2005;6:R38. [PubMed] 19. Wong SL, Zhang LV, Tong AH, Li Z, Goldberg DS, et al. Combining biological networks to predict genetic interactions. Proc Natl Acad Sci USA. 2004;101:15682–15687. [PubMed] 20. Beyer A, Bandyopadhyay S, Ideker T. Integrating physical and genetic maps: from genomes to interaction networks. Nat Rev Genet. 2007;8:699–710. [PubMed] 21. Bader GD, Hogue CW. An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinformatics. 2003;4:2. [PubMed] 22. Taylor RJ, Siegel AF, Galitski T. Network motif analysis of a multi-mode genetic-interaction network. Genome Biol. 2007;8:R160. [PubMed] 23. Hintze A, Adami C. Evolution of complex modular biological networks. PLoS Comput Biol. 2008;4:e23. [PubMed] 24. Bandyopadhyay S, Kelley R, Krogan NJ, Ideker T. Functional maps of protein complexes from quantitative genetic interaction data. PLoS Comput Biol. 2008;4:e1000065. [PubMed] 25. Kelley R, Ideker T. Systematic interpretation of genetic interactions using protein networks. Nat Biotechnol. 2005;23:561–566. [PubMed] 26. Ulitsky I, Shamir R. Pathway redundancy and protein essentiality revealed in the Saccharomyces cerevisiae interaction networks. Mol Syst Biol. 2007;3:104. [PubMed] 27. Ulitsky I, Shlomi T, Kupiec M, Shamir R. From E-MAPs to module maps: dissecting quantitative genetic interactions using physical interactions. Mol Syst Biol. 2008;4:209. [PubMed] 28. Dror O, Schneidman-Duhovny D, Shulman-Peleg A, Nussinov R, Wolfson HJ, Sharan R. Structural similarity of genetically interacting proteins. BMC Syst Biol. 2008;2:69. [PubMed] 29. Ye P, Peyser BD, Pan X, Boeke JD, Spencer FA, Bader JS. Gene function prediction from congruent synthetic lethal interactions in yeast. Mol Syst Biol. 2005;1:26. 30. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44:837–845. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||||
Nat Rev Genet. 2007 Jun; 8(6):437-49.
[Nat Rev Genet. 2007]Nat Rev Genet. 2002 Oct; 3(10):779-89.
[Nat Rev Genet. 2002]J Exp Biol. 2007 May; 210(Pt 9):1559-66.
[J Exp Biol. 2007]Nat Rev Genet. 2007 Jun; 8(6):437-49.
[Nat Rev Genet. 2007]Trends Genet. 2006 Jan; 22(1):56-63.
[Trends Genet. 2006]Science. 2001 Feb 9; 291(5506):1001-4.
[Science. 2001]Science. 2004 Feb 6; 303(5659):808-13.
[Science. 2004]Cell. 2005 Nov 4; 123(3):507-19.
[Cell. 2005]Proc Natl Acad Sci U S A. 2008 Mar 4; 105(9):3461-6.
[Proc Natl Acad Sci U S A. 2008]Nat Genet. 2007 Feb; 39(2):199-206.
[Nat Genet. 2007]Nat Genet. 2007 Feb; 39(2):199-206.
[Nat Genet. 2007]Genome Biol. 2006; 7(7):R63.
[Genome Biol. 2006]Proc Natl Acad Sci U S A. 2008 Mar 4; 105(9):3461-6.
[Proc Natl Acad Sci U S A. 2008]Nat Genet. 2007 Feb; 39(2):199-206.
[Nat Genet. 2007]Proc Natl Acad Sci U S A. 2008 Mar 4; 105(9):3461-6.
[Proc Natl Acad Sci U S A. 2008]Proc Natl Acad Sci U S A. 2008 Mar 4; 105(9):3461-6.
[Proc Natl Acad Sci U S A. 2008]Nat Genet. 2007 Feb; 39(2):199-206.
[Nat Genet. 2007]Genome Biol. 2005; 6(4):R38.
[Genome Biol. 2005]Nat Genet. 2005 Jan; 37(1):77-83.
[Nat Genet. 2005]Proc Natl Acad Sci U S A. 2008 Mar 4; 105(9):3461-6.
[Proc Natl Acad Sci U S A. 2008]Proc Natl Acad Sci U S A. 2004 Nov 2; 101(44):15682-7.
[Proc Natl Acad Sci U S A. 2004]Science. 2004 Feb 6; 303(5659):808-13.
[Science. 2004]Nat Rev Genet. 2007 Sep; 8(9):699-710.
[Nat Rev Genet. 2007]BMC Bioinformatics. 2003 Jan 13; 4():2.
[BMC Bioinformatics. 2003]PLoS Comput Biol. 2008 Apr 18; 4(4):e1000065.
[PLoS Comput Biol. 2008]Nat Genet. 2007 Feb; 39(2):199-206.
[Nat Genet. 2007]Nat Genet. 2007 Feb; 39(2):199-206.
[Nat Genet. 2007]Nat Genet. 2007 Feb; 39(2):199-206.
[Nat Genet. 2007]Nat Genet. 2007 Feb; 39(2):199-206.
[Nat Genet. 2007]Nat Genet. 2007 Feb; 39(2):199-206.
[Nat Genet. 2007]Nat Genet. 2007 Feb; 39(2):199-206.
[Nat Genet. 2007]Science. 2004 Feb 6; 303(5659):808-13.
[Science. 2004]Nat Genet. 2007 Feb; 39(2):199-206.
[Nat Genet. 2007]Proc Natl Acad Sci U S A. 2008 Mar 4; 105(9):3461-6.
[Proc Natl Acad Sci U S A. 2008]Biometrics. 1988 Sep; 44(3):837-45.
[Biometrics. 1988]Nat Genet. 2007 Feb; 39(2):199-206.
[Nat Genet. 2007]