![]() | ![]() |
Formats:
|
|||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright : © 2008 Levy and Siegal This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Network Hubs Buffer Environmental Variation in Saccharomyces cerevisiae Center for Genomics and Systems Biology, Department of Biology, New York University, New York, New York, United States of America Andre Levchenko, Academic Editor Johns Hopkins University, United States of America * To whom correspondence should be addressed. E-mail: sasha.levy/at/nyu.edu (SFL); Email: mark.siegal/at/nyu.edu (MLS) Received March 12, 2008; Accepted September 16, 2008. This article has been cited by other articles in PMC.Abstract Regulatory and developmental systems produce phenotypes that are robust to environmental and genetic variation. A gene product that normally contributes to this robustness is termed a phenotypic capacitor. When a phenotypic capacitor fails, for example when challenged by a harsh environment or mutation, the system becomes less robust and thus produces greater phenotypic variation. A functional phenotypic capacitor provides a mechanism by which hidden polymorphism can accumulate, whereas its failure provides a mechanism by which evolutionary change might be promoted. The primary example to date of a phenotypic capacitor is Hsp90, a molecular chaperone that targets a large set of signal transduction proteins. In both Drosophila and Arabidopsis, compromised Hsp90 function results in pleiotropic phenotypic effects dependent on the underlying genotype. For some traits, Hsp90 also appears to buffer stochastic variation, yet the relationship between environmental and genetic buffering remains an important unresolved question. We previously used simulations of knockout mutations in transcriptional networks to predict that many gene products would act as phenotypic capacitors. To test this prediction, we use high-throughput morphological phenotyping of individual yeast cells from single-gene deletion strains to identify gene products that buffer environmental variation in Saccharomyces cerevisiae. We find more than 300 gene products that, when absent, increase morphological variation. Overrepresented among these capacitors are gene products that control chromosome organization and DNA integrity, RNA elongation, protein modification, cell cycle, and response to stimuli such as stress. Capacitors have a high number of synthetic-lethal interactions but knockouts of these genes do not tend to cause severe decreases in growth rate. Each capacitor can be classified based on whether or not it is encoded by a gene with a paralog in the genome. Capacitors with a duplicate are highly connected in the protein–protein interaction network and show considerable divergence in expression from their paralogs. In contrast, capacitors encoded by singleton genes are part of highly interconnected protein clusters whose other members also tend to affect phenotypic variability or fitness. These results suggest that buffering and release of variation is a widespread phenomenon that is caused by incomplete functional redundancy at multiple levels in the genetic architecture. Author Summary Most species maintain abundant genetic variation and experience a wide range of environmental conditions, yet phenotypic differences between individuals are usually small. This phenomenon, known as phenotypic robustness, presents an apparent contradiction: if biological systems are so resistant to variation, how do they diverge and adapt through evolutionary time? Here, we address this question by investigating the molecular mechanisms that underlie phenotypic robustness and how these mechanisms can be broken to produce phenotypic heterogeneity. We identify genes that contribute to phenotypic robustness in yeast by analyzing the variance of morphological phenotypes in a comprehensive collection of single-gene knockout strains. We find that ~5% of yeast genes break phenotypic robustness when knocked out. The products of these genes tend to be involved in critical cellular processes, including maintaining DNA stability, processing RNA, modifying proteins, and responding to stressful environments. These genes tend to interact genetically with a large number of other genes, and their products tend to interact physically with a large number of other gene products. Our results suggest that loss of phenotypic robustness might be a common phenomenon during evolution that occurs when cellular networks are disrupted. Introduction The relationship between genotype and phenotype is a central concern of many fields, from developmental biology to human genetics to evolutionary biology. Although in most cases this relationship is poorly understood, some general properties do seem to be shared across diverse systems. Chief among these is robustness to genetic and environmental variation [1]. That is, most species maintain abundant genetic variation and experience a wide range of environmental conditions, yet phenotypic variation is relatively low [2]. Because of its ubiquity, phenotypic robustness, also termed canalization or buffering, is worthy of study in its own right [3]. It also presents an apparent contradiction: if biological systems are so robust, how do they diverge and adapt through evolutionary time? The contradiction might be resolved if the robustness itself were to be modulated by particular mutations or environmental conditions. The robust system would accumulate conditionally neutral, or “cryptic,” genetic variation. A genetic or environmental perturbation that impaired the system's robustness would then reveal the cryptic variation in the form of greater phenotypic diversity. The modulation of robustness would not only allow evolutionary divergence, but it might also accelerate it relative to the slow, step-wise fixation of fitness-increasing alleles that is normally considered within the Neodarwinian paradigm [3–5]. It is therefore essential to investigate mechanisms that contribute to the robustness of biological systems, and to understand how such mechanisms determine the phenotypic effects of different sources of variation. A model for the buffering and release of variation is provided by the molecular chaperone Hsp90, which targets a large set of signal transduction proteins. In both Drosophila and Arabidopsis, compromised Hsp90 function results in diverse morphological changes that exhibit strong dependence on the genetic background [6,7]. This implies that Hsp90 normally contributes to phenotypic robustness to genetic variation. Because Hsp90 function allows stores of genetic variation to build up, and Hsp90 impairment releases this variation to have phenotypic effects, it is termed a “phenotypic capacitor” [7]. Controversy surrounds the evolutionary relevance of Hsp90-mediated capacitance and any similar mechanisms that might exist. One issue is whether any fraction of the phenotypic variation revealed by an impaired capacitor is adaptive, or instead whether the variants consist entirely of hopeful, but ultimately unfit, monsters [8,9]. The major morphological defects seen originally in flies [7] support the latter conclusion, yet selection for one such defect did not cause correlated fitness costs, suggesting that the pleiotropic effects of Hsp90 impairment are modular and not unconditionally deleterious [10]. The variation seen in Arabidopsis also supports the potential adaptive value of capacitance, in that this variation is considerably less monstrous than that seen in flies [6]. A more systematic analysis of the phenotypic and fitness effects of capacitor impairment is needed to resolve this issue. A second debate concerns the ultimate evolutionary reason that capacitance exists. One view is that the ability to modulate evolvability is itself an adaptive trait, and that natural selection has therefore favored capacitor function [11]. This view generally meets with great skepticism, as do similar views on the evolutionary benefits of mechanisms that alter mutation or recombination rates [9]. Nonetheless, a population-genetic model has shown that an allele that modifies the rate of revelation of cryptic genetic variation can invade a population under a realistic range of parameter values [12,13]. Although adaptive evolution of capacitance therefore remains a formal possibility, many favor an alternative view that considers capacitance a side effect of other selected properties. One possibility is that natural selection favors mechanisms that buffer against environmental variation, with environmental variation taken to mean both large macro-environmental differences and stochastic fluctuations in the external micro-environment or internal cellular environment. The buffering of genetic variation then results from a hypothesized mechanistic congruence between the impacts of allelic variation and environmental variation on regulatory networks [14]. Another possibility is that regulatory networks inherently attenuate variation because they contain thresholds and other nonlinearities that allow them to respond properly to internal or external cues [8,15]. Indeed, our own simulations of evolving regulatory networks predicted that many gene products should act as phenotypic capacitors, contributing to phenotypic robustness when present and producing greater phenotypic diversity when absent [16]. The above considerations motivate the development of an experimental system in which many phenotypes can be precisely measured in many individuals, multiple gene products can be screened for capacitor function, and sources of variation can be precisely controlled and partitioned. Here we present such a system, using single-cell morphological phenotypes in the yeast S. cerevisiae. We focus here on the robustness of these phenotypes to environmental variation caused by stochastic fluctuations in a constant macro-environment. While the study of robustness to environmental variation is critical to understanding the development and physiology of organisms, it also lays the foundation for future work that will rigorously test the congruence between mechanisms of environmental and genetic buffering and that will investigate the impact of capacitors on evolutionary trajectories. To identify gene products that contribute to robustness to environmental variation, we take advantage of data from high-dimensional quantitative morphological phenotyping of 4,718 haploid yeast single-gene knockout (YKO) strains [17]. Phenotyping was performed by growing cells in rich media to logarithmic growth phase and triply staining them for the cell surface, actin cytoskeleton, and nuclear DNA. Digital micrographs of ~200 cells per strain were processed using automated image analysis [17], yielding means and variances for 220 diverse quantitative phenotypes for each YKO (Figure 1
Using these data we identify more than 300 gene products required for robustness to environmental variation. We find that these capacitors are involved in a number of critical cellular processes and that they are highly connected, in terms of both physical and genetic interaction networks. Despite this centrality, capacitor deletions result on average in decreases in growth rate that could allow these mutants to persist for many generations in the presence of wild-type cells, suggesting that capacitor impairment need not produce unfit monsters. Capacitors encoded by a member of a duplicate gene pair differ in their functional and network properties from those encoded by singletons, suggesting that these two classes of capacitors are likely to buffer environmental variation by different mechanisms. Results We used reported means and variances from quantitative morphological phenotyping of 4,718 haploid YKO strains [17] to identify phenotypic capacitors. Our working definition of a capacitor is a gene product that causes high variance in multiple nonredundant phenotypes when deleted. To identify gene products that meet this criterion, three obstacles must be overcome: (1) a measure of variance that is not dependent on the mean must be generated so as not to confound changes in variance with changes in mean phenotype [18]; (2) biologically or physically redundant phenotypes must be eliminated (dimensional reduction); and (3) a robust score for the overall variance in multiple phenotypes must be generated. We addressed the first obstacle by plotting, for each phenotype, the standard deviation versus the mean of each YKO, fitting a lowess (locally weighted) regression to each plot, and calculating the residual distance of each point from the regression (Figures 1 For dimensional reduction, we used partitioning around medoids (PAM), a robust variant of k-means clustering. By analyzing the average silhouette width, a measure of separation of clusters, we estimated that the 220 original phenotypes can be reduced to 70 representative phenotypes (“medoids”) (Figure S2). To generate a single measure of the overall phenotypic variance when a gene is deleted, we averaged the top 35 (of 70) residuals of standard deviation for each YKO (Table S1). This score, which we term phenotypic potential, was extremely robust to even large changes in the clustering and averaging procedure and did not appear to be biased by potential edge effects in the lowess procedure (Figures S3–S6). To identify genes with significantly higher phenotypic potential scores than expected by chance, we compared the distribution of scores of all YKOs to distributions generated when values within each phenotype are first permuted. Based on this permutation analysis, the 502 genes with the highest phenotypic potential were identified as putative phenotypic capacitors with an estimated false discovery rate (FDR) of 0.34 (Figure 1 Validation of Phenotypic Capacitors We validated our identification of capacitors by repeating the phenotyping of 50 C1 strains and 50 control strains in our lab. Haploid mutants in the YKO library used for the original phenotyping were passaged for an unknown number of generations. Because some knockouts might increase mutation rates and thereby cause phenotypic variability that is not associated with loss of environmental buffering, we used instead a haploid-convertible heterozygous diploid YKO library [19] and kept the number of generations between sporulation and fixation to a maximum of ~50. C1 strains exhibited more phenotypic variability than control strains (Figures 2
As additional validation, we looked for evidence in the literature of capacitors causing increased cell-to-cell variability. Indeed, knockouts of the capacitors CCR4, CLN3, and SWI6 have each been shown to result in increased variability in diploid cell size in liquid media [21]. Knockout of CCR4 also causes irregular colony morphology on solid medium, a finding consistent with increased cell-to-cell variation [22]. Two other members of the CCR4-NOT core complex, NOT5 and POP2, are identified as capacitors in our screen suggesting that disruption of this transcriptional regulatory complex is likely to result in cellular heterogeneity. Additionally, two of three knockouts that increase intrinsic expression noise of the PHO5 promoter are the capacitors ARP8 and SNF6 [23]. Lastly, knockout of the capacitor FUS3 has been shown to increase cell-to-cell variation in response to a pheromone signal [24]. Phenotypic Capacitors Are Enriched in Several Gene Ontology Categories We next investigated the entire set of 502 capacitors for enrichment in Gene Ontology (GO; http://www.geneontology.org/) process terms (Table S2). Phenotypic capacitors are highly enriched in numerous processes, most of which can be broadly categorized into DNA maintenance and organization, cell cycle and cell organization, response to stimuli such as stress, RNA elongation, or protein modification. This diverse set of enriched terms is likely to represent processes that can result in broad cellular changes when disrupted. Specifically, capacitors contain 123 of 565 ORFs annotated with “chromosome organization and biogenesis” and 80 of 275 ORFs annotated with its daughter term, “telomere organization and biogenesis” (p < 2.9 × 10−27 and p < 2.1 × 10−25, respectively, hypergeometric distribution with Bonferroni correction). These include all genes annotated with recombinase activity, the entire telomerase holoenzyme complex, the entire RecQ helicase-Topo III complex, all but one gene from the homologous recombination module, all genes involved in postreplication repair, the entire CTF18m complex involved in sister chromatid cohesion and DNA-replication check-point signaling, the entire MMS22m module thought to be involved in double strand break repair, and both genes of the HEX3m module [25]. Notably, however, other modules that are likely to cause DNA instability are absent, including the nucleotide excision repair module, the DNA damage checkpoint module RAD9m, the MUS81m module involved in cleaving branched DNA, and the TOF1m module involved in promoting sister chromatid cohesion to repair DNA damage [25]. Capacitors also include 28 transcriptional regulators; numerous gene products involved in global mRNA production, including all members of the carboxy-terminal domain protein kinase complex; most members of the THO complex, which is thought to couple transcriptional elongation with mRNA metabolism and export; numerous nuclear pore-associated proteins, including both members of the mRNA export SAC3/THP1 complex; at least seven gene products involved in mRNA splicing; approximately half of the gene products annotated to the cytoplasmic mRNA processing (P) body; genes involved in protein transport and degradation, including three members of the Golgi-localized alpha-1,6-mannosyltransferase complex and eight gene products involved in vacuolar acidification; and genes involved in the control of actin (nine genes) and microtubule (12 genes) organization and in bud emergence or selection. Phenotypic Capacitors Are Likely to Be Network Hubs Protein–protein interaction (PPI) and synthetic lethal interaction (SLI) networks have a small number of highly connected nodes (hubs) and many more poorly connected nodes [26,27]. In PPI networks, deletion of a hub is more likely to be lethal than deletion of other nodes [26]. This finding suggests that genetic properties are, at least in part, traceable to global network architecture. Our numerical simulations of transcriptional networks implied that robustness is likely to be an emergent property of complex networks, and that in some cases network architecture constrains functional and evolutionary properties [15,16,28]. Others have predicted that SLIs are crucial to understanding buffering [29] or that phenotypic capacitors are likely to be hubs [30,31]. Thus, we asked here if a gene's phenotypic potential is traceable to network position. First, we used networks derived both from curated literature citations [32] and from the nearly complete set of affinity-capture mass spectrometry interactions [33,34] to determine if the average PPI degree (number of interactions) of capacitors is different than that of other genes. We found that capacitors have more physical interactions than other nonessential gene products but fewer than essential gene products (Figure 3
A plot of phenotypic potential versus binned PPI degree shows that gene products with a higher connectivity have on average a higher phenotypic potential (Figure 3 Next, we used all SLIs from curated literature citations [32] and, again, find that capacitors have a higher degree than other nonessential genes (Figure 3 Effects of Phenotypic Capacitor Knockouts on Growth Rate We have shown that capacitors are likely to be PPI and/or SLI hubs, that they might be less likely to be completely functionally redundant, and that they are involved in a number of central processes in the cell. At this point, one might ask if disruption of capacitor function has too drastic of an effect on fitness to play a role in adaptation. To investigate this possibility, we asked if capacitors are less dispensable than other nonessential genes by comparing the growth rates of haploid [37] or diploid [38] mutants. Whereas capacitors appear to have no effect on growth rate in the heterozygous diploid, they are less dispensable in haploid and homozygous diploid knockouts, with a larger rate decrease seen in the haploid (Figure 3 Duplicate and Singleton Capacitors Have Different Modes of Functional Redundancy The finding that many of the most highly connected PPI hubs are duplicates but not capacitors (Figure 3
The above finding presents an apparent contradiction: high PPI degree is strongly associated with capacitor identity in duplicates (Figure 4 A more striking difference between capacitor duplicate pairs and other duplicate pairs was evident in the correlations in expression between paralogs across many experimental conditions [39]. Duplicate pairs that contain at least one capacitor have a lower expression similarity on average than duplicate pairs that contain an essential gene or noncapacitor nonessential duplicate pairs (Figure 4 Perhaps because of these expression profile differences, the capacitor gene has on average approximately three times as many SLIs as its paralog (Figure 4 If one assumes that incomplete functional redundancy is also causing high phenotypic variability in the case of deleted singleton capacitors, understanding the mechanism of this process poses a greater challenge because there are no obvious candidate genes that overlap with singleton function. One hypothesis is that redundancy is achieved not at the level of the single gene as is the case for duplicates, but rather at the level of the protein module. Indeed, many singleton capacitors are part of functionally overlapping modules in the DNA integrity network [25]. To test the hypothesis that this is a more general property of singleton capacitors, we examined further their network characteristics. The clustering coefficient is a measure of local network interconnectivity. Singleton capacitors have on average higher clustering coefficients in the PPI network than all nonessential singletons, suggesting that they are interacting with more tightly knit groups of proteins that might represent functional modules (Figure 5
The clustering of singleton capacitors into modules suggests that their other immediate PPI partners, or first-degree neighbors (FDN), might have special network and dispensability characteristics. Because neighbor parameters are not likely to be independent of PPI degree, we first generated FDN scores for each gene product that describe the properties of a gene product's neighbors while controlling for the PPI degree of that gene product (see Materials and Methods). Using these measures, we found that the FDNs of singleton capacitors are more likely to be capacitors, essential genes, or genes with a high SLI degree when compared to all nonessential singletons (Figures 5 Capacitors and Chromosome Stability Approximately one-fourth of identified capacitors are annotated to be involved in maintaining chromosome stability. While this class of genes still meets all of the criteria for singleton capacitors discussed above, one alternative mechanism to explain the increased phenotypic variability in these YKOs is that they are causing mutations or chromosomal aberrations [47]. Because YKOs that cause drastic increases in the spontaneous mutation rate are relatively well defined and rare (see Protocol S1), we concerned ourselves with YKOs that may be causing chromosomal aberrations, most easily measured by rates of gross chromosomal rearrangements (GCRs) [48]. We found that a high rate of GCRs is neither necessary nor sufficient for high phenotypic potential. For example, focusing on the eight genes in the homologous recombination module [25], we found YKOs that cause both high phenotypic potentials and high GCR rates (RAD50, RAD52, RAD57, XRS2), high phenotypic potentials but relatively low GCR rates (RAD51, RAD54), and low phenotypic potentials but high GCR rates (MRE11) [48]. Additionally, we estimated that measured GCR rates are likely to be too low to explain the phenotypic heterogeneity observed in the haploid-converted strains (see Protocol S1). Thus it is unlikely that GCR events alone explain high phenotypic variance. Another source of genetic variation in these lines could be caused by increased insertion rates of transposable elements such as Ty1 [49,50]. However, we again found that increased Ty1 insertion rates are neither necessary nor sufficient for high phenotypic potential (see Protocol S1). It is also possible that these same disrupted processes are changing the mutational spectrum at microsatellites [51,52]. Although not related to environmental buffering, this last possible mechanism is of great evolutionary interest, given recent findings in yeast [53] and dogs [54] of phenotypic variation driven by tandem-repeat length changes. Another possibility is that DNA stability knockouts are causing changes at the telomere [55,56] or elsewhere that heterogeneously activate cell cycle checkpoints [51,57]. Thus, the mechanism by which this subset of capacitors produces phenotypic variability warrants further study. If, however, mutational mechanisms can be excluded, our findings might provide new insight into human malignancies. Many cancers are associated with mutations in genes involved in DNA stability, including orthologs of the capacitors RAD50 [58], RAD54 [59,60], and YAF9 [61], or with genes with a large number of interactions, such as the transcription factor p53 [62]. Our results suggest that the first advantage mutations in these genes might provide to developing malignancies is phenotypic heterogeneity by way of network-associated loss of robustness. Discussion We show that: (1) there are in excess of 300 phenotypic capacitors of environmental variation in S. cerevisiae; (2) capacitors are highly enriched in GO processes that are likely to result in broad cellular changes when disrupted; (3) capacitors tend to be SLI hubs; (4) most capacitor knockouts result in decreases in growth rate that are not severe; (5) capacitor duplicates tend to be PPI hubs that have undergone a relatively ancient duplication event and diverged in expression from their paralogs; and (6) capacitor singletons tend to be part of highly interconnected protein clusters whose members are likely to disrupt robustness or be deleterious when knocked out. Taken together, these findings strongly suggest that loss of phenotypic robustness is a widespread phenomenon that is a consequence of disrupted physical or genetic-interaction networks. The mechanism of this disruption appears to be different in cases of duplicate and singleton capacitors, although in both cases incomplete functional redundancy might be causing phenotypic variability when the capacitor is absent (Figure 6
While previous studies have suggested that phenotypic capacitor function must be responsive to changes in the environment to influence evolutionary trajectories [6,7], the finding that many capacitors cause decreases in growth rate that are not severe offers another plausible mechanism in yeast: A loss-of-function mutation in a capacitor could be maintained indefinitely in a heterozygous diploid with little or no impact on fitness. Sporulation would be promoted under harsh environmental conditions [64] resulting in haploids with phenotypic variability that is dependent on the (previously cryptic) underlying genotype. Because capacitor loss-of-function usually causes decreases in haploid growth rate that are not severe, these cells could persist for many generations without being out-competed by wild-type counterparts. Some genotypes might provide a selective advantage, which, upon successive rounds of sporulation and mating, could become fixed in the population even in the absence of the capacitor loss-of-function mutation [6,7,16,65]. Alternatively, phenotypic heterogeneity in the absence of genotypic variation might produce epigenetic “persistent” phenotypes with increased fitness in some environments, analogous to those described in models of stochastic phenotype switching [66]. The estimate that approximately 30% of wild S. cerevisiae strains persist for some time as haploids (i.e., are heterothallic) [67,68] suggests that these are plausible mechanisms. Several challenges remain, however, to understanding mechanistically how phenotypic robustness impacts evolutionary trajectories. One major challenge is characterizing the relationship between environmental and genetic buffering. Phenotypic capacitors identified here buffer environmental variation. Some have predicted that the same mechanisms that buffer environmental sources of variation will also act to buffer genotypic variation [14]. Evidence in support of this hypothesis is mixed and mostly stems from studies of the molecular chaperone Hsp90, a known capacitor of genetic variation in flies and plants [6,7]. Using fluctuating asymmetry in isogenic Drosophila lines as a measure of robustness to environmental perturbation, Hsp90 was found to buffer environmental variation in some traits [69] but not others [70]. In Arabidopsis, Hsp90 appeared to buffer environmental variation for every trait tested [71]. Results described here provide an experimental system with which to formally test the congruence between the mechanisms of genetic and environmental buffering. Because variability of many quantitative phenotypes can be determined in a high-throughput manner and because we have identified many capacitors of environmental variation, experiments that precisely control and partition different sources of variation can be performed to test if these same gene products contribute to genetic robustness. Another major challenge is determining if the phenotypic heterogeneity that results from disrupted capacitor function in yeast could be advantageous under natural or artificial selection. One indication that this is likely comes from the finding that disrupted capacitor function might provide an advantage in environments that require invasive growth in yeast. A genetic screen found that mutations in 35 genes can promote haploid invasive growth in the ∑1278 genetic background, 27 of which were considered in our study and 12 of which are phenotypic capacitors [72]. Additionally, we found at least one capacitor YKO that promotes robust haploid invasive growth in the S288C background (SWI6, unpublished data). Stronger evidence comes from a study of stress-sensitive deletion mutants grown in varying concentrations of heavy metals or pro-oxidants [73]. Six deletions, four of which we identify as capacitors (CTR1, CUP5, VMA6, VMA7), resulted in a fitness disadvantage at moderate toxin concentrations but a clear heterogeneity-dependent fitness advantage over the wild-type at high toxin concentrations. Also of note, genome-wide examination of genes under positive selection in S. cerevisiae [74] finds nine capacitors out of 72 genes. Because of the relatively short generation time of yeast, it is now possible to formally test if disrupted capacitor function can provide a fitness advantage in some environments. Although our study has focused on the effects of deletions of nonessential genes, it might be more relevant to common evolutionary trajectories to ask if subtler allelic changes, such as those that result in altered transcription or protein sequence, could cause loss of robustness. Our prediction is that the allelic changes that affect phenotypic heterogeneity are most likely to alter network architecture or dynamics. One possibility is that such mutations would occur in essential genes because, like capacitor genes, they tend to encode highly connected network hubs. Encouraging results come from a recent study that used the progeny from a cross between a yeast lab strain and a wild isolate to map the variances of 35 quantitative phenotypes to 14 quantitative trait loci, one of which is a single nucleotide polymorphism in the essential G-protein alpha subunit GPA1 [75]. The recent availability of a yeast library with decreased expression of essential genes through mRNA perturbation [76] now makes this possible to test on a genome-wide scale. Materials and Methods Yeast strains. Haploid convertible diploid BY4743 YKO magic marker deletion strains (MATa/α ura3Δ0/ ura3Δ0 leu2Δ0/leu2Δ0 his3Δ1/ his3Δ1 lys2Δ0/LYS+ met15Δ0/MET15+ can1Δ::LEU2+-MFA1pr-HIS3/CAN1+ xxx::kanMX/XXX+) were purchased from Open Biosystems. General statistics and programming. All data analysis was performed using the open-source R statistical computing package (http://www.r-project.org/). Lowess regression was performed using the “lowess” function with a smoother span of 0.2 (944 YKOs) and three iterations for the genome-wide analysis (Figure 1 Dimensional reduction and calculation of phenotypic potential. A major problem we faced in dealing with a dataset of 220 phenotypes is removing those phenotypes that might be biologically or physically redundant. Principal components analysis, a common means by which to reduce the dimensionality in a matrix (remove redundant phenotypes), transforms the data to a new coordinate system such that the greatest variance of any projection of the data lies on the first coordinate or principal component, the second greatest variance on the second coordinate, etc. This transformation, however, does not preserve the directionality or syntax of the initial dataset because the coordinates are drawn to maximize the overall variance ignoring whether these values are positive or negative. In other words, loadings of the initial data into a principal component may be negative. Thus, a high value in a principal component may represent a high or low variance in the underlying phenotypes. Because we are interested in identifying genes that only result in high variance, principal components analysis was not appropriate, and so we used a strategy based on clustering instead. PAM [77] was selected for dimensional reduction because it has several advantages to k-means clustering with respect to our question: (1) resultant medoid cluster centers are real phenotypes from our residuals of standard deviation matrix (rather than difficult-to-interpret linear combinations of residuals of many phenotypes as in k-means); (2) using medoids as cluster centers rather than average cluster centers makes the procedure more robust to outliers; and (3) an “average silhouette” strategy [78] utilized here allows for estimation of the appropriate number of distinct clusters. To estimate the number of nonredundant clusters, the average silhouette strategy [78] was used: we performed PAM over a range of 20 to 100 clusters and generated silhouette plots for each. We then plotted the average silhouette width versus the number of clusters chosen (Figure S2). Although the average silhouette width peaks around 80 clusters, significant gaps (silhouette widths close to 0) are more likely to appear in silhouette plots when greater than 70 clusters were chosen, suggesting that noninformative clusters are added beyond 70. Thus, we estimated 70 nonredundant clusters. However, using a range of 50 to 80 clusters for PAM to identify phenotypic capacitors results in an extremely similar set of genes (Figure S3). Using the reduced matrix of 4,718 YKOs by 70 phenotype medoids, we next calculated a single measure of the overall phenotypic variance resulting from a gene's deletion, which we term the phenotypic potential. We generate this score for each YKO by averaging the top 35 (of 70) residuals of standard deviation. The rationale for using 35 of 70 medoids is as follows: (1) A gene deletion that results in a high variance in only one or a few phenotypes does not meet the pleiotropy requirement of our definition of a phenotypic capacitor. Thus, we sought YKOs that have both a large number of high variance phenotypic medoids and high magnitudes in those medoids (i.e., as many medoids should be averaged as possible to capture the overall phenotypic potential). (2) Only 24 YKOs result in high variance (>1 SD) in greater than 35 phenotype medoids, all of which were subsequently identified as phenotypic capacitors. Thus, phenotypic potential scores that rely on greater than 35 medoids are likely to increase noise in the scoring procedure. Although we chose to score 35 phenotype medoids to identify phenotypic capacitors, alternative procedures result in an extremely similar set of genes over a broad range of number of medoids scored (Figure S4). Estimation of the number of phenotypic capacitors using the FDR. First, we generated 100 randomized phenotypic medoid by YKO matrices by permuting elements within each phenotype column of the original matrix. These 100 matrices are then used to calculate phenotypic potentials. We then generated a reference distribution by averaging the top ranking phenotypic potential for each of the 100 trials, the second top ranking phenotypic potential, etc. This reference distribution was used to estimate the expected false positive rate under the null hypothesis. At a given expected false positive rate, the number of true positives was estimated by subtracting the number of false positives (i.e., expected false positive rate × 4,718) from the number of genes in the actual distribution that have a higher phenotypic potential than the reference distribution (all positives). The maximum number of true positives occurs at an expected false positive rate of 0.036 with 333 true positives estimated for 502 positives (a FDR of 34%). Cell staining and visualization. YKO magic marker deletion haploid convertible diploid strains (Open Biosystems) were grown on YPD agar for ~48 h, spread on GNA (5% D-glucose, 3% Difco nutrient broth, 1% Difco yeast extract, 2% bacto agar) plates and grown for 24 h. A single medium-sized colony was added to 5 ml of sporulation medium (10% potassium acetate, 0.005% zinc acetate + Ura + His + Leu) and incubated at 30 °C for 5–7 d. One to 10 μl of the sporulated cells were spread onto magic media plates (SC-Leu-His-Arg + canavanine + G418) and grown until medium sized colonies appeared (~20 generations and for no more than 72 h when possible). Single colonies were frozen at this point for later processing. Cell stocks were streaked out on YEPD agar and grown at 30 °C for a maximum of ~48 h when possible (~20 additional generations). Cells were subsequently grown overnight in 3 ml YEPD at 30 °C with shaking. Because many of the haploid YKO strains were expected to include heterogeneous morphologies, direct cell counts using a hemocytometer were relied upon rather than optical density readings. Cells were counted, then 1 × 108 cells were added to 20 ml YEPD and grown for 3–3.5 hrs at 30 °C (early logarithmic phase). Fixation and straining was performed as described in the CalMorph manual with modifications (http://scmd.gi.k.u-tokyo.ac.jp/datamine/calmorph/). Briefly, cells were fixed in 3.7% formaldehyde, 100 mM potassium phosphate, and triply stained for cell-surface manno-protein, actin cytoskeleton, and nuclear DNA using fluorescein isothiocyanate-Con A (Sigma), rhodamine-phalloidin (Molecular Probes), and 4′, 6-diamidino-2-phenylindole (Sigma), respectively. Cells were mounted in vectashield (Vector Laboratories), and visualized by epifluorescent microscopy on a Nikon Eclipse 90i automated microscope using a 100× objective and a Roper 1K CCD camera. For each YKO, a minimum of 40 micrographs was captured to yield a minimum of 200 (and usually in excess of 300) phenotyped cells that are not classified as “complex.” Captured micrographs were analyzed for quantitative morphological traits using the CalMorph software package. For all YKOs processed, ~50 generations is estimated to have passed from sporulation to fixation and phenotyping: ~40 generations on agar plates and ~10 generations in liquid media. Control strains are defined as those knockouts with a phenotypic potential that ranked below 1,000 in the original analysis of the haploid knockout library. GO enrichment. The full GO term hierarchy was downloaded from the GO website on May 11, 2007 (http://www.geneontology.org/). To determine GO process term enrichment, the hierarchy was first trimmed by removing GO terms that did not annotate three or more yeast ORFs. Additionally, the highly annotated process GO terms “physiological process,” “cellular process,” “biological process,” and “cellular physiological process” were removed because they are too general to be meaningful. Trimmed hierarchies resulted in 775 process terms. Significance of GO term enrichment of the 502 putative phenotypic capacitors was calculated using the hypergeometric distribution and Bonferroni corrected using the trimmed hierarchy. Network parameters. The May 1, 2007 release of interaction data (BIOGRID-ORGANISM-Saccharomyces_cerevisiae-2.0.27.tab.txt) was downloaded from the BioGRID (http://www.thebiogrid.org/) [32]. For analysis of physical interactions, two networks were constructed from the BioGRID: (1) every physical interaction from the hand-curated literature citation interaction database was included (including affinity capture-mass spectrometry, affinity capture-western, affinity capture-RNA, cofractionation, colocalization, copurification, fluorescence resonance energy transfer (FRET), two-hybrid, biochemical activity, cocrystal structure, far western, protein–peptide, protein–RNA, reconstituted complex; 5,192 nodes, 70,900 edges); or (2) only interactions from the higher confidence affinity capture-mass spectrometry (3,686 nodes, 47,774 edges). Because nearly every gene has been used as bait for the affinity capture-mass spectrometry network [33,34] and because of the higher consistency and lower rate of false positives compared to other methods, this network might represent the most unbiased view of the PPI network. Thus, this smaller network was used to calculate network properties of genes such as betweenness and clustering coefficient, to control for PPIs when estimating other gene properties such as synthetic lethality and dispensibility, and to estimate the properties of immediate neighbors in the physical interaction network. All interactions in the BioGRID annotated with “Synthetic Lethality” were used to generate the SLI network (2330 nodes, 18,550 edges). Because the SLI network is incomplete, we hand curated synthetic-lethal entries to create a subset of this network that only includes interactions derived from SGAs where both genes are completely functionally compromised and where the experiment contains ten or more total interactions (1,326 nodes, 11,934 edges). The subset of genes involved in SLIs discovered in SGA experiments was further separated into those genes that have been used as “bait” and those that have not, “prey.” Double knockouts for the 267 bait genes and all other nonessential genes have been performed and thus the complete SLIs for bait genes have likely been discovered; however, double knockouts for the 1,060 prey genes have only been performed with the 267 genes used as bait and thus represent an incomplete, although likely representative, interaction set. Genes without any interactions were excluded from the interaction networks. Betweenness [79] was calculated using the “betweenness” function from the “sna” library in R. The clustering coefficient for each gene was calculated, as described [80]. Essential genes are those genes that were determined to be essential in the systematic deletion project [81] except for the following genes which were deemed nonessential based on subsequent synthetic-lethal analysis: YJL174W, YJR057W, YLR103C, YOR326W, YPL153C, YBR234C, YDL102W, YDL029W, YDL017W, YDL003W, YDR052C [19,82]. Unless otherwise noted, nonessential genes refers to the 4,718 genes that were knocked-out and phenotyped in the high-dimensional quantitative morphological phenotyping experiment including any identified phenotypic capacitors [17]. Significance calculation of PPI degree comparison between capacitors and essential genes. All putative phenotypic capacitors (those with the top 502 phenotypic potential scores) were compared to all essential genes by Wilcoxon-Mann-Whitney test using PPI degrees derived from both the literature citation and mass spectrometry networks (p < 2.2 × 10−16 and p < 5.2 × 10−13, respectively). A similar comparison was made using only capacitors in the C1 and C2 classes (p < 2.0 × 10−6 for the literature citation network, and p < 7.8 × 10−7 for the mass spectrometry network). Genome-wide datasets. Genome-wide datasets were acquired from the following sources: PPIs and SLIs [32], haploid growth rates [37], heterozygous and homozygous diploid growth rates [38], duplicate and singleton identity [39], ohnolog identity [45], Ks of duplicate pairs [39], mRNA length [43], mRNA abundance [83], protein abundance [84], noise in protein expression in permissive (YEPD) and restrictive (SD) media [46], Ka/Ks [43,85]. The expression similarity between duplicate pairs was acquired from Kafri et al. [39] and was determined using correlations of expression over several reported expression array experiments (Ran Kafri, personal communication). The Ka for duplicates resulting from the whole genome duplication (ohnologs) was calculated as follows: alignments of the S. cerevisiae ohnologs and an ortholog from Kluyveromyces waltii, a related species whose divergence precedes the whole genome duplication event, have been previously performed [86]. Ka was calculated from these alignments in PAML [87] using the Yang and Nielsen method [88] with all settings set to default except for “icode,” which was set to 2 to reflect the yeast genome. Analysis of covariance to control for the effect of expression level on Ks and Ka. Because Ks and Ka are correlated with the expression level [43], we performed analyses of covariance (ANCOVAs) to estimate evolution rate differences. ANCOVAs were performed as described with modifications [43]. In one case we used Ks instead of dN, and we used the average mRNA expression level of the two paralogs as the continuous variable. Only duplicate pairs for which the expression level of each paralog has been measured were used for the analysis (Figures S16 and S18). FDN score. Parameters (such as phenotypic potential, dispensability, physical, or synthetic-lethal degree) for the immediate PPI neighbors of a given gene were calculated as follows: first, each gene is given a neighbor score in the parameter by averaging the scores of all of its neighbors. For example, a gene with three neighbors with haploid growth rate scores of 0.5, 0.9, and 1.0 will get a neighbor haploid growth rate score of 0.8. Second, the effect of PPI degree on the neighbor parameter score is removed by plotting the neighbor parameter score for each gene versus its PPI degree, fitting a lowess regression to this plot, and taking residuals of the curve fit as a measure of the neighbor parameter controlled for the number of PPIs. Thus, the gene described above with three neighbors and a neighbor haploid growth rate score of 0.8 will only have a low neighbor growth rate (and a negative residual) if it is low relative to other genes with a similar number of PPIs. Figure S1: Variance Depends on Mean in a Phenotype-Specific Manner Scatter plots of means and standard deviations for 4,718 YKO strains for nine representative phenotypes. Black circles represent individual YKOs and red circles represent the lowess curve fit. Phenotype descriptions and their respective units can be found in the CalMorph manual (http://scmd.gi.k.u-tokyo.ac.jp/datamine/calmorph/). (11.37 MB PDF) Click here for additional data file.(11M, pdf) Figure S2: The Average Silhouette Width Versus the Number of Clusters Chosen for PAM (438 KB PDF) Click here for additional data file.(438K, pdf) Figure S3: Phenotypic Capacitor Identification Is Robust to Changes in the Number of Clusters Heatmap of the percent match between the identities of genes with the top 502 phenotypic potentials using a different number of clusters for PAM while holding the number of clusters scored constant at 35. (268 KB PDF) Click here for additional data file.(268K, pdf) Figure S4: Phenotypic Capacitor Identification Is Robust to Changes in the Number of Clusters Scored Heatmap of the percent match between the identities of the top 502 phenotypic potentials using 70 clusters for PAM while changing the number of clusters scored (i.e., the number of top ranked residuals of standard deviation averaged per YKO). Low percent match is only seen when comparing to procedures with a low number of clusters scored. This suggests that genes that cause a high variance in only a few phenotypes are different from those that cause high variance globally. (618 KB PDF) Click here for additional data file.(618K, pdf) Figure S5: Phenotypic Capacitors Are Not Biased toward the Mean Extremes Where the Lowess Curve Fit Might Be Less Accurate Scatter plots of means and standard deviations for 4,718 YKO strains for nine representative phenotypes are shown. YKOs with the top 100 phenotypic potentials are plotted in blue and the lowess curve fit is plotted in red. All other YKOs are plotted in grey. Phenotype descriptions and their respective units can be found in the CalMorph manual (http://scmd.gi.k.u-tokyo.ac.jp/datamine/calmorph/). (11.41 MB PDF) Click here for additional data file.(11M, pdf) Figure S6: Identification of Phenotypic Capacitors Is Robust to General Changes in Procedure As an alternate procedure to identify phenotypic capacitors that does not rely on lowess regression or PAM, we calculated phenotypic potentials using the coefficient of variation (CV) matrix, controlling for confounding effects of differences in mean phenotype by eliminating CVs corresponding to mean outliers (described fully in the Protocol S1). Top ranking genes are similar to those using procedures described in the main text. Plotted is the percent of genes identified as phenotypic capacitors (the top 502 scoring genes using procedures described in the main text) for the top 100, 200, 300, 400, and 500 scoring genes using the alternative procedure. (169 KB PDF) Click here for additional data file.(169K, pdf) Figure S7: The Estimated Number of True Positive Identifications of a Phenotypic Capacitor Are Plotted Versus the Expected False Positive Rate under the Null Hypothesis Described fully in the Materials and Methods. (565 KB PDF) Click here for additional data file.(565K, pdf) Figure S8: Phenotypic Capacitor YKOs Have Highly Variable and Distinct Phenotypes Representative micrographs of control (A and B) and phenotypic capacitor (C and D) strains are shown. Cells are stained with FITC-concanavalin A (green), rhodamine phalloidin (red), and DAPI (blue). (3.63 MB PDF) Click here for additional data file.(3.5M, pdf) Figure S9: Histograms of the Phenotypic Potentials of Control (Black) and Phenotypic Capacitor (Red) YKOs of Haploid-Converted Heterozygous Diploid Strains Using Sampling Analysis (119 KB PDF) Click here for additional data file.(119K, pdf) Figure S10: Phenotypic Capacitors Have More PPIs than GO Matched Genes The average number of physical interactions of phenotypic capacitors in the C1 or C2 classes (blue) versus all nonessential (light grey) or essential (dark grey) genes annotated within a given GO process category using all physical interactions in the literature-curated network of the BioGRID (solid bars) or only affinity-capture mass spectrometry interactions (diagonal stripes). All p-values are a comparison to nonessential genes, Wilcoxon-Mann-Whitney test: *, p < 0.05; **, p < 0.001; ***, p < 1 × 10−5; ****, p < 1 × 10−10. (409 KB PDF) Click here for additional data file.(409K, pdf) Figure S11: The Percent of Phenotypic Capacitors in the Set of All Nonessential Genes as a Function of Binned Affinity-Capture Mass Spectroscopy PPI Degree (A), SLI Degree (B), or Haploid Growth Rate (C) (469 KB PDF) Click here for additional data file.(469K, pdf) Figure S12: Barplots of Synthetic-Lethal Degree and Growth Rate, Controlled for the Number PPIs The lowess regression residual values for each parameter were calculated and compared (see Protocol S1). Genes that fall into the C1 (dark blue), C2 (blue), or C3 (light blue) classes represent high, mid, and low confidence phenotypic capacitors, respectively. The abbreviations LC, AMS, and SGA refer to networks derived from all literature-citation interactions, affinity-capture mass spectrometry interactions, and SGA interactions, respectively. All p-values are a comparison to nonessential genes, Wilcoxon-Mann-Whitney test: *, p < 0.05; **, p < 0.001; ***, p < 1 × 10−5; ****, p < 1 × 10−10. (263 KB PDF) Click here for additional data file.(263K, pdf) Figure S13: Phenotypic Capacitors Have More Synthetic-Lethal Interactions Than GO Matched Genes The average number of SLIs of phenotypic capacitors in the C1 or C2 classes (blue) versus all nonessential (light grey) genes annotated within a given GO process category using all interactions annotated in the literature curated BioGRID network (solid bars), interactions only derived from SGA experiments considering only genes used as bait (diagonal stripes), and interactions only derived from SGA experiments considering only genes not used as bait (horizontal stripes). All p-values are a comparison to nonessential genes, Wilcoxon-Mann-Whitney test: *, p < 0.05; **, p < 0.001; ***, p < 1 × 10−5; ****, p < 1 × 10−10. (564 KB PDF) Click here for additional data file.(564K, pdf) Figure S14: Phenotypic Capacitors Are Less Dispensable Than GO Matched Genes The average growth rates of haploid KO (solid bars), heterozygous diploid KO (diagonal stripes), and homozygous diploid KO (horizontal stripes) strains of phenotypic capacitors in the C1 or C2 classes (blue) versus all nonessential (light grey) genes annotated within a given GO process category. All p-values are a comparison to nonessential genes, Wilcoxon-Mann-Whitney test: *, p < 0.05; **, p < 0.001; ***, p < 1 × 10−5; ****, p < 1 ×10−10. (466 KB PDF) Click here for additional data file.(466K, pdf) Figure S15: Barplots of All Nonessential Genes (Grey), All Nonessential Singletons (Light Green), Capacitor Singletons (Light Blue), All Nonessential Duplicates (Dark Green), Capacitor Duplicates (Dark Blue), and All Essential Genes (Dark Grey) for Several Genome-Wide Datasets The abbreviations LC, AMS, and SGA refer to networks derived from all literature-citation interactions, affinity-capture mass spectrometry interactions, and SGA interactions, respectively. All other parameters are described in the Materials and Methods. Wilcoxon-Mann-Whitney test: *, p < 0.05; **, p < 0.001; ***, p < 1 × 10−5; ****, p < 1 × 10−10. (623 KB PDF) Click here for additional data file.(623K, pdf) Figure S16: Binary Analysis of the Effect of Duplicate Identity on the Ks between Paralogs (A) Duplicate pairs that contain at least one capacitor (blue) are more ancient than nonessential noncapacitor duplicate pairs (light grey, p < 0.01, ANCOVA). (B) Duplicate pairs that contain at least one essential gene (dark grey) are more ancient than nonessential noncapacitor duplicate pairs (light grey, p < 9.05 × 10−9, ANCOVA). (C) Essential duplicate pairs (dark grey) are more ancient than capacitor duplicate pairs (blue, p < 1.03 × 10−8, ANCOVA). (728 KB PDF) Click here for additional data file.(728K, pdf) Figure S17: Histograms of the Ks between Paralogs of Duplicate Pairs That Contain at Least One Capacitor (Blue) or only Nonessential, Noncapacitor Genes (Grey) (134 KB PDF) Click here for additional data file.(134K, pdf) Figure S18: Binary Analysis of the Effect of Duplicate Identity on the Ka between Ohnologs (A) Ohnologs that contain at least one capacitor (blue) are plotted against nonessential noncapacitor ohnologs (light grey, p = 0.99, ANCOVA). (B) Ohnologs that contain at least one essential gene (dark grey) are plotted against nonessential noncapacitor ohnologs (light grey, p < 0.02, ANCOVA). (C) Essential ohnologs (dark grey) are plotted against capacitor ohnologs (blue, p < 0.07, ANCOVA). (620 KB PDF) Click here for additional data file.(620K, pdf) Figure S19: Barplots of the First-Degree PPI Neighbors of a Gene Controlled for the Degree of That Gene for Several Genome-Wide Datasets (FDN Score, see Materials and Methods) Colors are nonessential genes (grey), all nonessential singletons (light green), capacitor singletons (light blue), all nonessential duplicates (dark green), capacitor duplicates (dark blue), and all essential genes (dark grey). The abbreviations LC and AMS refer to networks derived from all literature citation interactions and affinity-capture mass spectrometry interactions, respectively. All other parameters are described in the methods. Wilcoxon-Mann-Whitney test: *, p <0.05. (827 KB PDF) Click here for additional data file.(827K, pdf) Table S1: S. cerevisiae Genes Ranked by Phenotypic Potential (893 KB TXT) Click here for additional data file.(893K, txt) Table S2: GO Process Terms with Significant Enrichment of Phenotypic Capacitors (47 KB TXT) Click here for additional data file.(47K, txt) Table S3: GO Process Terms with Significant Enrichment of Singleton or Duplicate Phenotypic Capacitors (18 KB TXT) Click here for additional data file.(18K, txt) Acknowledgments We thank F. Chiaromonte, R. Bonneau, K. Gunsalus, M. Purugganan, M. Dunham, D. Hartl, J. Masel, V. Prabhakar, and P. Sood for helpful discussions. We thank K. Drew, L. Uppendahl, S. Schnakenberg, J. Richardson, H. Hsiao, K. Birnbaum, and P. Eichenberger for technical assistance. Abbreviations
Footnotes Author contributions. MLS and SFL conceived and designed the experiments. SFL performed the experiments. MLS and SFL analyzed the data. MLS and SFL contributed reagents/materials/analysis tools. MLS and SFL wrote the paper. Funding. This work was supported by National Science Foundation (NSF) grant IOS-0642999 (MLS) and a National Institutes of Health (NIH) Ruth L. Kirchstein individual postdoctoral fellowship GM 079909 (SFL). MLS is an Alfred P. Sloan research fellow. Competing interests. The authors have declared that no competing interests exist. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
||||||||||||||||||||||||||||||||||||||||||||||||||
FEBS Lett. 2005 Mar 21; 579(8):1772-8.
[FEBS Lett. 2005]Nature. 2002 Jun 6; 417(6889):618-24.
[Nature. 2002]Nature. 1998 Nov 26; 396(6709):336-42.
[Nature. 1998]PLoS One. 2006 Dec 20; 1():e75.
[PLoS One. 2006]J Exp Zool. 1999 Aug 15; 285(2):116-8.
[J Exp Zool. 1999]Nature. 1998 Nov 26; 396(6709):336-42.
[Nature. 1998]PLoS One. 2006 Dec 20; 1():e76.
[PLoS One. 2006]Nature. 2002 Jun 6; 417(6889):618-24.
[Nature. 2002]Proc Natl Acad Sci U S A. 2004 Aug 10; 101(32):11531-6.
[Proc Natl Acad Sci U S A. 2004]J Exp Zool. 1999 Aug 15; 285(2):116-8.
[J Exp Zool. 1999]Genetics. 2005 Jul; 170(3):1359-71.
[Genetics. 2005]Theor Popul Biol. 2007 Dec; 72(4):560-75.
[Theor Popul Biol. 2007]PLoS One. 2006 Dec 20; 1():e75.
[PLoS One. 2006]Proc Natl Acad Sci U S A. 2005 Dec 27; 102(52):19015-20.
[Proc Natl Acad Sci U S A. 2005]Proc Natl Acad Sci U S A. 2005 Dec 27; 102(52):19015-20.
[Proc Natl Acad Sci U S A. 2005]Proc Natl Acad Sci U S A. 2005 Dec 27; 102(52):19015-20.
[Proc Natl Acad Sci U S A. 2005]Mol Cell. 2004 Nov 5; 16(3):487-96.
[Mol Cell. 2004]Curr Opin Genet Dev. 1999 Feb; 9(1):89-96.
[Curr Opin Genet Dev. 1999]Genetics. 2008 May; 179(1):345-57.
[Genetics. 2008]Exp Cell Res. 2001 Dec 10; 271(2):296-304.
[Exp Cell Res. 2001]Science. 2004 Jun 18; 304(5678):1811-4.
[Science. 2004]Nature. 2005 Sep 29; 437(7059):699-706.
[Nature. 2005]Cell. 2006 Mar 10; 124(5):1069-81.
[Cell. 2006]Nature. 2001 May 3; 411(6833):41-2.
[Nature. 2001]Science. 2001 Dec 14; 294(5550):2364-8.
[Science. 2001]Proc Natl Acad Sci U S A. 2002 Aug 6; 99(16):10528-32.
[Proc Natl Acad Sci U S A. 2002]Nature. 2003 Jul 31; 424(6948):549-52.
[Nature. 2003]Genetica. 2007 Jan; 129(1):83-103.
[Genetica. 2007]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D535-9.
[Nucleic Acids Res. 2006]Nature. 2006 Mar 30; 440(7084):631-6.
[Nature. 2006]Nature. 2006 Mar 30; 440(7084):637-43.
[Nature. 2006]Science. 2005 Sep 30; 309(5744):2185-9.
[Science. 2005]Genetics. 2006 Oct; 174(2):937-44.
[Genetics. 2006]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D535-9.
[Nucleic Acids Res. 2006]Proc Natl Acad Sci U S A. 2003 Dec 23; 100(26):15724-9.
[Proc Natl Acad Sci U S A. 2003]Genetics. 2005 Apr; 169(4):1915-25.
[Genetics. 2005]Proc Natl Acad Sci U S A. 2008 Jan 29; 105(4):1243-8.
[Proc Natl Acad Sci U S A. 2008]Nature. 2003 Jan 2; 421(6918):63-6.
[Nature. 2003]Proc Natl Acad Sci U S A. 2008 Jan 29; 105(4):1243-8.
[Proc Natl Acad Sci U S A. 2008]Science. 2000 Nov 10; 290(5494):1151-5.
[Science. 2000]Mol Biol Evol. 2006 Feb; 23(2):327-37.
[Mol Biol Evol. 2006]Genetics. 2001 Jun; 158(2):927-31.
[Genetics. 2001]Genome Res. 2005 Oct; 15(10):1456-61.
[Genome Res. 2005]Proc Natl Acad Sci U S A. 2008 Jan 29; 105(4):1243-8.
[Proc Natl Acad Sci U S A. 2008]Nature. 2006 Jun 15; 441(7095):840-6.
[Nature. 2006]Cell. 2006 Mar 10; 124(5):1069-81.
[Cell. 2006]Genome Biol. 2007; 8(12):R258.
[Genome Biol. 2007]Nature. 2001 Jun 28; 411(6841):1073-6.
[Nature. 2001]Cell. 2006 Mar 10; 124(5):1069-81.
[Cell. 2006]Genetics. 2001 Dec; 159(4):1449-65.
[Genetics. 2001]Genetics. 2008 Jan; 178(1):197-214.
[Genetics. 2008]Proc Natl Acad Sci U S A. 2008 Jan 29; 105(4):1243-8.
[Proc Natl Acad Sci U S A. 2008]Science. 2001 Feb 9; 291(5506):1001-4.
[Science. 2001]Nat Genet. 2006 Aug; 38(8):896-903.
[Nat Genet. 2006]Proc Natl Acad Sci U S A. 2006 Feb 21; 103(8):2707-12.
[Proc Natl Acad Sci U S A. 2006]Nature. 2002 Jun 6; 417(6889):618-24.
[Nature. 2002]Nature. 1998 Nov 26; 396(6709):336-42.
[Nature. 1998]Cell. 1990 May 4; 61(3):375-8.
[Cell. 1990]Nature. 2003 Jul 31; 424(6948):549-52.
[Nature. 2003]Adv Genet. 1961; 10():257-93.
[Adv Genet. 1961]Nature. 2002 Jun 6; 417(6889):618-24.
[Nature. 2002]Nature. 1998 Nov 26; 396(6709):336-42.
[Nature. 1998]Proc Natl Acad Sci U S A. 2003 Nov 11; 100(23):13396-401.
[Proc Natl Acad Sci U S A. 2003]J Exp Bot. 2007; 58(13):3513-24.
[J Exp Bot. 2007]Genetics. 2000 Nov; 156(3):1005-23.
[Genetics. 2000]Mol Microbiol. 2007 Jan; 63(2):507-20.
[Mol Microbiol. 2007]Proc Natl Acad Sci U S A. 2006 Nov 21; 103(47):17614-9.
[Proc Natl Acad Sci U S A. 2006]PLoS Genet. 2007 Feb 23; 3(2):e31.
[PLoS Genet. 2007]Nat Methods. 2008 Aug; 5(8):711-8.
[Nat Methods. 2008]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D535-9.
[Nucleic Acids Res. 2006]Nature. 2006 Mar 30; 440(7084):631-6.
[Nature. 2006]Nature. 2006 Mar 30; 440(7084):637-43.
[Nature. 2006]Nature. 1998 Jun 4; 393(6684):440-2.
[Nature. 1998]Nature. 2002 Jul 25; 418(6896):387-91.
[Nature. 2002]Mol Cell. 2004 Nov 5; 16(3):487-96.
[Mol Cell. 2004]Science. 2004 Feb 6; 303(5659):808-13.
[Science. 2004]Proc Natl Acad Sci U S A. 2005 Dec 27; 102(52):19015-20.
[Proc Natl Acad Sci U S A. 2005]Nucleic Acids Res. 2006 Jan 1; 34(Database issue):D535-9.
[Nucleic Acids Res. 2006]Proc Natl Acad Sci U S A. 2003 Dec 23; 100(26):15724-9.
[Proc Natl Acad Sci U S A. 2003]Genetics. 2005 Apr; 169(4):1915-25.
[Genetics. 2005]Proc Natl Acad Sci U S A. 2008 Jan 29; 105(4):1243-8.
[Proc Natl Acad Sci U S A. 2008]Genome Res. 2005 Oct; 15(10):1456-61.
[Genome Res. 2005]Mol Biol Evol. 2006 Feb; 23(2):327-37.
[Mol Biol Evol. 2006]