![]() | ![]() |
Formats:
|
||||||||||||||
Transcriptome Profiling of the C. elegans Rb Ortholog Reveals Diverse Developmental Roles University of Wyoming, College of Agriculture, Department of Molecular Biology, Dept 3944, 1000 E. University Avenue, Laramie, WY 82071 *To whom correspondence should be addressed. David S. Fay, Telephone: (307) 766-4961, Fax: (307) 766-5098, email: davidfay/at/uwyo.edu Abstract LIN-35 is the single C. elegans ortholog of the mammalian pocket protein family members, pRb, p107, and p130. To gain insight into the roles of pocket proteins during development, a microarray analysis was performed with lin-35 mutants. Stage-specific regulation patterns were revealed, indicating that LIN-35 plays diverse roles at distinct developmental stages. LIN-35 was found to repress the expression of many genes involved in cell proliferation in larvae, an activity that is carried out in conjunction with E2F. In addition, LIN-35 was found to regulate neuronal genes during embryogenesis and targets of the intestinal-specific GATA transcription factor, ELT-2, at multiple developmental stages. Additional findings suggest that LIN-35 functions in cell cycle regulation in embryos in a manner that is independent of E2F. A comparison of LIN-35-regulated genes with known fly and mammalian pocket-protein targets revealed a high degree of overlap, indicating strong conservation of pocket protein functions in diverse phyla. Based on microarray results and our refinement of the C. elegans E2F consensus sequence, we were able to generate a comprehensive list of putative E2F-regulated genes in C. elegans. These results implicate a large number of genes previously unconnected to cell cycle control as having potential roles in this process. Keywords: lin-35, retinoblastoma, pRb, p107, p130, pocket proteins, C. elegans, development, microarray INTRODUCTION The mammalian retinoblastoma protein (pRb) was amongst the first tumor suppressor proteins to be identified (Dryja et al., 1986; Friend et al., 1986; Lee et al., 1987), and loss or inactivation of pRb is thought to play a causative role in the majority of human cancers (Bindra and Glazer, 2006; Knudsen and Knudsen, 2006; Sherr, 2004; Sherr and McCormick, 2002; Yamasaki, 2003). Many studies on pRb, p107, and p130 have demonstrated their importance in the regulation of cell cycle progression, largely by binding E2F transcription factor family members using a cleft or “pocket” domain. This leads to the repression of E2F target genes and the consequent suppression of S-phase entry by inhibiting E2Fs promoting target gene expression (E2F1–3) or by augmenting E2Fs acting as transcriptional repressors (E2F4–6) (Dannenberg et al., 2000; Du and Dyson, 1999; Reed, 1997; Sage et al., 2000). Although the interaction of pocket proteins with E2F family members is the most thoroughly characterized function of pRb-family members, they are far from the only transcription factors known to bind pocket proteins (Lipinski and Jacks, 1999; Morris and Dyson, 2001). For example, pRb has been reported to interact with C/EBPε, to promote the terminal differentiation of granulocytes and adipocytes (Chen et al., 1996; Gery et al., 2004), with GATA-1 to induce erythrocyte differentiation (Rekhtman et al., 2003), and with MyoD, to promote myocyte differentiation (Gu et al., 1993). pRb also antagonizes the cell-differentation inhibitor Id2, allowing for the terminal differentiation of several cell types (Iavarone et al., 1994; Lasorella et al., 2000). In addition, both pRb and p107 have been shown to act through the basic helix-loop-helix transcription factor, NeuroD1, which is critical for early development of the neural system in Xenopus (Batsche et al., 2005). Notably, the majority of reports to date describe activities that have been observed in tissue culture systems and significantly less is known about pocket protein functions in the context of intact developing organisms. This lack of supporting in vivo data is in part due to the lethality of many pRb family knockouts in mice and flies (Clarke et al., 1992; Jacks et al., 1992; Lee et al., 1992; Dimova et al., 2003; Du and Dyson, 1999; Wikenheiser-Brokamp, 2006). C. elegans has only a single pocket protein family member, lin-35, which is not essential for viability. In addition, C. elegans has only one DP homolog and three E2F homologs, further simplifying the analysis of the role of pocket proteins in this organism. Studies have demonstrated that lin-35 also functions in G1–S-phase regulation during development (Boxem and van den Heuvel, 2001; Fay, 2005; Fay et al., 2002), and plays a number of diverse roles outside of cell cycle control. For example, lin-35, has been shown to repress vulval developmental fates (Ceol and Horvitz, 2001; Lu and Horvitz, 1998), and to play roles in pharynx and vulva morphogenesis (Bender et al., 2006; Fay et al., 2003; Fay et al., 2004), asymmetric cell division (Cui et al., 2004), somatic gonad cell fate (Bender et al., 2004), larval growth (Cardoso et al., 2005; Chesney et al., 2006; Cui et al., 2004), germline gene repression (Wang et al., 2005), and even RNA interference (Lehner et al., 2006; Wang et al., 2005). In this study, a microarray analysis was performed using lin-35 null mutants to determine the role of this protein at different developmental stages in the organism and to identify which proteins and pathways it may be acting through. A number of differentially-regulated genes identified in this study were homologous to mammalian genes also regulated by pocket proteins, suggesting a strong conservation of function of pRb-family members between nematodes and mammals. We have also refined the nematode E2F consensus binding site and have generated a compelling list of putative E2F targets in this organism. This, along with the discovery of several novel motifs that could represent binding sites for additional LIN-35 binding partners, may substantially expand the network of pRb-family regulated genes and help to elucidate the routes by which they function. MATERIALS AND METHODS Strains and maintenance All C. elegans strains were maintained according to established protocols (Stiernagle). All experiments were carried out at 20°C. Strains used in these studies included the following: N2 (wild type) and of lin- 35(n745) (Lu and Horvitz, 1998), which had been additionally backcrossed by our laboratory 5X. Staging of organisms Wild-type and mutant worms were staged by bleaching gravid adults to collect eggs, which were hatched onto NGM plates in the absence of food. After hatching, worms were transferred to NGM plates with bacteria (OP50). RNA harvested for embryonic stage worms was obtained from a total preparation of embryos. As a result of extracting embryos from young adult stage worms, predominantly early stage embryos were obtained. The differences in embryos between strains were similar to the differences between replicates in the same strain. For RNA from L1 stage larvae, collection occurred ~1 hour prior to the apoptosis of T.ppp, which occurred ~10 or 11 hours after hatching in N2 and lin-35 mutants, respectively (Sulston and Horvitz, 1977). For L4 stage larvae, a distinct Christmas tree vulval morphology was used as a biomarker. RNA extraction and microarrays Embryos or larvae were washed off of NGM plates using M9. RNA was extracted using phenol followed by phenol:chloroform:isoamyl alcohol. RNA was precipitated overnight at 4° C with 4M lithium acetate and ethanol. RNA was DNase treated, purified on RNEasy minicolumns (QIAGEN), and verified to be free of genomic DNA contamination by quantitative reverse transcription-mediated real time PCR (qRT-PCR). cDNA synthesis, cRNA synthesis, labeling, fragmentation, GeneChip hybridization, and scanning were performed according to specifications from Affymetrix. Triplicate chips were run for each strain at each stage. RNA and cDNA qualities were verified at each step using capillary electrophoresis (Bioanalyzer; Agilent Technologies). Gene expression values were determined using Robust Multiarray Analysis software (Irizarry et al., 2003). Relevant expression differences were identified using a modified Wilcoxon-Rank test (for details see Results). qRT-PCR analysis cDNA synthesis reactions were performed with SuperScriptII (Invitrogen), and reactions contained 5 mg RNA were purified as above. Approximately 20 ng of cDNA was used for each qRT-PCR reaction. qRT-PCR reactions were performed in an iCycler iQ (Bio-Rad), using a SYBR Green fluorophore. Control reactions were performed using actin-specific primers, and threshold cycles for all reactions were normalized using a D/DCt method. Regulatory motif analysis To identify candidate regulatory sequences in the 5′ noncoding regions of putative C. elegans homologs of p107/p130-target genes, 1 kilobase of sequence data upstream of the start codon of each gene was retrieved from C. elegans genomic information using Regulatory Sequence Analysis Tools software (http://embnet.ccg.unam.mx/rsa-tools/). Regions of DNA with low sequence complexity were masked using RepeatMasker (http://repeatmasker.org), and then searched for overrepresented motifs using the program MEME (Multiple Em for Motif Elicitation) (http://meme.sdsc.edu/meme/intro.html). This analysis was performed locally, using background models of the fourth order. Motif sequence logos were created using an online program (http://weblogo.berkeley.edu/logo.cgi). To verify the relevance of motifs returned by the software, a PWM was generated for each motif by MEME. For the E2F consensus site, this matrix was applied to all 1949 responsive genes, and RSAT software was used to examine the relative representation of the motif in genes within the list of responsive genes as compared with the entire genome. This analysis was performed with two different types of cutoff ranges, using scores obtained by applying the PWM to the relevant genes. First, successive ranges of cutoff scores were used. Score ranges were obtained using cutoff scores between the highest score obtained and sequentially lower scores (yielding sequentially larger ranges with lower cutoffs). Second, interval score cutoffs were examined. These were obtained using PWM score intervals (with a range of two between selected points). For the other reported motifs, sequences were derived and PWM were generated as above. Motifs were reported if they exhibited significant (p < 0.01) overrepresentation in genes that were in the cluster (see below) within which the motif was identified compared to the remaining non-E2F genes, overrepresentation within the non-E2F regulated genes as compared with E2F regulated genes, and overrepresentation within the non-E2F responsive genes as compared with the entire genome. The cutoff scores for these calculations were set by the lowest score obtained by applying the PWM to the genes that the motif was identified within. Clustering Gene expression levels were normalized to gene-specific relative maxima across both strains and all stages (six points per gene). Genes were then divided into ten clusters using profiles that were specific for strains and stages. Clustering was performed using MultiExperiment Viewer software (http://www.tm4.org/mev.html), with a number of different clustering algorithms tested, including Hierarchical Clustering, K-Means/K-Medians Support, Self-Organizing Maps, K-Means/K-Medians Clustering, and Quantitative Threshold Clustering. K-Means/K-Medians clustering yielded groupings of genes that were the closest to the average expression line for each cluster, so this method was used to further divide the differentially expressed genes in each cluster into subclusters according to their fold-change profile. This division was based on relative expression level of the genes in the cluster. RESULTS Time points and primary data analysis To identify genes that are responsive to LIN-35 during development, RNA was harvested from synchronized lin-35(n745) mutant and wild type (N2) worms at embryonic, L1, and L4 stages. Our specific time points were chosen in order to provide information about LIN-35 activities at several distinct developmental stages and because previous studies have implicated biological roles for LIN-35 during these stages (also see Introduction). To ensure the precise developmental synchronization of animal populations for these studies, we used several stage-specific developmental markers as opposed to chronological timing, which can be variable between strains and individual replicates (see Materials and methods). RNA was purified from each stage and used to generate cRNA, which was fragmented and hybridized to C. elegans Affymetrix GeneChips®. Three independent biological replicates were performed for each stage and expression values for each gene were generated using Robust Multiarray Analysis software (Irizarry et al., 2003). To determine relevant changes in gene expression, a modified Wilcoxon-Rank test was performed using a minimal quotient of >1.15, which has been shown to efficiently eliminate false positives and to yield results with high biological relevance (Braatsch et al., 2004; Harpster et al., 2006; Moskvin et al., 2005; Pappas et al., 2004; Zeller et al., 2005). In addition, a minimal mean- and median-fold change limit for individual genes was set at 1.5, which was stringent enough to return an analytically manageable number of genes while still retaining sufficient sensitivity. Using these criteria, a false discovery rate of <1.5% was calculated, suggesting that these applied conditions were highly effective in identifying statistically significant, differentially-regulated genes (data not shown). Overview of stage-specific transcriptional responses In this array study, a total of 1,949 genes showed altered expression in the lin-35 mutant background as compared with wild type. There were 642 differentially regulated genes at embryonic stages, 945 at L1, and 1108 at the L4 stage; a full list of these genes can be located in Fig. S1. Importantly, a number of genes previously demonstrated to be regulated by pRb–E2F complexes in multiple species, including human, mouse, and Drosophila, were amongst the targets identified (e.g. cyclin A, cyclin E, thymidylate synthase, BRCA, mcm-2, mcm-3, mcm-4, and mcm-5). Moreover, consistent with the notion that LIN-35/pRB functions primarily as a transcriptional co-repressor, the large majority of identified responsive genes were upregulated in the lin-35 null background; 81% in embryos, 70% at L1, and 64% at L4. To further validate the array findings, eleven genes with different developmental expression patterns and different biological functions were selected as targets for RT-PCR (Fig. 1
Of the total 1,306 upregulated, responsive genes, only 123 (9%) were found to be differentially regulated at all three developmental stages (Fig. 2A
Considerably fewer genes were observed to be downregulated in lin-35 mutants: 19% in embryos, 30% in L1, and 36% in L4. The increase in the number of genes specifically downregulated across developmental stages, from 79 in embryos to 301 in L4 stage worms, argues that some of this downregulation may be a result of non-specific causes. Furthermore, the degree of overlap observed in downregulated genes compared to upregulated genes (Fig. 2B With these analyses in hand, the responsive genes were divided into functional categories based on gene ontologies (GOs) and literature searches. Consistent with the general lack of overlap between certain developmental stages (Fig. 2 In embryos, a significantly larger number of genes associated with neurogenesis and neurotransmission were present as compared with larval stages (Fig. S3). This finding suggests a previously unknown role for LIN-35 in the nervous system during embryonic development. In stark contrast with larval stages, genes involved in cell cycle, DNA metabolism, and cell maintenance were not statistically overrepresented at this stage (Fig. S3). These latter results indicate that LIN-35 may not play a major role in cell cycle regulation during embryogenesis. Categorical analysis was further corroborated by evaluating stage-specific responses within the context of gene mountains (Kim et al., 2001). Mountain analysis first groups co-regulated genes, using a meta-array approach, and then attempts to ascribe functions to the mounts based on representative genes with established functions or known expression patterns. L1- and L4-stage mountain distributions showed expression patterns consistent with enrichment of meiosis and mitosis, and retinoblastoma complex members (Fig. S4B, C). Mountain analysis of embryos showed a dramatic overrepresentation of responsive genes in a mount associated with neuronal genes (Fig. S4A). These results corroborate those from those of the GO analysis and strengthen the hypothesis that LIN-35 may play an unrecognized role in the nervous system of C. elegans embryos. Two mountains (7 and 11) described to possess germline genes were also overrepresented. Genes in both mountains were strongly enriched in E2F sites (Fig. S4), consistent with the involvement of LIN-35 in the direct regulation of these genes (also see below) and with previous reports demonstrating LIN-35 repression of germline-associated genes in somatic tissues (Wang et al., 2005). Comparison of pocket protein targets from different species Genes previously demonstrated to be responsive to mammalian pRb (Finocchiaro et al., 2005) or to p107/p130 (Balciunaite et al., 2005) were used to identify best genome sequence matches from C. elegans. Genes were considered to be putative orthologs if a reciprocal search returned the original pocket-protein target in H. sapiens. Based on this approach, >50% of the mammalian pocket-protein responsive genes were found to contain likely C. elegans orthologs. Cross-referencing these genes with the list of LIN-35-responsive genes revealed that ~30% of the putative mammalian orthologs were also targets for regulation by LIN-35, a percentage that is highly statistically significant (Fig. S5B). Furthermore, similar levels of overlap were observed for both pRb- and p107/p130-target genes with the identified LIN-35-responsive genes. This indicates that LIN-35 may carry out ancestral-type functions that are common to all three mammalian family members. This finding is also consistent with the placement of LIN-35 equally distant from all three mammalian pocket proteins and both Drosophila pocket proteins and as being located in an ancestral position to all five (Fig. S5A). The majority of overlap between LIN-35 and the mammalian pocket proteins came from targets that were differentially regulated at either the L1 stage or at both the L1 and L4 stages. Polymerase subunits, replication factors, cyclins, and histones were among the overlapping genes that are targets of the human and C. elegans pocket proteins (see Fig. S6 for a full list). As has previously been noted, many of these targets are known to act in cell cycle control and DNA replication, further demonstrating a conservation of function between LIN-35 and the mammalian pocket proteins. In contrast, only nine of the 94 genes overlapping between LIN-35 and the mammalian pocket proteins were differentially regulated in embryos. This is consistent with previous data, such as the lack of conserved targets between embryonic and larval stages, further indicating distinct functions for LIN-35 in embryos that are unconnected to the cell cycle. Similar to embryos, LIN-35-responsive genes in the germline (Chi and Reinke, 2006), also failed to show high levels of overlap with the mammalian targets, indicating that LIN-35 plays a unique role in this tissue as well (Fig. S5). In both that study and in this one, responsive genes in embryos and the germline appeared to play a more prominent role in differentiation than in the extended cell cycle functions. We also note that the overlap of the mammalian pocket-protein targets with genes responsive to the Drosophila pocket proteins, though statistically significant, was much weaker than that observed for LIN-35. This may reflect more divergent functions for the Drosophila proteins or may be a reflection of the different experimental approaches used in these studies. Search for evolutionarily-conserved Rb family binding partners In order to find conserved Rb-family binding partners, the LIN-35-responsive orthologs of the mammalian p107/p130-responsive genes were searched for overrepresented motifs. Genes overlapping with p107/p130 targets were chosen instead of pRb-responsive genes because this data was derived from ChIP experiments rather than from a meta-analysis of multiple microarray experiments, thereby increasing the probability that the derived targets would be directly regulated by LIN-35. One kilobase of upstream sequence for each gene was retrieved from a C. elegans genomic database, regions of low sequence complexity were masked, and an unbiased search for overrepresented motifs was performed using MEME (see Materials and methods). This analysis returned a single motif (TTTSSCGCGC) that was enriched in both the genes that overlapped with mammals and in the entire list of lin-35-responsive genes (Fig. 3A
E2F-regulated genes in C. elegans A position-weight matrix (PWM) was derived for the putative C. elegans E2F consensus binding site (described above; see Materials and Methods). A PWM determines a weighted probability of particular nucleotides occurring at individual motif positions based on the degree of conservation. This PWM was used to search the promoter regions of the 1949 LIN-35-responsive genes. Scores were generated for each gene containing a putative E2F motif, with higher scores indicating a better match. Several genes known to function in DNA replication and mitosis (including mcm-3 and -4, pcn-1, and pri-2) obtained high scores using this matrix. The motif was found to be significantly (p<0.01) overrepresented in our list of LIN-35-responsive genes at all three developmental stages (see Materials and Methods for criteria), suggesting that this motif has biological significance. Furthermore, a strong correlation was observed between the PWM score and the probability that a given gene would be upregulated in the lin-35 null mutant background. For example, amongst the 50 best-scoring LIN-35-responsive genes from the L1 stage, all genes were observed to be upregulated in lin-35 mutants (p 0.01) compared to 70% of genes among all L1-stage responsive genes. In contrast, the presence of a high-scoring E2F motif was not found to be a reliable predictor of gene behavior in embryos, further indicating that the binding factor(s) acting through this motif are functioning differently at this stage.To generate a list of putative E2F-target genes from our dataset, several criteria were applied. First, candidate genes had to have a minimum score of 4 (based on several selective criteria, data not shown) from PWM application. Second, the nucleotides at positions 3, 5, 7, 8, and 9 had to conform to the refined motif (Fig. 3A The 452 genes identified shared several important characteristics. First, they demonstrated enrichment in cell cycle and DNA maintenance functions. Genes with this consensus site also often contained multiple copies (Fig. 3B As a means for further analysis, the putative E2F target genes were divided into 25 clusters based on their expression profiles using a K-means/K-medians clustering (KMC) algorithm (see Materials and Methods) (Fig. S8). Of these, 197 genes fell into nine clusters strongly enriched for cell cycle genes (p 0.01). The gross expression patterns exhibited by these genes in wild-type animals showed a striking similarity between the clusters (Fig. 4
It is noteworthy that many genes not previously annotated to play a role in cell cycle and replication functions were also located in these nine clusters. For example, gpr-2, a protein regulating chromosomal and spindle movements, possesses a number of high-scoring E2F sites and was upregulated at all three observed stages. TIM-1 (timeless), another protein upregulated at both larval stages, participates in the regulation of developmental timing in other systems (Gotter, 2006; Hardin, 2005; Myers et al., 1995). A number of transcription factors, most of which have not been previously connected to cell cycle control (e.g., cbp-1, mdl-1, lin-59, C18G1.2), were also found. The existence of nearly two hundred proteins in these clusters potentially expands the network of Rb/E2F cell cycle functions quite considerably. Identification of four potential non-E2F LIN-35 regulatory sites The remaining 1,497 responsive genes that did not contain consensus E2F binding sites were divided into clusters and subclusters using a KMC algorithm (see Materials and Methods). Upstream sequence data from each gene in these subclusters was retrieved, masked, and searched for overrepresented motifs (see Materials and Methods). In order to identify biologically relevant motifs amongst those returned by the software, a number of significance thresholds (p < 0.01) were applied (also see Materials and Methods). Among these, the motif had to be significantly overrepresented in the cluster compared to the list of non-E2F-responsive genes and also had to be overrepresented in the non-E2F responsive genes compared to the 452 E2F-responsive genes and to the entire C. elegans genome. This algorithm was used to identify two very precise motifs present exclusively in responsive genes that were downregulated at the L1 stage. The first motif (TWTAGATCTRGGACAGWGATAA, Fig. 5A
A more general motif (AYTGATAAVA) with a GATA core was identified in a large number of the responsive genes (Fig. 5C In addition to the motifs already mentioned, a fourth, very specific, motif was located by clustering genes that were responsive at the embryonic stage (Fig. 5D DISCUSSION In this study, a total of 1949 genes were found to be responsive to a null mutation in lin-35, the sole pocket protein family member in C. elegans. While it is somewhat surprising that these organisms can function with misregulation of upwards of five hundred genes at each developmental stage, they appear to suffer little more than a developmental lag. Their ability to survive this misregulation is possibly due to compensatory changes in the regulated expression of other genes, alterations in protein activities, and differences in polypeptide clearance. This is in marked contrast with other developmental models, such as mouse and fruit fly, where null alleles of pRb and its fly homolog, dRbF1, are lethal (Clarke et al., 1992; Du and Dyson, 1999; Jacks et al., 1992; Lee et al., 1992). Despite the lack of an overt phenotype, our analysis demonstrates a remarkable conservation of function between LIN-35/pRb and pocket proteins from other organisms. LIN-35/pRb regulates a large number of genes involved in cell cycle progression, replication, and mitosis, most of which are upregulated in the lin-35 null mutant background (Fig. 2 Our results also strongly indicate that LIN-35/Rb carries out cell cycle functions largely by acting through E2F family transcription factors, consistent with findings in other organisms (Du and Pogoriler, 2006; Korenjak and Brehm, 2005; Stevaux et al., 2005). Interestingly, the observed misregulation of genes with cell cycle functions in lin-35 mutants is much more prevalent during larval stages than in embryos, where expression levels of genes containing E2F binding motifs remain relatively unchanged, suggesting that LIN-35/Rb-E2F complexes are largely superfluous for this role in embryos. In accordance with these observations, CyclinD, which is known to act with cyclin-dependent kinases to inhibit pocket protein activity, is dispensable in worm and mouse embryos (Lukas et al., 1995; Park and Krause, 1999). Most interestingly, our results also provide compelling evidence that LIN-35/Rb may control the expression of certain cell cycle genes, specifically those encoding cyclin A family members, during embryogenesis through a mechanism that is independent of E2F binding (Fig 5D Since the E2F family of transcription factors is one of the most thoroughly studied groups of pRb binding factors, we undertook an in-depth analysis of the LIN-35/Rb responsive genes in order to further delineate the C. elegans E2F motif and to generate a high-confidence list of over 450 putative targets of the LIN-35/Rb-E2F complex in C. elegans. Of these genes, nearly 200 were clustered groups that were strongly enriched in genes with cell cycle functions. Many of these genes are directly involved in cell cycle control and replication and division processes, such as CDKs, cyclins, and helicases, and regulators of the mitosis. Interestingly, a substantial number of the differentially-regulated genes containing this site have no previous connection to these processes, and their identification may serve as a valuable resource for finding genes with previously unappreciated roles in cell cycle control and progression. It is also important to note that some known pRb/E2F-regulated genes did not make our list E2F targets. For example, ribonucleotide reductase (rnr-1), failed to make the list of responsive genes because its fold-change was only 1.4, which was below our cutoff. Furthermore, bub-1, another known pRb/E2F target that was identified as being responsive to lin-35 failed to make the list because its E2F site did not match sufficiently close to the derived consensus. We contend that the existence of false negatives among our data set was a necessary tradeoff in order to minimize the occurrence of false positives. In addition to a role in cell cycle regulation, our findings indicate that LIN-35/Rb may play a role in neurological development and neurotransmission. This hypothesized function seems to be important predominantly in embryos, since the proportion of genes annotated with these functions drops precipitously by the L1 stage. While this function was somewhat unexpected, pocket proteins have been previously implicated in the differentiation of neural cells in mammals and amphibians (Batsche et al., 2005; Ferguson and Slack, 2001; Slack and Miller, 1996), suggesting that a role in the nervous system may be conserved. Our analysis also yielded four additional motifs that may constitute target sites for LIN-35 regulation. One of these motifs was located in the upstream regions of major sperm proteins. Another motif is found in the promoter regions of heat shock proteins, which is consistent with reports of pRb playing a role in stress erythropoiesis in mammals (Spike et al., 2004). As discussed above, a third motif was located in genes differentially regulated in embryos that were annotated to be cyclin-A-related. A fourth motif, containing a GATA core, has recently been identified as the binding site for ELT-2 (McGhee et al., 2006), the major regulator of intestinal development in worms. This finding is unlikely to be the result of LIN-35/pRb–E2F transcriptional regulation of the elt-2, however, as the upstream regulatory region of this gene is devoid of consensus E2F binding sites. In agreement with this finding, an E2F-independent role of pRb in intestinal differentiation in mice has been observed (Haigis et al., 2006). Further, a region of pRb overlapping the E2F binding site (Dick and Dyson, 2003) has been shown to be tethered to another GATA transcription factor, GATA-1, resulting in repression of its activity (Rekhtman et al., 2003). Thus, a role in the regulation GATA targets by pocket proteins may also be conserved. While we do not currently know whether the identified role of LIN-35/pRb in regulating genes involved in intestinal and neuronal differentiation is direct, future experiments will address this question, as well as verifying the roles of some of the other genes predicted to function in cell cycle regulation. Regardless of whether this regulation is direct, similar findings in mammals and amphibians suggest that this role of LIN-35/pRb is biologically relevant and is highly conserved across widely disparate species. This fact, combined with the recapitulation of the regulation of homologous genes by pocket proteins in both worms and in mammals, demonstrates the utility of C. elegans as a model for determining pocket protein functions with whole organisms in vivo. 01: Fig. S1. Complete List of lin-35 Null Responsive Genes Colors codes are as follows: yellow – embryonic stage only, light green – L1 stage only, blue – L4 stage only, dark green – L1 and L4 stages, magenta – embryonic and L1 stages, purple – embryonic and L4 stages, red – all stages, pink – genes containing an E2F consensus site. Column headed “mountain” refers to mountains described in Kim et al., 2001. Click here to view.(491K, XLS) 02: Fig. S2. Overview of Transcriptome Response Expression patterns of genes responsive at (A) embryonic, (B) L1, and (C) L4 stages and their behavior at the other two stages. Each line represents the fold-change for a single gene (normalized to N2, logarithmic scale). Lines are colored red if the gene is upregulated and blue if it is downregulated at the representative stage for each panel (embryonic in panel A, L1 in panel B, and L4 in panel C). Each gene retained this color in the other two stages (e.g., L1 and L4 for panel A) for each panel, independently of the observed regulation at these stages. Click here to view.(116K, PDF) 03: Fig. S3. Categorical Distribution of lin-35 Responsive Genes at Different Stages Responsive genes from (A, E) embryonic stage, (B, F) L1 stage, (C, G) L4 stage, and (D, H) genes containing E2F sites were divided into categories according to known or putative functions. Panels A–D represent upregulated responsive genes and panels E–H represent downregulated responsive genes. Individual subcategories of genes involved in cell cycle regulation that were statistically overrepresented (p 0.01) at L1 and L4 stages include chromosome organization and biogenesis (GO: 0051276), DNA metabolism (GO:0006259), DNA repair (GO: 0006281), and DNA replication (GO: 0006260). The cell cycle category included genes involved in cell cycle regulation, DNA replication, repair, and metabolism, mitosis, meiosis, and cell division.Click here to view.(24K, PDF) 04: Fig. S4. Mountain Distribution of lin-35 Responsive Genes at Different Stages Responsive genes from (A) embryos, (B) L1 stage, (C) L4 stage, and (D) genes with E2F sites were divided into mountains (Kim et al., 2001). Percentages of responsive genes were compared to corresponding percentages of total genes from the genome within each mountain. Click here to view.(19K, PDF) 05: Fig. S5. Comparison of Pocket Protein “Regulons” from Different Species (A) A phylogenetic tree including known human, Drosophila melanogaster, and C. elegans pocket proteins was built using distance methods (topological clustering) from multiple sequence alignment. The numbers represent relative distance between the proteins. Overlap of (B) pRb-responsive genes, (C) p107/p130-responsive genes, and (D) all human pocket proteins with LIN-35 responsive genes from this study, LIN-35- and E2F/DP-responsive genes in the C. elegans germline (Chi and Reinke, 2006), and dRbF1/2-responsive genes (Dimova et al., 2003). Click here to view.(20K, PDF) 06: Fig. S6. Complete List of lin-35 Null Responsive Genes Orthologous to Human Pocket Protein-Responsive Genes Colors codes are as described in Fig. S1. Click here to view.(47K, XLS) 07: Fig. S7. Complete List of lin-35 Null Responsive Genes with E2F Binding Sites Colors codes are as described in Fig. S1. Genes are sorted by their PWM score. Genes in bold are those most likely to have cell cycle function according to cluster analysis (See Results and Fig. S8). Click here to view.(183K, XLS) 08: Fig. S8. Clusters of lin-35 Null Responsive Genes with E2F Binding Sites Clusters obtained by KMC algorithm, ranked (top to bottom) by PWM score from E2F consensus site. Clusters 23, 17, 20, 25, 7, 4, and 12 were strongly enriched in cell cycle genes. Lines are intended to facilitate visualization of cluster patterns and are not intended to represent proximal chronological steps. Click here to view.(148K, PDF) Acknowledgments We thank Oleg Moskovin for help with many aspects of our analysis, Mark Gomelsky for reagents, and James DeGregori for advice. We also thank Jim McGhee for sharing unpublished data, and Bifeng Gao and Tom Evans for assistance with microarray experiments. This work was supported by GM066868-01A1 from the National Institutes of Health. Footnotes Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||
Proc Natl Acad Sci U S A. 1986 Oct; 83(19):7391-4.
[Proc Natl Acad Sci U S A. 1986]Nature. 1986 Oct 16-22; 323(6089):643-6.
[Nature. 1986]Science. 1987 Mar 13; 235(4794):1394-9.
[Science. 1987]Exp Biol Med (Maywood). 2006 Jul; 231(7):1271-81.
[Exp Biol Med (Maywood). 2006]Cell. 2004 Jan 23; 116(2):235-46.
[Cell. 2004]Oncogene. 1999 Dec 20; 18(55):7873-82.
[Oncogene. 1999]Adv Cancer Res. 2001; 82():1-54.
[Adv Cancer Res. 2001]Genes Dev. 1996 Nov 1; 10(21):2794-804.
[Genes Dev. 1996]Blood. 2004 Feb 1; 103(3):828-35.
[Blood. 2004]Mol Cell Biol. 2003 Nov; 23(21):7460-74.
[Mol Cell Biol. 2003]Nature. 1992 Sep 24; 359(6393):328-30.
[Nature. 1992]Nature. 1992 Sep 24; 359(6393):295-300.
[Nature. 1992]Nature. 1992 Sep 24; 359(6393):288-94.
[Nature. 1992]Genes Dev. 2003 Sep 15; 17(18):2308-20.
[Genes Dev. 2003]EMBO J. 1999 Feb 15; 18(4):916-25.
[EMBO J. 1999]Cell. 1998 Dec 23; 95(7):981-91.
[Cell. 1998]Dev Biol. 1977 Mar; 56(1):110-56.
[Dev Biol. 1977]Biostatistics. 2003 Apr; 4(2):249-64.
[Biostatistics. 2003]Biostatistics. 2003 Apr; 4(2):249-64.
[Biostatistics. 2003]J Bacteriol. 2004 Nov; 186(22):7726-35.
[J Bacteriol. 2004]Mamm Genome. 2006 Jul; 17(7):701-15.
[Mamm Genome. 2006]J Bacteriol. 2005 Mar; 187(6):2148-56.
[J Bacteriol. 2005]J Bacteriol. 2004 Jul; 186(14):4748-58.
[J Bacteriol. 2004]J Bacteriol. 2005 Nov; 187(21):7232-42.
[J Bacteriol. 2005]Science. 2001 Sep 14; 293(5537):2087-92.
[Science. 2001]Nature. 2005 Jul 28; 436(7050):593-7.
[Nature. 2005]BMC Bioinformatics. 2005 Dec 1; 6 Suppl 4():S14.
[BMC Bioinformatics. 2005]Mol Cell Biol. 2005 Sep; 25(18):8166-78.
[Mol Cell Biol. 2005]Development. 2006 Aug; 133(16):3147-57.
[Development. 2006]Development. 2006 Aug; 133(16):3147-57.
[Development. 2006]Development. 2006 Aug; 133(16):3147-57.
[Development. 2006]Neuroreport. 2006 Aug 21; 17(12):1229-33.
[Neuroreport. 2006]Curr Biol. 2005 Sep 6; 15(17):R714-22.
[Curr Biol. 2005]Science. 1995 Nov 3; 270(5237):805-8.
[Science. 1995]Proc Natl Acad Sci U S A. 1999 Oct 12; 96(21):11883-8.
[Proc Natl Acad Sci U S A. 1999]Dev Biol. 2004 Jul 1; 271(1):11-25.
[Dev Biol. 2004]PLoS Biol. 2004 Nov; 2(11):e352.
[PLoS Biol. 2004]Nature. 1992 Sep 24; 359(6393):328-30.
[Nature. 1992]EMBO J. 1999 Feb 15; 18(4):916-25.
[EMBO J. 1999]Nature. 1992 Sep 24; 359(6393):295-300.
[Nature. 1992]Nature. 1992 Sep 24; 359(6393):288-94.
[Nature. 1992]Mol Cell Biol. 2005 Sep; 25(18):8166-78.
[Mol Cell Biol. 2005]Cancer Res. 2003 Jul 1; 63(13):3716-23.
[Cancer Res. 2003]Genes Dev. 2003 Sep 15; 17(18):2308-20.
[Genes Dev. 2003]BMC Bioinformatics. 2005 Dec 1; 6 Suppl 4():S14.
[BMC Bioinformatics. 2005]Cancer Res. 2002 Nov 15; 62(22):6587-97.
[Cancer Res. 2002]Oncogene. 2006 Aug 28; 25(38):5190-200.
[Oncogene. 2006]Curr Opin Genet Dev. 2005 Oct; 15(5):520-7.
[Curr Opin Genet Dev. 2005]Cell Cycle. 2005 Sep; 4(9):1272-80.
[Cell Cycle. 2005]Mol Cell Biol. 1995 May; 15(5):2600-11.
[Mol Cell Biol. 1995]Development. 1999 Nov; 126(21):4849-60.
[Development. 1999]J Biol Chem. 2005 Apr 22; 280(16):16088-95.
[J Biol Chem. 2005]Neuroreport. 2001 Jul 3; 12(9):A55-62.
[Neuroreport. 2001]Dev Genet. 1996; 18(1):81-91.
[Dev Genet. 1996]EMBO J. 2004 Oct 27; 23(21):4319-29.
[EMBO J. 2004]J Biol Chem. 2006 Jan 6; 281(1):638-47.
[J Biol Chem. 2006]Mol Cell. 2003 Sep; 12(3):639-49.
[Mol Cell. 2003]Mol Cell Biol. 2003 Nov; 23(21):7460-74.
[Mol Cell Biol. 2003]Science. 2001 Sep 14; 293(5537):2087-92.
[Science. 2001]Science. 2001 Sep 14; 293(5537):2087-92.
[Science. 2001]Development. 2006 Aug; 133(16):3147-57.
[Development. 2006]Genes Dev. 2003 Sep 15; 17(18):2308-20.
[Genes Dev. 2003]