![]() | ![]() |
Formats:
|
||||||||||||||||
Copyright © 2005, Cold Spring Harbor Laboratory Press New genes with roles in the C. elegans embryo revealed using RNAi of ovary-enriched ORFeome clones 1 Department of Biology, New York University, New York, New York 10003, USA 2 Center for Cancer Systems Biology and Department of Cancer Biology, Dana-Farber Cancer Institute and Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA 3 Department of Genetics, Yale University School of Medicine, New Haven, Connecticut 06520, USA 4These authors contributed equally to this work. 5Corresponding author.E-mail fp1/at/nyu.edu; fax (212) 995-4015. Received August 27, 2004; Accepted December 9, 2004. This article has been cited by other articles in PMC.Abstract Several RNA interference (RNAi)-based functional genomic projects have been performed in Caenorhabditis elegans to identify genes required during embryogenesis. These studies have demonstrated that the ovary is enriched for transcripts essential for the first cell divisions. However, comparing RNAi results suggests that many genes involved in embryogenesis have yet to be identified, especially those eliciting partially penetrant phenotypes. To discover additional genes required for C. elegans embryonic development, we tested by RNAi 1123 ORFeome clones selected to represent ovary-enriched genes not associated with an embryonic phenotype. We discovered 155 new ovary-enriched genes with roles during embryogenesis, of which 69% show partial penetrance lethality. Time-lapse microscopy revealed specific phenotypes during early embryogenesis for genes giving rise to high penetrance lethality. Together with previous studies, we now have evidence that 1843 C. elegans genes have roles in embryogenesis, and that many more remain to be found. Using all available RNAi phenotypic data for the ovary-enriched genes, we re-examined the distribution of genes by chromosomal location, functional class, ovary enrichment, and conservation and found that trends are driven almost exclusively by genes eliciting high-penetrance phenotypes. Furthermore, we discovered a striking direct relationship between phylogenetic distribution and the penetrance level of embryonic lethality elicited by RNAi. A continuing challenge in contemporary biology is to extract meaningful functional data from the genome sequence. High-throughput functional genomics projects are currently ongoing in several model systems to characterize gene function on multiple levels, including gene expression patterns (Spellman et al. 1998; Reinke et al. 2000, 2004; Jiang et al. 2001; Arbeitman et al. 2002; Baugh et al. 2003), protein-protein interactions (Uetz et al. 2000; Walhout et al. 2000; Ito et al. 2001; Giot et al. 2003; Li et al. 2004), and protein structures (Chance et al. 2002; Goh et al. 2003). Phenotypic data complement these studies and are necessary to make the connection between predicted biochemical function and biological role in living cells. The development of RNA interference (RNAi) (Guo and Kemphues 1995; Fire et al. 1998) has made possible large-scale in vivo gene analyses in a functional genomic context in Caenorhabditis elegans (Gönczy et al. 2000; Piano et al. 2000, 2002; Hanazawa et al. 2001; Maeda et al. 2001; Kamath et al. 2003; Simmer et al. 2003; Rual et al. 2004) and in cell culture systems from Drosophila melanogaster and mammals (Somma et al. 2002; Kiger et al. 2003; Berns et al. 2004; Boutros et al. 2004; Paddison et al. 2004). RNAi studies enable loss-of-function analyses to be performed on genes that had not been uncovered previously by mutagenesis, thus potentially allowing phenotypic characterization of every gene in the genome. Data from different types of functional genomics projects can be combined to make inferences about gene function and protein networks on a large scale (Marcotte et al. 1999; Ge et al. 2001; Boulton et al. 2002; von Mering et al. 2002; Walhout et al. 2002). C. elegans embryogenesis is a powerful in vivo model to study gene function in the context of animal development. Its germline is highly sensitive to RNAi, facilitating the detection of genes with essential functions in the embryo. The set of genes known to be required during embryogenesis is enriched for components of pathways that drive cell proliferation and other fundamental processes, and these genes tend to be highly conserved across species (Gönczy et al. 2000; Piano et al. 2000, 2002; Kamath et al. 2003). Thus defining a comprehensive set of genes needed for C. elegans embryogenesis is an important step toward gaining general insights into mechanisms underlying animal development. Having in hand a complete list of genes required for fertility and embryogenesis is important both to systems biologists who study large networks and to classical biologists who study in detail the genes that control a single process. Despite the unprecedented scale of in vivo functional data obtained using RNAi, there are technical challenges to RNAi-based phenotypic analysis. In C. elegans, variables that can affect RNAi include the double-stranded (ds)RNA template source, the delivery method, and scoring criteria (for review, see Piano and Gunsalus 2002). Different studies have varied in their use of dsRNAs derived from genome amplicons that include intronic sequences (Gönczy et al. 2000; Piano et al. 2002; Kamath et al. 2003; Simmer et al. 2003) or from cDNA clones composed exclusively of exonic sequences (Piano et al. 2000; Hanazawa et al. 2001; Maeda et al. 2001; Rual et al. 2004). Delivery of dsRNA has been accomplished by injection (Guo and Kemphues 1995; Fire et al. 1998), by feeding animals on bacterial strains expressing dsRNA (Timmons and Fire 1998), and by soaking them in a solution containing dsRNA (Tabara et al. 1998). Despite these various approaches and the inherent complexities of phenotypic analyses, independent large-scale RNAi studies have shown a relatively high reproducibility rate (Piano and Gunsalus 2002; Simmer et al. 2003). However, just as a genome must be sequenced at several-fold depth to achieve sufficient coverage and accuracy, comparisons between different RNAi studies also suggest that multiple passes of genome-wide RNAi will be necessary to obtain a comprehensive view of the in vivo functions of each gene in C. elegans. Previous RNAi studies in C. elegans have shown that the set of genes expressed at higher levels in the ovary, relative to the soma or male germline, is enriched for genes required for embryogenesis (Piano et al. 2002). Using a partial C. elegans microarray, Reinke et al. (2000) identified 766 genes enriched in the ovary, and RNAi analysis revealed that at least 43% of these genes are required for embryogenesis (Piano et al. 2002). Recently, using a C. elegans genome-wide microarray containing about 90% of predicted genes, a total of 3015 ovary-enriched genes were identified (Reinke et al. 2004). However, only ~28% of genes tested from this set have been found to be required for embryogenesis in any published RNAi study to date, suggesting that hundreds more remain to be identified (Reinke et al. 2004). To identify new genes required for embryogenesis, we analyzed 1011 ovary-enriched genes represented in the ORFeome. Over 85% of these had been assayed by RNAi in prior studies and not shown embryonic lethality. This group of genes may thus be enriched for genes whose embryonic requirement, if any, is more difficult to detect by RNAi. We used a combination of repeated RNAi by soaking and injection, and here report the identification of 192 genes involved in egg production or embryogenesis, 155 of which have no previously reported requirement in these stages. A majority of the genes in the ovary-enriched set have now been studied by RNAi by at least three independent studies, making it the most deeply studied set of genes available in C. elegans. Using the combined data from all previous studies of this set, we now include the dimension of penetrance in our analyses of phenotypic trends among the ovary-enriched genes. We observe that trends with regard to sequence similarity, expression level, and chromosomal location are predominantly due to genes with highly penetrant embryonic lethality or sterility. We also observe a relationship between the pattern of conservation across phyla and the penetrance level of embryonic lethality elicited by RNAi. Results and Discussion Evaluation of RNAi results To explore the large-scale RNAi data, we evaluated results from five RNAi studies in C. elegans (Gönczy et al. 2000; Piano et al. 2002; Maeda et al. 2001; Kamath et al. 2003; Simmer et al. 2003) with respect to their ability to detect sterile or embryonic lethal phenotypes. Comparisons of RNAi-induced phenotypes with phenotypes arising from genetic mutation suggest that large-scale RNAi studies performed in the wild-type genetic background have a detection rate of ~75%, or a false-negative rate of ~25%, for maternal sterility or embryonic lethality (Fig. 1
An alternative way to examine the efficacy of various RNAi approaches is to compare results from different large-scale RNAi studies directly with each other (Fig. 2
Since the comparisons suggest a difference in the strength of the RNAi effect elicited by the different methods, we might expect any such differences to be reflected in the penetrance levels of phenotypes reported. Indeed, when the same gene is analyzed by different studies, we observe a consistent trend in which soaking and injection elicit equivalent or stronger phenotypes than feeding, as judged by the relative penetrance of embryonic lethal phenotypes. For example, 67% of the 46 genes reported as Class II by feeding are classified in either the same or higher penetrance category based on soaking, whereas 22% of the 168 Class II genes reported by soaking are in an equal or higher class based on feeding (Fig. 2 A stronger RNAi effect might also be expected to elicit an earlier defect relative to the time of dsRNA delivery. In all studies analyzed, the RNAi treatment was initiated at the young adult stage, so the earliest observable defect in all studies was sterility of the treated animal, followed by embryonic defects among the progeny, and finally postembryonic phenotypes in the progeny. We therefore asked whether any genes displayed sterility or embryonic lethality in one study but only postembryonic defects that were not also accompanied by earlier defects in another study (Supplemental Fig. S1). Comparing genes studied by both feeding and soaking, the total number of genes showing only post-embryonic defects is similar: 128 by feeding and 125 by soaking. However, of the 128 genes that showed only post-embryonic phenotypes in the feeding study, the soaking study found 38 (30%) to be embryonic lethal or sterile. In contrast, of the 125 genes found to have post-embryonic defects alone from the soaking study, only 10 (8%) were found to show embryonic lethality or sterility when tested by feeding. The trend is even more pronounced when comparing feeding and injection (Supplemental Fig. S1). In addition to revealing technical aspects of RNAi analyses, these comparisons indicate that pleiotropy, or the reuse of the same cellular components in different developmental contexts, is a prevalent theme in the biology of this organism. Since the largest studies in C. elegans have been carried out using feeding, the combined data indicate that many genes required for embryogenesis remain to be found, and using RNAi by either soaking or injection could enhance the ability to find them. In addition, the data suggest that comparing phenotypes arising from genetic mutation with those elicited by RNAi does not realistically assess RNAi hit rates among genes with partially penetrant embryonic lethality. We conclude from this analysis that all RNAi delivery methods can detect high-penetrance embryonic lethal or maternal sterile genes with similar efficiency, and that RNAi by soaking or injection appear to be more efficacious in discovering genes that give rise to partial-penetrance embryonic lethality. Selection of the target gene set and RNAi results The above analysis suggests that additional genes required for embryogenesis in C. elegans remain to be found by RNAi. To test this prediction we focused on ovary-enriched genes (Reinke et al. 2004) found in the ORFeome collection (Reboul et al. 2003). Since our goal was to find new genes required for embryogenesis, we specifically excluded clones corresponding to known embryonic lethal genes at the time we initiated this study and those we had previously tested by injection (Fig. 3
We optimized the RNAi by soaking protocol and validated that we could obtain results comparable to those from singlegene injections using a control gene set of 83 genes (data not shown). In addition we tested our RNAi protocol using GFP-specific dsRNA and found negligible levels of embryonic lethality (1.2%, n = 1572). In a first round of RNAi assays applied to the test set, we found a total of 569 clones that gave rise to sterility of the RNAi-treated animal (Ste) and/or embryonic lethality (Emb); 241 of these were retained as Emb or Ste upon retesting. Sequencing verified the expected identity of 201 of the 241 clones; the remaining 40 were discarded from further analysis. By mapping these clones to genes we identified 195 independent genes that gave rise to high-confidence Emb and/or Ste phenotypes, of which 158 are identified here for the first time (Fig. 3 We estimated the likelihood of false positive results by comparing our phenotypes to those derived from genetic analyses (Fig. 3B To gauge our false-negative rate we also performed comparisons with published reports from genetic analyses. Of the 25 total genes we studied that also give rise to embryonic lethal or maternal sterile phenotypes upon genetic mutation, we identified 18 (72%) and failed to report seven (hus-1, mom-1, inx-3, cup-5, puf-8, sqv-1, and lin-41). Notably, four of the genes we missed showed lethality in one round of assays only, and were thus classified as low-confidence lethals (Supplemental Table S2). Thus, although our selection criteria might be expected to preferentially include genes that are harder to identify using RNAi, our false-negative rate based on genetic comparisons was comparable to those of previous studies (Fig. 1 Independent RNAi studies found that 25 of the genes we tested elicit embryonic lethal or maternal sterile phenotypes upon RNAi. We detected 17 of these, of which 14 were detected reproducibly. In order to minimize false positives, we chose to be conservative in our assignment of phenotypes; thus we did not report embryonic lethality in our final analysis for any of the low-confidence lethal results. Our comparisons suggest that this practice elevated our false-negative rate and also indicate that we have not yet identified all the genes required for embryogenesis in our sample set. Based on our comparisons of published RNAi studies and the gene selection strategy we used in this study, we would expect to recover a nonrandom distribution of Emb or Ste genes among the different subsets of genes we assayed by RNAi. The distribution of phenotypes among the different subsets is consistent with this expectation (Fig. 4
The 158 new genes we have identified with roles in embryogenesis or egg production augment by ~9% our knowledge of the genes required for these stages of development in C. elegans. The early embryo is particularly well suited to study cell biological defects generated by the RNAi treatments and can help reveal specific roles in cell division. To further characterize the effects on embryogenesis, we repeated RNAi by injection or soaking for the 39 genes whose RNAi analysis elicited the highest-penetrance embryonic lethality. We studied the early embryonic defects by time-lapse digital Nomarski microscopy from the one-cell stage to the eight-cell stage (we obtained, on average, eight time-lapse recordings per gene, available as Quicktime movies though www.RNAi.org). Among these are five genes (csc-1, mom-5, mex-5, air-2, spn-4) whose role in early embryogenesis was already known, and we recovered the known phenotypes in our assays. For the 34 genes that were newly identified here or were not previously studied in the early embryo, we analyzed detailed phenotypes during the first 50 min of development and scored them as previously described (Piano et al. 2002; Supplemental Table S4). The range of defects observed is similar to that of previous reports (Gönczy et al. 2000, Piano et al. 2000, 2002; Zipperlen et al. 2001), and results in the identification of new genes required for specific early embryonic events such as completion of meiosis, nuclear movements, and mitosis. Among the genes identified as being required for embryogenesis, we identified four genes (C47D12.8, F14B4.2, F54F2.8, and F35F11.1) whose human homologs have roles in disease (ERCC4, HK2, PXF, and HRPT2, respectively). We recorded early embryonic phenotypes for C47D12.8 and F35F11.1 (the others showed partial-penetrance lethality). Both of these genes are homologs of human genes whose mutations are associated with tumors. Interestingly, RNAi of F35F11.1 (the HPRT2 homolog) causes defects in the proper formation of polar bodies, pronuclei, and mitotic nuclei, indicating that this gene is important for the proper completion of both meiosis and mitosis (see Supplemental Table S4 and associated supplementary recordings). Although HRPT2 was demonstrated to function as a tumor suppressor in parathyroid carcinoma (Carpten et al. 2002), HRPT2'sroleinthe cell remains essentially unstudied in mammals. Our phenotypic analysis suggests that this gene product is important for microtubule-based functions in the early embryo, providing a functional link between its cellular activity and its role in tumorigenesis. How many genes are required for embryogenesis in C. elegans? These data allow us to refine our predictions of how many genes are required for embryogenesis or egg production in C. elegans. Based on the results of this study, and considering our estimated false-negative rate, we can predict that about 45% of ovary-enriched genes are required for embryogenesis. From these data we project that a total of ~1300 genes in the ovary-enriched fraction are required for embryogenesis or oogenesis. Currently, the ovary-enriched genes contain about half the total number of genes that are known to be required for embryogenesis or oogenesis. Therefore, by extrapolation, we predict that a lower estimate for the total number of genes required for these processes in C. elegans is ~2600 genome-wide. So far, considering no false positives, a little over two-thirds (1843) have been identified through the multiple large-scale RNAi studies. The ovary-enriched genes are the most extensively tested set of genes in the genome, yet only about three-quarters (956) of the predicted total have been found so far. We anticipate that further in-depth analyses across the entire genome will reveal many more genes that play a role in embryonic development. Our data also suggest that the majority of these as yet undiscovered genes will give rise to a partially penetrant embryonic lethal phenotype. Functional trends for the ovary-enriched genes The ovary-enriched genes have been heavily studied by RNAi: of the 3015 genes in WormBase (WS110) that fit the criteria for oogenic germline-enrichment, 2944 (98%) have been tested by RNAi at least once, 2793 (93%) have been tested at least twice, and 2201 (73%) have been tested at least three times. Using the strongest reported phenotype for each gene, we characterized patterns of gene function among ovary-enriched genes. When we examined the combined data with respect to penetrance, we found that functional trends are driven mostly by genes giving rise to the highest-penetrance phenotypes. The ovary-enriched gene set displays a positive and statistically significant correlation between essential gene function in the embryo and both sequence conservation (Fig. 5A
We also examined patterns of functional annotations with respect to penetrance (Fig. 5D Penetrance level and conservation across phyla To explore these observations further, we asked whether there was a correlation between the pattern of sequence conservation across phyla and the penetrance of the embryonic lethal phenotype elicited by RNAi, and we found a striking trend (Fig. 6
A priori, we suspected that penetrance designations derived from RNAi assays may not be particularly informative, as RNAi does not always deplete gene function completely and thus may not necessarily phenocopy null phenotypes. However, our analyses revealed strong trends among the different penetrance classes. This suggests that penetrance reported by RNAi can genuinely reflect differences in gene function when animals are observed under laboratory conditions, particularly when a gene has been assayed multiple times by different groups. The group of genes showing the strongest penetrance not only contains a higher proportion of genes involved in a few critical cellular processes, but is also significantly enriched for genes that are highly conserved in other species—a trend that is not observed among other, less penetrant phenotypic categories. Together these observations suggest that highly conserved genes carrying out indispensable functions occupy critical positions in gene networks that are essential to cell survival at any stage of development, whereas less-conserved genes may contribute more peripheral, or modulatory, functions that serve to adapt these core modules to specific cellular contexts and provide plasticity to genetic networks. The observation that the proportion of eukaryotic genes (the most ancient phylogenetic class) increases with increasing penetrance and that the proportion of worm-specific genes (the most recently evolved phylogenetic class) decreases with increasing penetrance suggests that the importance of a given gene's function can change over time. We speculate that as new genes arose over history, some have gradually become fixed in more crucial roles in the networks underlying biological processes. Another way to express this is to say that those genes that are retained over very long evolutionary times are those that have taken on indispensable roles. This suggests the possibility of a general architectural motif in genetic networks, in which genes entrenched in critical positions consist largely of older components, whereas newer functions added later provide additional layers of regulation that lend robustness or adaptive functions to these core networks. Combined analysis of phenotypic, gene expression, and protein interaction data from C. elegans will help resolve questions about how gene networks are adapted to different cellular contexts, and will reveal underlying architectural principles of metazoan development. Methods Generation of dsRNA ORFeome clones were amplified by PCR using T7 primers. Amplified products were obtained for 1123 of the 1245 ORFeome clones successfully cherry-picked for this study. PCR products were used as templates for in vitro transcription by T7 RNA polymerase in a 50-μL reaction. All RNA products were verified on 1% agarose gels, then precipitated by adding 5 μL sodium acetate (pH 5.2), 2 μL Pellet Paint (Novagen), and 100 μL ethanol. The dsRNA was stored in this manner at -20°C for a maximum of 1 wk. All molecular biology procedures and soaking were performed in 96-well plates. RNAi by soaking N2 Bristol worms were synchronized using 0.8% hypochlorite solution, and L1 larvae were hatched without food in M9 medium before being plated to NGM media seeded with OP50. Animals were grown at 25°C for 40-44 h until the L4 stage. To prepare the soak, dsRNAs were spun down, washed with 70% ethanol, dried completely, and resuspended in 4 μL soaking buffer (0.25X Mg++-free M9, 3mM spermidine, 0.05% gelatin). L4 larvae were washed from plates using M9 and allowed to crawl on unseeded plates 15 min; this step was repeated. After the last incubation on unseeded plates, worms were washed three times in M9 and twice in soaking buffer. Then, 10-30 worms in 1 μL of soaking buffer were added to each tube of resuspended dsRNA. Worms were incubated for 24 h at 20°C in dsRNA and then plated to NGM seeded with OP50 to recover for 24 h at 20°C. For each gene tested, five worms were placed onto individual egg laying plates and incubated for 24 h at 20°C. Treated animals were removed from egg-laying wells, and plates were placed at 25°C for 15-18 h to allow eggs to hatch. We then counted the number of eggs and the number of larvae present in each of the five replicate broods for each gene tested. Penetrance designations Genes that reproducibly (i.e., observed in at least two broods) gave rise to 20%-50%, 50%-80%, or 80%-100% embryonic lethality were designated as LPEL (“low partial embryonic lethal”), HPEL (“high partial embryonic lethal”), or EL (“embryonic lethal”), respectively. Genes were termed VLL (“very low lethal”)if at least two broods showed ≥5% embryonic lethality or if at least one brood showed ≥10% embryonic lethality. Genes were termed MS (“maternal sterile”) if at least three broods yielded ≤10 progeny, whether accompanied by embryonic lethality or not. We called a gene NELD (“no embryonic lethality detected”) if the minimum criteria for VLL were not satisfied and a minimum of four broods was scored. Genes that did not show phenotypes upon retesting were termed LCL (low-confidence lethals) and grouped with the NELD class for all bioinformatic analyses. Those that showed higher penetrance phenotypes upon retesting were moved to the higher penetrance class. Selection of ORFeome clones and mapping to C. elegans gene products Ovary-enriched ORFeome clones were selected by matching WormPep95 (Harris et al. 2003) with C. elegans ORFeome Version 1.1 (Reboul et al. 2003) and a list of microarray assays designated as “oogenic germline-enriched” according to the selection criteria defined in Reinke et al. (2004). The intersection of these two sets identified 1582 ovary-enriched ORFeome clones, which were then filtered using all RNAi results available in WormBase (WS95) and classified into four categories: (1) clones representing genes not previously assayed by RNAi; or (2-4) previously assayed and falling into one of three phenotypic classes: (2) WT (“wild-type”), (3) postembryonic only, or (4) embryonic lethal. We selected all genes from sets 1-3 that we had not previously assayed by injection (Piano et al. 2002) for characterization in this study (see Fig. 3 After completing the RNAi analysis, we remapped this set of ORFeome clones and the PCR products used to define the ovary-enriched gene set to the latest WormBase freeze (WS110). The set of 1083 ORFeome clones for which we reported RNAi results currently maps to 1011 ovary-enriched and 16 non-ovary-enriched genes (not including alternative splice forms). Only the ovary-enriched fraction was used in global analyses of ovary-enriched gene function. A total of 53 genes were independently assayed by two or more different ORFeome clones; for 14 of these, replicate assays yielded results in different penetrance classes. RNAi results for all clones assayed are reported in Supplemental Table S1. Some ORFeome clones overlap two adjacent genes. This indicates a discrepancy between the current gene models in WormBase and sequence evidence from these ORFeome clones; each of these cases was reviewed at WormBase, and the gene models were left unchanged due to discrepancies between EST data from cDNA clones and evidence based on the ORFeome clones. For clones that extensively overlapped two open reading frames, when the RNAi assay was negative (LCL or NELD), we reported this result for each gene. In the few rare cases where a clone overlapped one ORF extensively and another by <200 base pairs, we considered only the former gene to have been reliably assayed and conservatively reported no RNAi result for that clone in association with the second gene. Wherever we had independent confirmation of such assays from clones that spanned only one of the two adjacent transcription units, the results were always consistent with these policies. Time-lapse imaging Time-lapse digital recordings were made as described (Piano et al. 2002). At least three independent RNAi-affected worms from clones giving rise to ≥80% lethality were dissected to collect the early embryos ~36-48 h after dsRNA delivery, and were mounted on a 2% agarose pad. Movies were made with a Leica microscope equipped with a 40× lens and differential interference contrast (DIC) optics. We used digital acquisition through a Cohu CCD camera driven by macros written for Openlab (Improvision) to record 300 frames at 10-sec intervals. QuickTime Pro 4.0 (Apple Computers) was used to compress all movies (medium, JPEG B compression) and prepare them for internet streaming. All results from this study, including digital movies, are accessible through http://www.RNAi.org and http://www.wormbase.org. Bioinformatic analysis To facilitate the bioinformatic analyses, we built a MySQL database containing information about genes, RNAi experiments, expression studies, sequence similarities, and other functional genomics data. Data processing and informatics analyses were performed using Perl scripts and/or SQL queries to this database. Data sources included C. elegans “Predicted_gene” and “RNAi” objects from WormBase (WS110), as well as supplementary material from publications (e.g. Simmer et al. 2003), prepublication expression data (Reinke et al. 2004), multispecies sequence data from genome centers, results of in-house BLAST analyses (described below), and RNAi data collected in this study. All RNAi results contained in this database are searchable through www.RNAi.org. RNAi data collected in this study were reported to WormBase. Functional categories were assigned using the classification system from Kamath et al. (2003). Reciprocal whole-proteome BLASTP analysis (Altschul et al. 1997) was performed using an in-house BLAST pipeline on complete predicted protein sequence sets from Saccharomyces cerevisiae (SGD; Cherry et al. 1998; updated Aug. 8, 2003; 5889 sequences), C. elegans (WormPep100; C. elegans Sequencing Consortium 1998; Harris et al. 2003; 21,383 unique sequences)], D. melanogaster (Release 3.1; Celniker et al. 2002; Flybase Consortium 2003; 18,498 sequences), Homo sapiens (IPI v2.22; http://www.ebi.ac.uk/IPI/, 29,539 sequences), and Arabidopsis thaliana (TAIR Release 4.0; Garcia-Hernandez et al. 2002; 28,786 sequences). Homologous genes between species were identified using a simple reciprocal best-hit (e-value < 1e-6) criterion. Summarized results of this analysis were transferred to the above MySQL database. Off-target genes were considered potentially inhibited by RNAi if they shared contiguous blocks of 20 nucleotides totaling at least 60 nucleotides of sequence with the primary gene target. For comparative analyses and characterization of phenotypic trends among ovary-enriched genes, all RNAi results were mapped to genes in WormBase (WS110) using the chromosome coordinates of the reagents used for each assay. Genes were placed into exclusive categories according to the most severe phenotype observed by any group. Our comparative analyses included only those phenotypic results from Simmer et al. (2003) that were reproducible [listed in Simmer et al. 2003, Supplemental Tables 1A, 1B; nonreproducible results (Supplemental Table 1C) were excluded from this comparison]. In addition, because assays from that study with a wild-type result were not reported in WormBase, we inferred them for the remainder of genes assayed by Kamath et al. (2003), since both studies used the same set of reagents. In all, we included results for 15,353 genes (excluding alternative splice forms) assayed by Simmer et al. (2003). Acknowledgments We thank Casey Roehrig, Danny Shapiro, Tony Tao, and Krishna Vijayendran for technical assistance; Wanchen Yueh, Tong Hao, and Philippe Lamesch for help with data processing; WormBase curators for reviewing gene models; Dr. Julie Ahringer for critically reviewing our manuscript; Dr. Jane Hubbard for useful insights; and anonymous reviewers who provided useful comments that improved our manuscript. This work was supported by grants from NSF (DBI-0137617, to K.C.G.) and NIH(HG003019-01, to F.P.). A.J.S. is currently funded by the NCI CPFP. The ORFeome project is supported by a grant from NCI (7R33CA81658-02, to M.V.). Footnotes [Supplemental material is available online at www.genome.org and RNAiDB (www.RNAi.org). Results from this study are also available through WormBase (www.wormbase.org).] Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.3194805. References
WEB SITE REFERENCES
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||
Mol Biol Cell. 1998 Dec; 9(12):3273-97.
[Mol Biol Cell. 1998]Mol Cell. 2000 Sep; 6(3):605-16.
[Mol Cell. 2000]Development. 2004 Jan; 131(2):311-23.
[Development. 2004]Proc Natl Acad Sci U S A. 2001 Jan 2; 98(1):218-23.
[Proc Natl Acad Sci U S A. 2001]Science. 2002 Sep 27; 297(5590):2270-5.
[Science. 2002]Nature. 2000 Nov 16; 408(6810):331-6.
[Nature. 2000]Curr Biol. 2000 Dec 14-28; 10(24):1619-22.
[Curr Biol. 2000]Curr Biol. 2002 Nov 19; 12(22):1959-64.
[Curr Biol. 2002]Nature. 2003 Jan 16; 421(6920):231-7.
[Nature. 2003]Nature. 2000 Nov 16; 408(6810):331-6.
[Nature. 2000]Curr Biol. 2002 Nov 19; 12(22):1959-64.
[Curr Biol. 2002]Nature. 2003 Jan 16; 421(6920):231-7.
[Nature. 2003]Curr Biol. 2000 Dec 14-28; 10(24):1619-22.
[Curr Biol. 2000]Proc Natl Acad Sci U S A. 2001 Jul 17; 98(15):8686-91.
[Proc Natl Acad Sci U S A. 2001]Curr Biol. 2002 Nov 19; 12(22):1959-64.
[Curr Biol. 2002]Mol Cell. 2000 Sep; 6(3):605-16.
[Mol Cell. 2000]Development. 2004 Jan; 131(2):311-23.
[Development. 2004]Nature. 2000 Nov 16; 408(6810):331-6.
[Nature. 2000]Curr Biol. 2002 Nov 19; 12(22):1959-64.
[Curr Biol. 2002]Curr Biol. 2001 Feb 6; 11(3):171-6.
[Curr Biol. 2001]Nature. 2003 Jan 16; 421(6920):231-7.
[Nature. 2003]Differentiation. 2004 Mar; 72(2-3):81-91.
[Differentiation. 2004]Curr Biol. 2002 Nov 19; 12(22):1959-64.
[Curr Biol. 2002]Curr Biol. 2001 Feb 6; 11(3):171-6.
[Curr Biol. 2001]Nature. 2003 Jan 16; 421(6920):231-7.
[Nature. 2003]Development. 2004 Jan; 131(2):311-23.
[Development. 2004]Nat Genet. 2003 May; 34(1):35-41.
[Nat Genet. 2003]Curr Biol. 2002 Nov 19; 12(22):1959-64.
[Curr Biol. 2002]Nature. 2000 Nov 16; 408(6810):331-6.
[Nature. 2000]Curr Biol. 2000 Dec 14-28; 10(24):1619-22.
[Curr Biol. 2000]EMBO J. 2001 Aug 1; 20(15):3984-92.
[EMBO J. 2001]Curr Biol. 2000 Dec 14-28; 10(24):1619-22.
[Curr Biol. 2000]Curr Biol. 2002 Nov 19; 12(22):1959-64.
[Curr Biol. 2002]Curr Biol. 2001 Feb 6; 11(3):171-6.
[Curr Biol. 2001]Nature. 2003 Jan 16; 421(6920):231-7.
[Nature. 2003]Development. 2004 Jan; 131(2):311-23.
[Development. 2004]Nature. 2003 Jan 16; 421(6920):231-7.
[Nature. 2003]Nucleic Acids Res. 2003 Jan 1; 31(1):133-7.
[Nucleic Acids Res. 2003]Nat Genet. 2003 May; 34(1):35-41.
[Nat Genet. 2003]Development. 2004 Jan; 131(2):311-23.
[Development. 2004]Curr Biol. 2002 Nov 19; 12(22):1959-64.
[Curr Biol. 2002]Curr Biol. 2002 Nov 19; 12(22):1959-64.
[Curr Biol. 2002]Development. 2004 Jan; 131(2):311-23.
[Development. 2004]Nature. 2003 Jan 16; 421(6920):231-7.
[Nature. 2003]Nucleic Acids Res. 1997 Sep 1; 25(17):3389-402.
[Nucleic Acids Res. 1997]Nucleic Acids Res. 1998 Jan 1; 26(1):73-9.
[Nucleic Acids Res. 1998]Science. 1998 Dec 11; 282(5396):2012-8.
[Science. 1998]Nucleic Acids Res. 2003 Jan 1; 31(1):133-7.
[Nucleic Acids Res. 2003]Nucleic Acids Res. 2003 Jan 1; 31(1):172-5.
[Nucleic Acids Res. 2003]Nature. 2003 Jan 16; 421(6920):231-7.
[Nature. 2003]Differentiation. 2004 Mar; 72(2-3):81-91.
[Differentiation. 2004]Nature. 2000 Nov 16; 408(6810):331-6.
[Nature. 2000]Curr Biol. 2002 Nov 19; 12(22):1959-64.
[Curr Biol. 2002]Nature. 2003 Jan 16; 421(6920):231-7.
[Nature. 2003]