![]() | ![]() |
Formats:
|
|||||||||||||||||||||||||||||||||||
Copyright : © 2005 Dybbs et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Using Microarrays to Facilitate Positional Cloning: Identification of Tomosyn as an Inhibitor of Neurosecretion 1 Department of Molecular Biology, Massachusetts General Hospital, Boston, Massachusetts, United States of America 2 Department of Molecular and Cell Biology, Functional Genomics Laboratory, Helen Wills Neuroscience Institute, University of California, Berkeley, California, United States of America Gregory S Barsh, Editor Stanford University School of Medicine, United States of America *To whom correspondence should be addressed. E-mail: kaplan/at/molbio.mgh.harvard.edu Received January 13, 2005; Accepted February 1, 2005. This article has been cited by other articles in PMC.Abstract Forward genetic screens have been used as a powerful strategy to dissect complex biological pathways in many model systems. A significant limitation of this approach has been the time-consuming and costly process of positional cloning and molecular characterization of the mutations isolated in these screens. Here, the authors describe a strategy using microarray hybridizations to facilitate positional cloning. This method relies on the fact that premature stop codons (i.e., nonsense mutations) constitute a frequent class of mutations isolated in screens and that nonsense mutant messenger RNAs are efficiently degraded by the conserved nonsense-mediated decay pathway. They validate this strategy by identifying two previously uncharacterized mutations: (1) tom-1, a mutation found in a forward genetic screen for enhanced acetylcholine secretion in Caenorhabditis elegans, and (2) an apparently spontaneous mutation in the hif-1 transcription factor gene. They further demonstrate the broad applicability of this strategy using other known mutants in C. elegans, Arabidopsis, and mouse. Characterization of tom-1 mutants suggests that TOM-1, the C. elegans ortholog of mammalian tomosyn, functions as an endogenous inhibitor of neurotransmitter secretion. These results also suggest that microarray hybridizations have the potential to significantly reduce the time and effort required for positional cloning. Synopsis Genetic screens are commonly used to figure out which genes are involved in a biological process. The first step in a genetic screen is to isolate mutant animals that are defective in the process being studied. The next step is to find which of the thousands of genes has the mutation that causes the observed defect. Positional cloning, the tried-and-true method for locating mutations, is slow and expensive. The authors propose using microarray hybridizations to speed the process. Their approach relies on the fact that a large fraction of the mutations found in screens are the results of premature stop codons, a particularly severe type of mutation. In cells, messages containing premature stop codons are rapidly destroyed by a protective pathway, called nonsense-mediated decay, thus making them directly detectable by microarray hybridization. The authors apply this strategy retrospectively to known mutants in Caenorhabditis elegans, Arabidopsis, and mouse. They identify two uncharacterized mutations in C. elegans, including one, tom-1, found in a forward genetic screen for enhancers of neurotransmission. Interestingly, their characterization of tom-1 mutants suggests that the highly conserved protein tomosyn inhibits neurotransmission in neurons. This study shows that microarray hybridizations will help reduce the time and effort required for positional cloning. Introduction Forward genetic screens have been traditionally utilized in model systems (e.g., Caenorhabditis elegans, Drosophila, yeast, and Arabidopsis). More recently, large-scale screens have been undertaken in vertebrate systems such as zebrafish [1,2] and mouse [3–5]. Mutations isolated in genetic screens are typically identified by positional cloning. The difficulty posed by positional cloning is determined by the size of the genome, the recombination rate, and the difficulty of assessing the mutant phenotype. For example, the mouse genome comprises 3,600 centimorgans (cM) and 3 × 109 base pairs. The ultimate goal of a typical positional cloning project is to analyze a sufficient number of recombinants to map the mutation to a small genetic interval (typically approximately 0.1 cM). Once a mutation has been precisely mapped, gene identification is typically achieved by a variety of strategies: direct sequencing of the region (100 kb in the mouse), candidate gene testing, or screening for informative alleles (e.g., microdeletions). The difficulty of a particular positional cloning can be compounded by the nature of the mutant phenotype. This problem is particularly acute for behavioral mutants, which often have phenotypes that must be scored in multiple trials, or in populations of animals. Together, these issues conspire to make traditional positional cloning a significant and costly bottleneck. To circumvent these difficulties, several new technologies have been developed to isolate mutations by reverse genetics. Reverse genetic strategies include use of insertional mutagens [6−10], PCR screens for randomly induced deletions [11], homologous gene targeting [12,13], and physical or genetic detection of point mutations in sequenced genes [14,15]. While reverse genetic strategies circumvent the positional cloning bottleneck, these approaches also have limitations. Mutations isolated by reverse genetics often lack obvious phenotypic defects (e.g., because they are in functionally redundant genes). Phenotypic differences observed in mutants isolated by reverse genetics can be confounded by other mutations in the genetic background, particularly since animals are typically heavily mutagenized in these strategies. For these reasons, it would be useful to develop methods that would allow more rapid characterization of mutations isolated in forward genetic screens. We wondered whether microarray expression data could facilitate the identification of mutations responsible for behavioral defects isolated in forward genetic screens. It is well established that nonsense mutations result in the degradation of the mutant messenger RNA (mRNA) via the nonsense-mediated decay (NMD) pathway. A surveillance mechanism common to all eukaryotes, NMD serves as a quality control system to destroy faulty mRNAs whose translation would lead to an inappropriately truncated protein [16−18]. NMD protects cells by eliminating inactive or potentially deleterious dominant negative proteins that are the result of somatic mutation, transcriptional mistakes, or splicing errors. It has been proposed that NMD could be used as a basis to identify nonsense mutations in cell lines [19,20]. In principle, a nonsense mutation in mutant animals could be identified using microarray hybridizations to find transcripts with decreased abundance. In practice, microarray data alone are unlikely to be sufficient to identify nonsense mutations. In addition to the expected statistical noise associated with microarray experiments, there are likely to be transcriptional changes in other genes that are caused by the mutation being studied. The most powerful cloning approach would thus be one that uses microarray data together with traditional mapping information. Here, we present evidence supporting the feasibility and general utility of this strategy. Results To test the feasibility of using microarrays to facilitate positional cloning, we will address four questions. (1) How frequently are nonsense alleles recovered in forward genetic screens? (2) Are microarray hybridizations sensitive enough to detect the decreased abundance of a nonsense mutant transcript? (3) Can microarray hybridizations be used to identify an uncloned behavioral mutant in C. elegans? (4) Is this microarray-based strategy applicable to other model organisms? Nonsense Alleles Represent a Large Fraction of C. elegans Mutations The utility of microarrays in cloning depends on the frequency with which nonsense alleles are recovered in phenotypic screens. Since 15 of the 61 amino acid–encoding codons are mutable to stop codons by a single base-pair substitution, nonsense alleles are likely to represent a large fraction of all alleles recovered after random mutagenesis with agents that increase the rate of nucleotide misincorporation. To assess the prevalence of nonsense alleles isolated following random mutagenesis, we compiled a list of sequenced C. elegans mutant alleles by downloading information from WormBase and conducting targeted literature searches (Figure 1
We calculated the percentage of nonsense alleles recovered for each of the 117 genes in our dataset with three or more characterized alleles (Figure 1 Proof of Principle: mec-3 and unc-43 CaMKII Mutations Are Detectable by Microarray Are microarray hybridizations sensitive enough to detect changes in mutant transcript abundance due to a nonsense lesion above the global variation in gene expression between mutant and control strains? Some potential sources of variance in gene expression include random fluctuations in gene expression [23,24], uncontrolled differences between the mutant and control populations (e.g., differences in developmental stage or physiological status), and differences in genetic backgrounds [25,26]. Perhaps the most important potential limitation is changes in gene expression that are a secondary consequence of a mutation. This could be particularly problematic for mutations in genes encoding transcription factors or other components of signal transduction cascades, the loss of which would be expected to alter the expression of many downstream genes. To address some of these concerns, we examined the large collection of microarray experiments used to build a whole-genome expression profile for C. elegans [27]. Most of these experiments, which were done with printed microarrays, were designed to identify gene-expression profiles associated with various developmental programs or specific tissues. However, one set of experiments analyzed changes in gene expression in mutants lacking the MEC-3 transcription factor (see Materials and Methods) [28]. The mec-3(e1338) allele corresponds to a W69Stop mutation, and homozygous animals carrying this mutation are touch-insensitive [29,30]. Using this dataset, we classified genes as differentially expressed in mec-3(e1338) based on two criteria: average fold-change in expression level and statistical significance using a Student's t-test. We constructed a volcano plot with the log2(fold-change) on the x-axis and negative log10(p-value) on the y-axis [31]. This provides a useful way to visualize differentially expressed genes—those whose expression level is down (negative on the x-axis) and that show high statistical significance (large on the y-axis). Seventy genes were identified as having significantly reduced expression in mec-3(e1338), using fold-change greater than −1.0 (log2 scale) and p < 0.01 as thresholds for decreased expression (Figure 2
To further address the sensitivity of microarray-assisted cloning, we analyzed changes in gene expression observed in KP3365 unc-43(n1186) mutants. The unc-43 gene encodes type II calcium- and calmodulin-dependent protein kinase (CaMKII), which is broadly expressed in the worm nervous system as well as in muscles and in the intestine [33]. This provides another demanding test case because CaMKII plays a pivotal role in calcium-mediated signaling in neurons, and unc-43 mutations are known to cause changes in the expression of other genes [34]. The n1186 allele corresponds to a Q67Stop mutation, and homozygous animals carrying this mutation have relatively subtle behavioral defects [33]. We hybridized total RNA isolated from wild-type and KP3365 unc-43(n1186) CaMKII mutant animals to the Affymetrix C. elegans GeneChip (Dataset S1). Using fold-changes greater than 0.5 (log2 scale) and p < 0.01 as thresholds, we found 20 probesets with decreased expression in KP3365 unc-43(n1186) CaMKII mutants as compared to wild-type controls (Figure 3
Identification of a hif-1 Polymorphism in the KP3365 Strain One potential limitation of our strategy is that mutant strains may contain multiple mutations, some of which do not contribute to the mutant phenotype. This will be particularly true in heavily mutagenized strains, and in cases where the mutants have not been extensively backcrossed with wild-type strains. Therefore, we examined the KP3365 unc-43(n1186) CaMKII hybridization data for other genes with significantly reduced expression. Interestingly, the gene with the largest decrease in expression in KP3365 unc-43(n1186) animals was not unc-43; rather, it was hif-1 (Figure 3
In aerobic conditions, the HIF-1 protein is constitutively degraded by the von Hippel–Lindau ubiquitin ligase [36–38]; consequently, the hif-1(nu469) mutation would presumably be phenotypically silent in normal growth conditions. The hif-1(nu469) mutation was not present in several other strains containing the unc-43(n1186) allele, suggesting that this mutation occurred spontaneously during culturing in our laboratory (data not shown). In summary, KP3365 animals carry a previously uncharacterized polymorphism in hif-1, which we identified based solely on our microarray hybridization results. Identifying such polymorphisms may allow researchers to explain unexpected aspects of mutant phenotypes of particular strains. Using Microarrays to Identify a Mutation in Tomosyn, an Inhibitor of Neurotransmitter Secretion To further address whether microarray hybridizations can be used to identify uncharacterized mutations, we analyzed a behavioral mutant that was isolated in a forward genetic screen for inhibitors of neurotransmitter secretion. Neurotransmission serves as the primary mode of communication between cells in the nervous system. Neurotransmitters such as acetylcholine (ACh) are secreted by presynaptic nerve cells, and activate receptors on postsynaptic cells. Behavioral and pharmacological screens in C. elegans have proven to be a powerful approach to identifying molecules involved in synaptic transmission and nervous system function [39–42]. The cholinesterase inhibitor aldicarb is widely used as a means to monitor ACh secretion at the C. elegans neuromuscular junction [41,43–46]. In the presence of aldicarb, ACh accumulates in the synaptic cleft, causing the body wall muscles to become hypercontracted and animals to become paralyzed. Mutations that increase ACh secretion cause hypersensitivity to the paralytic effects of aldicarb [46–48]. To identify negative regulators of ACh secretion, we used hypersensitivity to aldicarb as the basis for a forward genetic screen. One of the strongest mutations recovered in our screen was nu468 (filled squares in Figure 5
We meiotically mapped nu468 to Chromosome 1, which contains approximately 2,700 genes. We then hybridized RNA from KP3293 nu468 animals to the C. elegans GeneChip, comparing the hybridizations to wild-type hybridizations as previously described (Dataset S1). Six probesets showed significantly decreased expression in KP3293 animals (fold-change < −0.5, p < 0.01) (Figure 6
We performed several experiments to confirm that the tom-1(nu468) mutation caused the aldicarb hypersensitivity observed in the KP3293 strain. First, we tested a second tom-1 allele, ok285, which was generated by the C. elegans Gene Knockout Consortium (http://celeganskoconsortium.omrf.org). This allele, tom-1(ok285), encodes a mutant protein lacking 202 residues in a highly conserved region, and homozygous tom-1(ok285) mutants exhibited aldicarb hypersensitivity similar to that observed in KP3293 tom-1(nu468) mutants (triangles in Figure 5 Tomosyn is an approximately 1,100–amino acid protein with two functional domains: (1) the C-terminal coiled-coil domain, which shares homology with synaptobrevin and has been shown to bind to syntaxin and SNAP-25, and (2) the approximately 600-residue WD40-rich N-terminal region, which shows strong homology to the fly tumor suppressor protein Lethal giant larvae (Figure 7 Generalizability of Microarray-Assisted Cloning Since NMD functions in all eukaryotes [16,18], we wondered whether our strategy could be applied to other model systems. To address this, we conducted a retrospective analysis of microarray data from mutants in other organisms. We searched the public microarray databases for experiments in which researchers had analyzed mutants in other organisms. Specifically, we looked for hybridizations where mutant RNA had been compared to wild-type RNA and where the mutation was the result of a premature stop codon (and thus a predicted NMD target). For practical reasons, we also required that the mutant gene be represented and detectable on the microarray. Surprisingly, we found that only two experiments met these criteria. The first was a study of pmr4 (powdery mildew resistant 4), a cell-wall biosynthesis gene in Arabidopsis that confers pathogen resistance when mutated. The lesion used in the microarray studies was a premature stop codon in the second exon (PMR4 dataset) [53]. The second was a study of the mdx mouse, an animal model of Duchenne muscular dystrophy, with a premature stop codon in exon 23 of dystrophin (MDX dataset) [54,55]. In both of these studies, the authors knew the nature of the mutation and were attempting to find genes whose expression changed in the mutant background. For these two examples, we asked retrospectively whether hybridization data would have aided identification of the mutant genes. To do this, we reanalyzed the PMR4 and MDX data as described for the C. elegans mutants and constructed volcano plots (Figure 8
Discussion We present evidence demonstrating the utility of microarray hybridizations in facilitating the rapid identification of mutations isolated in forward genetic screens. Several results suggest that this technique will be widely applicable. This strategy was successful in identification of C. elegans, mouse, and Arabidopsis mutations. Mutations were successfully identified in both transcription factors and signal transduction components, which are likely to be the most challenging cases. Mutant genes were successfully detected using data obtained with both printed arrays and Affymetrix chips. And finally, we were able to identify two previously uncharacterized C. elegans mutations with this approach. Will this strategy work for genes that regulate the expression of many other genes? We provide examples for successful identification of three genes that directly affect transcription—two transcription factors (mec-3 and hif-1) and a protein kinase that regulates neuronal gene expression (unc-43). Although 70 genes were differentially expressed in mec-3 mutants, only three differentially expressed genes mapped within a 2-cM interval containing mec-3 and 100 other genes. Therefore, microarray hybridizations would have facilitated identification of mec-3. The success rate for this strategy depends on three factors: (1) the fraction of genes that are detectable by microarray, (2) the frequency of nonsense alleles recovered in screens, and (3) the efficiency with which nonsense mutated mRNAs are degraded by NMD. In our hybridizations using mRNA prepared from whole worms, 80% of the genes on the array showed detectable expression. In cases where a mutation affects a particular cell type or tissue, the likelihood of detecting a particular transcript can be increased using RNA isolated from that tissue or cell type [28,56]. What fraction of newly isolated mutations will be nonsense alleles (see Figure 1 What fraction of nonsense alleles are efficiently targeted by the NMD machinery? In each of the six examples we present above, this was the case, but how often do nonsense transcripts evade degradation by the NMD machinery? Rules governing NMD recognition of mutant mRNAs have been described in yeast, C. elegans, and mammals [16–18,59–63]. The NMD machinery distinguishes premature stop codons from natural stops using the exon-junction complexes that are deposited at exon–exon boundaries by the spliceosome. Stops that are greater than 50–55 base pairs upstream of the last exon-junction complex are recognized by the NMD machinery as premature and are efficiently targeted for destruction [61,64]. Prior studies have shown that 100% (n = 23) of C. elegans nonsense mutations were susceptible to NMD surveillance (measured either by mRNA abundance or by suppression of mutant phenotypes by NMD pathway mutations) [17]. Of these, six mutations (26%) were judged to be only partially targeted by NMD. Based on these examples and those we describe here, we estimate that 75%–100% of nonsense alleles in C. elegans would show a detectable decrease in mRNA levels. Considering all three of these factors (gene detection by microarray, nonsense allele frequency, and NMD efficiency), we expect microarray-assisted cloning to be successful in 25%–30% of positional clonings (assuming only one allele is hybridized per gene). The principal costs of positional cloning are those incurred in isolating, phenotyping, and genotyping a sufficient number of recombinants (i.e., informative meioses) to map a mutation to a small genetic interval. A typical positional cloning requires 2,000–10,000 informative meioses. Our results suggest that microarray hybridizations can significantly reduce the number of meioses required for positional clonings. In five of six cases, microarray data in conjunction with chromosomal linkage data were sufficient for gene identification. Therefore, while we expect that this strategy will be useful in many genetic systems, microarray-assisted cloning promises to provide the greatest value in organisms such as mouse and zebrafish, where long generation times and large genomes make meiotic mapping more time consuming and costly. Furthermore, microarray-assisted cloning may be particularly useful in cases where mutant phenotypes are more difficult to assess, such as behavioral mutants, or incompletely penetrant or complex (i.e., multigenic) traits [65,66]. Given the effort and challenges involved in meiotic mapping and the relative ease and speed of microarray hybridizations, we believe that this microarray-based strategy provides significant benefit, even though it will be successful in only a subset of cases. Can microarrays be used to aid the cloning of human disease genes? One-third of human disease genes are predicted to be caused by nonsense lesions or mutations that decrease transcript abundance [21,22]. Furthermore, nonsense mutant transcripts encoded by disease genes such as BRCA1 and hepatocyte nuclear factor 1α have been shown to be effectively degraded by NMD [67,68]. Given the enormous time and expense involved in mapping genes in humans, the strategy described here could provide a valuable addition to the toolbox of human geneticists. Materials and Methods Allele analysis. Information about 930 recessive single base-pair substitution alleles was downloaded from WormBase (http://www.wormbase.com), Release WS123 (see Table S1). Information about 82 additional alleles was obtained through literature searches. Based on their molecular description, 943 alleles were classified as either NMD targets (nonsense) or non-NMD targets (missense). Excluded from the analysis were 69 alleles that could not be definitively classified. These alleles included those with incomplete molecular descriptions and those with lesions such as splice site mutations that could not be classified without further characterization. RNA sample preparation. Animals analyzed in microarray experiments were first synchronized by hypochlorite treatment and arrested at the first larval stage by incubation for 22 h in M9 [27,69]. Animals were then grown at 20 °C on 15-cm NG HB101 plates until the fourth larval stage (approximately 46 h). Animals were washed, harvested in M9, and then flash-frozen in liquid nitrogen and stored at −80 °C. Total RNA was prepared by Trizol extraction (Invitrogen, Carlsbad, California, United States). Microarray target preparation and hybridization. Targets were prepared and hybridized at the Harvard Medical School Biopolymer Facility. Starting with 10 μg of total RNA, first-strand cDNA was synthesized as described in the Affymetrix (Santa Clara, California, United States) expression technical manual. Briefly, 10 μg of RNA was added to 1 μl of 50 μM T7 primer (HPLC purified) (Integrated DNA Technologies, Coralville, Iowa, United States) in a volume of 9 μl. Then 1 μl of each Poly A spike control (5 nM) was added to the RNA, and T7 was added as an internal control. Poly A spikes were created from Poly A–tailed genes from Bacillus subtilis cloned into Stratagene (La Jolla, California, United States) pBluescript as an XhoI-to-NotI insert 5′–3′, respectively, and commercially available through ATCC (Manassas, Virginia, United States) (see Affymetrix technical expression manual). The RNA, T7, and Poly A spike controls were heated to 70 °C for 10 min and then placed on ice for 5 min. The RNA, T7, and Poly A mix was then heated to 42 °C. Then 4 μl of 5× first-strand buffer (Invitrogen), 2 μl of 0.1 M DTT (Invitrogen), 1 μl of 10 mM dNTP (Invitrogen), and 1 μl of Superscript II, RNase H− was added to the RNA and incubated at 42 °C for 1 h. Double-strand DNA was created via a replacement reaction under the following conditions. To the 20-μl first-strand reaction was added 91 μl of water, 30 μl of second-strand buffer (Invitrogen), 3 μl of 10 mM dNTP (Invitrogen), 1μl of Escherichia coli DNA ligase (Invitrogen), 1 μl of RNase H (Invitrogen), and 4 μl of E. coli DNA polymerase (Invitrogen). This 130-μl second-strand mix was added to the first-strand reaction and incubated at 16 °C for 2 h, then 2 μl of T4 DNA polymerase was added for 5 min at 16 °C, then the reaction was phenol-chloroform-extracted using 150 μl of phenol chloroform isoamyl alcohol (pH 7) (Ambion, Austin, Texas, United States), and the organic and aqueous phases were separated using a 1.5-ml phase lock heavy gel (Brinkmann Eppendorf, Westbury, New York, United States). The 150-μl aqueous layer was removed and precipitated in 375 μl of 100% ethanol and 15 μl of 3 M sodium acetate (Sigma, St. Louis, Missouri, United States). The cDNA pellet was isolated using an Eppendorf (Hamburg, Germany) 5415C centrifuge at room temperature for 20 min. Ethanol was aspirated and the pellet washed in 75% ethanol, centrifuged for 10 min, and aspirated. The cDNA pellet was rehydrated using 22 μl of nuclease-free water (Ambion) and used with the BioArray HighYield RNA Transcript Labeling Kit T7 (Enzo Life Sciences, Farmingdale, New York, United States). The resulting biotinylated cRNA probes were purified using RNeasy columns (Qiagen, Valencia, California, United States) and quantitated using A260 with an Agilent (Palo Alto, California, United States) 8453 spectrophotometer. Then 15 μg of labeled probe was fragmented with 5× fragmentation buffer (see Affymetrix technical manual) and combined with hybridization controls (Affymetrix), herring sperm DNA (Promega, Madison, Wisconsin, United States), and BSA (Invitrogen) to create 300 μl of hybridization mix. Of this, 200 μl was added to the Affymetrix C. elegans GeneChip. Hybridization was done in a GeneChip Hybridization Oven 320 for 16 h at 45 °C, processed on an Affymetrix Fluidics Station 400 using double amplification staining (see Affymetrix technical manual), and washed using fluidics protocol EukGE-WS2v4. The GeneChips were then scanned on a Hewlett-Packard (Palo Alto, California, United States) GeneArray Scanner. Public datasets. Descriptions of all available hybridizations in the C. elegans whole-genome expression profiles were downloaded from the Stanford Microarray Database (http://genome-www5.stanford.edu). These were then searched to find direct mutant versus wild-type comparisons. The 368 hybridizations that were publicly accessible included a large number of developmental time courses, aging experiments, and heat-shock and tissue-specific expression profiles (see Figure 2 Microarray data analysis. For Affymetrix data, probesets were first filtered to eliminate those that showed no detectable signal. A threshold of 32 was used for the C. elegans and Arabidopsis data. A threshold of 256 was used for the mdx data because these data showed significantly higher signals than the other datasets. This is most likely because the RNA for these experiments was prepared from a single tissue (mouse skeletal muscle), as opposed to the C. elegans and Arabidopsis RNA, which was derived from the whole organism. For printed arrays, only spots that showed detectable signal (mean signals greater than 1.5 standard deviations above background) were included in the analysis. Probes were classified as differentially expressed based on two criteria: fold-change and statistical significance. For Affymetrix data, fold-change was calculated as the average expression in the mutant divided by average expression in wild-type. For printed arrays (mec-3), fold-change was calculated by averaging the expression ratio in each of the mutant–versus–wild-type replicate hybridizations. This ratio provides a measure of the magnitude of expression difference between mutant and wild-type samples. To assess the statistical significance of expression differences, we compared the replicate expression values in the mutant hybridizations to replicate expression values in the wild-type hybridizations using a Student's t-test, and calculated a p-value. We then constructed a volcano plot with the log2(fold-change) on the x-axis and negative log10(p-value) on the y-axis [31]. Cutoffs for differential expression were based on shape and distribution of individual volcano plots. Raw image files were converted to probeset data (.cel files) in Microarray Suite (MAS 5.0). The nine probeset data files were normalized together and expression values were determined using the Robust Multi-chip Average (RMA) method as implemented in RMA Express (http://stat-www.berkeley.edu/~bolstad/RMAExpress/RMAExpress.html). Subsequent analysis was done using the R statistical computing package (http://www.r-project.org) and the Bioconductor libraries (http://www.bioconductor.org). Graphs were produced in Igor Pro 4.0 (WaveMetrics, Lake Oswego, Oregon, United States). Probeset annotations were downloaded from the Affymetrix Web site (http://www.affymetrix.com). Molecular characterization of hif-1(nu469). Using RNA prepared from KP3365 and wild-type animals, first-strand cDNA was synthesized using a primer specific for the 3′ UTR of hif-1 (Invitrogen). The hif-1 gene was then amplified by PCR from this cDNA and sequenced. Isolation and mapping of the tom-1 mutation. The tom-1(nu468) allele was isolated in an ethyl methane sulfonate screen for mutants that displayed hypersensitivity to aldicarb. F2 progeny of mutagenized animal were transferred to agar plates containing 0.5 mM aldicarb (Chem Service, West Chester, Pennsylvania, United States). After 1 h, a time point at which all wild-type worms were still moving, paralyzed animals were transferred to separate plates and rescreened for aldicarb sensitivity in subsequent generations. nu468 was determined to be recessive and was mapped to Chromosome 1 using conventional meiotic mapping. Analysis of aldicarb sensitivity. Aldicarb sensitivity was assessed essentially as described [45,46]. Briefly, for each experiment, 20 to 25 animals were transferred to agar plates containing 1 mM aldicarb (Chem Service). Paralysis was assessed every 10 min by prodding each animal with a platinum wire. Data from independent trials were averaged and used to calculate standard error. All experiments were conducted blind with respect to the genotype of the animals. Molecular characterization of tom-1(ok285). Genomic sequence from VC223 animals, available on WormBase, shows a deletion of 1,580 nucleotides that removes all of exons 11–13 and part of exon 10. To characterize the effect of this lesion on the tom-1 gene product, we purified RNA from VC223 animals and amplified the mutated tom-1 mRNA by RT-PCR (Invitrogen). Sequencing revealed that the genomic deletion results in an in-frame lesion in the mRNA, removing 606 nucleotides of coding sequence, and adding 23 nucleotides of intronic sequence and a 490-base-pair alternative exon from isoform C of tom-1 that is located just downstream of the deletion. Rescue of tom-1(nu468). tom-1(nu468) was rescued using the full-length cDNA of the major splice form of tom-1 (M01A10.2a) under the promoter of unc-17 synaptic vesicle ACh transporter. A transgenic strain was isolated by microinjecting the rescuing plasmid at 100 ng/μl using pttx-3::dsRed as a marker into nu468. During characterization and rescue of tom-1, we discovered that the start site and first exon were incorrectly predicted and described in WormBase. We identified the correct start site and initial two exons of tom-1 by performing RT-PCR using a primer complementary to the trans-splice acceptor (SL1). This corrected version of tom-1 shows much better alignment to the N-terminus of mammalian homologs (see Figure S2). Dataset S1: Microarray Expression Data from Wild-Type, unc-43(n1186), and tom-1(nu468) Animals (2 MB TDS) Click here for additional data file.(1.9M, tds) Figure S1: Probeset Alignments to unc-43 CaMKII Isoform H (K11E8.1h) Target sequences were downloaded (http://www.affymetrix.com) and aligned to unc-43 CaMKII isoform H. Since a majority of genes on the C. elegans GeneChip are represented by one probeset, unc-43 represents an atypica1 case. To explain this, it is useful to consider the history and design of the C. elegans GeneChip (see http://www.affymetrix.com/support/technical/datasheets/celegans_drosophila_datasheet.pdf). The targets for this GeneChip were designed by Affymetrix based on more than 18,800 Sanger Center predicted transcripts from the December 2000 genome sequence, as well as 2,300 3' EST clusters and 300 GenBank mRNAs (Release 121). Despite efforts to eliminate redundancy, there is not a strict one-to-one correspondence between the current set of genes and the probesets on the GeneChip. Our analysis indicates that 13% of the genes on the GeneChip are represented by more than one probe. Even so, having unc-43 CaMKII represented by seven probesets is an unusual situation. However, only two of these probesets (193459_s_at and 193463_s_at) are annotated as corresponding to the unc-43 CaMKII mRNA transcript according to annotations of the C. elegans GeneChip provided by Affymetrix in the March 28, 2003, update (downloadable at http://www.affymetrix.com). During our examination of the eight candidate probesets that showed decreased expression and were on Chromosome 4, we discovered five additional probesets that corresponded to unc-43 CaMKII. Four of these probesets (172058_x_at, 175820_s_at, 175821_s_at, and 175824_s_at) were based on GenBank sequences, and one (187759_s_at) was based on a predicted open reading frame, Y43C5B.1, that was part of the genome as of December 2000 but has since been shown to correspond to the 5′ UTR of unc-43 CaMKII. These additional probesets provided a serendipitous blind control, since we were not aware of their existence until they appeared on our candidate list from the hybridization. There is also an additional (eighth) probeset (173423_at) described in the Affymetrix annotation as corresponding to unc-43 CaMKII. However, based on the current gene model, 173423_at aligns to the intron between exons 11 and 12. As would be expected, this probeset shows no detectable expression and thus was not considered in our analysis of the unc-43 CaMKII mutant. (33 KB PDF) Click here for additional data file.(33K, pdf) Figure S2: Alignment of TOM-1 to Human and Rat Tomosyn Sequences of human and mouse tomosyn were downloaded from Ensembl (http://www.ensembl.org). The multiple sequence alignment was performed using T-Coffee (http://www.ch.embnet.org/software/TCoffee.html). Alignment output was produced using GeneDoc (http://www.psc.edu/biomed/genedoc). (104 KB PDF) Click here for additional data file.(104K, pdf) Table S1: List of Missense and Nonsense Alleles (51 KB PDF) Click here for additional data file.(52K, pdf) Accession Numbers The National Center for Biotechnology Information Gene Expression Omnibus (GEO, http://www.ncbi.nlm.nih.gov/geo) accession number for the data generated by the authors and discussed in this publication is GSE2210. The GEO accession numbers for datasets downloaded by the authors and discussed in this paper are mdx mouse (GDS236), mdx mutant (GDS236), PMR4 Arabidopsis (GDS417), and PMR4 mutant (GDS417). The GenBank (http://www.ncbi.nlm.nih.gov/Genbank) accession numbers for genes and gene products discussed in this paper are tom-1(ok285) (AY912103) and tomosyn (AY912102). The Ensembl (http://www.ensembl.org/) accession numbers for genes and gene products discussed in this paper are human tomosyn (ENSP00000179882) and mouse tomosyn (ENSRNOP00000018806). Acknowledgments We thank K. Vranizan for statistical and programming advice; T. Rector for Affymetrix sample preparation and hybridizations; J. Rine, D. Altshuler, V. Mootha, S. McCarroll, and C. Murphy for helpful discussions; T. Speed for statistical input; P. Juo, L. Drier, J. Dittman, and I. Ruvinsky for reading this manuscript; other members of the Kaplan and Ruvkun labs for suggestions; K. Scott for providing a fantastic foster lab for MD in Berkeley; and Gregoire for sustenance. Some of the strains described here were provided by the C. elegans Genetic Stock Center. This work was supported by a grant from the National Institutes of Health to JK (GM54728). MD was supported by a Howard Hughes Medical Institute predoctoral fellowship. Abbreviations
Footnotes Competing interests. The authors have declared that no competing interests exist. Author contributions. MD, JN, and JMK conceived and designed the experiments. MD performed the experiments and analyzed the data. MD and JMK wrote the paper. Citation: Dybbs M, Ngai J, Kaplan JM (2005) Using microarrays to facilitate positional cloning: Identification of tomosyn as an inhibitor of neurosecretion. PLoS Genet 1(1): e2. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
||||||||||||||||||||||||||||||||||
Development. 1996 Dec; 123():1-36.
[Development. 1996]Development. 1996 Dec; 123():37-46.
[Development. 1996]Science. 2001 Feb 16; 291(5507):1251-5.
[Science. 2001]Nat Genet. 1997 Sep; 17(1):119-21.
[Nat Genet. 1997]Science. 2000 Jun 16; 288(5473):2013-8.
[Science. 2000]Int J Dev Biol. 1998; 42(7):943-50.
[Int J Dev Biol. 1998]Nat Rev Genet. 2004 Feb; 5(2):145-50.
[Nat Rev Genet. 2004]Plant Physiol. 2000 Jun; 123(2):439-42.
[Plant Physiol. 2000]Nat Genet. 2004 Sep; 36(9):979-83.
[Nat Genet. 2004]Nat Biotechnol. 2001 May; 19(5):434-9.
[Nat Biotechnol. 2001]Hum Mol Genet. 1999; 8(10):1893-900.
[Hum Mol Genet. 1999]Cell. 2001 Nov 16; 107(4):411-4.
[Cell. 2001]Science. 2004 Jun 18; 304(5678):1811-4.
[Science. 2004]Science. 2002 Apr 26; 296(5568):752-5.
[Science. 2002]Nature. 2003 Mar 20; 422(6929):297-302.
[Nature. 2003]Science. 2001 Sep 14; 293(5537):2087-92.
[Science. 2001]Nature. 2002 Jul 18; 418(6895):331-5.
[Nature. 2002]Cell. 1988 Jul 1; 54(1):5-16.
[Cell. 1988]Science. 1993 Sep 3; 261(5126):1324-8.
[Science. 1993]Genome Biol. 2003; 4(4):210.
[Genome Biol. 2003]Genes Dev. 1989 Dec; 3(12A):1823-33.
[Genes Dev. 1989]Nature. 1999 Nov 11; 402(6758):199-203.
[Nature. 1999]Cell. 2001 Apr 20; 105(2):221-32.
[Cell. 2001]Nature. 1999 May 20; 399(6733):271-5.
[Nature. 1999]Cell. 2001 Oct 5; 107(1):43-54.
[Cell. 2001]Science. 2001 Apr 20; 292(5516):464-8.
[Science. 2001]Genetics. 1983 Aug; 104(4):619-47.
[Genetics. 1983]Proc Natl Acad Sci U S A. 1996 Oct 29; 93(22):12593-8.
[Proc Natl Acad Sci U S A. 1996]Genetics. 1995 Jun; 140(2):527-35.
[Genetics. 1995]Neuron. 1999 Sep; 24(1):231-42.
[Neuron. 1999]Neuron. 2001 Dec 6; 32(5):867-81.
[Neuron. 2001]Neuron. 1998 Sep; 21(3):479-80.
[Neuron. 1998]Neuron. 1998 May; 20(5):905-15.
[Neuron. 1998]J Biol Chem. 2003 Aug 15; 278(33):31159-66.
[J Biol Chem. 2003]Proc Natl Acad Sci U S A. 2004 Feb 24; 101(8):2578-83.
[Proc Natl Acad Sci U S A. 2004]Cell. 1999 Feb 5; 96(3):307-10.
[Cell. 1999]Genes Dev. 2001 Nov 1; 15(21):2781-5.
[Genes Dev. 2001]Science. 2003 Aug 15; 301(5635):969-72.
[Science. 2003]Science. 1989 Jun 30; 244(4912):1578-80.
[Science. 1989]J Appl Physiol. 2002 Aug; 93(2):537-45.
[J Appl Physiol. 2002]Nature. 2002 Jul 18; 418(6895):331-5.
[Nature. 2002]Nature. 2002 Aug 29; 418(6901):975-9.
[Nature. 2002]Science. 2002 Mar 22; 295(5563):2258-61.
[Science. 2002]Curr Opin Genet Dev. 1997 Apr; 7(2):220-32.
[Curr Opin Genet Dev. 1997]Cell. 1999 Feb 5; 96(3):307-10.
[Cell. 1999]Genes Dev. 2001 Nov 1; 15(21):2781-5.
[Genes Dev. 2001]Curr Opin Genet Dev. 1997 Apr; 7(2):220-32.
[Curr Opin Genet Dev. 1997]Trends Genet. 1999 Feb; 15(2):74-80.
[Trends Genet. 1999]Trends Biochem Sci. 2003 Sep; 28(9):464-6.
[Trends Biochem Sci. 2003]Science. 2002 Dec 20; 298(5602):2345-9.
[Science. 2002]Nat Genet. 2002 Jul; 31(3):235-6.
[Nat Genet. 2002]Hum Mol Genet. 1999; 8(10):1893-900.
[Hum Mol Genet. 1999]Cell. 2001 Nov 16; 107(4):411-4.
[Cell. 2001]Diabetes. 2004 Feb; 53(2):500-4.
[Diabetes. 2004]Hum Mol Genet. 2002 Nov 1; 11(23):2805-14.
[Hum Mol Genet. 2002]Science. 2001 Sep 14; 293(5537):2087-92.
[Science. 2001]Nature. 2003 Jul 17; 424(6946):277-83.
[Nature. 2003]Science. 2001 Sep 14; 293(5537):2087-92.
[Science. 2001]Genome Biol. 2003; 4(4):210.
[Genome Biol. 2003]Neuron. 1999 Oct; 24(2):335-46.
[Neuron. 1999]Neuron. 1999 Sep; 24(1):231-42.
[Neuron. 1999]