![]() | ![]() |
Formats:
|
||||||||||||||||||||||||||||||||||||||||||
Copyright : © 2005 Mathavan et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Transcriptome Analysis of Zebrafish Embryogenesis Using Microarrays 1 Genome Institute of Singapore, Singapore 2 Department of Biological Sciences, National University of Singapore, Singapore 3 Institute of Molecular and Cell Biology, Singapore 4 Bioinformatics Institute, Singapore Mary Mullins, Editor University of Pennsylvania School of Medicine, United States of America * To whom correspondence should be addressed. E-mail: lufkin/at/gis.a-star.edu.sg Received January 10, 2005; Accepted July 14, 2005. This article has been cited by other articles in PMC.Abstract Zebrafish (Danio rerio) is a well-recognized model for the study of vertebrate developmental genetics, yet at the same time little is known about the transcriptional events that underlie zebrafish embryogenesis. Here we have employed microarray analysis to study the temporal activity of developmentally regulated genes during zebrafish embryogenesis. Transcriptome analysis at 12 different embryonic time points covering five different developmental stages (maternal, blastula, gastrula, segmentation, and pharyngula) revealed a highly dynamic transcriptional profile. Hierarchical clustering, stage-specific clustering, and algorithms to detect onset and peak of gene expression revealed clearly demarcated transcript clusters with maximum gene activity at distinct developmental stages as well as co-regulated expression of gene groups involved in dedicated functions such as organogenesis. Our study also revealed a previously unidentified cohort of genes that are transcribed prior to the mid-blastula transition, a time point earlier than when the zygotic genome was traditionally thought to become active. Here we provide, for the first time to our knowledge, a comprehensive list of developmentally regulated zebrafish genes and their expression profiles during embryogenesis, including novel information on the temporal expression of several thousand previously uncharacterized genes. The expression data generated from this study are accessible to all interested scientists from our institute resource database (http://giscompute.gis.a-star.edu.sg/~govind/zebrafish/data_download.html). Abstract Synopsis In an effort to shed light on the mysterious molecular process of embryogenesis, scientists have employed the powerful technological approach of high-density microarrays to analyze for the first time (to our knowledge) the expression patterns of over 16,000 zebrafish genes during the major stages of embryonic development. They determined that developmentally regulated genes in zebrafish could be grouped based upon their time of transcript accumulation onset (activation) and their peak of activity (peak accumulation). For example, genes encoding transcriptional regulators are activated en masse earlier than the target genes that they control. Thus, the developmental gene expression profile in zebrafish can be grouped into successive waves of gene activation followed by peak activity. One very novel group of genes that was discovered in this study is a set of about 100 genes that are activated even earlier than what was previously thought to be the earliest time of embryonic transcriptional activation. These genes are likely to be important for preparing the embryonic cells for subsequent high-level genome activation and normal development. Introduction Zebrafish is an important vertebrate model for the analysis of developmentally regulated genes. Its advantages are rapidly developing transparent embryos, a short generation time, and amenability to genetic manipulation. In recent years, large-scale mutagenesis has been undertaken by both chemical mutagens [1–3] and proviral insertions [4–6]. Genetic tools such as transgenesis and morpholino knock-down assays have also been established for zebrafish [7,8]. In effect, a unique combination of these features makes zebrafish a strong model to study vertebrate developmental disorders and human hereditary disease [1,9–11]. It is believed that the zebrafish genome may contain about 30% more genes than the human genome, owing to an additional round of genome duplication about 450 million years ago, followed by extensive gene loss [12]. Several expressed sequence tag (EST) projects were launched to characterize genes expressed during different developmental stages [13–15] and in the various tissues of zebrafish [16]. The zebrafish database contains more than 400,000 ESTs grouped into about 18,000 clusters, but only a few thousand ESTs have been analyzed by whole embryo in-situ hybridization (WISH) (http://zfin.org). Systematic analysis of the temporal expression of the ESTs by WISH may take several more years to complete. A high-throughput expression genomics approach would provide complementary information to the single-gene approaches currently under way. Microarrays are currently the strongest technology platform for large-scale analysis of gene expression profiles during embryogenesis. They provide an opportunity to simultaneously monitor the expression of thousands of genes at various developmental stages, thus providing an opportunity to analyze temporal and spatial patterns of gene expression [17–23]. Preliminary analysis of the expression of a subset of zebrafish genes has been undertaken using cDNA microarrays [14,24,25]. Global evaluation of gene expression during the life cycle and organogenesis of Drosophila and mouse has been reported [20,23,26–29]. In contrast, a high-density microarray for genome-wide gene expression (transcriptome) analysis has not previously been attempted for zebrafish. In the present study, we used high-density oligonucleotide microarrays containing 16,416 genes to map the transcriptome profiles at 12 different developmental time points (from unfertilized eggs to the hatching stage). RNA was collected at closely spaced time points during blastula and gastrula stages—the dynamic development stages—and thus provides a dense coverage and a comprehensive analysis of the expression patterns of developmentally regulated genes during embryogenesis. Traditionally, it is believed that the zygotic genome becomes active at the mid-blastula transition (MBT) [30]. However, recent evidence in Xenopus has shown the commencement of zygotic genome transcription to occur prior to MBT [31]. We speculated that there might be a subset of genes activated with regards to transcript accumulation prior to MBT that in turn could play a subsequent role in genome-wide transcriptional activation but had escaped detection owing to their low numbers relative to the entire genome. To test our speculation, we made a systematic analysis of gene expression profiles preceding MBT, which revealed for the first time to our knowledge a striking cohort of genes that actively increase transcript levels prior to MBT. These new data form the starting point for determining the role of these genes in the control of zebrafish genome activation and embryogenesis. To make the data we have generated here easily available to the public, we have created a GIS-sponsored Web site (http://giscompute.gis.a-star.edu.sg/~govind/zebrafish/index.html) that includes annotation data and all expression data for all the genes in the zebrafish microarray. Results/Discussion Validation of Zebrafish Microarray Data The microarray data obtained in this work were validated using the following criteria: Firstly, 172 copies of the beta-actin gene were printed on the oligonucleotide array as calibration spots. Biological and experimental replicates of array hybridizations showed that, at the selected time points, the signals of all the beta-actin calibration spots were identical (Figure S1), indicating the reproducibility and reliability of the array data. Secondly, candidate genes with known expression patterns were compared with the array-derived expression patterns, and the patterns were found to be similar to one another (Table S1). In addition, real-time PCR analysis was performed for two genes (beta-actin and myosin light chain, mlc2f) at all selected time points. The quantity of transcripts detected could not be compared because of different methods of estimations. However, the pattern of transcript abundance detected for these genes in the array and in real-time PCR showed nearly identical expression profiles (Figure S2). Further, expression data of ribosomal protein (RP) genes of zebrafish obtained from the arrays showed temporally coordinated expression (see results section of RP gene expression). In order to confirm the array data, a Northern blot analysis was performed for selected genes (rpl7a-, rps3a-, rps12-, and rps10-) using the RNA samples collected at 14 different time points during embryogenesis. The pattern of expression of RP genes obtained in the array and Northern blot were essentially identical, validating the array data (Figure S3A). Intensity of hybridization of RP genes in the Northern blot were digitized and measured. Using the digitized data, the pattern of transcript accumulation was compared with array data, and the pattern of expression was similar in both the experiments (Figure S3B). These analyses demonstrated the validity and reliability of the array data presented in this work. Annotation of the Zebrafish Oligoarray The annotation given by Compugen (oligonucleotide supplier) is based on Entrez Nucleotide, BLAST, and Gene Ontology (GO) (http://www.labonweb.com/oligo) and is outdated. For example, the company database gives a description based on GO terms for about 1,152 probes, and the remaining are indicated as “GO unknown.” From our recent annotation, we have found GO terms for more than twice those reported by the company. We made critical analysis of the clones and identified the duplicates to determine the actual number of non-redundant clones in the array. The array represented 16,416 GenBank entries. Of this, 16,177 probes had non-redundant GenBank entries and the rest of the 239 GenBank entries were duplicated (including 171 copies of beta-actin) in the array. We mapped the non-redundant GenBank entries to the zebrafish UniGene database (UniGene build 79) and found that 2,751 GenBank entries did not have UniGene IDs and the rest of the 13,426 entries matched to UniGenes. Of these clones with UniGene entries, 12,153 clones were non-redundant, while the rest (1,273 UniGenes) were duplicates. On the whole, the probes in the array represented 14,904 non-redundant genes (12,153 non-redundant UniGene IDs plus 2,751 non-redundant GenBank entries). We have included here the most recent putative annotations for the clones in the array by obtaining the annotations from various database sources. The annotations can be obtained from our institute resource Web site (http://giscompute.gis.a-star.edu.sg/~govind/zebrafish/index.html). We have also included examples of annotations in the Datasets. The resources used to build the zebrafish annotation database, strategies followed, and the query methods for annotation retrieval are described in the Materials and Methods section. Transcriptome Analysis Identified Developmentally Regulated Genes The gene expression pattern of the zebrafish genome during embryogenesis was investigated using high-density oligonucleotide arrays representing 16,416 zebrafish genes. High-resolution time-course expression data were generated during embryogenesis. Considering the sensitivity of the array data and the number of replicates of each sample, it is very likely that most of the polyadenylated transcripts were detected for the genes represented in the array for each time point. Total RNA was extracted at 12 different developmental time points (see Materials and Methods for details), and for each time point analysis two to three independent biological replicates were made. Reference RNA was prepared by pooling equal concentration of total RNA from various embryonic stages and also adult male and female fish. The intensity of value of each spot and each side was normalized using the intensity-based log ratio median method [32]. Differentially expressed genes were selected using modified t-statistic (SAM) [33] with 15% of the standard deviation percentile as the fudge constant and log2 ratios. Stage-specific genes were selected following paired t-test (see Materials and Methods for details). By this procedure, we have identified 3,657 genes that showed significant levels of differential expression with a single peak during the course of development. This subset of developmentally regulated genes was used for further analysis of gene activity during embryogenesis. For the benefit of other scientists, the entire dataset generated from this study has been placed in our institute resource database, which can be accessed at http://giscompute.gis.a-star.edu.sg/~govind/zebrafish/data_download.html. Hierarchical clustering of 3,657 genes demonstrated diverse temporal expression profiles during zebrafish embryogenesis (Figure 1
Differential Degradation of Maternally Loaded Gene Transcripts The dataset of 622 gene transcripts displaying maximum abundance at the unfertilized egg stage (maternally loaded transcripts; (Figure 2 The second group of transcripts (259 genes) persisted to a greater extent during maternal and blastula stages and subsequently degraded (Figure 2 These observations revealed that the maternally deposited gene transcripts are not degraded at the same rate and there is a definite gene-specific differential degradation. The differential degradation and the extended stability of specific maternal genes implicated a potential relevance of these genes in later zygotic stages. It is known that after fertilization the earliest developmental processes in the egg are programmed by maternally deposited gene transcripts [38]. Subsequently, embryos initiate zygotic genome activation for the continuation of embryonic development. The maternal control of zebrafish development before and after MBT has recently been reported following extensive analysis of maternal mutants [2,3]. These authors showed that mutations in some of the maternal genes have generated embryos with defects in zygotic gene functions and developmental alterations during pre-MBT, MBT, and also beyond these stages. In this study, we have identified maternal stage-specific expression for a number of previously uncharacterized ESTs, which may play a vital role in modulating normal development. It has been reported in Drosophila that maternal RNA is degraded during the course of early development either by maternally derived factors (to degrade transient transcripts) or zygotically derived factors (to degrade moderately stable transcripts), and that the combined action of maternal and zygotic factors is required to degrade the highly stable maternal transcripts [39–42]. Zygotic regulation of maternal cyclin A1 and cyclin B2 mRNAs has been demonstrated in Xenopus laevis [43,44]. We presume that similar mechanisms of maternal RNA degradation reported for Drosophila and Xenopus may be acting in zebrafish to differentially degrade the maternally loaded messages. Zygotic Transcriptome Analysis and an Overview of Stage-Specific Clusters The time point at which the gene transcripts commenced accumulation from their basal expression level is considered as the time of onset, and it is determined using an algorithm (see Materials and Methods and Protocol S1). In the present study, stage-wise developmental onset of gene transcription was determined for 3,052 zygotic genes expressed during MBT and beyond (Figure 2 Embryonic development in zebrafish (from fertilization to hatching) extends for 48 h. Within the first 5 h post-fertilization (hpf), cleavage and blastula are completed, and the process of gastrulation takes another 5 h. Thus, within 10 hpf (which is equal to about 21% of the total embryonic period), the early developmental processes are completed. Segmentation and pharyngula stages are completed in about 14 and 24 h, respectively (Figure 2 To obtain an overview of embryonic gene expression, we sought to specifically address the temporal expression patterns in the following developmental stages: (1) maternal (unfertilized eggs); (2) blastula; (3) gastrula; (4) segmentation; and (5) pharyngula/hatching stages (Figure 3
The following observations can be made from this analysis. Of the total number of 3,657 genes analyzed, 622 genes (17%; Dataset S13) showed maximum activity at the maternal stage. Of the remaining 3,035 (zygotic) genes, 609 (16.6%), 1,006 (27.5%), 688 (18.9%), and 732 (20.0%) genes showed maximum transcript levels at blastula (Dataset S14), gastrula (Dataset S15), segmentation (Dataset S16), and pharyngula (Dataset S17) stages, respectively (Figure 3 Genes displaying a peak of expression during the pharyngula stage continued their expression beyond the pharyngula stage, suggesting a functional requirement for these genes during embryonic and post-embryonic stages of development. For example, ceruloplasmin (cp), which has been implicated in liver development [50], displayed maximum expression during the pharyngula stage. Expression of this gene did not terminate at the end of the pharyngula stage, suggesting that its continued expression during post-embryonic stages could be playing a role in liver function. The genes that displayed similar expression are involved in parallel embryonic and post-embryonic developmental processes. For example, muscle-specific genes and collagens also show similar patterns of expression, as they are involved in the progressive formation of muscles and skeletal development in zebrafish. It is known that almost all the primordial processes of organogenesis commence at gastrulation and continue during segmentation and post-segmentation stages. Thus, an overview of genome-wide expression analysis demonstrated a temporally demarked expression pattern of developmentally regulated genes and showed that the gene activities were well correlated with the developmental events at the respective stages. Prior to our analysis, only 7% of the differentially expressed genes were fully annotated (with GO), and temporal expression patterns for the majority of the clones in the array were not established. Our analysis, for the first time to our knowledge, identified the temporal patterns of expression for about 80%–90% of the clones, and for most of them the patterns of expression are presented here for the first time, to our knowledge. Transcription Analysis of Selected Clusters of Functionally Related Genes From the array data, expression patterns of a number of functionally related genes could be identified based on their GO and/or putative functions. However, we selected three groups of genes involved in (1) cell cycle, (2) ubiquitin functions (ubiquitins and proteasomes), and (3) somitogenesis as examples and discuss here the expression patterns of these genes in relation to their biological significance. Cell cycle. Expression of cell cycle related genes encoding cyclins and cyclin-dependent kinases were analyzed (Figure 4
Ubiquitins and proteasomes. Genes involved in protein degradation by ubiquitin-mediated proteolytic function displayed low levels of transcripts during early development (up to the blastula stage). From the beginning of the early gastrula stage, these genes increased their transcript levels (Figure 4 Analysis of Gene Expression during Somitogenesis Initiation of organogenesis in the embryo is accomplished by a set of structural genes and the transcription factors that modulate their expression. We have taken somitogenesis as an example and analyzed a subset of genes involved in this process. Broadly, somitogenesis can be divided into an early phase where the paraxial mesoderm is subdivided in a rostrocaudal pattern into blocks of cells called somites and a subsequent phase of cell differentiation within the somites where somatic cells acquire a variety of distinct dorsoventral and mediolateral fates. In zebrafish, the dorsolateral somitic mesodermal cells contribute to muscle development. Consistent with this, most of the muscle-specific protein (MSP) genes commenced expression during early somitogenesis, and they steadily increased their transcript abundance in positive correlation with somite formation. From the array expression data, the transcripts for MSP genes were first detected around 11 hpf (Figure 4 Distinct classes of genes encoding transcriptional regulators are expressed dynamically during somitogenesis in all vertebrates and precede the onset of MSP gene expression. The activities of members of the bHLH transcription factors, namely hairy (h), Enhancer of split (E spl), and hairy-related, have been shown to play critical roles during somitogenesis in vertebrates. Analysis of the zebrafish bHLH family of h/E (spl)-related genes has shown the involvement of hairy related 9 (her9-) [57], her1-, her4-, her6-, and her7- [58,59] in somitogenesis. Our array data clearly revealed that most of these transcription factors (mespa-, mespb-, mef2a-, mef2c-, mef2d-, foxc1a, and a number of her genes) commenced their expression prior to the onset of muscle-specific gene expression (Figure 4 RP Genes Are Coordinately Expressed during Zebrafish Embryogenesis From the array expression data, 35 RP genes of zebrafish (similar to known RP genes) showed temporally coordinated expression (Figure 4 Accumulation of Gene Transcripts Do Occur prior to the MBT in Zebrafish Gene expression patterns during early development (pre-MBT and post-MBT stages) were studied by looking at the profiles of expression at selected time points during early cleavage and post-cleavage stages. The expression data obtained during these stages were selected following the statistical method described in the Materials and Methods section. Groups of genes showing distinct expression patterns were clustered based on the onset of transcript accumulation (Figure 5
Since these novel gene transcripts were the earliest activated in zebrafish development, they are likely to be involved in specific early embryonic functions and possibly in subsequent genome-wide activation. We analyzed several of the genes in this cluster based on the predicted putative functions, which are as follows. Several proteasome subunits in this cluster are known to be involved in protein degradation by the ubiquitin-mediated proteolytic pathway [54,55]. The proteasomes may be required at this stage for the proteolytic degradation of maternal gene products. A number of proteasome subunits have been shown to be functional in the proliferating ARPE19 retinal pigment epithelium cells. Since the cleavage stage is similar to the proliferating cells stage, it is reasonable to find the proteasome transcripts at this stage. This cluster also contains the putative gene for ubiquitin conjugating enzyme, and it is known that ubiquitin conjugating enzymes mediate ubiquitination and degradation of specific substrates by the ubiquitin-dependent degradation pathway [65]. This enzyme has also been shown to promote cyclin proteolysis during mitosis [66]. We have reported in the present study that transcript levels of cyclins involved in cell cycle regulation (cyclin A2, cyclin B, cyclin B1, cyclin E) are highly abundant in unfertilized eggs and during cleavage stages [37,53]. For the degradation of these gene products, proteasomes and ubiquitin conjugating enzyme may be required at the synchronous cell cycles stage (pre-MBT) of zebrafish development. Another gene in this cluster is a putative RNA helicase. RNA helicases are implicated in a number of cellular processes involving RNA secondary structures such as translation initiation and ribosome and spliceosome assembly. Some of the members of this family have been active in embryogenesis and in embryonic stem cells. RNA helicase has been shown to be essential for normal gastrulation in the mouse [67]. It has been shown that RNA helicase II is necessary for cell growth and cell cycle progression [68], and a role for RNA helicases in ribosomal RNA biogenesis in Xenopus oocytes has been observed [69]. In addition, it has been shown that down-regulation of RNA helicase II results in depletion of ribosomal RNA production in Xenopus oocytes and cell culture [69,70]. Thus, the expression of putative RNA helicase at the pre-MBT stage may be essential for cell growth and/or cell cycling during the cleavage stages of embryogenesis and also early commitment to normal gastrulation. Furthermore, it appears that the presence of RNA helicase II during early embryogenesis may facilitate ribosomal RNA production and the positioning of the translational machinery so as to meet the demand at MBT. Other interesting genes in this cluster are alanyl-tRNA synthetase and nascent-polypeptide associated complex alpha (DNA primase small subunit). Transcripts for these genes have also been found in ES cells. DNA primase is involved in nascent polypeptide production resulting in protein biosynthesis, and tRNA synthetase is involved in protein synthesis related processes such as tRNA binding and tRNA ligase activity. Thus, all the genes described above are related to cell growth, cell proliferation, cell adhesion, nascent RNA synthesis and stability, and other cellular functions. It may be that these genes need to commence their function during pre-MBT stages in order to facilitate normal progression through MBT and subsequent embryonic development. GO analysis of the genes in this group reveals that more than 50% of the genes show putative functions such as DNA/RNA binding, development, protein folding, and protein biosynthesis (Figure S4). Human and mouse orthologs of some of the known genes in this group are given in Table S2. Clusters 3 and 4 displayed the cohort of zygotic genes displaying maximum transcript accumulation at blastula (4 hpf) and gastrula (6 hpf) stages, respectively. The genes that have been shown to be expressed in blastula- (for example, cyclin D1) and gastrula- (for example, cathepsin L) stage embryos were identified with Clusters 3 and 4, respectively, as expected. Thus, the microarray detected the expression of known genes at the expected time points, once again confirming the reliability of the array detection. It has been shown that transcription can occur prior to MBT in Xenopus embryos [31]; namely, these authors showed that beta-catenin and tcf regulated pre-MBT transcription of nodal genes xnr5 and xnr6 does take place. However, in this study, they restricted the investigation to single gene expression analyses. Most of the earlier studies on the functional analysis of zebrafish genes were restricted to a handful of genes using WISH or Northern blot analysis with embryos at MBT or beyond. Parallel analysis of the expression of thousands of genes at selected closely spaced time points and early cleavage stages has not been previously attempted, to our knowledge. Thus, we are the first to our knowledge to undertake parallel analysis of expression of early genes using high-density zebrafish arrays. This approach revealed a cohort of genes that actively accumulate zygotic transcripts prior to the MBT stage, and the data presented here will serve as a starting point for the functional investigations of the roles of these genes during zebrafish embryogenesis. To validate the expression profiles, real-time PCR was done for selected genes from each cluster. Real-time PCR was done for 16 samples from Cluster 2 (Figures 5
In summary, we described here the first genome-wide microarray analysis of the embryonic zebrafish transcriptome. This study revealed a remarkable undulating developmental expression program and temporal clustering of gene groups never before characterized (to our knowledge) during embryogenesis. Furthermore, a previously unknown cohort of very early genes transcribed prior to MBT was discovered in this study, which now represents the earliest known (to our knowledge) transcribed and accumulating RNAs in zebrafish and revised our earlier thinking that MBT marked the onset of all zygotic gene transcription. We are the first to create a comprehensive integrated database by incorporating resources from different databases for the annotation of the clones in the zebrafish oligoarray (Compugen) that will serve as a significant resource for the zebrafish community. Materials and Methods Embryo collection. Wild-type zebrafish (Singapore local stock) embryos were obtained from the zebrafish facility of the Institute of Molecular and Cell Biology. Embryos were collected immediately after fertilization, maintained at 28.5 °C, and staged by hpf using standard morphological criteria [71]. Unfertilized eggs were collected by squeezing the abdomen of spawning females. Embryos were collected at 12 time points (unfertilized egg, 3, 4.5, 6, 7.7, 9, 10.7, 12, 15, 24, 30, and 48 hpf) (Table S3), snap-frozen in liquid nitrogen, and stored at −80 °C. The 12 time points were assigned to the following developmental stages: (1) maternal (unfertilized egg), (2) blastula (3.0 and 4.5 hpf), (3) gastrula (6.0, 7.7, and 9.0 hpf), (4) segmentation (10.7, 12.0, and 15.0 hpf), and (5) pharyngula (24.0, 30.0, and 48.0 hpf). In order to maintain a uniform genetic background, all embryos were collected from the same batches of fish stock. RNA isolation and reference RNA. Total RNA was extracted from all the frozen embryos using Trizol reagent (Gibco BRL, Gaithersburg, Maryland, United States). RNA quality was evaluated by gel electrophoresis, and the concentration was measured with a UV spectrophotometer. To prepare reference RNA, total RNA was collected from the following stages (embryos/adults) and mixed in equal concentration: 1 hpf, 4 hpf, 24 hpf, 48 hpf, 3-wk-old fry, adult of male and female. Sufficient amount of reference RNA required for the entire project was prepared at one time, and the aliquots were stored at −80 °C. Zebrafish oligonucleotide probe design and microarray construction. Zebrafish oligonucleotide probes for this array were designed by Compugen and synthesized by Sigma-Genosys (The Woodlands, Texas, United States). For each gene, one 65-mer oligonucleotide probe was designed from the 3′ region of the sequence. Each probe was selected from a sequence segment that is common to a maximum number of splice variants predicted for each gene. The arrays contained 16,416 probes representing oligonucleotides of selected genes. Oligonucleotide probes were re-suspended in 3XSSC at 20 μM concentration and spotted onto poly-L-lysine-coated microscope slides using a custom-built DNA microarrayer. Printed arrays were post-processed following the standard procedure described for cDNA arrays [72]. Target labeling and hybridization strategy. For fluorescence labeling of target cDNAs, 20 μg of total RNA from the reference and experimental samples was reverse transcribed in the presence of Cy3-dUTP and Cy5-dUTP (Amersham Biosciences, Little Chalfont, United Kingdom), respectively. Labeled target cDNAs were pooled, concentrated, and resuspended in DIG EasyHyb (Roche, Basel, Switzerland) buffer for hybridization. Hybridization strategies were as described by [72] with the following modifications: (1) usage of DIG EasyHyb buffer (Roche) and (2) hybridization at 42 °C. For each developmental stage, two to four independent replicate hybridizations were performed using a minimum of two biological replicates. For dye bias control, we employed a “dye-swapping” strategy for microarray hybridization whereby the Cy dye labeling scheme was reversed for alternating replicate hybridizations [73]. Data acquisition and statistical analysis. The arrays were scanned using the GenePix 4000B microarray scanner (Axon Instruments, USA) to generate 16-bit TIFF images of Cy3 and Cy5 signal intensities. GenePix Pro 4.0 image analysis software (Axon Instruments, Union City, California, United States) was employed to measure the fluorescence signal intensities of the array features and local background. Normalization of the two channels (sample and reference) was done for each slide using the intensity-based log ratio median method [32]. For selection of differentially expressed genes, which are supposed to be specific for a developmental stage, a two-step filtering process was used. First, only genes that differentially expressed at least at one time point (in all the arrays tested) were retained. Modified t-statistic (SAM) [33] with 15% of the standard deviation percentile as the fudge constant was used for identification of differentially expressed genes, and log2 ratios were used in the modified t-statistic. Predicted false discovery rate of 0.05 was used as the threshold for differential expression. At the next step, those differentially expressed genes were further filtered to determine whether they were specific for a particular stage. Paired t-test was used to check whether the means of log2 ratios of their expression levels (over the replicates) in a stage was significantly (at 95% confidence level) larger than the means at other stage(s). The above methods of analysis have been used for all the experimental data that are presented in this paper. The log ratios from replicate arrays were averaged, and the average expression values were used in subsequent analyses. The extracted datasets were hierarchically clustered and visualized (Cluster and Tree View; [74]). Onset and offset finding algorithm. The objective of the algorithm was to detect the onset (beginning) and peak time of transcript accumulation for a given gene using its time-course expression profile. Peak of the expression is defined as the time at which the mRNA abundance attains its highest level. The onset of expression is defined as the earliest time of the peak in the profile. This algorithm was used for the single peak genes selected as differentially expressed for this study. Prior to applying the algorithm, the dataset was selected based on statistical methods (SAM and FDR) described above, which removed the noisy profiles. Apart from that, the algorithm contains a Gaussian smoothing step that also removes the noisy profiles. Both onset and peak of expression are the onset and peak of relative abundance rather than absolute abundance. Manual verification of the profiles derived by using the algorithm showed that less than 0.1% of the data deviated from onset time by just one time point. The prediction of peak of expression did not vary from the observed data. Details of the algorithm are given in Protocol S1. Annotation of the zebrafish oligoarray. In order to obtain the most recent annotation, we tried to obtain the full-length sequences or assembled sequences for the clones in the array from different sources: Zebrafish gene collection (ZGC: http://zgc.nci.nih.gov/ZGC; ftp://ftp1.nci.nih.gov/pub/MGC/fasta); Zebrafish UniGene (UniGene build 79: ftp://ftp.ncbi.nih.gov/repository/UniGene/) and Zebrafish Gene Index ([ZGI] release 16: http://www.tigr.org/tigr-scripts/tgi/tc_ann.pl?db=zest ftp://ftp.tigr.org/pub/data/tgi/Danio_rerio/). If the full-length sequence was not available, we obtained the assembled sequences from the zebrafish UniGene assembly (unique, longest sequences in the UniGene assembly) and from the ZGI (release 16), which provides current tentative consensus sequences generating assemblies of ESTs of particular gene. However, some of the clones in the array are represented as ESTs. Thus, we downloaded the sequences for the full-length clones, assembled sequences, and ESTs for all the probes in the array. Putative annotations available for the clones in the array were downloaded from the following databases: (1) Compugen description (provided by oligo supplier: http://www.labonweb.com/oligo (2) Entrez Gene (for gene ID, gene description, and gene symbol: ftp://ftp.ncbi.nlm.nih.gov/gene/; http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=gene); (3) Zebrafish UniGene cluster ( for UniGene ID, UniGene description: ftp://ftp.ncbi.nlm.nih.gov/repository/UniGene/; http://www.ncbi.nlm.nih.gov/UniGene/clust); (4) GO http://www.geneontology.org release 21 November 2004; ftp://ftp.geneontology.org/pub/go/); (5) Locus link (ftp://ftp.ncbi.nih.gov/refseq/LocusLink/; release 21 November 2004); (6) UniGene protein similarity and description (for mouse and human protein similarity ID, percentage of similarity and description: ftp://ftp.ncbi.nlm.nih.gov/repository/UniGene/; http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=protein&dopt=GenPept&list); (7) ZGI (build 16) (for gene description and tentative consensus sequence number; ftp://ftp.tigr.org/pub/data/tgi/Danio_rerio/; http://www.tigr.org/tigr-scripts/tgi/tc_ann.pl?db=zest); (8) HomoloGene (mouse and human homolog gene ID and description; http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?DB=homologene; ftp://ftp.ncbi.nih.gov/pub/HomoloGene/); (9) Chromosomal location (ftp://ftp.ncbi.nih.gov/repository/UniGene/). All the sequence data and annotation information are stored in a MySqL database and can be accessed at http://giscompute.gis.a-star.edu.sg/~govind/zebrafish/index.html details of retrieval are given in the help file. The database can be queried using the GenBank accession number of the probes in Compugen Zebrafish array and the following information can be obtained: (1) Compugen description, (2) Zebrafish UniGene ID (build 79), (3) Zebrafish UniGene description (build 79), (4) Entrez Gene description, (5) Entrez Gene ID and Gene symbol, (6) GO term, (7) Locus link, (8) UniGene protein similarity and description (mouse and human), (9) full-length or assembled sequences, (10) HomoloGene (human and mouse, build 38.1), and (11) chromosomal location of the gene. The annotation for all the GenBank entries in the Compugen array is available from the GIS-sponsored Web site designed for zebrafish microarray annotation (http://giscompute.gis.a-star.edu.sg/~govind/zebrafish/index.html). To our knowledge, we are the first to create such an integrated database and search facility for the zebrafish genes present in the array. This GIS zebrafish database will be updated periodically by integrating the latest release from the resource database. From our expression analysis during embryogenesis, 3,657 differentially expressed genes were identified and annotated. There were no duplicates in the differentially expressed genes selected for the analysis. To obtain the annotation of these genes, the following procedures were adopted. Annotations/putative gene descriptions were obtained from the following databases by using GenBank accession number as query information: Entrez Gene, UniGene descriptions, protein similarity to human/mouse, and ZGI (build 16). From the above sources, we obtained annotation for about 75% of the clones. (Entrez Gene: 31%; UniGene, protein similarity, and ZGI: 44%). To annotate the rest of the clones, we BLASTed the assembled DNA sequence of the clones against the non-redundant database (ftp://ftp.ncbi.nlm.nih.gov/blast/db/ protein; ftp://www.expasy.ch/databases/sp_tr_nrdb/). Putative annotations were obtained for about 20% of clones from the non-redundant database. For the rest of the clones (5%), we got the description obtained from BLAST search (ftp://ftp.ncbi.nlm.nih.gov/blast/db/). By this process, we have obtained putative annotation for most of the differentially expressed genes. All of the resource data used in the annotations are available via our database (http://giscompute.gis.a-star.edu.sg/~govind/zebrafish/index.html http://giscompute.gis.a-star.edu.sg/~govind/zebrafish/data_download.html) (Datasets S26 and S27). Based on the gene description and GO, the genes involved in selected functions were identified using appropriate keywords, such as “cell cycle,” “proteasomes,” “RPs,” and so on. Northern blot hybridization and WISH analysis. Total RNA isolated from embryos at various stages of development were fractionated on 1.2% formaldehyde-agarose gels and transferred to GeneScreen membranes following standard procedures. The blotted membranes were prehybridized and hybridized following standard protocols. Selected RP cDNA clones (l7a, s3a, s12, and s10) were used as templates for probe preparation and labeled with 32P using Random Primer DNA labeling system (Gibco BRL). Whole mount in situ hybridization using RNA probes labeled with digoxigenin (Roche) was carried out as previously reported [75]. Quantification by real-time PCR. In order to verify the microarray results, two randomly chosen differentially expressed genes were tested in real-time RT-PCR analysis for all the time points. The SYBR Green I (Roche) RNA amplification kit was used on the LightCycler according to the manufacturer's instructions. The beta-actin and myosin light chain 2 (mlc2f) transcripts were amplified using the following primers (beta-actin: LC1–5′ CCGTGACATCAAGGAGAAGCT-3′; LC2 5′-TCGTGGATACCGCAAGATTCC-3′; mlc2f: LC1 5′-TCTCACTCATTACCCACAA-3′; LC2 5′-ACTCCATCGTGCTTCTTTC-3′). Prior to quantification, the optimal concentrations of template, primers, and magnesium were determined. Serially diluted plasmid DNA samples were used to construct a standard curve to quantify the test samples as well as the amplification efficiency. The products from real-time RT-PCR were also analyzed by agarose gel electrophoresis for single bands of the predicted size (unpublished data). The same protocol was followed for all real-time PCR analysis. Analysis of early gene expression. For the analysis of gene expression during early developmental time points, embryos were collected at the following cleavage and post-cleavage stages: one-cell, four-/eight-cell (mostly four-cell), 64-/128-cell (mostly 64-cell), 4 hpf, and 6 hpf. Total RNA extracted from unfertilized eggs was used as reference RNA in this analysis of gene expression. To perform the real-time PCR using the new set of RNA, total RNA was extracted from one-cell, two-/four-cell, 64-/128-cell, 256-/512-cell, and 4-hpf embryos and used for the experiment. To calculate ratio of expression for the real-time PCR data, real-time data of one-cell stage was used as reference. Normalization statistical analyses were done as described in the Materials and Methods. Dataset S1: Developmentally Regulated Genes with Expression Data This dataset contains all the genes selected for subsequent analysis. (688 KB DOC) Click here for additional data file.(688K, doc) Dataset S2: Maternal Genes Genes showing maximum transcript levels in unfertilized eggs and different patterns of degradation. (137 KB DOC) Click here for additional data file.(138K, doc) Dataset S3: List of Genes with Onset of Transcript Accumulation at the Blastula Stage and Peak of Expression at the Blastula Stage (130 KB DOC) Click here for additional data file.(131K, doc) Dataset S4: List of Genes with Onset of Transcript Accumulation at the Blastula Stage and Peak of Expression at the Gastrula Stage (191 KB DOC) Click here for additional data file.(191K, doc) Dataset S5: List of Genes with Onset of Transcript Accumulation at the Blastula Stage and Peak of Expression at the Segmentation Stage (66 KB DOC) Click here for additional data file.(66K, doc) Dataset S6: List of Genes with Onset of Transcript Accumulation at the Blastula Stage and Peak of Expression at the Pharyngula Stage (44 KB DOC) Click here for additional data file.(45K, doc) Dataset S7: List of Genes with Onset of Transcript Accumulation at Gastrula and Peak of Expression at Gastrula Stages (27 KB DOC) Click here for additional data file.(27K, doc) Dataset S8: List of Genes with Onset of Transcript Accumulation at Gastrula and Peak of Expression at Segmentation Stages (51 KB DOC) Click here for additional data file.(52K, doc) Dataset S9: List of Genes with Onset of Transcript Accumulation at Gastrula and Peak of Expression at Pharyngula Stages (29 KB DOC) Click here for additional data file.(29K, doc) Dataset S10: List of Genes with Onset of Transcript and Peak of Expression at the Segmentation Stage (36 KB DOC) Click here for additional data file.(36K, doc) Dataset S11: List of Genes with Onset of Transcript Accumulation at the Segmentation Stage and Peak of Expression at the Pharyngula Stage (88 KB DOC) Click here for additional data file.(88K, doc) Dataset S12: List of Genes with Onset of Transcript Accumulation and Peak of Expression at the Pharyngula Stage (43 KB DOC) Click here for additional data file.(44K, doc) Dataset S13: Genes Exhibiting Peak of Expression in the Unfertilized Egg (Maternally Loaded RNA) (137 KB DOC) Click here for additional data file.(137K, doc) Dataset S14: Genes Exhibiting Peak of Expression at the Blastula Stage (130 KB DOC) Click here for additional data file.(130K, doc) Dataset S15: Genes Exhibiting Peak of Expression at the Gastrula Stage (199 KB DOC) Click here for additional data file.(199K, doc) Dataset S16: Genes Exhibiting Peak of Expression at the Segmentation Stage (142 KB DOC) Click here for additional data file.(143K, doc) Dataset S17: Genes Exhibiting Peak of Expression at the Pharyngula Stage (156 KB DOC) Click here for additional data file.(156K, doc) Dataset S22: Gene Expression Dataset of Pre-MBT and Post-MBT Stages—Maternal Dominant Genes (22 KB DOC) Click here for additional data file.(22K, doc) Dataset S23: Gene Expression Dataset of Pre-MBT and Post-MBT Stages with Onset of Expression from 64 Cells Onwards (39 KB DOC) Click here for additional data file.(40K, doc) Dataset S24: Gene Expression Dataset of Pre-MBT and Post-MBT Stages with Onset of Expression from 4 hpf Onwards (40 KB DOC) Click here for additional data file.(40K, doc) Dataset S25: Gene Expression Dataset of Pre-MBT and Post-MBT Stages with Onset of Expression from 6 hpf Onwards (100 KB DOC) Click here for additional data file.(100K, doc) Dataset S26: Putative Annotation Obtained for the Selected 3,675 Genes Using the Zebrafish Annotation Database Generated in the Work also from the NR DB Blast Search (3.5 MB XLS) Click here for additional data file.(3.3M, xls) Dataset S27: Putative Annotation Obtained for the Selected Pre-MBT and Post-MBT Genes using the Zebrafish Annotation Database Generated in this Work (525 MB XLS) Click here for additional data file.(525K, xls) Figure S1: Expression Profile of the Beta-Actin Gene in the Array All the copies of the oligos in the array showed an almost identical expression pattern, indicating the reproducibility and homogeneity of the array data. (410 KB DOC) Click here for additional data file.(410K, doc) Figure S2: Comparison of Expression Profiles Obtained for Myosin Light Chain (mlc2f) and Beta-Actin Genes Using Microarray Analysis and Real-Time PCR The patterns of expression detected in both methods are almost identical. (204 KB DOC) Click here for additional data file.(205K, doc) Figure S3: Northern Blot Analysis of RP Genes (A) Northern blot analysis to detect the transcript abundance for selected RP genes (l7a, s3a, s12, and s10) during embryogenesis. The expression pattern obtained by Northern blot analysis is similar to the data obtained from the array analysis. (B) (I). Intensity of hybridization in the Northern blots for the selected RP gene was digitized and plotted. (II). Expression data obtained from the arrays for the selected RP genes used in Northern blotting are presented. Both analyses show similarity in the pattern of transcript accumulation. (463 KB DOC) Click here for additional data file.(464K, doc) Figure S4: Distribution of Early Gene Expression at Later Stages of Embryogenesis Based upon Peak Expression and Classification of Early Genes into Functional Groups Based on GO Terminology (227 KB PDF) Click here for additional data file.(226K, pdf) Figure S5: Real-Time PCR Analysis of Early Genes Real-time PCR was done using total RNA extracted from early stages of embryonic development (one-cell, two-/four-cell, 64-/128-cell, 256-/512-cell, and 4-hpf) for eight genes from Cluster 2 (pre-MBT transcribed genes). The expression pattern of these genes by real-time PCR and microarray are compared. The expression pattern is similar in both types of analysis. (182 KB PDF) Click here for additional data file.(181K, pdf) Table S1: Comparison of the Expression Profile of Selected Genes in the Microarray Analysis and Other Methods of Transcript Detection (11 KB PDF) Click here for additional data file.(10K, pdf) Table S2: Zebrafish Pre-MBT Transcribed (Early Gene) Orthologs in Mouse and Human (9 KB PDF) Click here for additional data file.(8.6K, pdf) Table S3: Characteristics of Developmental Stages of Embryos That Were Collected for the Expression Profile Analysis in this Work (24 KB PDF) Click here for additional data file.(24K, pdf) Protocol S1: Onset and Offset Finding Algorithm (71 KB PDF) Click here for additional data file.(70K, pdf) Accession Numbers The GenBank (http://www.ncbi.nlm.nih.gov/Genbank) accession numbers for the genes and gene products discussed in this paper are alanyl-tRNA synthetase (AI793498), beta-actin (AF025305), birc5b survivin2 (AY057058), calreticulin (BM101539), catalase (AF170069), cathepsin L (BI865754), ceruloplasmin (BC048037), cyclin B1 (BC055553), cyclin D1 (BI888360 and BC075743), cyclinE (BC045842), foxc1a (AF219949), her1- (X97329), her4- (AI794276), her6- (X97333), her7- (AF240772), her9- (AF301264), lmnl3 (AF397015), mcl1a (AF302805), mef2a- (BI880399, mef2c- (U66569), mef2d- (U66570), mespa- (AB037939), mespb- (AB037940), mlc2f (AF081462), nascent-polypeptide associated complex alpha (AI626587), RNA helicase (AW154620), rpl7a- (AI964218), rps10- (BI842921), rps12- (BM070699), rps3a- (AI558833), Vsx1 (AF025348), zp2 (AW170860), zp2.4 (AF331967), zp3 (AF095457), and BM184007. Acknowledgments We thank Aradhana Rani for help in data analysis, and Ben Zion Caveri for help with the Northern analysis. The financial support received from the Biomedical Research Council of Singapore is gratefully acknowledged. Abbreviations
Footnotes Competing interests. The authors have declared that no competing interests exist. Author contributions. SM, LDM, VK, ZG, and ETL conceived and designed the experiments. SM, SGPL, AM, YT, YLW, and SHL performed the experiments. KRG performed the annotations. SM, LDM, KRKM, VK, ZG, ETL, and TL analyzed the data. KRKM, YR, VK, ZG, ETL, and TL contributed reagents/materials/analysis tools. SM, LDM, VK, ZG, ETL, and TL wrote the paper. Citation: Mathavan S, Lee SGP, Mak A, Miller LD, Murthy KRK, et al. (2005) Transcriptome analysis of zebrafish embryogenesis using microarrays. PLoS Genet 1(2): e29. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||||||||||||||||||||
Curr Opin Genet Dev. 2000 Jun; 10(3):252-6.
[Curr Opin Genet Dev. 2000]Dev Cell. 2004 Jun; 6(6):781-90.
[Dev Cell. 2004]Proc Natl Acad Sci U S A. 1996 Jul 23; 93(15):7777-82.
[Proc Natl Acad Sci U S A. 1996]Nat Genet. 2002 Jun; 31(2):135-40.
[Nat Genet. 2002]Development. 1997 Oct; 124(20):4105-11.
[Development. 1997]Development. 1993 Oct; 119(2):447-56.
[Development. 1993]Development. 2002 Dec; 129(24):5743-52.
[Development. 2002]Proc Natl Acad Sci U S A. 2003 Feb 4; 100(3):1122-7.
[Proc Natl Acad Sci U S A. 2003]Proc Natl Acad Sci U S A. 2001 Apr 24; 98(9):5116-21.
[Proc Natl Acad Sci U S A. 2001]Biochim Biophys Acta. 1999 Jul 7; 1446(1-2):156-60.
[Biochim Biophys Acta. 1999]Biosci Biotechnol Biochem. 2001 Jun; 65(6):1328-38.
[Biosci Biotechnol Biochem. 2001]Biosci Biotechnol Biochem. 2002 Jun; 66(6):1427-30.
[Biosci Biotechnol Biochem. 2002]Development. 1993 Oct; 119(2):447-56.
[Development. 1993]Dev Biol. 2001 Jan 15; 229(2):421-31.
[Dev Biol. 2001]Curr Opin Genet Dev. 1999 Oct; 9(5):530-41.
[Curr Opin Genet Dev. 1999]Dev Cell. 2004 Jun; 6(6):771-80.
[Dev Cell. 2004]Dev Cell. 2004 Jun; 6(6):781-90.
[Dev Cell. 2004]Genes Dev. 1996 Aug 1; 10(15):1966-77.
[Genes Dev. 1996]Genetics. 2003 Jul; 164(3):989-1001.
[Genetics. 2003]Biochem Biophys Res Commun. 2000 Dec 20; 279(2):725-31.
[Biochem Biophys Res Commun. 2000]Comp Biochem Physiol B Biochem Mol Biol. 2003 Dec; 136(4):731-42.
[Comp Biochem Physiol B Biochem Mol Biol. 2003]Dev Dyn. 2002 Dec; 225(4):597-601.
[Dev Dyn. 2002]Dev Biol. 1999 May 15; 209(2):409-33.
[Dev Biol. 1999]Mech Dev. 2001 May; 103(1-2):137-9.
[Mech Dev. 2001]Mol Cell Biol. 2001 Mar; 21(5):1662-71.
[Mol Cell Biol. 2001]Dev Dyn. 2002 Dec; 225(4):511-21.
[Dev Dyn. 2002]J Embryol Exp Morphol. 1968 Sep; 20(2):141-50.
[J Embryol Exp Morphol. 1968]Biochim Biophys Acta. 1995 Dec 27; 1264(3):257-60.
[Biochim Biophys Acta. 1995]Dev Biol. 2001 Jan 15; 229(2):421-31.
[Dev Biol. 2001]Dev Cell. 2003 Jan; 4(1):67-82.
[Dev Cell. 2003]J Neurochem. 2000 Jul; 75(1):48-55.
[J Neurochem. 2000]Dev Dyn. 2000 Oct; 219(2):201-15.
[Dev Dyn. 2000]BMC Genomics. 2002 Aug 5; 3(1):21.
[BMC Genomics. 2002]Development. 1996 Jul; 122(7):2071-8.
[Development. 1996]Dev Genes Evol. 2001 Oct; 211(10):493-500.
[Dev Genes Evol. 2001]Trends Biochem Sci. 1989 May; 14(5):175-8.
[Trends Biochem Sci. 1989]Cell. 1982 Aug; 30(1):163-71.
[Cell. 1982]Dev Biol. 1992 Jan; 149(1):27-40.
[Dev Biol. 1992]Gene. 2000 May 30; 250(1-2):209-17.
[Gene. 2000]Gene. 2002 May 29; 291(1-2):177-86.
[Gene. 2002]Development. 1993 Oct; 119(2):447-56.
[Development. 1993]Dev Cell. 2003 Jan; 4(1):67-82.
[Dev Cell. 2003]J Neurochem. 2000 Jul; 75(1):48-55.
[J Neurochem. 2000]Biochemistry. 1997 Aug 26; 36(34):10526-37.
[Biochemistry. 1997]Curr Biol. 1996 Apr 1; 6(4):455-66.
[Curr Biol. 1996]Dev Biol. 2001 Jan 15; 229(2):421-31.
[Dev Biol. 2001]Proc Natl Acad Sci U S A. 1998 Nov 10; 95(23):13709-13.
[Proc Natl Acad Sci U S A. 1998]Exp Cell Res. 1999 Mar 15; 247(2):389-98.
[Exp Cell Res. 1999]J Biol Chem. 2003 Oct 3; 278(40):38847-59.
[J Biol Chem. 2003]J Biol Chem. 2003 Dec 26; 278(52):52307-14.
[J Biol Chem. 2003]Development. 2002 Dec; 129(24):5743-52.
[Development. 2002]Dev Dyn. 1995 Jul; 203(3):253-310.
[Dev Dyn. 1995]Methods Enzymol. 1999; 303():179-205.
[Methods Enzymol. 1999]Methods Enzymol. 1999; 303():179-205.
[Methods Enzymol. 1999]Cancer Cell. 2002 Nov; 2(5):353-61.
[Cancer Cell. 2002]Proc Natl Acad Sci U S A. 2003 Feb 4; 100(3):1122-7.
[Proc Natl Acad Sci U S A. 2003]Proc Natl Acad Sci U S A. 2001 Apr 24; 98(9):5116-21.
[Proc Natl Acad Sci U S A. 2001]Proc Natl Acad Sci U S A. 1998 Dec 8; 95(25):14863-8.
[Proc Natl Acad Sci U S A. 1998]Nucleic Acids Res. 1993 Mar 11; 21(5):1087-95.
[Nucleic Acids Res. 1993]