![]() | ![]() |
Formats:
|
||||||||||||||||||||
Copyright © 2005 Ruissen et al; licensee BioMed Central Ltd. Evaluation of the similarity of gene expression data estimated with SAGE and Affymetrix GeneChips 1Department of Neurogenetics, Academic Medical Center, University of Amsterdam, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands 2Department of Anatomy and Embryology, Academic Medical Center, University of Amsterdam, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands 3Department of Human Genetics, Academic Medical Center, University of Amsterdam, Meibergdreef 9, 1105 AZ Amsterdam, The Netherlands Corresponding author.#Contributed equally. Fred van Ruissen: f.vanruissen/at/amc.uva.nl; Jan M Ruijter: j.m.ruijter/at/amc.uva.nl; Gerben J Schaaf: g.j.schaaf/at/amc.uva.nl; Lida Asgharnegad: f.vanruissen/at/amc.uva.nl; Danny A Zwijnenburg: d.a.zwijnenburg/at/amc.uva.nl; Marcel Kool: m.kool/at/amc.uva.nl; Frank Baas: f.baas/at/amc.uva.nl Received October 28, 2004; Accepted June 14, 2005. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. This article has been cited by other articles in PMC.Abstract Background Serial Analysis of Gene Expression (SAGE) and microarrays have found awidespread application, but much ambiguity exists regarding the evaluation of these technologies. Cross-platform utilization of gene expression data from the SAGE and microarray technology could reduce the need for duplicate experiments and facilitate a more extensive exchange of data within the research community. This requires a measure for the correspondence of the different gene expression platforms. To date, a number of cross-platform evaluations (including a few studies using SAGE and Affymetrix GeneChips) have been conducted showing a variable, but overall low, concordance. This study evaluates these overall measures and introduces the between-ratio difference as a concordance measure pergene. Results In this study, gene expression measurements of Unigene clusters represented by both Affymetrix GeneChips HG-U133A and SAGE were compared using two independent RNA samples. After matching of the data sets the final comparison contains a small data set of 1094 unique Unigene clusters, which is unbiased with respect to expression level. Different overall correlation approaches, like Up/Down classification, contingency tables and correlation coefficients were used to compare both platforms. In addition, we introduce a novel approach to compare two platforms based on the calculation of differences between expression ratios observed in each platform for each individual transcript. This approach results in a concordance measure per gene (with statistical probability value), as opposed to the commonly used overall concordance measures between platforms. Conclusion We can conclude that intra-platform correlations are generally good, but that overall agreement between the two platforms is modest. This might be due to the binomially distributed sampling variation in SAGE tag counts, SAGE annotation errors and the intensity variation between probe sets of a single gene in Affymetrix GeneChips. We cannot identify or advice which platform performs better since both have their (dis)-advantages. Therefore it is strongly recommended to perform follow-up studies of interesting genes using additional techniques. The newly introduced between-ratio difference is a filtering-independent measure for between-platform concordance. Moreover, the between-ratio difference per gene can be used to detect transcripts with similar regulation on both platforms. Background Methods for the analysis of gene expression profiles have gone through progressive development over recent years. Traditionally, the level of transcribed mRNA has been analyzed using methods such as Northern blots, quantitative RT-PCR, differential display [1,2], representational difference analysis [3], total gene expression analysis [4] and suppressive subtractive hybridization [5,6]. All these methods, although fruitful and still in use, have a limited scope with regard to the number of genes that can be analyzed simultaneously. Because of this limitation, new methods have been developed, including serial analysis of gene expression (SAGE) [7], massive parallel signature sequencing (MPSS) [8], cDNA and oligo microarray chip technologies [9-13] and Affymetrix GeneChips [11]. SAGE is based on the high-throughput sequencing of concatemers of short (13–14 bp; recently 21–25 bp) sequence tags that originate from a known position within a transcript and therefore theoretically contain sufficient information to identify a transcript [7]. In contrast to microarrays, SAGE estimates the abundances (expression levels) of thousands of transcripts without prior knowledge of the transcripts being expressed. The proportion of the tag within the total number of tags in the library gives a direct estimate of the abundance of the transcript within a biological sample. The advantage of the SAGE technique is that it performs a random sampling from the pool of all expressed transcripts (also called a transcriptome) allowing the discovery of new transcripts. The proportional nature of the data enables easy exchange among researchers thus allowing the creation of large public SAGE data sets for numerous human tissues, both normal and diseased [14,15]. Disadvantages of SAGE are that the technique is expensive, labor-intensive and prone to sequencing errors. Moreover, the annotation of the short 10 bp sequence tags may identify more than one transcript. This can be overcome by using LongSAGE libraries that contain 17 bp tags which can be more reliably mapped to Unigene clusters or the complete genome sequence [16]. However, SAGE is not suitable for high-throughput analyses of multiple samples. In contrast to SAGE, DNA microarrays are used to measure relative expression levels between samples of thousands of known transcripts. Currently, three array variants are being used (for reviews see [17,18]) i.e. spotted cDNA microarrays, spotted oligonucleotide microarrays and synthesized oligonucleotide microarrays (Affymetrix GeneChips). The advantages of Affymetrix GeneChips are that they are highly sensitive enabling the detection of mRNAs present at levels as low as 1 transcript in 100000 [11] when the probe labeling step is not considered [19]. They are suitable for high-throughput analyses of multiple samples, and data can easily be shared and used for comparisons with other researchers using the same chips. Disadvantages of Affymetrix GeneChips are that they are only commercially available, are costly and require expensive specialized equipment and are inflexible in design (although custom design is possible at high cost). Furthermore, GeneChips only measure the expression of genes represented on the chip in contrast to SAGE, in which the expression profile of the complete transcriptome can be mapped. At present, SAGE, oligo microarrays, cDNA microarrays and Affymetrix GeneChips are the most widely used techniques for determining gene expression levels and gene expression ratios in different disease states and in cells under different physiological conditions or environmental stimuli. Often these different gene-expression profiling platforms are being used in parallel and data generated with the different techniques need to be compared, and possibly interchanged, within and between laboratories. Due to the overall difference in platform design, transcript level estimation, and gene annotation, direct comparisons are difficult and only a few attempts have been made to compare these different platforms (Figure (Figure6).6
In the current study we have determined the similarity between SAGE- and Affymetrix GeneChips-generated gene expression profiles of two independent RNA samples. One RNA sample is isolated from a Wilms' tumor; the other is the Stratagene Universal reference RNA. These expression data were then used to evaluate the annotation problems when comparing different gene profiling platforms and the methods that can be used to compare two different platforms with respect to individual gene expression measurements and with respect to between-sample gene expression ratios. Finally, it is demonstrated that the between-ratio difference can be applied to select those transcripts that display similar expression changes in both platforms. Results SAGE data analysis In order to compare SAGE with other gene expression profiling techniques we created a SAGE library with 69792 tags from a Wilms' tumor sample. SAGE data (51954 tags) for the Stratagene Universal reference RNA (GSM1734;[20]) were obtained from the NCBI website. All tag counts are after removal of duplicate dimers and linker sequences. Within the SAGE libraries we could identify 25052 and 17497 unique SAGE 10 bp tags, for the Wilms tumor sample and the Stratagene sample, respectively. Tags can be divided into tags with low abundance (1–5 tags per 100000), intermediate abundance (6–50 tags per 100000), and high abundance (more than 50 tags per 100000). In each of the libraries, these categories contained on average 84%, 15% and 1% of the total number of unique tags (Data not shown). In addition, we created a LongSAGE library of the Wilms tumor sample for annotation purposes (as described below) and not for the comparison with Affymetrix GeneChips. This library could be used as a technical replicate of the 'short' SAGE library. Comparison of the SAGE and LongSAGE libraries showed a Pearson Correlation coefficient of 0.651 (P < 0.01) and using Z-test statistics [21] the two libraries only differed significantly from each other in 3% (α = 0.05) or 0.6% (α = 0.001) of the tags (Figure (Figure2A).2A
Microarray analyses Microarray experiments were performed using Wilms' tumor RNA and the Stratagene Reference RNA. Results of biological replicas of each sample, with independent cRNA synthesis and hybridizations, showed a good reproducibility (Pearson correlation coefficients of 0.982 (n = 11938) and 0.979 (n = 10489);both P < 0.01) using intensity values for all probe sets with a "present" signal (on average 54%; absent = 44% and marginal = 2%) (Figure (Figure2B;2B Annotation problems In the comparison of data obtained by SAGE and Affymetrix GeneChips only reliably annotated tags can be included (as described in the 'Matching of platforms' paragraph of the Material and Methods section; see also Shippy et al.[23]). Annotation of SAGE tags to genes and their corresponding Unigene cluster numbers revealed that on average 30% of all tags (including low abundant tags) could be reliably annotated based on the SAGE Genie principles [24]. Annotation improves to an average of 70% for tags that have an intermediate to abundant expression level. The remainder of the tags could not reliably be associated with a gene or Unigene cluster because they were not available through the SAGE Genie site, annotated to unclustered ESTs, or their reliability was below 67% (according to the SAGE Genie principles). Additionally, we performed LongSAGE for the Wilms' tumor sample, which allows the identification of 17 bp tags instead of 10 bp tags. Theoretically, over 99.8% of the 17 bp tags are expected to occur only once in the human genome. However, analyses based on actual sequences have demonstrated that only 75% of the 17 bp tags occur only once in the human genome, with the remaining tags matching duplicated genes or repeated sequences [16]. Complete annotation of LongSAGE tags using SAGE Genie data and principles revealed that 28% of all tags could be assigned a reliable Unigene cluster. Similar to SAGE, the annotation improves to approximately 70% for tags that have an intermediate to abundant expression level. The Affymetrix HG-U133A GeneChips contained probe sets for 13727 Unigene clusters that could be identified, whereas eight percent of the probe sets (i.e. 1795 probe sets) could not be linked to a Unigene cluster because these sequences are withdrawn or because these sequences are currently under revision. Figure Figure33
Comparison of gene expression levels In the comparison of platforms, we first analyzed the similarity of gene expression levels between SAGE and Affymetrix data in one tissue sample. Both datasets were matched according to their Unigene cluster numbers. Figure Figure2C2C Comparison of between-sample expression ratios In most gene expression studies, alterations of expression levels are expressed in relation to the simultaneously determined expression level of a reference sample and conclusions are drawn based on these ratios. To this end, expression ratios were calculated between the reference RNA and the Wilms' tumor data for the SAGE tag counts as well as for Affymetrix HG-U133A GeneChips spot intensities. In this comparison the final data set containing only the between-sample ratios for unambiguous transcripts was used (Figure (Figure3),3 To enable direct comparisons of ratio measurements using different gene expression platforms, the ratios of the Affymetrix platform were scaled to those of the SAGE platform as described in Figure Figure44
Like others have demonstrated (Figure (Figure6)6
Sources of differences in gene expression ratios In an attempt to explain the difference in gene expression between SAGE and Affymetrix GeneChips we summarize different sources. Variation due to "noisy fold ratios" generated from low-intensity transcripts is a widespread cause of error when computing statistics on ratios without accounting for the intensities from which the ratios were derived [25]. Within our data set we have shown that the final data set is an unbiased selection of the total data set (Figure (Figure2D).2D In addition, it has been suggested that the GC-content of the transcripts could influence the correspondence between platforms [26]. To test this hypothesis for the final data set (n = 1094) we retrieved all transcript sequences (mostly Refseq sequences [27]) and probe set sequences and calculated the GC-content for each transcript and the average GC-content of the corresponding probe sets. The GC-contents were divided into classes (30–35%; 35–40%; 45–50% etc.) and the correlation between GC-content and the differences in expression ratios between platforms was tested. Statistical analysis showed that ratio differences did not depend on the GC-content of the transcript (Chi-square value of 25.69; df = 35; P = 0.875). However, Unigene clusters showing good agreement between platforms tend to depend on the high GC-content of the corresponding probe sets (Chi-square value of 61.114; df = 30; P = 0.001). This GC-analysis indicates that expression data from probe sets with a higher GCcontent show a better agreement with their corresponding SAGE data and are more reliable. Note in this respect that for a Unigene cluster the GC content of a probe set is not necessarily the same as that of a transcript. Discussion To answer the question whether gene expression data generated by SAGE and by Affymetrix HG-U133A GeneChips can be used interchangeably, data from these two techniques were compared using two independent RNA samples. Analysis of intra-platform variation shows good correlation for both SAGE and Affymetrix; this is also observed by others (see Figure Figure6).6 A first impression about the agreement between SAGE and Affymetrix HG-U133A GeneChips was obtained from the evaluation of the top100 of highly abundant transcripts in one RNA sample in each platform. This comparison showed that approximately 50% of the top100 of highly expressed transcripts showed a corresponding expression within the top100 of highly expressed transcripts of the other platform. This is in line with the findings of Ishii et al. [29] who compared SAGE with Affymetrix GeneChips containing approximately 6000 transcripts, and Iacobuzio-Donahue et al. [30] who showed that only genes that display robust changes in gene expression were identified by both platforms. In our current study, approximately 80% of transcripts detected in the top100 of one platform were mapped within the top1000 of the competing platform. A similar figure was presented by Evans et al. [31] who used the RG-U34A Affymetrix GeneChips. Recently, Kim [32] suggested that absolute expression analyses of SAGE and oligonucleotide microarray technology reliably detected medium-to-high abundant transcripts. For a more extensive comparison between the individual gene expression profiling platforms we used gene expression ratios between Wilms' tumor and Stratagene Universal reference RNA as determined by SAGE and Affymetrix GeneChips. The use of ratios might have the disadvantage of losing information about individual expression values. However, it corrects for platform specific variations (i.e. probe design, hybridization efficiencies etc.). By matching SAGE and Affymetrix data, an unambiguous data set was generated. On average about 30% of the unambiguous genes were observed to be expressed by both SAGE and Affymetrix GeneChips and could be included in the final comparison. Although this comparison comprised only 13% of all SAGE Unigene clusters and only 8% of the Affymetrix Unigene clusters, it was demonstrated that this selection was unbiased with respect to gene expression levels in each of the platforms. This allows the extrapolation of the conclusions to the whole platform. We looked for the correspondence in gene expression results between the two techniques using Up/Down classification (Figure (Figure1A),1A The overall similarity between SAGE and Affymetrix GeneChips is modest when expression ratios are compared. The correspondence improves to 90% when only highly expressed transcripts are included which means that noise is filtered out for both platforms. The differences between SAGE and Affymetrix GeneChips were not caused by a biased selection of the final data set, differences in GC-content of the included transcripts or extreme ratios resulting from low gene expression values. The observed cross-platform differences, arise from intrinsic properties of the platforms themselves, differences in the principle of determining the expression levels, such as absolute (SAGE) versus quantitative (microarray) mRNA levels, and/or processing and analytical evaluation [33]. These disparities of the two technical approaches are summarized in table 2 and may all contribute to the modest overall correlation of SAGE and microarray data. We cannot conclude which of the platforms performs best. These results show, as also argued by Tan and co-workers [33], that it is important to validate the results obtained with SAGE or Affymetrix GeneChips with subsequent northern blots or quantitative PCR analysis [34-36]. It was beyond the scope of our analysis to perform such a verification of expression data. Anyway, such a validation is impractical for large numbers of genes. However, it seems that the divergence of the SAGE and Affymetrix platforms in this study is for a large part due to the wide range of Affymetrix gene expression values observed for transcripts with a low gene expression level in SAGE (Figure (Figure2C).2C
Future studies should be aimed on improving the efficiency of SAGE tag annotation and avoidance of systematic bias in microarray techniques. Only then, measurements of various technologies can be directly compared and transformed to a universal gene expression catalogue. SAGE has the advantage that a whole transcriptome is analyzed, but is limited to the analysis of a small number of samples. For screening of large sets of samples SAGE cannot be the favored choice and Affymetrix GeneChips might be a good alternative. Therefore, we think that the future lies in combining the data from SAGE with Affymetrix GeneChips, custom cDNA or oligo arrays. This gives the advantage of complete expression profiling using SAGE and high-throughput array screening of a larger panel of samples allowing rapid identification and for instance validation of clinical relevant genes involved in disease onset [42,43]. Finally, the proposed ratio difference between platforms using an universal reference sample (as also indicated in [25]) can serve as a measure for interplatform correspondence per individual gene. Conclusion This paper evaluates several approaches for the comparison of different gene expression platforms, outlined using SAGE and Affymetrix GeneChips. We demonstrate that for both SAGE and Affymetrix GeneChips the intra-platform correlations are extremely good, but that the inter-platform agreement based on an unbiased selection of transcripts is modest. The agreement between platforms increases if only transcripts are included with high tag counts and high hybridisation intensities. It appears that the expression distributions are similar for each of the platforms, but that the correlation between platforms is modest due to intrinsic differences, like sensitivity, levels of noise, and gene annotation. Finally, we introduce a novel, filtering-independent approach for data analysis based on the calculation of differences between expression ratios observed in SAGE and Affymetrix GeneChips for each individual transcript. The statistical probability value that can be assigned to each individual betweenratio difference, allows the selection of individual transcripts that display similar regulation on both platforms. Methods Tissue and RNA extraction Wilms' tumor tissue was obtained from a single individual after resection of the tumor. Tissue was immediately frozen in liquid nitrogen. Informed consent to use this material for scientific research was obtained. After homogenization, total RNA was extracted using Trizol (Invitrogen, Breda, The Netherlands), dissolved in RNase free water and stored at -80°C. The Stratagene Universal reference RNA was obtained from Stratagene (Stratagene, Amsterdam, The Netherlands, catalog #740000-41). Purity and integrity of the RNA samples was confirmed on the Agilent 2100 Bioanalyzer (Agilent Technologies Netherlands B.V., Amstelveen, The Netherlands), using the LabChip® approach. Construction of SAGE libraries The SAGE library of the Wilms' tumor RNA was generated using the I-SAGE kit according to the manufacturer's instructions (Invitrogen, Breda, The Netherlands; cat. #T5000-03). A detailed protocol may be obtained as a free download [44]. For LongSAGE minor modifications were implemented in the protocol of the I-SAGE kit; i.e. the restriction enzyme BsmFI was replaced by MmeI, linkers were adapted for LongSAGE and ditags were created using sticky-end ligation. All sequence files were processed using the SAGE2000 software provided by Dr. K.W. Kinzler (see also [45]). The SAGE library from the Stratagene Universal reference RNA was obtained from the NCBI website. This library can be retrieved in the Gene Expression Omnibus under code GSM1734 [14,20]). Annotation of tags Extracted SAGE tags were annotated based on the SAGE Genie principles [24] through several stringent filters using data from the CGAP website [15]. Several databases (i.e. HsMap.txt, HsRepetitive.txt and HsDatasets.txt) were combined to a final dataset containing all information necessary for tag annotation. Tags matching to unclustered EST's were considered to be no-matches. Tags matching to Unigene clusters retrieved from low ranked databases (<67%; according to the rules set by CGAP) were not included in our comparisons. During this process tags are matched to no, one unique, or more than one Unigene cluster (Unigene Build 160, March 2003). To further identify tags matching more than one Unigene cluster, we extracted the 11th base from our original sequence files using the SAGE2000 software. This 11th base can be used to match against the deposited sequences (Genbank, EMBL etc.) and in this way one may be able to exclude Unigene clusters that contain a different 11th base in their sequence and thereby minimize the number of multiple matches. In the final comparison tags matching to multiple Unigene clusters were excluded. For annotation of LongSAGE tags we used the data available at the CGAP site for Unigene Build 170 (July 2004). These annotations were not available for Unigene Build 160. Affymetrix Affymetrix HG-U133A GeneChips were used and the hybridizations were performed according to the manufacturer's protocols and carried out at the Micro-array Department (MAD; Institute for Life Sciences, Faculty of Science, University of Amsterdam). For analysis, the MAS 5.0 software suite was used and comparisons between duplicate Wilms' tumor hybridizations and duplicate Stratagene Universal reference RNA hybridizations were made (data were deposited into the GEO under accession GSE1158). This gives four comparisons (2 Log ratios), from which the geometric mean gene expression ratio between the two samples was calculated. Probe sets on the Affymetrix chips were matched with Unigene clusters (Unigene Build 160, March 2003). Matching of platforms The matching of data from two different gene expression profiling platforms (as illustrated in figure figure3)3 Comparison of expression ratios between samples For each platform and each transcript that full-filled the matching criteria an expression ratio between Wilms' tumor and Stratagene Reference RNA was calculated. With these ratios the correspondence between platforms was estimated using the Pearson correlation coefficient, Up/Down classification and a contingency table (Figure 1A, 1B, 1C Authors' contributions MK, FB and FVR planned and designed the study. JMR and FVR analyzed the data, generated the figures and drafted the manuscript. MK and FB helped by editing the manuscript, providing overall technical guidance and coordination. LA, DAZ and FVR created the LongSAGE and SAGE libraries, and performed cloning and sequencing of the concatemers. JMR developed the new approach for the comparison of multiple platforms, performed calculations with FVR and provided guidance with the statistical analyses. GJS and FVR performed the annotation of SAGE tags. All authors read and approved the final manuscript. Grants This work was supported by the Stichting Kindergeneeskundig Kankeronderzoek (SKK) and the Dutch Cancer Society (KWF; grant UVA 2001–2558) Acknowledgements We would like to thank Dr. A.H.C. van Kampen for reading the manuscript and helpful discussions and Raymond J. Waaijer for his bioinformatics support (Bioinformatics Laboratory, Academic Medical Center, the Netherlands). References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||
Science. 1992 Aug 14; 257(5072):967-71.
[Science. 1992]Methods Enzymol. 1999; 303():234-58.
[Methods Enzymol. 1999]Science. 1993 Feb 12; 259(5097):946-51.
[Science. 1993]Proc Natl Acad Sci U S A. 2000 Feb 29; 97(5):1976-81.
[Proc Natl Acad Sci U S A. 2000]Proc Natl Acad Sci U S A. 1996 Jun 11; 93(12):6025-30.
[Proc Natl Acad Sci U S A. 1996]Science. 1995 Oct 20; 270(5235):484-7.
[Science. 1995]Nat Biotechnol. 2002 May; 20(5):508-12.
[Nat Biotechnol. 2002]Annu Rev Biomed Eng. 2002; 4():129-53.
[Annu Rev Biomed Eng. 2002]Cancer J. 2001 Jan-Feb; 7(1):2-15.
[Cancer J. 2001]Nat Genet. 1999 Jan; 21(1 Suppl):20-4.
[Nat Genet. 1999]Genomics. 2004 Oct; 84(4):631-6.
[Genomics. 2004]Nucleic Acids Res. 2002 Jan 1; 30(1):207-10.
[Nucleic Acids Res. 2002]Mol Biol Cell. 1999 Jun; 10(6):1859-72.
[Mol Biol Cell. 1999]Physiol Genomics. 2002 Oct 29; 11(2):37-44.
[Physiol Genomics. 2002]BMC Genomics. 2004 Sep 2; 5(1):61.
[BMC Genomics. 2004]Nat Biotechnol. 2002 May; 20(5):508-12.
[Nat Biotechnol. 2002]J Biotechnol. 2004 Sep 9; 112(3):225-45.
[J Biotechnol. 2004]Bioinformatics. 2002 Mar; 18(3):405-12.
[Bioinformatics. 2002]Nucleic Acids Res. 2003 Jan 1; 31(1):34-7.
[Nucleic Acids Res. 2003]Nat Biotechnol. 2002 May; 20(5):508-12.
[Nat Biotechnol. 2002]Genomics. 2004 Oct; 84(4):631-6.
[Genomics. 2004]Nat Genet. 2003 Nov; 35(3):197-8.
[Nat Genet. 2003]Genomics. 2000 Sep 1; 68(2):136-43.
[Genomics. 2000]Cancer Res. 2003 Dec 15; 63(24):8614-22.
[Cancer Res. 2003]Eur J Neurosci. 2002 Aug; 16(3):409-13.
[Eur J Neurosci. 2002]Exp Mol Med. 2003 Oct 31; 35(5):460-6.
[Exp Mol Med. 2003]Nucleic Acids Res. 2003 Oct 1; 31(19):5676-84.
[Nucleic Acids Res. 2003]Genomics. 2001 Jan 1; 71(1):34-9.
[Genomics. 2001]Genome Res. 2003 Jul; 13(7):1775-85.
[Genome Res. 2003]Genomics. 2004 Oct; 84(4):631-6.
[Genomics. 2004]Nucleic Acids Res. 2005 Feb 16; 33(3):e26.
[Nucleic Acids Res. 2005]Cancer Res. 1999 Nov 1; 59(21):5464-70.
[Cancer Res. 1999]Blood. 2003 Mar 15; 101(6):2285-93.
[Blood. 2003]J Biotechnol. 2004 Sep 9; 112(3):225-45.
[J Biotechnol. 2004]Nucleic Acids Res. 2002 Jan 1; 30(1):207-10.
[Nucleic Acids Res. 2002]