![]() | ![]() |
Formats:
|
||||||||||||||||||||||
Copyright © 2005 Barker et al; licensee BioMed Central Ltd. Increased DNA microarray hybridization specificity using sscDNA targets 1Gladstone Institute of Cardiovascular Disease, The J. David Gladstone Institutes, San Francisco, California 94158, USA 2San Francisco General Hospital General Clinical Research Center, University of California, San Francisco, San Francisco, California 94143, USA 3Department of Medicine, University of California, San Francisco, San Francisco California 94143, USA Corresponding author.Christopher S Barker: cbarker/at/gladstone.ucsf.edu; Chandi Griffin: aniya/at/itsa.ucsf.edu; Gregory M Dolganov: gregd/at/itsa.ucsf.edu; Kristina Hanspers: khanspers/at/gladstone.ucsf.edu; Jean Yee Hwa Yang: jean/at/biostat.ucsf.edu; David J Erle: david.erle/at/ucsf.edu Received February 17, 2005; Accepted April 22, 2005. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. This article has been cited by other articles in PMC.Abstract Background The most widely used amplification method for microarray analysis of gene expression uses T7 RNA polymerase-driven in vitro transcription (IVT) to produce complementary RNA (cRNA) that can be hybridized to arrays. However, multiple rounds of amplification are required when assaying very small amounts of starting RNA. Moreover, certain cRNA-DNA mismatches are more stable than the analogous cDNA-DNA mismatches and this might increase non-specific hybridization. We sought to determine whether a recently developed linear isothermal amplification method (ribo-SPIA) that produces single stranded cDNA would offer advantages over traditional IVT-based methods for microarray-based analyses of transcript expression. Results A single round of ribo-SPIA amplification produced sufficient sscDNA for hybridizations when as little as 5 ng of starting total RNA was used. Comparisons of probe set signal intensities obtained from replicate amplifications showed consistently high correlations (r = 0.99). We compared gene expression in two different human RNA samples using ribo-SPIA. Compared with one round IVT, ribo-SPIA had a larger dynamic range and correlated better with quantitative PCR results even though we used 1000-fold less starting RNA. The improved dynamic range was associated with decreases in hybridization to mismatch control probes. Conclusion The use of amplified sscDNA may offer substantial advantages over IVT-based amplification methods, especially when very limited amounts of starting RNA are available. The use of sscDNA targets instead of cRNA targets appears to improve hybridization specificity. Background DNA microarrays are a powerful tool for global analysis of gene transcript expression. The initial studies using arrays required large amounts of starting material in order to reliably detect sample signals. Since that time, improvements in sample preparation, amplification and labeling methods [1-5] have reduced the starting material requirement to ~1–5 μg of total RNA [6]. Efforts to use smaller amounts of starting material have focused on PCR [7,8] and multiple rounds of T7 RNA polymerase in vitro transcription [IVT] [9-12]. PCR based methods have been successfully used to amplify subnanogram quantities of RNA from as little as a single cell [13,14], but these approaches have not been widely adopted. Most attempts to perform arrays using submicrogram amounts of RNA have relied on 2 or 3 rounds of linear amplification using IVT, but this approach has proven to be time consuming and technically demanding. In our hands, two round IVT is necessary to prepare samples from 5–50 ng total RNA and the amplification typically takes 4–5 days to complete. Others have reported a 10% decrease in sensitivity in detection of differentially expressed genes with the addition of a second IVT round [15]. A new single primer, isothermal linear amplification method (ribo-SPIA) has been specifically developed for amplification of very small samples for use on DNA microarrays [16,17]. With this method (Figure (Figure1),1
Results and discussion Yield of sscDNA We used the ribo-SPIA method to amplify several different total RNA samples. RNA was obtained from a pool of human tissues (Clontech Universal Human Reference RNA, or cUHR, Experiment 1), mouse liver (Experiment 2), a second pool of human cells (Stratagene Universal Human Reference RNA, or sUHR, Experiment 3), and K562 human erythroleukemia cells (also Experiment 3). The amount of starting total RNA ranged from 5–100 ng and yields were in the range of ~6–12 μg of sscDNA (Table I). The somewhat lower yields seen in Experiment 1 are likely attributable to our unfamiliarity with the protocol, and yields improved in subsequent experiments. There was no clear relationship between the amount of starting RNA and the sscDNA yield. To determine how much sscDNA product was produced in the absence of any template, we performed two additional amplifications with no input RNA (Experiment 4). Some sscDNA was produced, although the amount was substantially less than that seen when input RNA was present (Table 1). To determine the possible impact of this template-independent product on microarray results, we hybridized the entire sscDNA product from a template-independent reaction to a U95Av2 microarray. This resulted in low overall signal intensity with only 0.6% of probe sets yielding "present" calls.
Size of sscDNA products sscDNA preparations were analyzed by electrophoresis using an Agilent 2100 BioAnalyzer. sscDNAs ranged widely in size and the median size was typically slightly greater than 1 kb (data not shown). sscDNAs were fragmented in preparation for hybridization resulting in fragments of ~50–200 bp. These results are similar to those previously obtained using this method [16,17]. Reproducibility of microarray hybridization results Each of the experiments included replicate amplifications (independent amplifications of aliquots of the same starting material). We hybridized each replicate to a separate microarray and calculated intensities for each probe set. Fig. Fig.2A2A
Differential gene expression measurement To test the ability to detect differential gene expression using amplified sscDNAs, we compared gene expression in two different RNA samples. We chose to compare K562 cell and sUHR RNAs since we have previously used these two RNAs to compare the performance of single round IVT-based amplification with other methods [25]. In Experiment 3, we did three separate sscDNA amplifications of K562 and sUHR RNAs. We used the same RNA preparations as for the previously reported IVT-based amplification experiments, but started with 1000-fold less material (10 ng instead of 10 μg). For each probe set, we used intensity values from replicate arrays to calculate relative gene expression (M, defined as log2 [mean K562 intensity/mean sUHR intensity]) and average signal intensity (A, defined as 1/2 log2 K562 mean intensity + 1/2 log2 mean sUHR intensity). M and A values obtained using sscDNA are shown in Fig. Fig.3B3B
The new observation that sscDNA gave a wider range of relative expression (M) values despite lower average intensity (A) values could be explained by improved hybridization specificity under the conditions used in this study. This is plausible because the binding energy for DNA-DNA interactions is more sensitive to base pair mismatching than the binding energy for DNA-RNA interactions [26,27]. To look for further evidence about specificity of hybridization, we took advantage of the mismatched (MM) probes included on the arrays. For each perfect match (PM) GeneChip 25 mer probe, there is a corresponding MM probe with a single base mismatch at base 13. The MM probes were included in the probe set design to allow adjustments for nonspecific hybridization. Under ideal conditions, MM probes would never give signals higher than PM probes, although in practice this does sometimes occur. MM probes would be more likely to give stronger signals than PM probes if there was more non-specific hybridization of off-target sequences to the probes. We found that MM intensities exceeded PM intensities less frequently when we used sscDNA as compared to cRNA. When sUHR RNA was used as starting material, the average number of probe sets where MM intensity exceeded PM intensity was 2247 for cRNA versus 1671 for sscDNA (34% higher, p = 0.008). MM intensity also exceeded PM intensity more frequently with cRNA probes for K562 RNA arrays (2903 vs. 2482 probe sets, 17% higher, p = 0.017). When we looked at raw signal intensity for all MM probes, we found that the cRNA MM intensity distributions were skewed compared to the sscDNA MM distribution (Fig. (Fig.4).4
Comparison of expression measurements made with sscDNA, cRNA, and qPCR We wished to compare how measurements made using amplified sscDNA and microarrays compared with measurements made using other approaches. We began by comparing results obtained using sscDNA and cRNA microarray hybridizations for all 12,625 probe sets on the arrays. Since the sscDNA and cRNA methods would be expected to introduce different systematic biases, we were not surprised that direct correlations between signal intensities obtained with the two different methods showed show relatively poor agreement (r = 0.72–0.75 for K562 and r = 0.68–0.70 for sUHR, as opposed to r = 0.98–0.99 between replicates performed using the same sample preparation method). The finding indicates that it will not be useful to directly compare one array hybridized with sscDNA to another one hybridized with cRNA. We next compared differential gene expression measures (M values) determined using sscDNA with those determined using cRNA. There was a clear correlation (r = 0.83, Fig. Fig.5A).5A
Next we generated another set of expression measurements that could be used as a basis for comparison for the sscDNA and cRNA array results. qPCR is typically used as "gold standard" to confirm putative differentially expressed genes detected with microarrays. Since we saw a subset of genes for which expression differed between sscDNA and cRNA targets, we next assessed if either method tracked more closely to qPCR. We chose qPCR primers and probe sets from a large group of >1000 sets that have been developed for various studies. From these, primers and probes for four subsets of genes were selected for qPCR. The first set included all genes with >4 fold difference in expression between K562 and sUHR samples as determined using the sscDNA method, the cRNA method, or both methods (53 primer/probe sets). The second set included all other genes in which the two methods disagreed by more than 2-fold (29 primer probe sets). The third set consisted of a group of 33 empirically-derived 'housekeeping genes.' These were all genes that were nearly equally expressed (|M| < 0.1) in K562 and sUHR samples according to both the sscDNA and cRNA methods and gave strong signals (A > 5 for both methods). The fourth set included 8 housekeeping genes that had been previously validated as controls for qPCR in other experiments. We determined the gene copy number for each qPCR primer and probe set and then calculated a measure of relative expression, M = log2 (K562 copy number)/(sUHR copy number), that could be directly compared to M values from arrays. 37 putative duplicate probe sets from 17 genes probe sets were hand-curated to confirm that they would correspond to the predicted qPCR product. In two cases probe sets were found to be misidentified in the GeneChip annotation and were removed from the analysis. In the remaining cases of duplication, the qPCR and microarray values were averaged across the duplicates. The final set of 106 curated genes and the associated data can be found at http://asthmagenomics.ucsf.edu. There were clear correlations between qPCR M values and array M values obtained using sscDNA (Fig. (Fig.6A)6A
Conclusion We examined the suitability of a new isothermal linear amplification method for application to Affymetrix GeneChip microarrays. We performed a series of tests using starting amounts of RNA ranging from 5 to 100 ng for amplification yield and reproducibility. The amplification reactions consistently produced sufficient sscDNA for multiple array hybridizations. Pairwise comparison of technical replicates hybridized to microarrays by regression analysis showed excellent consistency. When we used sscDNA to analyze differential gene expression between two samples, we found a larger dynamic range than that obtained with cRNA hybridizations. The improved performance appears to be related to increased sscDNA hybridization specificity. The data obtained using this new method also more closely matched the results from qRT-PCR than data obtained using standard IVT reactions, even though the amount of starting RNA used was 1000-fold less. This new amplification method is a useful alternative approach for preparing targets that is especially well-suited for experiments involving small amounts of starting material. Methods Test samples Clontech Human Universal Reference Pool total RNA (cUHR), derived by pooling RNA from a variety of human tissues, was purchased from BD Biosciences and used in Experiment 1. Mouse total liver RNA was isolated by standard methods from C57/BL6 mice according to procedures approved by the UCSF Committee on Animal Research and used in Experiment 2. For Experiment 3, we used Stratagene Human Universal Reference Pool and K562 erythroleukemia total RNAs from the same batches used in a previous study [25]. All samples were assessed for size and integrity using the Agilent 2100 BioAnalyzer RNA 6000 Nano LabChip assay. RNA and DNA samples were quantified using a NanoDrop ND-1000 spectrophotometer. Sample preparation sscDNA samples were prepared using the NuGEN Technologies Ovation RNA amplification and Biotin Labeling system (Version 1.0) according to the manufacturer's directions from the indicated amount of starting RNA (5–100 ng). All reactions were performed in 0.2 ml strip PCR tubes in an MJ GeneWorks PTC-100 thermocycler using recommended programs. Since the seal for PCR tubes and caps tends to deteriorate with repeated use, we replaced the caps for each tube before each resealing step in the protocol. Following amplification, sscDNA product was purified using QIAquick PCR purification kits (Qiagen). Samples were fragmented and end labeled with biotin. After stopping, each reaction was concentrated in a Microcon YM-3 column to a final volume of ~20 μl. The concentrated material was purified using a Centri-Sep 100 spin column (Princeton Separations). Negative control reactions were prepared by replacing input RNA with the appropriate volume of RNase free water. DNA microarrays All samples were placed in standard Affymetrix hybridization buffer. The sample denaturation time for the sscDNA samples was reduced from 5 to 2 minutes and hybridization time increased from 16 to 20 hours as recommended by NuGEN Technologies. cUHR gene expression was assayed using Affymetrix Human Genome U133A GeneChip arrays (Experiment 1). Mouse liver RNA was assayed using Murine Genome Mu6500A arrays (Experiment 2). K562 and sUHR RNAs were assayed using Human Genome U95Av2 arrays (Experiment 3). One template independent sample was also analyzed using a Human Genome U95Av2 array (Experiment 4). Arrays were stained with phycoerythrin-streptavidin according to the manufacturer's instructions. Metrics for all sample hybridizations including scaling factors, mean background intensities, and percent present calls have been provided (see Additional File 1). Each set of data was normalized independently using RMAExpress software http://stat-www.berkeley.edu/~bolstad/RMAExpress/RMAExpress.html. K562 and sUHR microarray data were also analyzed using Microarray Suite 4.0 in order to calculate PM-MM values for each transcript probe set. Probe level analyses were performed using the BioConductor [29] affy analysis package [30]. All microarray data have been deposited in the Gene Expression Omnibus (GEO) database under the accession numbers GSM41384 – GSM41393, GSM41433 – GSM41438 and GSM4843 – GSM4847. Real-time PCR Real-time (RT) PCR was used to measure the expression of selected genes in sUHR and K562 cells. Gene-specific primers for multiplex real time RT-PCR were designed for each gene of interest using "Primer Express" software (Perkin-Elmer) and purchased from Biosearch Technologies. Sequence data for all oligonucleotides primers has been provided (see Additional File 2). First strand cDNA synthesis was performed using total RNA, Powerscript reverse transcriptase (BD Biosciences), and random hexamer primers. Real time amplification was performed using an ABI Prizm7900 and Invitrogen Universal Master Mix. Relative gene copy numbers (GCN) were calculated as described previously [31]. GeNorm [32] was used to select the two most stable housekeeping genes across all specimens for normalization. Authors' contributions CSB conceived of the study, participated in the design and coordination, and wrote the manuscript. CG participated in the design of the study and both CG and KH performed the microarray experiments. GMD carried out the real time PCR studies. JYHY performed the statistical analyses. DJE participated in the study design and analysis and revised the manuscript. All authors read and approved the final manuscript. Additional File 1 Hybridization Metrics is a .txt file suitable for opening in Microsoft Excel containing information about each individual hybridization. Click here for file(1.2K, txt) Additional File 2 Taqman Primers provides sequence information for oligonucleotides used for qRT-PCR. Click here for file(29K, txt) Acknowledgements This study was supported in part in the General Clinical Research Center at San Francisco General Hospital and supported by Grant 5-MO1-RR00083 from the Division of Research Resources, National Institutes of Health, the UCSF NHLBI Shared Microarray Facility (NIH grant HL072301) and the UCSF Sandler Center for Basic Research in Asthma. We thank Yanxia Hao, Jennifer Gregg, Agnes Paquet and Michael Salazar for excellent technical assistance, Ron Davis for helpful discussions and Karena Essex for assistance in preparation of the manuscript. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||
Nat Med. 1999 Jan; 5(1):117-22.
[Nat Med. 1999]J Mol Diagn. 2003 Feb; 5(1):9-14.
[J Mol Diagn. 2003]J Neurosci. 2001 Nov 1; 21(21):8310-4.
[J Neurosci. 2001]Curr Biol. 2000 Mar 23; 10(6):301-10.
[Curr Biol. 2000]Nat Biotechnol. 2002 Sep; 20(9):940-3.
[Nat Biotechnol. 2002]Biotechniques. 2004 Nov; 37(5):854-7.
[Biotechniques. 2004]Biotechniques. 2004 Nov; 37(5):854-7.
[Biotechniques. 2004]Biotechniques. 2004 Nov; 37(5):854-7.
[Biotechniques. 2004]Biotechniques. 2004 Mar; 36(3):498-506.
[Biotechniques. 2004]Biotechniques. 2002 Oct; 33(4):906-12, 914.
[Biotechniques. 2002]Biotechniques. 2002 Jun; 32(6):1330-4, 1336, 1338, 1340.
[Biotechniques. 2002]Biomol Eng. 2003 Mar; 20(3):97-106.
[Biomol Eng. 2003]Genome Res. 2003 Jul; 13(7):1775-85.
[Genome Res. 2003]Nucleic Acids Res. 1977; 4(5):1539-52.
[Nucleic Acids Res. 1977]Biochemistry. 2000 Sep 19; 39(37):11270-81.
[Biochemistry. 2000]Nucleic Acids Res. 2001 Nov 15; 29(22):E112.
[Nucleic Acids Res. 2001]Genome Res. 2003 Jul; 13(7):1775-85.
[Genome Res. 2003]Genome Biol. 2004; 5(10):R80.
[Genome Biol. 2004]Bioinformatics. 2004 Feb 12; 20(3):307-15.
[Bioinformatics. 2004]Genome Res. 2001 Sep; 11(9):1473-83.
[Genome Res. 2001]Genome Biol. 2002 Jun 18; 3(7):RESEARCH0034.
[Genome Biol. 2002]