![]() | ![]() |
Formats:
|
||||||||||||
Copyright © 2004, European Molecular Biology Organization Review Article Review In control: systematic assessment of microarray performance 1Department of Biomedical Genetics, University Medical Center Utrecht, PO Box 85060, 3508 AB Utrecht, the Netherlands aTel: +31 30 253 8486; Fax: +31 30 253 90354; Email: f.c.p.holstege/at/med.uu.nl Received May 3, 2004; Accepted August 19, 2004. This article has been cited by other articles in PMC.Keywords: expression profiling, external controls, microarray, performance, quality, spikes Expression profiling using DNA microarrays is a powerful technique that is widely used in the life sciences. How reliable are microarray-derived measurements? The assessment of performance is challenging because of the complicated nature of microarray experiments and the many different technology platforms. There is a mounting call for standards to be introduced, and this review addresses some of the issues that are involved. Two important characteristics of performance are accuracy and precision. The assessment of these factors can be either for the purpose of technology optimization or for the evaluation of individual microarray hybridizations. Microarray performance has been evaluated by at least four approaches in the past. Here, we argue that external RNA controls offer the most versatile system for determining performance and describe how such standards could be implemented. Other uses of external controls are discussed, along with the importance of probe sequence availability and the quantification of labelled material. Introduction DNA microarrays are universal tools that can be applied throughout the life sciences (Brown & Botstein, 1999; Lockhart & Winzeler, 2000; Young, 2000). mRNA-expression profiling is the most frequent application. Such microarray hybridizations determine changes in mRNA levels between two samples or result in an absolute quantification that is correlated to mRNA levels. How reliable are these measurements? Given the widespread interest, it is surprising that there have been relatively few systematic analyses of microarray performance. One reason for this lack of assessment is the complicated nature of microarray technology; there is no single 'microarray technology', but rather a collection of different technology platforms. Established platforms include Affymetrix GeneChips (Santa Clara, CA, USA), PCR-product-based cDNA arrays and long oligomer arrays that are manufactured in-house or by Agilent (Palo Alto, CA, USA). New platforms are still being introduced, such as the Illumina Beadarray (San Diego, CA, USA; Fan et al, 2004) or the Universal Hexamer Array from Agilix (New Haven, CT, USA; Roth et al, 2004). To complicate matters further, many technical alternatives are possible within each platform for each of the numerous steps between sample preparation and data analysis. These include diverse methods of generating labelled material, various hybridization conditions, different microarray scanners and settings, a range of image-quantification techniques, and several approaches for determining statistically and biologically significant differential gene expression. Microarray technology is therefore an amalgamation of many different techniques, even within individual technology platforms. This complexity makes the need for comparing performance even stronger, whilst confounding such comparisons. Determining reliability is a complicated undertaking if all aspects are to be assessed in a non-arbitrary way across the different platforms and their variants. In addition, reliability is a sensitive issue for those groups that provide the technology. Finally, not every application requires reliable estimates of mRNA level changes. This should be interpreted as an indication of the power of microarray technology, as even lower quality data can yield important results. Improved performance would nevertheless benefit all applications. A high degree of reliability is a requirement if certain fields, such as systems biology (Ideker et al, 2001) or diagnostic mRNA-expression profiling (van de Vijver et al, 2002) are to mature. A strong argument can be made for investigating how the technology can be systematically assessed, given its increased usage, the costs that are involved and the fact that the aim is to determine the mRNA levels of all genes, including those that are expressed at nearly zero levels. Here, we describe approaches for determining microarray performance and propose that the use of external control RNAs is a versatile and robust method for achieving this goal. Accuracy and precision Which performance parameters should be assessed? The two main characteristics of data quality are accuracy and precision. Whereas accuracy refers to how close a measurement is to the real value, precision indicates how often a measurement yields the same result (Fig 1
Performance-assessment models There are at least two situations that require the systematic determination of performance: technology assessment and individual hybridization assessment. An example of the first case is determining which platform works best. This approach can be expanded to include technology optimization. For example, how is the accuracy of a microarray hybridization influenced by various labelling strategies? This question can be asked for every step of microarray production and experimentation, whether it is testing oligomer-probe design, assessing hybridization protocols, equipment, or even evaluating data-processing approaches. The second issue that could benefit from performance assessment is at the level of individual hybridizations: do all of the microarray hybridizations in a large experiment or project yield results of similar quality? This comes after technology optimization and is important because of the complicated nature of microarray methodology. Proceeding from biological material to differential mRNA levels involves several steps, many of which have not yet been stably optimized. Confounding artefacts are still being uncovered (Diehl et al, 2001; Ramdas et al, 2001; Chuaqui et al, 2002; Fare et al, 2003; Martinez et al, 2003; Raghavachari et al, 2003; t Hoen et al, 2003; Lyng et al, 2004). Therefore, monitoring quality would benefit individual hybridizations and projects. This could also aid in analyses of the data that are now being collected in public databases (Edgar et al, 2002; Brazma et al, 2003). In these cases, internal quality control would allow the refinement of decisions about which data to use, depending on the requirement for different quality parameters. Approaches to determining performance One method that can be used to optimize protocols is to measure and increase the signal intensity (Rickman et al, 2003; Wrobel et al, 2003). The underlying assumption is that increased signal-to-noise ratios will yield better quality hybridizations. However, an increase in signal might be aspecific; for example, owing to increased cross-hybridization or the nonspecific binding of fluorophores to nucleic-acid probes (Chuaqui et al, 2002). It is therefore risky to optimize signal-to-noise ratios without knowing whether specificity is being maintained. A second approach is to determine the correlation between new methods and an approach that is already in use. Different amplification and labelling techniques are usually assessed by comparison to a standard cDNA-synthesis protocol (Mahadevappa & Warrington, 1999; Manduchi et al, 2002; Gupta et al, 2003; t Hoen et al, 2003; Kenzelmann et al, 2004). A correlation coefficient only shows how similarly two protocols behave; it does not give information on their individual accuracy. A high correlation (Barczak et al, 2003) might therefore mean that the technologies that are being compared both suffer from the same error. Moreover, a low correlation (Tan et al, 2003) still begs the question of which technique is better. Another use of correlation is to monitor reproducibility; for example, between the two dye channels of cDNA arrays. The drawback is that the technology is being optimized for yielding identical intensities, rather than for accurately reporting what most users are interested in: differences in mRNA levels. Perfectly tight same-versussame scatter plots, which are often touted in publications or advertisements as proof of superior performance, should be treated with caution. Optimization that is based on achieving tight scatter plots can lead to a decreased ability to report changes in mRNA levels. Ideally, optimization should focus on reporting relative or absolute mRNA levels and should take into account the entire range of expression levels. A third method for performance evaluation is to use an established cell-culture experiment in which changes in mRNA levels are verified by other means, such as northern blotting analysis or quantitative reverse transcription (RT)-PCR (Taniguchi et al, 2001; Yuen et al, 2002; Polacek et al, 2003; Loguinov et al, 2004; Roth et al, 2004). Using such established differentials is a good method because it optimizes the reporting of differences in expression, which is the goal of most microarray hybridizations. One disadvantage is that verification and optimization are driven by the differences that are reported by the microarrays, rather than by all of the mRNA-level differences that are present in the experimental system. There is no test for false-negative differentials unless RT-PCR, for example, is carried out on many hundreds of genes that are not reported as being differentially expressed in the microarray experiment. A further drawback is that this method, similar to those described above, does not lend itself to the routine assessment of each individual microarray hybridization before optimization. External controls A fourth approach for assessing microarray technology is based on the use of external RNA controls, which are also known as spikes, spike-in controls, exogenous controls or standards (Lockhart et al, 1996; Eickhoff et al, 1999; Girke et al, 2000; Hughes et al, 2001; Yue et al, 2001; Dorris et al, 2002; Relogio et al, 2002; Badiee et al, 2003; Benes & Muckthaler, 2003; Wang et al, 2003). An external control approach does not have to suffer from the disadvantages discussed above; it can also lend itself to reporting accuracy in each microarray experiment after optimization (see below). External controls are RNA molecules that are synthetically produced by in vitro transcription. Control RNAs can be added in defined amounts to biological samples that are of interest. Microarray probes that correspond to the controls report the fidelity of the technology in determining the presence of the controls. The crucial feature of external RNAs is that the user has absolute control over how they are added. When they are spiked differentially between two identical RNA samples, external RNAs can mimic differentially expressed genes. The user knows that only the control RNAs are differentially present and at exactly what amounts. Owing to this versatility, external controls form an ideal benchmarking system for microarray technology. To illustrate their convenience, Figure 2
Design and handling of external controls The most important requirement for external controls is that they are representative of the endogenous mRNAs with regard to length and sequence characteristics. Similarly, the microarray probes that are used for monitoring the external control RNAs should be typical of all of the probes that are present on the microarray with regard to design, manufacture and cross-hybridization potential. Excessive cross-hybridization towards and from endogenous transcripts must be avoided. Detailed recommendations on how to achieve representative controls and probes are described in the supplementary information online. Requirements for the vectors that are used to generate external control RNA, details of how to store and handle controls, measures to avoid pipetting errors and information about how to incorporate the controls on microarrays are also described. An important consequence of some of these criteria is that several controls and probes must be used. Spiking strategies External controls can be spiked at different stages during sample processing. However, routine use dictates that external controls should be spiked early during sample processing so that as many steps as possible are monitored. The easiest way to achieve this is to spike external control mixes into total RNA samples, thereby controlling all of the downstream steps. The example in Figure 2
RNA sample preparation Stricter requirements for RNA sample preparation and the possibility of pipetting errors represent a potential 'Achilles' heel' for some applications. These are discussed below in increasing order of difficulty. Technology assessment and optimization present no challenge with regard to RNA preparation. Here, controls can be spiked differentially between aliquots of an identical total RNA sample. For hybridization assessment in a single project, heterogeneity in RNA quantification of different samples can be confounding. However, there is no reason why the quantification or quality of RNA should be heterogeneous, as long as the yields are high enough, and the preparation and handling methods are consistent. Recent improvements in low-yield RNA quantification include the introduction of microlitre-volume spectrophotometers. Provided that such samples can be used for repeat hybridizations, any inconsistencies will be obvious. The most difficult situation for hybridization assessment is when unique samples, such as biopsy material, are used that can only be obtained in low yields and are of heterogeneous quality. However, external controls could be used here as a control for differential sample quality; aberrant behaviour of the mRNA population relative to the controls indicates heterogeneity in the quality or quantification of the RNA sample. External control availability Some external control RNAs and their corresponding probes are already commercially obtainable. However, their use is restricted by cost and sequence availability. Plasmids for the external controls that are present on Affymetrix microarrays (Lockhart et al, 1996) can be obtained from the American Type Culture Collection (ATCC). Most of the controls used in the examples cited here were designed several years ago, when only limited genome sequence information was available with which to determine the cross-hybridization potential. At present, no sufficiently large set of external controls is available that fits all the criteria and can be used across several organisms and applications. This could soon change, as the industry-led External RNA Control Consortium (ERCC; http://www.affymetrix.com/community/standards/ercc.affx) is endeavouring to establish and develop a universal set of external RNA controls. Such an initiative should be wholeheartedly endorsed. Alternative uses of external controls Besides sensitivity testing (Fig 3 Insight into data quality is not always necessary We believe that all microarray applications would benefit from the wider use of external controls, especially for technology assessment and optimization. However, not all applications require quality assessment. One such example is microarray screening with the sole purpose of identifying differentially expressed genes; as long as the differential expression of these genes is verified independently, there is no requirement for knowing the data quality of the microarrays. Many studies still fall under this category even though much more is feasible (Lockhart & Winzeler, 2000; Young, 2000). A common denominator of studies that go beyond screening is comparative analysis across many experiments for the purpose of examining entire metabolic, regulatory and pathological pathways. A wider implementation of performance assessment would increase the value of screening experiments for such comparative studies and would also benefit the success of the screens themselves. Other issues that affect accuracy The wider implementation of external controls will not address all of the data-reliability issues. An obvious example is the lack of microarray-probe sequence information. Owing to the necessary revisions of genome annotation, a lack of probe sequence information confounds the analysis of experiments at present, especially across different arrays. Without probe sequence information, there is no guarantee that an oligomer or cDNA probe actually represents the gene that it is supposed to. This problem could be addressed by making it compulsory to submit probe sequence information along with microarray data. The Microarray Gene Expression Data (MGED) Society (http://www.mged.org), which is responsible for coordinating the agreement on the annotation of microarray data, is now recommending that probe sequence information is also included as part of the Minimal Information About a Microarray Experiment (MIAME) criteria (Brazma et al, 2001). A different confounding issue for performance assessment is the poor description of labelled material. At present, the absolute amount and incorporation percentage of labelled material being applied to microarrays is usually not reported. Both are important determinants of hybridization success and can be measured relatively easily (Fig 4
The data-reliability issues discussed here should not discourage the use of microarrays; rather, this is a review of approaches that might further advance an important technology. Although they were first introduced at an early stage (Lockhart et al, 1996), the widespread use of external RNA controls has been held back by several factors. Incomplete genome sequences were one such barrier, but this is no longer an issue. Another factor is the additional work that is required, many aspects of which are discussed above. The most feasible application of external controls is in technology assessment and optimization, which on its own would contribute markedly to increased microarray accuracy. Supplementary information is available at EMBO reports online (http://www.emboreports.org). Design and use of external controls in DNA microarray experiments Click here to view.(206K, pdf) Acknowledgments We thank E. Brendeford for providing Figure 2 References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||
Nat Genet. 1999 Jan; 21(1 Suppl):33-7.
[Nat Genet. 1999]Nature. 2000 Jun 15; 405(6788):827-36.
[Nature. 2000]Cell. 2000 Jul 7; 102(1):9-15.
[Cell. 2000]Genome Res. 2004 May; 14(5):878-85.
[Genome Res. 2004]Nat Biotechnol. 2004 Apr; 22(4):418-26.
[Nat Biotechnol. 2004]Annu Rev Genomics Hum Genet. 2001; 2():343-72.
[Annu Rev Genomics Hum Genet. 2001]N Engl J Med. 2002 Dec 19; 347(25):1999-2009.
[N Engl J Med. 2002]Nat Genet. 2002 Dec; 32 Suppl():490-5.
[Nat Genet. 2002]Nat Genet. 2002 Dec; 32 Suppl():496-501.
[Nat Genet. 2002]Nat Rev Genet. 2002 Aug; 3(8):579-88.
[Nat Rev Genet. 2002]Nucleic Acids Res. 2001 Apr 1; 29(7):E38.
[Nucleic Acids Res. 2001]Nat Genet. 2002 Dec; 32 Suppl():509-14.
[Nat Genet. 2002]Anal Chem. 2003 Sep 1; 75(17):4672-5.
[Anal Chem. 2003]Nucleic Acids Res. 2003 Feb 15; 31(4):e18.
[Nucleic Acids Res. 2003]Anal Biochem. 2003 Jan 15; 312(2):101-5.
[Anal Biochem. 2003]Nucleic Acids Res. 2003 Sep 15; 31(18):e109.
[Nucleic Acids Res. 2003]Nucleic Acids Res. 2003 Jun 15; 31(12):e67.
[Nucleic Acids Res. 2003]Nat Genet. 2002 Dec; 32 Suppl():509-14.
[Nat Genet. 2002]Nat Biotechnol. 1999 Nov; 17(11):1134-6.
[Nat Biotechnol. 1999]Physiol Genomics. 2002 Sep 3; 10(3):169-79.
[Physiol Genomics. 2002]Nucleic Acids Res. 2003 Feb 15; 31(4):e13.
[Nucleic Acids Res. 2003]Nucleic Acids Res. 2003 Mar 1; 31(5):e20.
[Nucleic Acids Res. 2003]Genomics. 2004 Apr; 83(4):550-8.
[Genomics. 2004]Genomics. 2001 Jan 1; 71(1):34-9.
[Genomics. 2001]Nucleic Acids Res. 2002 May 15; 30(10):e48.
[Nucleic Acids Res. 2002]Physiol Genomics. 2003 Apr 16; 13(2):147-56.
[Physiol Genomics. 2003]Genome Biol. 2004; 5(3):R18.
[Genome Biol. 2004]Nat Biotechnol. 2004 Apr; 22(4):418-26.
[Nat Biotechnol. 2004]Nat Biotechnol. 1996 Dec; 14(13):1675-80.
[Nat Biotechnol. 1996]Nucleic Acids Res. 1999 Nov 15; 27(22):e33.
[Nucleic Acids Res. 1999]Plant Physiol. 2000 Dec; 124(4):1570-81.
[Plant Physiol. 2000]Nat Biotechnol. 2001 Apr; 19(4):342-7.
[Nat Biotechnol. 2001]Nucleic Acids Res. 2001 Apr 15; 29(8):E41-1.
[Nucleic Acids Res. 2001]Nat Biotechnol. 1996 Dec; 14(13):1675-80.
[Nat Biotechnol. 1996]Nat Biotechnol. 1996 Dec; 14(13):1675-80.
[Nat Biotechnol. 1996]Plant Physiol. 2000 Dec; 124(4):1570-81.
[Plant Physiol. 2000]Nat Biotechnol. 2001 Apr; 19(4):342-7.
[Nat Biotechnol. 2001]Genome Res. 2002 Jun; 12(6):976-84.
[Genome Res. 2002]Nucleic Acids Res. 2002 Apr 1; 30(7):e30.
[Nucleic Acids Res. 2002]Nature. 2000 Jun 15; 405(6788):827-36.
[Nature. 2000]Cell. 2000 Jul 7; 102(1):9-15.
[Cell. 2000]Nat Genet. 2001 Dec; 29(4):365-71.
[Nat Genet. 2001]Nat Biotechnol. 1996 Dec; 14(13):1675-80.
[Nat Biotechnol. 1996]Nucleic Acids Res. 2002 May 15; 30(10):e48.
[Nucleic Acids Res. 2002]Nucleic Acids Res. 2002 Feb 15; 30(4):e15.
[Nucleic Acids Res. 2002]Nat Biotechnol. 1996 Dec; 14(13):1675-80.
[Nat Biotechnol. 1996]