- We are sorry, but NCBI web applications do not support your browser and may not function properly. More information

# Standardized determination of real-time PCR efficiency from a single reaction set-up

^{1}Institute of Agronomy and Plant Breeding, FML-Weihenstephan, Center of Life and Food Science, Technical University of Munich, Germany and

^{2}EpiGene GmbH, Biotechnology in Plant Protection, Hohenbachernstrasse 19–21, 85354 Freising, Germany

^{*}To whom correspondence should be addressed. Tel: +49 8161 713511; Fax: +49 8161 714204; Email: ed.mut.wzw@lffafp

**This article has been corrected.**See Nucleic Acids Res. 2003 November 15; 31(22): 6688.

## Abstract

We propose a computing method for the estimation of real-time PCR amplification efficiency. It is based on a statistic delimitation of the beginning of exponentially behaving observations in real-time PCR kinetics. PCR ground fluorescence phase, non-exponential and plateau phase were excluded from the calculation process by separate mathematical algorithms. We validated the method on experimental data on multiple targets obtained on the LightCycler platform. The developed method yields results of higher accuracy than the currently used method of serial dilutions for amplification efficiency estimation. The single reaction set-up estimation is sensitive to differences in starting concentrations of the target sequence in samples. Furthermore, it resists the subjective influence of researchers, and the estimation can therefore be fully instrumentalized.

## INTRODUCTION

More than 10 years of PCR-based technologies have found their place in most of the laboratories involved in biomedical science. The application of PCR in gene expression studies is an example of a fast innovating field. So far, real-time PCR in combination with array techniques is the major approach adopted in quantitative gene expression studies. The fact that several nucleic acid molecules can be amplified up to microgram amounts opens the possibility to study gene regulation even in a single cell (1). The recent introduction of various fluorescence-based monitoring detection techniques into PCR (2–7) allowed the documentation of the amplification process in the so-called real-time PCR (8–10). The amplification of nucleic acids within the range of exponential growth of the reaction trajectory can be described by a pure exponential growth (equation **1**):

*P* = *I* *E*^{n} **1**

where *P* is the amount of the PCR product of the reaction, *I* is the input nucleic acid amount, *E* is the efficiency of the reaction ranging from 1 to 2 and *n* is the number of PCR cycles.

There is a constant tendency to place the quantification into an early phase of detectable amplification. In such an early portion of PCR trajectory the amplification has the exponential character described in equation **1**. The reaction trajectory at later reaction stages significantly diverges from the exponential type, and becomes a more stochastic process. In such an early portion of the amplification kinetics, a threshold fluorescence is set. As soon as the reaction reaches this threshold fluorescence, the information necessary for the quantitative judgment about the input concentration of the target sequence has been gathered (11). The fractional cycle number of threshold value (*C*_{t}) or crossing point (CP) is then compared with the CP of control samples. There are two ways of threshold level setting. It can be done either arbitrarily by using a randomly selected threshold or by applying computing algorithms. The maximum of the second or, generally, *n*th derivative of smoothed amplification kinetics gives a good and justified threshold level within the assay (11).

Since the result of a single quantitative PCR just reflects the relative amount of target sequence in the form of fluorescence units, it must be objectified by some control. Therefore, adequate quantitative information cannot be obtained from a real-time PCR assay unless at least two samples are analyzed. To make sure that RT reactions and amplification reactions proceed in a similar way in both samples, the amplification of another target sequence present in the sample is often introduced into the assay either simultaneously or in separated runs. The expression of the standard, called the housekeeping gene or reference gene, is assumed to be uninfluenced by experimental treatment and a similar detectable amplification product should therefore be obtained (12,13). Yet, there is a lot of evidence for regulation of these genes under defined treatments (14,15).

Recently, problems have been discussed, that different sequences were often amplified with different efficiencies, causing under/overestimation of input template copy numbers in orders of magnitude. The solution is to document the amplification efficiencies (*E*) of both reactions and to apply a compensating computing algorithm (16–18). The currently used and partly automated method of determination of amplification efficiency is the method of serial dilutions, each analyzed in triplicate (11). Using this method, serial dilutions of the starting template are prepared; in these, the input nucleic acid concentration is varied over several orders of magnitude. Usually, dilution series are prepared by serially diluting the input nucleic acid five to 10 times with sterile water. Subsequently, the CP or *C*_{t} values are plotted against the log of the known starting concentration value and from the slope of the regression line the amplification efficiency is estimated (11,16). There are some variations of this method, but the serial dilution is always necessary. The method finally gives only one value of *E* for all dilution concentrations of the respective sequence. This is, however, a simplified approach, since the *E* varies considerably as the input concentration changes.

Therefore, what is required is a method of amplification efficiency determination that uses the reaction kinetics of a single sample. Since the amplification fluorescence raw data are available by data export from LightCycler (19) or ABI Prism Sequence Detection System (20) software, the efficiency estimation can be based on these data. Liu and Saint (21) suggested a method of amplification efficiency estimation based on absolute fluorescence increase in single reaction kinetics data. In this method, the portion of the data array believed to be exponentially behaving is taken, log- transformed and plotted. The authors consider the slope of the regression line the amplification efficiency. The idea behind this method is correct, but the crucial disadvantage consists of the researcher’s subjective judgment; what data are exponentially behaving and what data are not. Furthermore, the necessary subjective delimitation procedure can not be instrumentalized. Delimitation of the exponential portion of the data is done precisely at the end of it, as the reaction kinetics here strongly depart from the exponential. A similar published method (22) is also based on the absolute fluorescence increase, but it takes place around the point of inflection of the quantification trajectory where a strong decaying trend of the amplification efficiency already occurs. This method is therefore underestimating the ‘real efficiency’.

Here, we report a new method for a reliable estimation of real-time PCR efficiency, which is based on the fluorescence history of just a single reaction set-up and it resists any subjective manipulation. This method was applied on raw fluorescence data from the LightCycler real-time PCR platform.

## MATERIALS AND METHODS

### SRY plasmid DNA construction

The bovine SRY (male sex determining) gene coding sequence was cloned into pCR®4-TOPO® vector using the TOPO TA Cloning Kit for Sequencing (Invitrogen, Karlsruhe, Germany). This circular DNA construct was linearized with restriction digest and its purity was inspected on a 2% agarose gel. Exact quantification of the DNA content was done at OD_{260 nm} on a spectrophotometer (BioPhotometer®; Eppendorf, Hamburg, Germany) with UVettes (Eppendorf) in various dilutions and repeats (*n* = 12), to circumvent any source of error. For standard curve acquisition, six serial dilutions of linearized plasmid DNA ranging were prepared, representing 2.65 × 10^{2}–2.65 × 10^{7} single-stranded (ss) SRY DNA molecules, serving as DNA templates for real-time PCR.

### Real-time PCR on LightCycler

A primer pair flanking sequence within bovine SRY gene was constructed and synthesized (MWG Biotech, Ebersberg, Germany) as follows: forward primer, 5′-GAA CGC CTT CAT TGT GTG GTC-3′; reverse primer, 5′-TGG CTA GTA GTC TCT GTG CCT CCT-3′. The conditions for PCR were optimized in a gradient cycler (TGradient; Biometra, Göttingen, Germany) and subsequently in LightCycler (Roche Diagnostics, Mannheim, Germany) analyzing the melting curves of the products acquired (23). This was done with respect to primer annealing temperatures, primer concentrations, template concentrations and number of cycles applied. Real-time PCR using SYBR Green I technology (Roche Diagnostics) (10,19) with the above-mentioned primers was carried out amplifying cloned sequence in triplicate for each respective concentration. Master-mix for each PCR run was prepared as follows: 6.4 µl of water, 1.2 µl of MgCl_{2} (4 mM), 0.2 µl of each primer (4 µM), 1.0 µl of Fast Start DNA Master SYBR Green I and 2.65 × 10^{2}–2. 65 × 10^{7} copies of ss SRY linearized plasmid DNA. The following amplification program was applied: after 10 min of denaturation at 95°C, 40 cycles of four-segment amplification were accomplished with: (i) 15 s at 95°C for denaturation, (ii) 10 s at 60°C for annealing, (iii) 20 s at 72°C for elongation and (iv) determination of fluorescence at an elevated temperature of 83°C (22). Subsequently, a melting curve program was applied with continuous fluorescence measurement.

## RESULTS

After optimization of the real-time PCR assay with SRY, the gene sequence could be routinely run generating specific amplicons showing no primer dimers, a single sharp peak, identical melting points and an expected length of 164 bp in gel electrophoresis. The sensitivity of the LightCycler RT–PCR was evaluated using different starting amounts of cDNA in a standard curve. SYBR Green I fluorescence determination at the elevated temperature resulted in a reliable and sensitive cDNA quantification assay with high linearity (*r* = 0. 99) over six orders of magnitude from 2.65 × 10^{2} to 2.65 × 10^{7} recombinant standard DNA start molecules.

### Determination of fluorescence ground phase in PCR

The earliest observation of detectable growth phase above the ground phase with sufficient *n* is well suited for estimation of *E* (Fig. (Fig.1,1, inlay).

*n*>

**...**

To objectively detect the beginning of the exponential phase and to skip down the prior ground phase, a statistical method is applied. The ground phase is considered to behave linearly (equation **2**) and linear regression with intercept *i*_{lin} and slope β:

*y*_{lin} = *i*_{lin} + β × *x* **2**

Therefore, it can fit observations as long as there is no sudden significant increment of fluorescence due to reaction product generation. At the moment when the increment of fluorescence becomes a consistent trend, the beginning of the exponential phase takes place. To inspect whether each successively inspected observation still belongs to the linear ground phase or not, standardized residuals of the linear regression are computed. The last one of the regressed observations is always inspected as to whether it does or does not deviate from the linear trend. This procedure starts with the first three observations and proceeds in the way shown in the flowchart in Figure Figure22.

Computation of the studentized residual statistics is a way to obtain a test on the distribution of particular residuals. To test statistically the probability that a given residual value is an outlier we must ensure that the residual value is comparable with some defined pre-existing probability distribution (here a *t*_{n–1–p} distribution; see later).

Since observation of *x*_{i} from the data set of varying *n* is always inspected, it must be taken into account that observations further from the (mean value) have stronger influence [*h*_{ii} (leverage)] on the slope of the regression line:

Therefore, *h*_{ii} (equation **3**) is the measure of a particular influence of the respective observation *x*_{i} on the slope of the regression line.

Furthermore, the so called ‘externally studentized’ residual (24) is computed as follows:

where ε_{i} is the raw residual value, etc., the difference between the observed fluorescence (*y*_{i}) value and fitted fluorescence (*ŷ*_{i}) value, *s*_{i(n–1)} is the deviance of residuals in the regression model fitted over data with the deleted inspected observation (*n* – 1). This is computed as follows:

*MSE*_{i(n–1)} (equation **5**) is the mean square residual of the regression model with the deleted inspected data point. *n* – 2 in the denominator denotes the residual degrees of freedom of the regression model.

Each *r*_{i(n–1)} is distributed as *t*_{n–1–p} under the model. Therefore, we can test the hypothesis whether a single observation deviates from the model by comparing *r*_{i(n–1)} with the *t*_{n–1–p} distribution (equation **6**) where *F*(·) is the cumulative distribution function of the *t*_{n–1–p} distribution:

*P*-value = 2 × [1 – *F*(1 – |*r*_{i(n – 1)}|)] **6**

Note, that even if the model holds for every observation (i.e. there are no outliers), one expects ~5% of the observations to have *P*-values <0.05. Therefore, we cannot automatically call every observation with a *P* < 0.05 an outlier, especially when *n* is large. If the observation is really an outlier and the fluorescence data points are entering the exponential phase, the following observations will also be detected as residuals. Based on experience, two more data points should be inspected after the first outlier is indicated to make sure that a consistent trend takes place (Fig. (Fig.11).

### Determination of exponential observations

The start of the exponential behavior of the kinetic PCR is estimated by the described ‘externally studentized’ residual algorithm. We considered the end of the exponentially behaving observation to be just under the second derivative maximum (SDM) value as generated by LightCycler software 3.3 (Roche Diagnostics). Alternatively, from a four parametric logistic model (FPLM) with the parameters *y*_{0}, *a*, *x*_{0} and *b* (equation **7**), fitting all fluorescence observations without any background correction gives:

where *f* is the value of function computed (fluorescence at cycles *x*), *y*_{0} is the ground fluorescence, *a* is the difference between the maximal fluorescence acquired in the run and the ground fluorescence, *x* is the actual cycle number, *x*_{0} is the first derivative maximum (FDM) of the function or the inflexion point of the curve and *b* describes the slope of the curve at *x*_{0}. The FPLM maximal value of its second derivative (SDM) is computed as follows. First, second and third derivatives of the model are calculated (data not shown). To result in an SDM, the third derivative must be null, which can be achieved by computing equation **8**. Two maxima are obtained; only the first ‘positive maximum’ is relevant for the approximation of the CP:

Other ways of computing the SDM were tested: these were based on just a distinct part of amplification trajectory around the expected SDM (25) or on a four parametric sigmoidal model (FPSM). These methods yield similar results to the SDM of FPLM values obtained (26) herein.

### Estimation of amplification efficiency (*E*)

Once the beginning and the end of the exponential phase are defined, the exponential model is fitted over these data (equation **9**):

*f* = γ_{0} + α*E*^{n} **9**

The fluorescence value is represented by *f*, γ_{0} is the upward shift due to ground fluorescence, α is the fluorescence due to the nucleic acid input, *n* is the cycle number and *E* is the efficiency of amplification in the early exponential phase of real-time PCR.

### Verification of the method

Real-time PCR amplification efficiency was calculated from the given slopes in LightCycler Software 3.3 (Roche Diagnostics) (11). In the DNA calibration curve model, the efficiency per cycle was E1_{fp} = 1.95, using the ‘fit-point method’ (Table (Table1).1). The threshold fluorescence *Y* of the amplified real-time PCR product was calculated according to equation **10**:

*Y* = *I* *E*^{CP} **10**

This resulted in a distinct product threshold fluorescence *Y* at a mean concentration (*n* = 18) of 9.91 × 10^{10} ss SRY molecules/set-up for E1_{fp}, with a coefficient of variance (CV) of *Y* of 79.65%. Additionally, the SDM in the LightCycler Software 3.3 (Roche Diagnostics) (11) was performed, and resulted in 2.89 × 10^{11} ss SRY molecules/set-up for E1_{SDM} and in lower real-time PCR efficiency (E1_{SDM} = 1.92) and variation (CV = 41.48%).

Furthermore, the method of absolute fluorescence increase in the FDM (or point of inflection) of the amplification trajectory E2_{FDM} (22) and in its SDM E2_{SDM} was applied to compute the amplification efficiency E2. Briefly, the slope (or the first derivative) of the model curve at the respective maximum point is divided by the absolute fluorescence value reached at this point. These efficiencies varied between 1.351 < E2_{FDM} < 1.377 and 1.448 < E2_{SDM} < 1.484, with CVs of 159.77 and 195.92%, respectively. *Y* was also calculated and resulted in significant lower concentrations of 5.93 × 10^{8} ss SRY molecules/set-up for E2_{FDM} and 1.99 × 10^{8} ss SRY molecules/set-up for E2_{SDM}. The difference between the general efficiency calculation methods E1 and E2 is approximately three orders of magnitude.

Finally, in the new single curve estimation method by FPLM, as suggested here, the mean product threshold fluorescence was 1.05 × 10^{11} ss SRY molecules/set-up with a variation of 30.80%, comparable with E1 methods. The calculated efficiency values varied in the range 1.822 < *E*_{new} < 1.884, and lay between the evaluations described previously.

The verification method was straightforward and was based on equation **10**. At the same threshold level, the amount of nucleic acids must also be identical in samples with a different known input concentration of nucleic acid. Here, the fractional value of *n* is known as the CP. If equation **10** is computed for each sample with the respective value of *I*, *n* and *E*, identical *P*-values should be theoretically obtained. The *Y* values were computed for each three concentrations of the dilution series used. As the *E* values obtained from different computing methods were entered, the method with the lowest variance of computed *Y* was considered the most accurate (Table (Table11).

## DISCUSSION

As shown in Figure Figure1,1, fluorescence observations acquired from real-time PCR fluorescence monitoring are generally of a logistic or sigmoid shape (21,26), indicating that the PCR kinetics (27) consist of early ground phase, exponential growth phase, linear growth phase, and plateau phase. In the ground phase, the fluorescence acquisition is not detectable or just barely detectable due to the fluorescence passively emitted by the initial reaction system itself. At a certain cycle, the fluorescence emitted by the reaction product steps over the ground phase and enters the phase of growth. This phase takes several cycles and possesses a non-linear character (11). At the very beginning of this phase, the nature of the product increment can be well approximated as exponential (*r* > 0.999, *P* < 0.001). The rate of product generation slows down until the plateau phase is entered. In this phase, no more significant specific product is generated, as a consequence of reaction exhaustion (28).

Herein, we propose a method of real-time PCR amplification efficiency estimation based on single reaction kinetic observations. As shown in the theoretical work of Peccoud and Jacob (29), if the raw fluorescence observations on the PCR trajectory are available, they contain information about the amplification efficiency in itself. The pitfall in such an amplification efficiency estimation from fluorescence observations is that just a few of the reaction observations represent the initial exponential mode of the reaction. To detect where the reaction leaves its undetectable ground phase, a statistical method of residual inspections was applied. This method was robust enough to detect the first observation significantly diverging from the ground phase. In this method, no influence of the number of observations (*n*) was present, as long as very few observations were not inspected (*n* = 4). Such reaction kinetics, where the exponential phase is entered after the first three cycles, are, however, far from real usage.

The end of the exponentially behaved observations was placed into the last observation just before the SDM. This is not an arbitrary decision. After the reaction reaches the SDM, it weakens and looses its exponential character. The computing of the fractional value of the SDM for the purpose of efficiency estimation need not be of the precision demanded for threshold placement, because just discrete observations are used for the efficiency computing. In this respect, the fit of the full-observation model such as the FPLM can be used for computing the SDM. Taking LightCycler computed values of SDM (23) yields similar results. Such a delimitation of observations representing the exponentially behaved part of the PCR yields a set of observations that can be fitted by an exponential model (equation **10**) with high significance (*r*^{2} > 0.999), where the number of raw data fluorescence observations per set-up was *n* > 7.

This efficiency calculation method was tested on a further four bovine target sequences of IGF-1, TNFα, prion protein and 18S rRNA amplified in several independent runs on the LightCycler platform (Roche Diagnostics), and resulted in similar findings (data not shown). Furthermore, the method was applied to real-time fluorescence data generated during the amplification of the recombinant sequence of the *Pyrenophora teres* 18S rRNA gene on an ABI-Prism 7700 instrument (Applied Biosystems, Branchburg, USA), using either SYBR Green I dye or a FIC-labeled minor groove binding 18S rRNA probe. Good results comparable with the dilution series method were also obtained here (data not shown). Altogether, 145 reaction kinetics of various samples have been analyzed in this way, all giving consistent results.

This shows that the presented algorithm is independent of the used platform, the used fluorescence dye (SYBR Green I or FIC), the analyzed target gene and, furthermore, independent of any arbitrary decisions made by the investigator.

In conclusion, verification recalculation of the product amount at a constant threshold level of fluorescence with known efficiency showed that such computed efficiency is more accurate than the method currently used. This is above all clear in the dilution series of the same sequence as the method shows the resolution for various input target concentrations in the sample. Such a computed amplification efficiency can be output from automated platforms, as well as CP values for each sample. This is the major advantage in contrast to other real-time PCR efficiency calculation methods (11,21,22). Efficiency estimations done after the SDM are therefore underestimating the real-time PCR efficiency, whereas previously described methods using a dilution series overestimate it. The newly developed method, with values lying between those of the conventional methods, in our opinion, reflects the ‘real PCR efficiency’. The CV values for the variation of *Y* might seem to be too large (e.g. CV for E1_{fp} = 79.65%; Table Table1).1). Here, the fact must be taken into account that a great deal of the *Y* variance is caused by initial vertical shifts in the ground phase. That is, different samples have different fluorescence products already at the very beginning, before any cycling starts. This discrepancy between different samples contributes to the overall CV value for a given method (Table (Table1).1). Therefore, not the absolute CV values, but rather its order, is a measure of the applicability of a given method.

Although we want to stress the possibility of determining the amplification efficiency from just a single sample, a statistical approach with more replicates can be adopted. Herein, three replicates were investigated to confirm the stability of the described model.

## REFERENCES

*Thermus aquaticus*DNA polymerase. Proc. Natl Acad. Sci. USA, 88, 7276–7280. [PMC free article] [PubMed]

**Oxford University Press**

## Formats:

- Article |
- PubReader |
- ePub (beta) |
- PDF (148K)

- [Use of the real-time RT-PCR method for investigation of small stable RNA expression level in human epidermoid carcinoma cells A431].[Tsitologiia. 2003]
*Nikitina TV, Nazarova NIu, Tishchenko LI, Tuohimaa P, Sedova VM.**Tsitologiia. 2003; 45(4):392-402.* - Validation of an algorithm for automatic quantification of nucleic acid copy numbers by real-time polymerase chain reaction.[Anal Biochem. 2003]
*Wilhelm J, Pingoud A, Hahn M.**Anal Biochem. 2003 Jun 15; 317(2):218-25.* - [Quantitative PCR in the diagnosis of Leishmania].[Parassitologia. 2004]
*Mortarino M, Franceschi A, Mancianti F, Bazzocchi C, Genchi C, Bandi C.**Parassitologia. 2004 Jun; 46(1-2):163-7.* - [CMV DNA detection in plasma using real-time PCR based on the SYBR-Green I dye method].[Enferm Infecc Microbiol Clin. 2006]
*Varela-Ledo E, Romero-Yuste S, Ordóñez-Barbosa P, Romero-Jung P, Prieto-Rodríguez E, Aguilera-Guirao A, Regueiro-García B.**Enferm Infecc Microbiol Clin. 2006 Nov; 24(9):541-5.* - [Real-time PCR: approaches to data analysis (a review)].[Prikl Biokhim Mikrobiol. 2006]
*Rebrikov DV, Trofimov DIu.**Prikl Biokhim Mikrobiol. 2006 Sep-Oct; 42(5):520-8.*

- Selection of Reference Genes for Quantitative Real Time PCR (qPCR) Assays in Tissue from Human Ascending Aorta[PLoS ONE. ]
*Rueda-Martínez C, Lamas O, Mataró MJ, Robledo-Carmona J, Sánchez-Espín G, Jiménez-Navarro M, Such-Martínez M, Fernández B.**PLoS ONE. 9(5)e97449* - Analysis of Artifacts Suggests DGGE Should Not Be Used For Quantitative Diversity Analysis[Journal of microbiological methods. 2013]
*Neilson JW, Jordan FL, Maier RM.**Journal of microbiological methods. 2013 Mar; 92(3)256-263* - Single-base resolution of mouse offspring brain methylome reveals epigenome modifications caused by gestational folic acid[Epigenetics & Chromatin. ]
*Barua S, Kuizon S, Chadman KK, Flory MJ, Brown WT, Junaid MA.**Epigenetics & Chromatin. 73* - Evaluation of Candidate Reference Genes for Real-Time Quantitative PCR of Plant Samples Using Purified cDNA as Template[Plant Molecular Biology Reporter / Ispmb. 2...]
*Phillips MA, D’Auria JC, Luck K, Gershenzon J.**Plant Molecular Biology Reporter / Ispmb. 2009; 27407-416* - Discrimination of infectious hepatitis A virus and rotavirus by combining dyes and surfactants with RT-qPCR[BMC Microbiology. ]
*Coudray-Meunier C, Fraisse A, Martin-Latil S, Guillier L, Perelle S.**BMC Microbiology. 13216*

- Standardized determination of real-time PCR efficiency from a single reaction se...Standardized determination of real-time PCR efficiency from a single reaction set-upNucleic Acids Research. Oct 15, 2003; 31(20)e122PMC

Your browsing activity is empty.

Activity recording is turned off.

See more...