- We are sorry, but NCBI web applications do not support your browser and may not function properly. More information

# Mathematics of quantitative kinetic PCR and the application of standard curves

^{*}To whom correspondence should be addressed. Tel: +1 418 648 2582; Fax: +1 418 648 5849; Email: ac.yrtserof.lfc@egdelturb

## Abstract

Fluorescent monitoring of DNA amplification is the basis of real-time PCR, from which target DNA concentration can be determined from the fractional cycle at which a threshold amount of amplicon DNA is produced. Absolute quantification can be achieved using a standard curve constructed by amplifying known amounts of target DNA. In this study, the mathematics of quantitative PCR are examined in detail, from which several fundamental aspects of the threshold method and the application of standard curves are illustrated. The construction of five replicate standard curves for two pairs of nested primers was used to examine the reproducibility and degree of quantitative variation using SYBER® Green I fluorescence. Based upon this analysis the application of a single, well- constructed standard curve could provide an estimated precision of ±6–21%, depending on the number of cycles required to reach threshold. A simplified method for absolute quantification is also proposed, in which quantitative scale is determined by DNA mass at threshold.

## INTRODUCTION

Kinetic PCR (kPCR) allows quantification of a target DNA within a sample, with the advantage that sensitivity is independent of copy number (1–4). The key aspect differentiating kPCR from previous quantitative PCR methodologies is that target copy number is determined from the fractional cycle at which a threshold amount of amplicon DNA is reached (threshold cycle or C_{t}), set at a point where amplicon DNA just becomes detectable, but is still within the exponential phase of the amplification (5–7). This approach ensures that interfering factors associated with the late stages of amplification are minimized, and provides the potential for unprecedented precision for quantitative determinations.

Although several methods have been developed to measure C_{t}, all are based upon fluorescent monitoring of amplicon DNA generation (8–10). Absolute quantification can be achieved using a standard curve, constructed by amplifying known amounts of target DNA in a parallel group of reactions run under identical conditions to that of the sample (7,11). Standard curve preparation is both labour intensive and error prone, with quantitative accuracy being dependent on both the accuracy of DNA standard quantification and the quality of standard curve construction (1,12,13).

In this study, a detailed examination of the mathematics governing PCR yielded insights into the process of quantitative kPCR and into some of the fundamental aspects of the threshold method. This provided a foundation from which to examine the reproducibility of standard curve construction. Assessment of intra- and inter-run variation indicates that a substantial degree of precision can be achieved, even with the application of a single standard curve to multiple runs. An alternative approach is also proposed in which quantitative scale is determined by DNA mass at threshold, such that absolute quantification would only require determination of amplification efficiency.

## MATERIALS AND METHODS

The DNA standard consisted of a 218 bp amplicon produced by the K3/K2 primer pair (forward K3: GGCACCTC AGGAATGGGCTATTACAA and reverse K2: AGAATA ACACAGAAATCTGTAGGTGGAATTGAA) that was purified by chloroform extraction followed by isopropanol precipitation, and quantified by averaging three replicate A_{260} absorbance determinations conducted on two spectrophotometers. A second 102 bp amplicon was produced by pairing of K2 with another primer (forward K1: TCCTATGAGATTATGACGCATTTCTCCAAA) located near the center of the K3/K2 amplicon. The primer pair combinations of K3/K2 and the nested K1/K2 thus allowed the production of two different-sized amplicons (218 and 102 bp, respectively) using the same DNA standard dilution series.

PCR amplifications were conducted using QuantiTect™ Syber® Green PCR Kit (Qiagen Inc.) according to the manufacturer’s instructions, with 0.25 µM primers and a variable amount of DNA standard in a 35 µl final reaction volume. Thermocycling was conducted using an Opticon2 DNA Engine (MJ Research Inc.) initiated by a 15 min incubation at 94°C, followed by 45 cycles (90°C, 1 s; 62°C, 120 s) with a single fluorescent reading taken at the end of each cycle. Each run was completed with a melting curve analysis to confirm the specificity of amplification and lack of primer dimers. C_{t} values were determined by the Opticon2 software using a fluorescence threshold manually set to 0.0160 for all runs and exported into a MS Excel workbook (Microsoft Inc.) for analysis (available as Supplementary Material).

## RESULTS

### Mathematics of quantitative kPCR

The basic equation describing PCR amplification is:

N_{C} = N_{0}·(E + 1)^{C}**1**

where C is the number of thermocycles, E is amplification efficiency (also expressed as %E = E ×100%), N_{C} is the number of amplicon molecules and N_{0} is the initial number of target molecules.

In simple terms, each thermocycle produces an increase in N_{C} in proportion to amplification efficiency, such that 100% efficiency produces a doubling in the number of amplicon molecules. Additionally, the quantity of N_{C} present after any specific number of thermocycles is dependent on N_{0}. Rearrangement of equation 1 provides the mathematical relationship upon which quantitative kPCR is based:

N_{0} = N_{C}/(E + 1)^{C}**2**

Quantification of N_{C} thus allows N_{0} to be calculated if amplification efficiency is known. A major breakthrough for quantitative PCR came with the use of DNA fluorescence to monitor amplicon accumulation (3,5). Based upon this technique Higuchi *et al*. (5) developed an elegant method that simplifies N_{C} determination, such that individual amplification reactions are compared at the point at which they contain identical amounts of amplicon DNA. This is accomplished by selecting a fluorescent threshold (F_{t}) from which the fractional thermocycle (C_{t}) is calculated that defines the theoretical point at which each amplification reaction reaches fluorescence threshold.

Under this ‘threshold’ method, N_{C} becomes a constant such that equation **2** becomes:

N_{0} = N_{t}/(E + 1)^{Ct}**3**

where C_{t} is the threshold cycle and N_{t} is the number of amplicon molecules at fluorescent threshold.

Absolute quantification can be achieved using a standard curve constructed by amplification of known amounts of target DNA and plotting the resulting C_{t} values against target DNA concentration. The mathematical basis of a standard curve can be derived by taking the logarithm of equation **3**:

Log(N_{0}) = Log(N_{t}) – Log[(E + 1)^{Ct}]

Log(N_{0}) = Log (N_{t}) – Log(E + 1)·C_{t}

Log(N_{0}) = –Log(E + 1)·C_{t} + Log (N_{t})**4**

Assuming E and N_{t} are constants, equation **4** has the general structure of a line (y = mx + b) such that plotting Log(N_{0}) versus C_{t} produces a line with:

Slope = –Log(E + 1)

E_{S} = 10^{–Slope} – 1**5**

and

Intercept = Log(N_{t})

N_{t} = 10^{Intercept}**6**

where E_{S} is the slope-derived estimate of amplification efficiency.

Although the ability to derive amplification efficiency from the slope of a standard curve has been widely reported, it has not been generally recognized that the number of amplicon molecules at threshold can be directly determined from the intercept. It must also be stressed that these derivations are valid only if all PCR reactions have identical amplification efficiencies, and only if amplification efficiency is invariant over the number of thermocycles required to reach C_{t}.

Another important but often overlooked aspect of the threshold method is the interdependency of C_{t} and N_{t} on F_{t}, which has two important implications. First, C_{t} values generated from different amplification runs can be directly compared only if an identical F_{t} is used for each run. Second, the relationship between N_{t} and F_{t} is dependent on amplicon size. This is due to the fact that the underlying determinant of F_{t} is DNA fluorescence, which in turn has a linear relationship with DNA mass. As such F_{t} directly reflects DNA mass at threshold, which is related to N_{t} as described by:

M_{t} = (N_{t}·A_{S})/9.1 × 10^{11}**7**

where M_{t} is the DNA mass at threshold in nanograms, A_{S} is the amplicon size in base pairs and 9.1 × 10^{11} is the number of single base pair molecules per nanogram.

A less obvious but potentially significant extension of this is that if M_{t} is known, N_{t} can be predicted for any amplicon of known size, if it is assumed that amplicon size and base pair composition do not significantly influence DNA fluorescence. To test the general utility of PCR mathematics for standard curve evaluation and to examine the effectiveness of M_{t} for predicting N_{t}, a series of replicate standard curves was constructed for two amplicons that differ significantly in size.

### Experimental design for constructing replicate standard curves

Figure Figure11 is an example of the two types of graphic output generated by the instrument used in this study, and illustrates the two basic steps in quantitative kPCR using the threshold method, i.e. the selection of a fluorescent threshold from which C_{t} values are generated (Fig. (Fig.1A),1A), followed by linear regression analysis of a Log(N_{0}) versus C_{t} plot, from which E_{S} and N_{t} are estimated (Fig. (Fig.11B).

**A**) Plot

**...**

The major consideration for F_{t} selection is that it falls within the exponential phase of the amplification reaction, best illustrated by plotting log fluorescence versus cycle number (Fig. (Fig.1A).1A). As long as F_{t} is within this log-linear region, the absolute value of F_{t} was found to have only a modest impact on the slope-derived estimate of amplification efficiency (data not shown). However, as outlined above, F_{t} does have a direct impact on both C_{t} and N_{t} such that F_{t} must be fixed if data from multiple runs are to be directly compared.

To evaluate the reproducibility and quantitative variation of the threshold method, five replicate standard curves were generated from two pairs of nested primers (K3/K2 and K1/K2, see Materials and Methods for additional details) using a DNA standard dilution series covering six magnitudes of target DNA concentration. The use of nested primers allowed two different-sized amplicons (218 and 102 bp, respectively) to be amplified side-by-side within the same run, using the same DNA standard dilution series. Intra- and inter-run variation could then be examined for each of the two amplicons, free of errors caused by variations in the DNA standard. Using an identical F_{t} for all runs, the average C_{t} of four replicate amplifications for each DNA concentration were used in the analysis (Table (Table1).1). A spreadsheet containing the individual C_{t} values and the calculations used for their analysis is provided as Supplementary Material.

### Intra- and inter-run variation in C_{t}

As an initial step for evaluation of quantitative precision, the reproducibility of amplification under our experimental conditions was estimated, based upon the standard deviation in C_{t} values generated from replicate amplifications. Moreover, due to the exponential scale of C_{t}, the impact of its variation can be difficult to assess, and thus the standard deviations in C_{t} were also used to estimate the variation in percent molecules based upon the equation:

±%Molecules = [(E + 1)^{SD} – 1] ×100%**8**

where SD is the standard deviation in C_{t} generated from replicate amplifications.

Overall the standard deviation in C_{t} of replicate amplifications ranged from 0.036 to 0.367 cycles, with an average of 0.183 cycles (Table (Table11 and Supplementary Material). This corresponds to an estimated variation in molecules that ranges from ±2.3 to ±26.6% with an average of ±12.4%, using an amplification efficiency of 90% taken from the slope-based estimate of amplification efficiency determined below.

Based upon the average standard deviation produced from each individual run, estimates of the intra-run variation were similar for both amplicons, ranging from ±9.6% to 14.9% of molecules (runs 1–5, Table Table1).1). When combined with inter-run variation, this increased to ±17.4 and ±21.3% of molecules for each amplicon, respectively, based upon averaging the standard deviation in C_{t} for each DNA concentration from all runs (‘Combined’, Table Table1).1). These variations, although significant, indicate that C_{t} values have an acceptable level of reproduciblity over the six magnitudes of target DNA concentration that were examined.

### Standard curve construction and evaluation

Evaluation of the quantitative variation between replicate standard curves was conducted by generating N_{t} and %E_{S} values for each amplification run listed in Table Table1.1. This was done by exporting the C_{t} values into a spreadsheet, and calculating the slope and intercept for each run using linear regression analysis of log(N_{0}) versus C_{t}. Two methods were then used to assess the quantitative variation between the five replicate standard curves constructed for each of the two amplicons (Table (Table22).

Examination of the absolute values of N_{t} and %E_{S} revealed similar trends for both amplicons, with an inter-curve variation in %E_{S} of ±2.2 and ±2.1%, and variation in N_{t} of ±19.0 and ±14.7%, respectively, as based upon their standard deviations (Table (Table2).2). Taken individually, the magnitude of variation in %E_{S} and N_{t} suggests the resulting variation in N_{0} determination could be large. For example, for a C_{t} of 25 cycles, a ±2.2% variance in the estimate of amplification efficiency would produce an approximate ±33% variation in N_{0} that, when combined with the apparent ±19% variance in N_{t}, could produce an overall variation of about ±52% for N_{0}. It must be noted, however, that further examination suggests that these estimates of variance are most certainly erroneous, due to an apparent intra-curve correlation between slope and intercept.

Comparing the %E_{S} and N_{t} values generated from each individual standard curve reveals that for both amplicons, the curve that produced the highest %E_{S} also produced the highest N_{t} (Table (Table2,2, K1/K2, run 2 and K3/K2, run 1). Similarly, the standard curves producing the lowest %E_{S} also had the lowest N_{t} (Table (Table2,2, K1/K2, run 1 and K3/K2, run 5). Taken together, these trends suggest that variations in intercept and slope are not solely caused by inter-run variation in instrumentation and/or amplification, but also reflect an innate characteristic of linear regression in which variations in slope can be compensated for to some degree by a corresponding variation in intercept.

This can be best illustrated through an alternative approach to evaluating quantitative differences produced by replicate standard curves. As illustrated in Table Table2,2, inter-curve variation can be estimated by comparing the calculated N_{0} for a series of simulated C_{t} values using equation **3**. Thus, for the five standard curves constructed from the K1/K2 amplicon, the calculated N_{0} for C_{t}=10 cycles ranges from 3.67 × 10^{7} to 4.64 × 10^{7} molecules, with an average of 4.13 × 10^{7} molecules and a standard deviation corresponding to ±8.7% of molecules (Table (Table2).2). Furthermore, a general increase in variation is observed with increasing C_{t} such that for C_{t} = 30 cycles, a variation of ±18.1% of molecules is produced. Very similar results were produced by the larger K3/K2 amplicon (Table (Table22).

Overall, this analysis demonstrates that quantitative variations produced by replicate standard curves can be relatively small, ranging in this study from a low of about ±6% to a high of about ±21% depending on the number of cycles needed to reach threshold. The observed inter-curve variation in the absolute values of slope and intercept also suggests that curve-based estimates of amplification efficiency and N_{t} require a larger data set than would normally be used for construction of a single standard curve. Indeed, the relative accuracy of the N_{t} estimates for each of the two amplicons can be tested through the correlation of their respective M_{t} values, as described by equation **7**. Based upon the N_{t} values derived from each respective ‘combined’ data set, the estimated M_{t} values differ by 7.3% (Table (Table2).2). This provides support for both the optical precision of the instrumentation and similarity in the SYBER® Green I fluorescent characteristics of these two amplicons.

## DISCUSSION

Despite the extensive use of the threshold method for absolute quantification, there exists a paucity of studies that have examined the utility of the underlying mathematics. Furthermore, the general simplicity and widespread use of standard curves has led to the automation of quantitative determinations, which can obscure the mathematical principles upon which the analysis is based. Familiarity with the fundamentals of PCR mathematics cannot only yield important insights, but as well provide a foundation from which to address some of the major limitations of quantitative kPCR.

At the most basic level, the threshold method does not generally provide an effective indication of quantitative precision or accuracy. Although the standard deviation in C_{t} produced from replicate amplifications can provide an estimate of reproducibility, there is a general deficiency in reporting the errors associated with standard curve construction. This makes it difficult to evaluate the effectiveness of any specific quantitative determination, or of comparing results produced by different studies. As was demonstrated in this study, a basic assessment of standard curve construction can be conducted, if it is understood that slope and intercept are directly correlated to amplification efficiency and the number of amplicon molecules at threshold (N_{t}), respectively.

In this study, comparison of replicate standard curves revealed potentially large inter-curve variations, based upon the absolute values of slope and intercept. This initially led to the conclusion that this was caused by substantial inter-run variations in amplification and/or instrumentation. However, upon closer examination, an intra-curve correlation between slope and intercept became apparent, such that differences in slope are compensated for to a significant degree by corresponding differences in intercept.

This can be demonstrated through simple mathematical modeling, in which the initial number of target molecules (N_{0}) is calculated for a series of simulated C_{t} values (equation **3**, Table Table2).2). This showed that despite the differences in the absolute values of amplification efficiency and N_{t}, the resulting N_{0} values generated by each standard curve were unexpectedly similar. Based upon this analysis the application of a single, well-constructed standard curve could provide an estimated precision of ±6–21% of molecules, depending on the number of cycles required to reach threshold.

Notwithstanding the interrelationship of slope and intercept, it must be stressed that the mathematics of PCR dictate that amplification efficiency and N_{t} are independent entities. In reality N_{t} is determined solely by the fluorescent threshold (F_{t}), and as such its value is independent of the parameters impacting PCR amplification. Indeed, this interrelationship between N_{t} and F_{t} has important practical implications, based on the principle that F_{t} does not directly reflect the number of amplicon molecules, but rather DNA mass at fluorescent threshold (M_{t}). This in turn dictates that M_{t} could be used to predict N_{t} for any amplicon of known size, if it is assumed that amplicon size and base composition do not significantly impact DNA fluorescence. Support for the validity of this assumption was provided by the N_{t} estimates generated from the two amplicons used in this study, for which the predicted M_{t} values differ by 7.3% (equation **7**, Table Table22).

The practical significance of this becomes apparent if it is noted that N_{t} is the sole determinant of scale (equation **3**), the accuracy of which is dependent on the quantitative accuracy of the DNA standard used for standard curve construction. If, however, M_{t} can be used to predict N_{t} with sufficient precision, a common quantitative scale could be applied to all amplicons. In addition to circumventing the necessity of preparing a quantified DNA standard for each individual amplicon, the major source of variation in quantitative scale would become the optical precision of the instrument. Equally significant is that absolute quantification would be simplified, requiring only determination of amplification efficiency once M_{t} has been established.

## ACKNOWLEDGEMENTS

The authors thank Richard Hamelin, Krystyna Klimaszewska and Brian Boyle for helpful comments, and Pamela Cheers for editorial assistance. This research was supported by a grant from the National Biotechnology Strategy of Canada.

## REFERENCES

**Oxford University Press**

## Formats:

- Article |
- PubReader |
- ePub (beta) |
- PDF (103K)

- [Quantitative PCR in the diagnosis of Leishmania].[Parassitologia. 2004]
*Mortarino M, Franceschi A, Mancianti F, Bazzocchi C, Genchi C, Bandi C.**Parassitologia. 2004 Jun; 46(1-2):163-7.* - A kinetic-based sigmoidal model for the polymerase chain reaction and its application to high-capacity absolute quantitative real-time PCR.[BMC Biotechnol. 2008]
*Rutledge RG, Stewart D.**BMC Biotechnol. 2008 May 8; 8:47. Epub 2008 May 8.* - Result variation and efficiency kinetics in real-time PCR.[Acta Med Iran. 2010]
*Shahsiah R, Abdollahi A, Azmoudeh Ardalan F, Haghi-Ashtiani MT, Jahanzad I, Nassiri Toosi M.**Acta Med Iran. 2010 Sep-Oct; 48(5):279-82.* - Absolute estimation of initial concentrations of amplicon in a real-time RT-PCR process.[BMC Bioinformatics. 2007]
*Smith MV, Miller CR, Kohn M, Walker NJ, Portier CJ.**BMC Bioinformatics. 2007 Oct 23; 8:409. Epub 2007 Oct 23.* - The real-time polymerase chain reaction.[Mol Aspects Med. 2006]
*Kubista M, Andrade JM, Bengtsson M, Forootan A, Jonák J, Lind K, Sindelka R, Sjöback R, Sjögreen B, Strömbom L, et al.**Mol Aspects Med. 2006 Apr-Jun; 27(2-3):95-125. Epub 2006 Feb 3.*

- Analysis of qPCR reference gene stability determination methods and a practical approach for efficiency calculation on a turbot (Scophthalmus maximus) gonad dataset[BMC Genomics. ]
*Robledo D, Hernández-Urcera J, Cal RM, Pardo BG, Sánchez L, Martínez P, Viñas A.**BMC Genomics. 15(1)648* - Pten Regulates Development and Lactation in the Mammary Glands of Dairy Cows[PLoS ONE. ]
*Wang Z, Hou X, Qu B, Wang J, Gao X, Li Q.**PLoS ONE. 9(7)e102118* - Selection of Reference Genes for Quantitative Real Time PCR (qPCR) Assays in Tissue from Human Ascending Aorta[PLoS ONE. ]
*Rueda-Martínez C, Lamas O, Mataró MJ, Robledo-Carmona J, Sánchez-Espín G, Jiménez-Navarro M, Such-Martínez M, Fernández B.**PLoS ONE. 9(5)e97449* - An Extended ?CT-Method Facilitating Normalisation with Multiple Reference Genes Suited for Quantitative RT-PCR Analyses of Human Hepatocyte-Like Cells[PLoS ONE. ]
*Riedel G, Rüdrich U, Fekete-Drimusz N, Manns MP, Vondran FW, Bock M.**PLoS ONE. 9(3)e93031* - Evaluation of Candidate Reference Genes for Real-Time Quantitative PCR of Plant Samples Using Purified cDNA as Template[Plant Molecular Biology Reporter / Ispmb. 2...]
*Phillips MA, D’Auria JC, Luck K, Gershenzon J.**Plant Molecular Biology Reporter / Ispmb. 2009; 27(3)407-416*

- Mathematics of quantitative kinetic PCR and the application of standard curvesMathematics of quantitative kinetic PCR and the application of standard curvesNucleic Acids Research. Aug 15, 2003; 31(16)e93PMC

Your browsing activity is empty.

Activity recording is turned off.

See more...