- Journal List
- AAPS J
- v.15(3); 2013 Jul
- PMC3691426

Non-Arrhenius Protein Aggregation
Corresponding author.Abstract
Protein aggregation presents one of the key challenges in the development of protein biotherapeutics. It affects not only product quality but also potentially impacts safety, as protein aggregates have been shown to be linked with cytotoxicity and patient immunogenicity. Therefore, investigations of protein aggregation remain a major focus in pharmaceutical companies and academic institutions. Due to the complexity of the aggregation process and temperature-dependent conformational stability, temperature-induced protein aggregation is often non-Arrhenius over even relatively small temperature windows relevant for product development, and this makes low-temperature extrapolation difficult based simply on accelerated stability studies at high temperatures. This review discusses the non-Arrhenius nature of the temperature dependence of protein aggregation, explores possible causes, and considers inherent hurdles for accurately extrapolating aggregation rates from conventional industrial approaches for selecting accelerated conditions and from conventional or more advanced methods of analyzing the resulting rate data.
INTRODUCTION
In the late nineteenth century, Svante Arrhenius proposed the following equation based on the collision theory in gaseous state to describe the relationship of a fundamental rate coefficient with temperature for a chemical reaction:
Where k is the rate coefficient, T is the absolute temperature, Ea is the energy of activation for the reaction, R is the universal gas constant, and A is a pre-exponential factor related to steric effect and collision frequency. In this equation, the rate coefficient increases exponentially with the absolute temperature T. If the natural logarithm is taken on both sides,
Equation 2 shows a linear relationship between lnk and the inverse of absolute temperature, assuming Ea is independent of temperature. If rate coefficients are experimentally determined at different temperatures, a linear plot of lnk vs. 1/T can be obtained, and Ea and A can be regressed as the slope and intercept. The rate coefficient at any particular temperature can be then be calculated, provided the linear behavior is preserved when one considers extrapolation to temperatures outside the temperature range of the original experiment.
With development of the transition state theory, the Eyring equation was proposed to describe the relationship for the temperature dependence of a rate coefficient in a form that is mathematically similar to the Arrhenius equation:
Where k is the reaction rate coefficient, kB is Boltzmann’s constant (R = NAkB, with NA denoting Avogadro’s number), h is Planck’s constant, and ΔG* is the Gibbs free energy of activation. If one takes the natural log of Eq. 3, and considers the derivative of lnk with respect to 1/T, a form that is effectively the same as Eq. 2 is recovered, as one can typically neglect the temperature dependence due the factor of lnT when dealing with the relatively small range of temperatures (on an absolute scale) that are relevant for real-time and accelerated pharmaceutical product stability. In this case, Ea from Eq. 2 is replaced with ΔH‡, the activation enthalpy for the reaction, which follows from the Gibbs-Helmholtz relationship for ΔG/T with respect to 1/T (1).
Notably, the above relations were developed for relatively simple reactions, and the transition state is that for a single rate-limiting, elementary molecular step within a given mechanism (1). In either case, Ea or ΔH‡ are treated as being independent of temperature over the temperature window of interest, and thus a plot of lnk vs. 1/T (i.e., an Arrhenius plot) results in a straight line. In the remainder of this review, this will be termed Arrhenius behavior while non-Arrhenius will denote systems in which significantly nonlinear behavior is observed in an Arrhenius plot.
The significance of the Arrhenius equation for pharmaceutical companies is its ability to predict shelf lives of drug products based on short-term, accelerated storage stability studies at elevated temperatures. Such predictions can significantly shorten the development time, and products can be put on the market faster. Therefore, the Arrhenius equation has been (implicitly or explicitly) used widely for rapid and accurate assessment of stability of certain types of pharmaceutical dosage forms through accelerated aging studies (2).
The Arrhenius equation, or its mathematical form, has been able to describe the temperature-dependent behavior of many chemical reactions in both liquid and solid states. Chemical reactions (i.e., those involving making/breaking covalent bonds) for even large and complex protein molecules have been shown to follow this relationship reasonably well. Examples include oxidation of several Met residues in recombinant human interleukin-1 receptor antagonist between 5 and 45 C (3), and several Met residues in recombinant human granulocyte colony-stimulating factor between 4 and 45 C (4), Asn deamidation in recombinant human interleukin-15 (IL-15) between 6 and 40 C (5), hydrolysis of a Fc fusion protein at Ser-Met and Asp-Glu between 15 and 40 C (6), and even multiple degradations in oxytocin from pH 2–9 between 40 and 80 C (7).
Some physical reactions of protein molecules (i.e., without changes in covalent bonds) were also shown to follow the Arrhenius or Eyring relationships, perhaps most notably the process of protein unfolding when it occurs as a cooperative step with a net rate coefficient ku (8). Examples include unfolding of lysozyme between 5 and 45 C (9), RNase A between 45 and 63 C (10), α-chymotrypsinogen (aCgn) between 26 and 38 C (11), chitinase F1 between 42 and 55 C (12), several porcine PYY molecules (helical hairpin; 36 aa) between 25 and 65 C (13), β-galactosidase between 45 and 55 C (14,15), and ovalbumin between 68.5 and 80 C (16). The values of Ea or ΔH‡ one obtains for the unfolding rate coefficient are large—typically of the order of 102 kcal/mol— and are of similar magnitude to the enthalpy of unfolding. To place this in context, the largest activation energy or enthalpy values one often observes for chemical reactions (e.g., deamidation) are of the order of only a few tens of kilocalories per mole (2).
Interestingly, different types of non-Arrhenius behaviors have been observed in a number of cases. A relatively simple case is one in which lnk vs. 1/T is composed of two straight-line segments that cross each other at a specific overlap temperature. Examples include temperature-dependent concanavalin A-dextran precipitin reaction at pH 7.4 between 8.6 and 37 C (overlap point at 18 C) (17) and loss of enzymatic activity of adenylosuccinate synthetase between 20 and 85 C (overlap point at 40 C) (18). The causes for these behaviors are often either a change in reaction mechanisms (e.g., the first example above) or a change in the rate determining step (second example above).
In other cases, non-Arrhenius lnk vs. 1/T clearly follow a continuous, nonlinear functional form. These types of curves can be simply divided into two major categories—concave up and concave down, as shown in Fig. 1. If high-temperature data are used to predict the rate of degradation at a lower temperature assuming a linear relationship, the rate can be under- or overestimated, respectively, for concave-up or concave-down types of behavior. Examples of each type have been reported in the literature. Concave-down behavior was demonstrated for: refolding of hen egg lysozyme between 2 and 50 C (19), refolding of trypsin inhibitor 2 between 10 and 80 C (20), and unfolding of several mutants of λ-repressor fragment (λ6–85) in different temperature ranges (21). Concave-up behavior was demonstrated for de-excitation of Trp (triplet state) in LADH and α-crystallin between 4 and 60 C (22) (two activation energies), and unfolding of mutant T4 lysozyme in 3 M GdnHCl at pH 5 between −10 and 18 C (23). Protein aggregation often displays concave-up behavior (see the following section).

An extreme form of the non-Arrhenius behavior is an increase in the observed rate coefficient with an increase in inverse temperature. This type of curve would have a positive slope, leading to an apparent negative activation energy. This can be an extension of the concave-down type at high temperature end or the concave-up type at the low temperature end (dotted lines in Fig. 1). It was found that the observed folding rate coefficient for chymotrypsin inhibitor 2 shows a concave-down increase with increasing temperature from 25 to 50 C but decreased with increasing temperature above 50 C (24). Similar results were also observed for thermal unfolding of hen egg white lysozyme, where the observed folding rate increased from 5 to 40 C but decreased above 40 C (19), and for a RNase A mutant, where the observed unfolding rate coefficient decreased with increasing temperature in the low temperature range between 45 and 63 C (10). A rare case of nonlinearity is the zig–zag relationship for the inactivation of α-chymotrypsin between 45 and 85 C (25). This behavior was due to the presence of two stable native forms—one at a low temperature and one at a high temperature.
There are many possible causes for nonlinear Arrhenius behavior. Some include temperature-dependent changes in reaction conditions (such as phase, pH, and relative humidity or water content), reaction mechanisms, or Arrhenius equation parameters (2). Changing slopes could mean a gradual change in one or more of these aspects. The concave behavior for the refolding of trypsin inhibitor 2 is due to the activation enthalpy being temperature dependent—that is, the heat capacity difference between the “ground” state and transition state is significantly non-zero, at least putatively due to their differences in structure and solvent exposure (20).
The generalization of this observation is that an observed or effective rate coefficient k may account for a single elementary (chemical) reaction or may be a convolution of multiple steps. If any of the steps that contribute significantly to k involve a large difference in heat capacity (Δcp), then the effective Ea or ΔH‡ for k will change with temperature and non-Arrhenius behavior will be observed over a sufficiently large temperature range. Of course, if Δcp or the temperature range of interest is not large then for practical purposes k may be approximately Arrhenius (8,26). As discussed in more detail below, one way of phrasing the problem of extrapolating nonlinear behavior is that one seeks a way to “linearize” k in some simple, yet reliable, manner that would not require one to have more than the high-temperature k values—i.e., this would allow a true prediction of low-T values of k, rather than needing to have the low-T data in order correct for the non-Arrhenius behavior.
PROTEIN AGGREGATION AND ITS TEMPERATURE DEPENDENCE
Proteins have a high tendency to aggregate via a variety of mechanisms/pathways. The major pathways are shown in Fig. 2 and can be roughly divided into: (1) aggregation through unfolding intermediates and/or unfolded states; (2a) aggregation through protein self-association (colloidal interactions) in native states; (2b) aggregation upon chemical degradation directly linking protein monomers; and (3) aggregation upon chemical degradation indirectly changing the aggregation tendency of the protein. The arrows in Fig. 2 often constitute multiple steps along a given pathway and are shown as single arrows only to give simple graphical representation. Also, path 2a (native aggregation) is typically reversible, and as such requires additional steps such as structural rearrangement and/or misfolding to provide the net irreversible non-native protein–protein contacts that are more typical of the aggregates relevant to the remainder of the discussion and analysis here.
It appears that a majority of (non-native) protein aggregation pathways involve formation of an aggregation nucleus and hence, is nucleation dependent (27). However, for therapeutic proteins, the nucleus may be as small as a dimer, and the kinetics of nucleation do not need to include a “lag” phase such as observed in phase transitions (28). The process from N (or I/D) to A in Fig. 2 can be considered as the nucleation step, where A is the smallest stable (net irreversible) aggregate species. All the aggregation pathways noted earlier may lead to formation of a nucleus. The steps leading up to the nucleus are often reversible, as nucleation is a rate-limiting step in such cases. That is, nucleation is by definition the creation of the smallest stable (net irreversible) aggregate. Self-association and/or unfolding steps prior to nucleation are therefore reversible, although they are not necessarily in equilibrium (29). The next stage after nucleation is growth in aggregate size. Several terms have often been used interchangeably in the literature to describe the growth stage(s), including aggregation, elongation, fibrillation, polymerization, and condensation. More precise definitions to delineate these mechanisms have been suggested (8,30) but are beyond the scope of this report.
Nucleation-dependent protein aggregation originating from unfolded or partially unfolded intermediates has been incorporated mathematically into a number of models (for a more extensive review, see e.g., (29,31)). Except in cases where an empirical or heuristic approach is taken, most if not all of these models highlight that there are multiple steps that contribute to a measured rate or rate coefficient for aggregation. One example studied in detail by one of us is the Lumry–Eyring nucleated polymerization (LENP) model (32–34). The final working equation for the observed rate coefficient of monomer loss (kobs) incorporates multiple steps in the aggregation pathway—including protein conformational change, reversible self-association prior to nucleation, nucleation, and growth via different pathways. Aggregation of several proteins appear to follow the LENP model reasonably well. These include the non-native aggregation of α-chymotrypsinogen (aCgn) and anti-streptavidin IgG1 over a range of pH and (NaCl) conditions (11,30) (Kim et al., under review), a model IgG1 at pH 4.5 and 5.5 (35,36), and gamma-D crystallin (37). This type of aggregation can be unfolding limited, association limited, or nucleation limited (8). Depending on the rate-limiting step in this process, aggregation (monomer loss) can display first-, second-, or higher-order kinetics when monitoring monomer concentration over multiple half lives. In addition, the apparent reaction order of aggregation may vary with protein concentrations (36), and this may occur because of change in the identity of key intermediates or pre-nuclei (11).
Arguably, the most widely studied and reported example of nucleation-dependent protein/peptide aggregation is the fibrillation of β-amyloid forming peptides. The behavior of amyloid-type aggregation covers many peptides and small proteins. Their aggregation kinetics are often reported in terms of a sigmoidal curve, characterized by a so-called “lag phase,” a growth phase and a plateau phase (38,39). This type of observation is typical when one employs assays such as turbidity (optical density) or dyes that are specific to only certain aggregate states. In this case, the “lag phase” is a misnomer, as it can include a number of steps where aggregates form and rearrange, but are undetected simply due to limitations of the experimental assay in question. When one employs more sensitive techniques such as laser and neutron and/or X-ray scattering (40,41), it appears that the peptides/proteins form intermediate oligomeric aggregates before they reorganize and grow into much larger or structurally ordered species such as fibrils or macroscopically detectable aggregates (39,42).
Depending on the solution conditions, lag phases have been reported to range from a few minutes to several days. However, the relevance of true lag phases for aggregation in the context of pharmaceutical protein shelf life may be questionable once one accounts for limitations of a given assay, and because the time scales of observation in this context are extremely long (∼105 to 108 s) compared with time scales for molecular events such as (un)folding or misfolding, and molecular diffusion, unless one considers extremely small concentrations or sample volumes. In the latter scenario, one must then also observe stochastic kinetics, meaning that a set of identical samples will display aggregation rates that vary widely (factors of ten or more), and that such variations cannot be attributed to differences in the sample preparation, container defects, or other inherent sources of variability within product manufacture (29).
Elevated temperature is commonly used to probe protein conformational stability as well as to help accelerate aggregation. While increasing temperature does increase diffusion rates, this may be a relatively weak effect compared with the observed temperature dependence of kobs. With the possible exception of very high concentration protein solutions (∼100–300 g/L), the diffusion rate scales as RT/ηRh, where Rh is the hydrodynamic radius of the protein, and η is the solvent viscosity (other symbols defined previously). Rh depends weakly on T, while (on an absolute scale) T changes by less than 30% for the most extreme range one might practically select for testing with accelerated and real-time storage conditions. Therefore the primary source of changes in diffusion rates with temperature is the exponential dependence of η on temperature (η ∼ exp(−Ea,η/RT)). Using the viscosity of water as an example, the effective activation energy is Ea, η≈4 kcal/mol. Recall from above that “large” activation energy values for covalent reactions in water are of the order of 10–20 kcal/mol, and that protein aggregation Ea values are often ∼100 kcal/mol or higher. As such, it is difficult to attribute the temperature dependence of protein aggregation to that for bulk diffusion in solution.
Of course, for solid-state protein stability, or possibly for diffusion of proteins in near-glassy films at solid–liquid or air-liquid interfaces, diffusion may be sufficiently slow to be rate-limiting and more temperature sensitive. In addition, at high protein concentrations the interactions between proteins become important, and viscosity rises (albeit still remaining many orders of magnitude below that for solid or near-glassy systems). It remains an open question as to the magnitude of the temperature dependence of diffusion, and its relevance to aggregation rates, when high-concentration protein solutions are considered.
More generally, changing temperature can have an effect on each of the multiple aggregation steps involved, and thus the effective or observed activation energy (or enthalpy) is a combination of the temperature dependence of each of those steps. If the step is fast compared with kobs then it equilibrates quickly (i.e., it “pre-equilibrates”) relative to the progression of aggregation. In this case, its temperature dependence is described by the equilibrium enthalpy change (ΔH) of that process, rather than its activation energy. Because we are considering lnkobs when referring to activation energies, the overall or observed Ea is therefore a sum of the activation energy value for the rate-limiting step, and the ΔH values of the pre-equilibrated step(s) that contribute to kobs.
If one only considers a relatively small range of temperature, or if none of the steps that contribute to kobs have an Ea or ΔH that changes significantly with T, then one will observe Arrhenius behavior (to within statistical/experimental uncertainty in lnkobs) values. There are a number of published cases where protein aggregation rates fit reasonably well to the Arrhenius equation. Examples include: nucleation rates for fibrillogenesis of β-Amyloid peptide (Aβ) between 29 and 45 C (38), elongation rates for fibrillogenesis of Aβ between 4 and 40 C (43), and between 28 and 68 C (42), fibrillation (covering both elongation and thickening, and association of fibrils into bundles or clusters) of human insulin at low pH and between 50 and 70 C, and between 60 and 80 C (44,45) both nucleation and elongation rates for the (agitation-induced) fibrillation of HET-s(218–289) between 20 and 50 C (46), elongation rates for fibrillation of α-synuclein, a natively unfolded protein, between 27 and 57 C (47), and fibril nucleation rates of N47A α-spectrin SH3 domain between 47 and 55 C (48). Notably, many of these examples are proteins or polypeptides that either are natively unfolded or are unable to adopt folded monomer structures under the solution conditions of interest. Based on the discussion below regarding the effects of conformational changes on the Arrhenius or non-Arrhenius behavior of kobs, it is perhaps not surprising that the above examples show Arrhenius behavior.
For large and folded proteins, a small number of Arrhenius-like examples exist regarding solution-phase aggregation. For example, the aggregation growth rate coefficient of a human IgG follows a linear Arrhenius relationship in a narrow temperature range between 47 and 62 C at several concentrations from 1.9 to 15.4 mg/mL (49), and Yoshioka et al. (15) observed reasonably Arrhenius behavior for beta-galactosidase aggregation. However, in the former case the rate coefficient was established based on the growth of hydrodynamic radius of the aggregates. Care must be taken when using changes in aggregate size as a relevant measure of kobs in the present context, as aggregates may have higher fractal dimension, and may grow via mechanisms that do not involve monomers. In the latter case, the rates were considered over a rather limited temperature range, and that can obscure non-Arrhenius behavior (see below).
In the solid state, protein aggregation appears more often to follow the Arrhenius relationship (50,51). It was found that the rate of aggregation for a mAb in lyophilized formulations at several different moisture levels followed Arrhenius kinetics between 5 and 50 C in the glassy state (well below the effective glass transition temperature (Tg)) (52). Also, it was found that the initial rate for tetramer formation for β-lactoglobulin showed an Arrhenius temperature dependence between 60 and 100 C in pure lyophililized protein or protein-trehalose mixture formulations (53).
In the liquid state examples above, the estimated activation energies for protein aggregation seem to vary significantly. The activation energy for insulin fibrillation between 50 and 70 C was estimated to be 25 kcal/mol (45), similar to the Ea values for nucleation (17.9 kcal/mol) and elongation (20.1 kcal/mol) during fibrillation of α-synuclein (47), as well as the fibrillogenesis of Aβ (23 kcal/mol) between 4 and 40 C (43), and the nucleation (Ea = 14.4–17.0 kcal/mol) of the HET-s(218–289) between 20 and 50 C (46). Under different agitation rates, Ea for elongation of fibrils of the HET-s(218–289) at physiological pH between 20 and 50 C was only 3.3–4.3 kcal/mol (46), which is likely a diffusion-controlled process (<<10 kcal/mol). In contrast, the nucleation activation energy for fibrillogenesis of Aβ is 74.4 kcal/mol between 29 and 45 C (38). In the solid state, the Ea values for aggregation of a lyophilized mAb were 9.9 and 10.3 kcal/mol for samples containing 2% and 5% moisture, respectively, between 5 and 50 C (52), but 23 kcal/mol for the initial rate for tetramer formation for lyophilized β-lactoglobulin between 60 and 100 C (53).
These examples show a large variation in quantitative temperature dependence for protein aggregation both in liquid and solid states. In comparison, the activation energy for the oxidation of several Met residues was 9.2–13.2 kcal/mol in recombinant human interleukin-1 receptor antagonist (3), Ea ∼9.1 to 18.8 kcal/mol for Met oxidation in recombinant human granulocyte colony-stimulating factor (4). While Ea for Asn deamidation was 13.3 kcal/mol in a model peptide (54) and 22.9 kcal/mol in IL-15 (5). The much larger variation in activation energy for aggregation when compared with chemical degradation indicates the complexity of the aggregation process and potential source(s) of its temperature dependence.
The large Ea found for aggregation in the liquid state suggests a large temperature dependence and possible contributions from multiple aggregations steps—including unfolding, growth and/or nucleation. For example, most the examples above with low Ea values (∼30 kcal/mol or lower) are for natively unfolded proteins or polypeptides. In contrast, the effective Ea value for aggregation of the four-helix bundle protein bG-CSF is well over 100 kcal/mol over a range of temperatures (55), Ea for the alpha-beta protein aCgn is over 200 kcal/mol under accelerated (high T) conditions below its midpoint unfolding temperature (11), and Ea for an IgG1 is between 100 and 200 kcal/mol under typical formulation pH and salt conditions (56).
As noted above, it is more often than not that aggregation of folded proteins does not follow the Arrhenius relationship. The degree of non-linearity may vary, depending on the solution conditions (pH, salts, etc.) (26). In most cases, upward curvature is observed for temperature-dependent protein aggregation, as seen for different types of monoclonal antibodies (26,57,58). Upward curvature for lnkobsvs. 1/T leads to an overestimation of Ea values when extrapolated to low temperatures, and thus to an overestimation of the shelf life that will be gained by cooling to low-temperature conditions (26). In practical terms, this type of nonlinearity makes shelf life prediction much more difficult and less reliable based on high-temperature stability data.
POSSIBLE REASONS FOR NONLINEAR PROTEIN AGGREGATION
The temperature-dependent non-linearity in protein aggregation can be attributed to several possible factors. These factors primarily include: (1) an intrinsically temperature-dependent contribution to Ea (e.g., a ΔH that depends on temperature) or (2) a competition between different pathways such that there is a change in pathway or rate-limiting step as temperature changes. The latter was already discussed above, and some specific examples of (1) are discussed next.
Temperature-Dependent Change in Protein Stability
For aggregation in which unfolding is not rate limiting, but some degree of unfolding is required within the pathway, lnkobs will be a sum of at least two factors. One is that for the rate-limiting step (e.g., that for diffusion or for nucleating an irreversible species from an otherwise reversible oligomer). If this is diffusion then this will typically be a pseudo-Arrhenius process unless one is approaching a glass transition for the system. If this is nucleation, then it is unclear whether one should expect an intrinsically non-Arrhenius contribution to kobs, as theories regarding the nucleation event itself, at a molecular level, are not well developed in general for non-native protein aggregation.
The other factor is a free energy for reversible conformational change or (partial) unfolding, ΔGun. This is a pre-equilibrated step and so should be thought of as creating a reactive intermediate in the process, rather than creating a transition state. This ultimately gives a mathematical form that is analogous to that for the Eyring equation (Eq. 3) but now with an extra term for lnkobs, proportional to ΔGun/RT. As with the Eyring equation, when one takes the derivative of this term with respect to 1/T it gives a factor of ΔHun (the equilibrium unfolding enthalpy) in the sum of terms that make up Ea for aggregation. ΔHun is temperature dependent because the heat capacity change upon unfolding (Δcp, un) is large and positive for protein folding. Therefore the overall Ea for aggregation will have a term that is proportional to the product of Δcp, un and T. If we denote the range of temperatures over which aggregation is tested as ΔT, then one should see non-Arrhenius behavior for kobs if the product of Δcp, un and ΔT is of similar magnitude to the average value of ΔHun over this range ΔT (8). This same effect can be illustrated by considering the temperature-dependent protein conformational stability, expressed by integrating the Gibbs–Helmholtz equation with a constant Δcp, un (59,60),
where ΔHun, 0 is the unfolding enthalpy evaluated at the midpoint unfolding temperature (Tm or T0). The graphical relationship between ΔGun and T (protein conformational stability curve) is shown in Fig. 3 (61–63). The stability curve shows three characteristic temperatures—that for maximal stability (Tms), cold denaturation (Tcd) and heat denaturation (Thd or Tm). For mesophilic globular proteins, the predicted or measured Tms values typically range from −25 to +35 C (64). Other analyses show that the majority of reversible two-state proteins are maximally stable around room temperature with an average of 20 ± 8 C at or near neutral pH, irrespective of their structural properties, the melting temperature, or the living temperatures of their source organisms (63). This is in agreement with a recent suggestion that the maximal stability of a typical protein is around 17 C with ∆Gun about 7.7 kcal/mol (65). Note that the maximum Kun is not where ΔGun = 0, but rather where ΔHun = 0. In practice, this temperature differs from Tms by a few degrees (61), but does not affect the discussion below.

The particular shape of the stability curve in Fig. 3 is the result of the effect of temperature on multiple aspects of the protein structure, manifest as a large Δcp, un. Physically, a large difference in heat capacity for unfolding is attributed to the exposure of primarily hydrophobic side chains from the interior of folded domain(s) of the protein (66,67). Therefore it appears reasonable to argue that the hydrophobic effect, manifesting via the temperature dependence of protein unfolding, is a strong contribution to the non-Arrhenius behavior of aggregation rates. Note that simple models of protein aggregation often pose that aggregation is due primarily to hydrophobic attractions, however additional factors are clearly involved when one considers the importance of complementary steric packing and van der Waals forces, as well backbone–backbone hydrogen bonding (27,68,69). Here, the analysis simply is focused on why a large non-Arrhenius effect is seen, rather than assuming whether hydrophobic interactions are the dominant factor in folding or aggregation.
The above analysis is also consistent with the earlier examples based on natively unfolded proteins and other polypeptides that do not have a well-defined folded monomer state, and thus are either relatively hydrophilic or too short to fold effectively. In either case, they do not appear to require an unfolding step to enable aggregation, and therefore are lacking both the large contribution from ΔHun (∼102 kcal/mol near Tm) as well as the large value of the product Δcp, unΔT.
Temperature-Dependent Change in Aggregation Mechanisms or Solution Conditions
Temperature can enhance the rate of chemical degradation. Degraded proteins may have altered protein structure and stability for protein aggregation. It was shown that methionine oxidation in a human IgG1 Fc altered secondary and tertiary structure, reduced melting temperature (Tm) of the C(H)2 domain, and led to enhanced aggregation at 45 C (70). The free energy change between the folded and unfolded states of lysozyme decreased by the accumulation of chemical reactions (isomerization of Asp-Gly, deamidation of Asn, racemization of Asp and Asn, and cleavage of the Asp-X-peptide bond) induced at high temperature (71). Fragmentation can generate smaller proteins/peptides of different sizes. It has been shown that the length of simple polypeptides can change the aggregation rate—e.g., as occurs for polyalanine (72). Indirectly, temperature-induced hydrolysis of sucrose, a commonly used protein excipient, led to formation of protein glycation products, increasing the rate of protein aggregation (73).
It is well known that protein aggregation tendency or mechanism in solution is strongly influenced by pH. Changing pH can significantly alter the mechanism of aggregation and size/solubility of protein aggregates such as IgG1 (36). In a relatively narrow pH range, IgG1 aggregates were shown to grow primarily by monomer addition and remained small and soluble at pH 3.5 but grew by both chain polymerization and condensation and become large and insoluble at pH 4.5 and higher pH (36). Changing temperature also changes the pH of certain formulation buffers due to temperature-dependent changes in the degree of ionization (74). Chemical degradation could lead to formation of basic or acidic species, altering the solution pH, such as deamidation of asparagines (75).
POSSIBLE SOLUTIONS FOR NON-ARRHENIUS AGGREGATION KINETICS
In the development of protein biotherapeutics, accelerated conditions are often used to facilitate the assessment of protein aggregation tendencies and to attempt to predict the low-temperature aggregation behavior based on an Arrhenius-like relationship. If the actual relationship between lnk and 1/T is not linear, one may elect to consider alternatives for more accurate predictions.
As mentioned above, one type of non-linear Arrhenius curve is composed of two linear segments. The entire curve can be fit with two linear equations with two activation energies. It is noted that the overlap or transition temperature may not be easily discernable by visual inspection, especially when the number of data points is limited. For this purpose, a program has been developed for use to identify the ideal breaking point (76). Nevertheless, in accelerated stability studies, only a limited number of temperatures are tested, due to limited materials and/or resources. Therefore, it is not practical to use this method to predict low-temperature aggregation kinetics.
If the rate–temperature curves are slightly curved upward, the Erying equation or the modified Arrhenius equation could be used to describe the non-linear relationship: (77)
In this case, one more temperature term Tn with a floating n is included to the equation in fitting the nonlinear data and for extrapolating the rate coefficients (78). Addition of this extra term could improve the linearity of the curve but significant non-linear data would not fit well. The additional parameter n is purely empirical, and as such is just a an attempt to “linearize” the data, assuming that one has sufficient data that show such nonlinear behavior to be able to obtain a reliable estimate of the value of n (see also, discussion below).
For significantly nonlinear lnk vs.1/T behavior, one option is to use a mechanism-based model for the change in Ea as a function of T that explicitly includes non-Arrhenius effects such as the effects of Δcp, un. Such a nonlinear function has been included in the analysis of the LENP model, and gives a predicted concave-up lnk vs. 1/T curve based on separately measured values of Δcp, un and the values of lnk at high temperature (but well below Tm). In this scenario, one measures lnk under accelerated conditions where Ea does not have a discernable dependence on T, and then lnk is extrapolated to lower T where it is non-Arrhenius. All of the non-Arrhenius behavior is therefore presumed to result from the shifts in unfolding equilibrium with temperature (26,34). In prior work, this approach was successful to predict low-temperature rates from just accelerated data (i.e., without knowledge of the low-temperature rates) (34), but more recent work highlights that it can be difficult to directly determine Δcp, un because aggregation occurs during the measurement of unfolding (e.g., in calorimetry or other thermal scanning measurements) (36).
Alternatively, the Vogel–Fulcher–Tammann (VFT) equation could also be used to “straighten” the non-linear curve: (57)
where T0 is an adjustable reference temperature. Addition of this parameter can significantly linearize the data (see below) but for this approach to work a sufficiently large temperature range must be used, such that the non-Arrhenius behavior is already apparent.
Figure 4 reproduces two literature examples where lnk (from monomer loss data) vs. 1/T show significant non-Arrhenius behavior. One is for a single-domain four-helix bundle protein, bovine granulocyte-colony stimulating factor (bG-CSF) (55). The other is for a monoclonal antibody (MAb) (57). The main panel shows the data over the full set of temperatures reported, which include both accelerated (high-T) and real-time conditions (room T and below). The inset shows the data over just the high-T conditions that correspond to temperature ranges that are typical of industrial practice for choosing accelerated conditions—i.e., 40 C plus and additional few temperature values: for bG-CSF this is 30 to 40 C, while for the MAb it is 40 to 60 C. Notably, the set of four accelerated temperature conditions in the inset for either data set illustrate that non-Arrhenius behavior (i.e., significant curvature vs. 1/T) is not apparent. That is, one must consider a wider range of temperature and/or have additional data points in order to observe statistically significant deviations from linearity (see also discussion below).

Rather than conventionally fitting high-T data to obtain a nonlinear extrapolation, Kayser et al. (57) identified an alternative approach guided by the mathematical form of the VFT equation. They observed that by choosing a T0 value that lies at high temperature (T > Tm) their otherwise non-Arrhenius rate data for MAb aggregation would display a linear dependence if plotted on VFT temperature axis—i.e., when lnk is plotted vs. 1/(T − T0), rather than plotted vs. 1/T. Figure 5a illustrates this behavior using the same MAb data as in Fig. 4. In each panel, the data are not regressed, but rather are simply replotted using 1,000/(T − T0) on the X-axis, with a different value of T0 for each panel. The case of T0 = 0 is of course equivalent to an Arrhenius plot of the data such as shown in Fig. 4.
Interestingly, if one selects T0 lower than the experimental range of the kobs data (i.e., 5 C in the present case) then the data show upward curvature. If one instead selects T0 above the experimental range of kobs then the data show downward curvature. As T0 approaches the range of T values for the kobs data, the curvature becomes increasingly pronounced. If one chooses a T0 value within the range of the data, then there is an unphysical “break” in the data trend due to the mathematical artifact of the switch in curvature that this transform of the temperature axis causes. As such, one must select T0 such that is either far above or far below the experimental data range if one is to employ the VFT-inspired equation to linearize the data. Physically, this is not surprising since mechanistic models that display VFT behavior identify T0 as the point at which the configurational entropy of the system vanishes (e.g., as a liquid vitrifies as it is cooled extremely rapidly). In those systems, T0 is always below the experimental range, as it is physically unrealistic for the system to exist at T < T0.
Figure 5b (inset) quantifies the deviation from linearity as a function of the choice of T0 for this data set. It shows the value of the R2 goodness-of-fit parameter for regressing a straight line to lnkobs plotted vs. 1/(T − T0). The values that are close to 1 are reasonable choices for T0 if one seeks to linearize non-Arrhenius data in this manner. The inset shows the results for fits where one uses the full set of temperatures from Fig. 4. Clearly, one must avoid T0 values that lie close to the experimental range of temperatures, for the reasons noted above. Choosing high values of T0 gives the most linear results, as noted by Kayser et al. (57), and as illustrated in Fig. 5a by the T0 = 368 K example. However, choosing T0 too high induces artificial downward curvature (e.g., T0 = 400 K or 450 K in Fig. 5a). Figure 5b shows that there is a broad maximum in R2vs. T0 if one considers high T0 values, and so it could be that one could choose a T0 within this region somewhat arbitrarily and still obtain useful results.
However, one must also consider that this example employed data over the full set of temperatures, including many at much lower and higher temperatures than conventionally tested in formal accelerated stability programs in industrial laboratories. The main panel of Fig. 5b shows the analogous results if one uses only the four high-T values of kobs from the inset to Fig. 4. Again, one must avoid choosing T0 within the experimental data range (gray-shaded regions in Fig. 5b), but otherwise essentially any T0 value gives equivalent results, and thus one could easily select a T0 value that is inappropriate for linearizing the data for lower temperatures. This follows naturally because with only a small number of data points, over too small of a temperature range, the data will not show sufficiently non-Arrhenius behavior to allow any non-Arrhenius model to regress its additional parameter(s). Any approach based on regression or adjustment of all key model parameters using just the values of k vs. temperature will encounter this issue. As noted above in the context of the LENP approach, this is not an issue if there is an alternative way to estimate the parameter(s) that describe the non-Arrhenius behavior, but making such estimates may involve different difficulties (56).
Can a simpler linearization method be created for improved rate predictions, and one that can be done without have to introduce a third adjustable parameter (as in Eq. 6), or a third parameter that may be difficult to obtain from independent experiments (as in the LENP model)? A simple approach that at first glance does not involve more than two adjustable or fitted parameters is to try to empirically “linearize” the lnk data by a mathematical transform such as taking an additional logarithm, i.e., consider ln(lnkobs) vs. 1/T based on the following proposed relationship:
Unless one has a fundamental (e.g., mechanistic) reason for choosing a particular transform, this is a purely empirical approach to help eliminate curvature in lnkobs without requiring a third parameter that introduces non-linearity in the model. However, choosing a particular transform could introduce bias or hidden additional arbitrary parameters, as will be illustrated below.
To illustrate this point, consider the simple example of attempting to “linearize” the data from Fig. 4 by taking the logarithm of lnk. Figure 6 (insert) shows the k data for the same monoclonal antibody (57) as in Fig. 4, but now plotted as ln(lnk) vs. 1/T. Inspection of the data shown as squares in Fig. 6 gives the impression that the transformed rate data is indeed more linear. However, in order to be able to take this second logarithm, the k data had to be scaled to assure none of the values of lnk were below zero—i.e., mathematically, k must be above 1 in order to be able to take its logarithm more than once. As such, what is plotted in Fig. 6 is ln[ln(kY)], with Y a scaling constant. By judicious choice of units for k, one may make Y = 1, but ultimately Y is a new, semi-arbitrary constant or fitting parameter. In order for this transform of k to be reliable in practice, the choice for Y (or for the units of k) must not affect the resulting prediction when extrapolating from high T to lower T.
The other data sets (different symbols) in Fig. 6 show the effect of changing Y on the resulting behavior of ln[ln(kY)] vs. 1/T. Clearly, the choice of Y has a pronounced effect on the degree of linearity of the transformed k data at low T. But if one inspects only the high-T regime, any choice of Y seems to produce reasonably linear behavior. Therefore the choice of Y is effectively arbitrary when one has only the high-T data available (as would occur during product development). This illustrates that even simple transformations of k data suffer from significant uncertainties when one seeks to extrapolate inherently non-linear data if one does not know the inherent form of the non-linearity. Thus, transforming either the k or the T values involves similar issues when one seeks to extrapolate rather than interpolate.
Returning briefly to the question of the number of accelerated (high-T) temperature points that are used in common practice in industrial laboratories, one must realize that for linear behavior one needs two parameters (e.g., intercept and slope) to mathematically describe the behavior—and these are the fitted Ea (slope) and A (intercept) values. If one has nonlinear Arrhenius behavior, then one needs at least three (and possibly more) parameters to have any rational chance of extrapolating accurately to lower temperatures. In the above examples, the “third parameter” was, respectively, Δcp, un, n, T0, or Y. If one is to fit any three-parameter model to experimental data, then there will be statistical uncertainty or variability in that data.
Therefore, in practice it is unrealistic to use only three temperature values if one is to expect reliable predictions from such a fit. Rather, one will necessarily need significantly more conditions if one is to reliably predict low-temperature behavior from just the high-temperature kinetics, if there is significantly non-Arrhenius behavior. Purely mathematically, if one is fit three parameters then a bare minimum of four temperatures must be used if there is to be any statistical relevance to the fitted parameters, and even then the uncertainties in the statistical parameters will be so large as to make them of little use. Given that the data themselves (e.g., the kobs values) will also have statistical uncertainty, even four temperatures are not realistically enough to obtain reliable results. Secondly, one must consider that it is not just the number of temperatures that is essential; it is also the range of temperatures. As shown above, if one is to fit kobsvs. T or 1/T, then the data must extend over a large enough range of temperature that non-Arrhenius behavior is apparent within the uncertainty of the rate data. It is not possible to give a general “rule” as to how many temperatures this will require, as that will depend on the protein and underlying mechanism of the non-Arrhenius behavior. In the examples above, on the order of five to ten different temperatures, spanning a range of at least ∼20°C, were needed to achieve this.
Pragmatically, the two simplest solutions to this problem are: (1) to obtain sufficient data points and/or supplemental data from complementary experiments to capture the non-linear behavior under accelerated conditions to make a reliable low-T prediction more practical; (2) develop improved means to accurately and quickly obtain aggregation rates at conditions that do not require one to extrapolate so far in terms of the extrapolating variable (in this case, temperature). The former is illustrated by each of the approaches presented earlier in this section, while the latter has been discussed in some detail elsewhere by one of us (8,56). In either case, our argument is that current practice regarding experimental design in the industry may mimic too closely what is done for small-molecule shelf life prediction. Doing so will inherently limit, or even preclude one from accurately accounting for, and accurately predicting, non-Arrhenius behavior for protein aggregation rates.
SUMMARY
Protein aggregation is a complex, multi-step process. Nonlinear or non-Arrhenius aggregation rates are most typically observed for proteins that exist primarily in folded states in their monomeric form, although it also occurs when there is a change in rate-limiting step (e.g., via chemical modifications) with a change in temperature. In the former case, temperature influences protein conformational stability effectively “parabolically,” and this is a natural consequence of the large heat capacity changes upon unfolding and exposure of (primarily) hydrophobic residues. By analogy, the conformational changes within dimers or larger oligomers that are needed to stabilize aggregates may play a role in non-Arrhenius aggregation rates if they are involved in the rate-limiting step(s) for aggregation. Similar arguments apply for aggregation occurring in bulk solution and at interfaces. Different approaches have been offered in the literature and in this report to enable more accurate extrapolation of accelerated (high temperature) protein aggregation rates to lower temperature, but each has its limitations. Truly quantitative prediction of low-temperature aggregation rates, based on only accelerated data, remains an outstanding challenge. However, the factors and approaches reviewed here may prove useful in at least semi-quantitative extrapolation to low temperature based on accelerated stability studies. However, doing so may require researchers to incorporate significantly more temperatures and/or a wider range of temperature for accelerated stability programs than is the current the norm in their organization.
Acknowledgments
The authors thank B. Trout, V. Kayser, V. Voynov, and N. Chennamsetty for their suggestions and careful reading of this review in manuscript form. It is a pleasure to submit this review as part of the issue honoring Professor Garnet Peck.
Contributor Information
Wei Wang, Phone: +1-636-2472111, Email: moc.rezifp@gnaw.2.iew.
Christopher J. Roberts, Phone: +1-302-8310838, Fax: +1-302-8311048, Email: ude.ledu@rjc.


