![]() | ![]() |
Formats:
|
|||||||||||||||||||||||||||||
Copyright : © 2005 Wearing et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Appropriate Models for the Management of Infectious Diseases 1Institute of Ecology, University of Georgia, Athens, Georgia, United States of America, 2Department of Biological Sciences, University of Warwick, Coventry, United Kingdom Stephen P. Ellner, Academic Editor Cornell University, United States of America *To whom correspondence should be addressed. E-mail: hjw/at/uga.edu Competing Interests: The authors have declared that no competing interests exist. Author Contributions: PR conceived the idea. HJW, PR, and MJK designed the study. HJW formulated and analyzed the models. PR, HJW, and MJK contributed to writing the paper. Received December 3, 2004; Accepted April 27, 2005. This article has been corrected. See PLoS Med. 2005 August 30; 2(8): e320. See "Appropriate Modeling of Infectious Diseases" , e239. This article has been cited by other articles in PMC.Abstract Background Mathematical models have become invaluable management tools for epidemiologists, both shedding light on the mechanisms underlying observed dynamics as well as making quantitative predictions on the effectiveness of different control measures. Here, we explain how substantial biases are introduced by two important, yet largely ignored, assumptions at the core of the vast majority of such models. Methods and Findings First, we use analytical methods to show that (i) ignoring the latent period or (ii) making the common assumption of exponentially distributed latent and infectious periods (when including the latent period) always results in underestimating the basic reproductive ratio of an infection from outbreak data. We then proceed to illustrate these points by fitting epidemic models to data from an influenza outbreak. Finally, we document how such unrealistic a priori assumptions concerning model structure give rise to systematically overoptimistic predictions on the outcome of potential management options. Conclusion This work aims to highlight that, when developing models for public health use, we need to pay careful attention to the intrinsic assumptions embedded within classical frameworks. Introduction The past decade has seen a dramatic increase in the significance attached to infectious diseases from the public health perspective. This trend is due in part to the emergence of new and highly pathogenic infections such as Ebola [1], West Nile virus [2], and SARS [3]. There are also well-publicized concerns surrounding the deliberate introduction of pathogens as bioterrorism weapons [4,5], and the continued persistence and resurgence of older infections, several of which now boast strains resistant to more than one drug [6]. In addition, there have been a number of high-profile and economically expensive disease outbreaks in domestic livestock [7–9] as well as wildlife populations [10]. The effective management and control of such infections is increasingly done with substantial input from mathematical models, which are used not only to provide information on the nature of the infection itself, through estimates of key parameters such as the basic reproductive ratio R0 [11], but also to make predictions about the likely outcome of alternative courses of action [12–15]. During the 2001 outbreak of foot-and-mouth disease in the United Kingdom, for example, the former UK Ministry of Agriculture, Fisheries, and Food set up a committee that included two groups with expertise in mathematical modeling of disease dynamics. It is becoming increasingly important, therefore, that epidemiological models produce accurate quantitative predictions, and this in turn relies on accurate parameterization. Here, we examine the dynamical consequences of an unrealistic yet almost ubiquitous assumption embedded in such models concerning the distribution of the latent and infectious periods. In particular, we show that without greater care in model formulation, we may risk systematic biases when parameterizing models from data and may make overly optimistic policy recommendations. The most commonly used framework for epidemiological systems, is still the susceptible–infectious–recovered (SIR) class of models, in which the host population is categorized according to infection status as either susceptible, infectious, or recovered [16,17]. Subsequent refinements of the model have incorporated an additional exposed (infected but not yet infectious) class (susceptible–exposed–infectious–recovered [SEIR] models) (see Protocol S1 for mathematical equations). One of the fundamental mathematical assumptions in such models is that the rate of leaving the exposed or infectious class is constant, irrespective of the period already spent in that class. While mathematically very convenient, this assumption gives rise to exponentially distributed latent and infectious periods, which is epidemiologically unrealistic for most infections [18–20]. A more sensible formulation would be to specify the probability of leaving a class as a function of the time spent within the class, such that initially the chance of leaving the class is small, but the probability increases as the mean infectious/latent period is reached. This would give rise to a more realistic distribution of latent and infectious periods, with a stronger central tendency. A convenient way to describe such distributions is to write an expression for the infectious class (neglecting the latent class for this example) as follows:
which translates mathematically into
where β is the infection transmission rate and N is the total population size. The probability of remaining infectious through time is governed by the survivorship function, P(t), and as such the average infectious period, denoted by 1/γ, is given by [21]. The probability density function of the infectious period, p(t), is just the negative derivative of the survivorship function, −dP(t)/dt. Different functional forms for p(t) give rise to alternative assumptions concerning the distribution of the infectious period in the model. For example, setting p(t) equal to e−γt/γ recovers the classical exponentially distributed SIR model. More realistic distributions can be obtained by choosing p(t) to be a gamma probability density function [22–27], with parameters γ and n (see Protocol S1). An alternative (and computationally efficient) means of modeling infections with gamma distributions is to divide the infectious class into n subclasses with nγ being the rate of sequential progression through the subclasses. The advantage of this formalism is that when n = 1 we recover the exponentially distributed model, which has a large variance, while as n→∞ we obtain a fixed infectious period. The effects of n on the distribution of the infectious period are demonstrated in Figure 1
The dynamical consequences of these differences in the distribution of infectious and latent periods have received some attention over the past two decades. It has been shown, for example, that the precise distribution of the infectious period has no qualitative effects on the asymptotic values or properties of the system [21,24], though perturbations to the endemic equilibrium take longer to die out as n increases [22,26]. When contact rates vary seasonally, for example, to mimic the aggregation of children in schools [28,29], changes in p(t) are known to have important consequences for the persistence likelihood of infections [25,26,30]. An issue that has received surprisingly little attention, despite its obvious applicability to emerging infections and possible “deliberate exposure,” is the influence of latent and infectious period distributions on the invasion dynamics of an infection into a largely susceptible population. This is in contrast to the conceptually similar situation of within-host dynamics of viral disease, such as HIV, for which some models already adopt realistic distributions to describe stages in the cell life cycle [31,32]. In particular, Lloyd [33] has shown how parameter estimates made from viral load data are affected by different assumptions about these distributions. Here we are interested in the application of this work to between-host transmission dynamics. As can be seen in Figure 1 Whether these marked differences between alternate model formulations may translate into potentially important public health concerns is a key question, which we address in two ways. First, we document systematic differences in the model parameters estimated from an epidemic using the exponential and gamma-distributed models. Second, we demonstrate that the use of exponential models produces overoptimistic predictions about the low levels of control required to subdue an epidemic. Methods The Relationship between R0 and Initial Epidemic Growth Rate During the early phase of an epidemic, the observed exponential growth rate, λ, is related to the basic reproductive ratio, R0, of the infection. Mathematically, λ is just the dominant eigenvalue of the disease-free equilibrium, and one can show that λ must satisfy
for the SEIR model with gamma-distributed latent and infectious periods (further details are given in Protocol S1). This equation translates into an expression for R0 in terms of λ and the other parameters, as is presented in equation 4. Therefore, if we can estimate the growth rate λ from data, we can use equation 4 to obtain an estimate of R0. Contact Tracing and Isolation To study the effects of contact tracing and isolation, we modify the assumptions of the SEIR epidemic model, while still incorporating gamma-distributed latent and infectious periods. In the new model, isolation of newly infectious cases occurs at a daily rate of dI after a delay of τD days, which represents a period when infected individuals are infectious but asymptomatic or undetectable (IA). A fraction q of those who had contact with an infectious and symptomatic individual (IS) (but did not contract the infection) are removed to the quarantined susceptible class, SQ, where they spend exactly τQ days. An identical fraction of newly exposed individuals is also quarantined. Full details of the model equations are given in Protocol S1. Results In a typical management setting, such as the SARS outbreak of 2003, public health professionals are confronted with a novel (or perhaps a highly virulent variant of an existing) pathogen that is spreading rapidly through a predominantly susceptible population. One of the important tasks of any modeling exercise is to provide insights into some of the epidemiological characteristics of the invading infection, such as its transmissibility, virulence, and persistence dynamics. Of great interest is the estimation of the basic reproductive ratio of the infection (R0), which measures the transmission potential of the infection, and determines the degree of control required. Some of these aspects can be explored by studying the range of model parameters that give (initial) outbreak dynamics consistent with the (short-term) epidemic data thus far gathered. One approach is to fit model parameters to data by “trajectory matching,” where one looks for the combination of parameters that, in a statistically rigorous sense, give rise to dynamics most consistent with observed patterns [34,35]. Alternatively, one may use the well-established result that during the initial stages of an epidemic, the growth rate is approximately exponential [17,20], with the rate determined by R0. First, we use this approach to examine, in general, how the distribution of the latent and infectious periods may affect the estimation of R0 from initial epidemic data. To illustrate that our results are not specific to this methodology, we then take incidence data from an influenza outbreak and parameterize the epidemic models using trajectory matching. Estimating R0 from Initial Epidemic Growth Rate We may obtain an estimate for R0 by calculating the initial growth rate (λ) of an infection from data and equating it to the growth rate of the equations, calculated from the dominant eigenvalue of the disease-free equilibrium (see Methods). Such an exercise reveals that for any observed λ, the precise value of R0 estimated is crucially dependent on the fundamental assumptions made concerning the distributions of latent and infectious periods. Specifically, we find that the following equation determines the relationship between R0 and an empirically estimated epidemic growth rate, λ:
where m and n represent the number of subclasses in the exposed and infectious categories, respectively. The mean latent and infectious periods are represented by 1/σ and 1/γ, respectively, and are assumed to be known or estimated from independent data. This relationship was first determined by Anderson and Watson [36] and has recently been applied in the context of viral life cycle dynamics by Lloyd [33]. The relationship between estimated R0 and the distributions of the latent and infectious periods is demonstrated in Figure 2
Estimating R0 from Trajectory Matching While the results described in the previous section are based on the rate of epidemic take-off, we reach the same qualitative conclusions about the effects of the distributions of latent and infectious periods when estimating R0 by other data-fitting methods. For illustration, we use data from an influenza outbreak in an English boarding school [37] to estimate model parameters by trajectory matching. In the absence of independent data, this method can be used to provide estimates of the key infectious parameters. Of course, here we can also compare the parameter estimates to observed parameter ranges, since the influenza virus is known to have a latent period of between 1 and 4 d and infected individuals may transmit the virus up to 4 or 5 d after the onset of illness [38]. We determine the best fit of the model output to daily incidence data by minimizing the least squares errors for different values of the distribution parameters m and n. For comparison, we also determine the best-fit parameters in the absence of any latent period. The least squares errors and estimated R0 of the best fit for a combination of m and n are presented in Figure 3
Despite visually similar solutions, the SIR best-fit and SEIR best-fit models (Figure 3 Management Consequences The results outlined above highlight the pitfalls of making a priori assumptions concerning the distributions of latent and infectious periods when estimating parameters. Depending on the precise details, inappropriate model selection may give rise to either gross over- or underestimates for the basic reproductive ratio of an infection. However, even when parameter estimates are reliable, choice of model structure can also be very important when making recommendations concerning individual-level control strategies. Historically, it has been shown that contact tracing and the effective quarantine of infected individuals and those potentially exposed is an important means of infection management [13,39,40]. We introduce both these measures into the SEIR epidemic model, assuming that there is a small delay in detecting newly infectious individuals, which may represent an asymptomatic phase or uncertainty in diagnosing symptoms (see Methods and Protocol S1). As we show in Figure 4
The process of isolating infected individuals results in a reduction in the mean infectious period (see Protocol S1). It is much more effective when the infectious period is exponentially distributed because it essentially truncates the tail of the distribution, so that the infectious period of a few individuals is dramatically reduced. This effect is not as pronounced in the gamma-distributed models because there is less variation in the infectious periods (see Figure 1 To focus on the effects of the infectious period distribution on different courses of intervention we have assumed that all those who are quarantined and exposed are detected before the end of the quarantine period and are not released back into the general population. We have also formulated a model that takes into account the distribution of the latent period during quarantine and find similar qualitative results to those shown in Figure 4 Discussion The use of models in epidemiology dates back almost a century, and while traditional models have often been highly successful in explaining observed dynamics [17,20,28,29,41], our results show that within a strict management setting, epidemiological details can make a crucial difference. Although a body of theoretical work [25,26,30] has demonstrated the importance of incorporating realistic distributions of latent and infectious periods into models of endemic disease, few studies have considered the effects associated with making predictions for an emerging disease [42]. The large discrepancies between estimates of R0 from the exponentially distributed and gamma-distributed fits reiterate the importance of accurately determining the precise distributions of latent and infectious periods. Although the data required for such a task are often available from post hoc analyses of epidemics they are certainly lacking for a novel emerging infection. Instead, the uncertainty surrounding assumptions about the distributions should be incorporated into quantitative predictions made from epidemiological models, especially since this may well be greater than any uncertainty that arises from noise in the data. Of course, more sophisticated fitting methods than those used in this paper exist [43–46], but if the underlying structure of the model is inappropriate, the method of parameterization is largely irrelevant. The take home message from our work is that when developing models for public health use, we need to pay careful attention to the intrinsic assumptions embedded within classical frameworks. While some practitioners are already using the approach we advocate [3,15,34,39,47], the vast majority of applied epidemiological studies still use models that incorporate exponentially distributed latent and infectious periods. Perhaps this work points to the next steps in delivering quantitatively accurate epidemiological models. Protocol S1: Further Details and Analysis of the Mathematical Models (7 KB TEX). Click here for additional data file.(7.4K, tex)
Acknowledgments We are grateful to Andy Dobson for comments that helped to improve the manuscript. HJW and PR are supported by the National Institutes of Health, and MJK is supported by the Royal Society. PR would also like to thank the Ellison Medical Foundation for funding. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Abbreviations
Footnotes Citation: Wearing HJ, Rohani P, Keeling MJ (2005) Appropriate models for the management of infectious diseases. PLoS Med 2(7): e174. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
||||||||||||||||||||||||||||
Lancet. 2003 Mar 22; 361(9362):1020.
[Lancet. 2003]Science. 2000 Jan 21; 287(5452):443-9.
[Science. 2000]Lancet. 2003 May 24; 361(9371):1761-6.
[Lancet. 2003]Nature. 2001 Dec 13; 414(6865):748-51.
[Nature. 2001]Science. 2002 Nov 15; 298(5597):1428-32.
[Science. 2002]Acta Biotheor. 2002; 50(3):189-204.
[Acta Biotheor. 2002]Science. 2001 Oct 26; 294(5543):813-7.
[Science. 2001]Proc Natl Acad Sci U S A. 2004 Apr 20; 101(16):6146-51.
[Proc Natl Acad Sci U S A. 2004]Am J Hyg. 1950 May; 51(3):310-8.
[Am J Hyg. 1950]J Math Biol. 1980 Mar; 9(1):37-47.
[J Math Biol. 1980]Theor Popul Biol. 1980 Oct; 18(2):204-43.
[Theor Popul Biol. 1980]Am Nat. 2004 Oct; 164(4):543-58.
[Am Nat. 2004]J Math Biol. 1980 Mar; 9(1):37-47.
[J Math Biol. 1980]IMA J Math Appl Med Biol. 1988; 5(1):1-19.
[IMA J Math Appl Med Biol. 1988]Theor Popul Biol. 1980 Oct; 18(2):204-43.
[Theor Popul Biol. 1980]Am J Epidemiol. 1973 Dec; 98(6):453-68.
[Am J Epidemiol. 1973]IMA J Math Appl Med Biol. 1984; 1(2):169-91.
[IMA J Math Appl Med Biol. 1984]Science. 2003 Jun 20; 300(5627):1961-6.
[Science. 2003]J Theor Biol. 2004 Jul 7; 229(1):119-26.
[J Theor Biol. 2004]Science. 2003 Jun 20; 300(5627):1966-70.
[Science. 2003]Science. 2001 May 11; 292(5519):1155-60.
[Science. 2001]Proc Natl Acad Sci U S A. 2002 Aug 6; 99(16):10935-40.
[Proc Natl Acad Sci U S A. 2002]Am J Epidemiol. 1973 Dec; 98(6):453-68.
[Am J Epidemiol. 1973]IMA J Math Appl Med Biol. 1984; 1(2):169-91.
[IMA J Math Appl Med Biol. 1984]Science. 1999 Oct 29; 286(5441):968-71.
[Science. 1999]Science. 1997 Jan 3; 275(5296):65-7.
[Science. 1997]Theor Popul Biol. 2001 Aug; 60(1):59-71.
[Theor Popul Biol. 2001]Lancet. 2003 May 24; 361(9371):1761-6.
[Lancet. 2003]Proc Natl Acad Sci U S A. 2004 Apr 20; 101(16):6146-51.
[Proc Natl Acad Sci U S A. 2004]Science. 2003 Jun 20; 300(5627):1961-6.
[Science. 2003]Science. 2001 May 11; 292(5519):1155-60.
[Science. 2001]