![]() | ![]() |
Formats:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Copyright © 2008 Goovaerts and Gebreab; licensee BioMed Central Ltd. How does Poisson kriging compare to the popular BYM model for mapping disease risks? 1BioMedware, Inc., Ann Arbor, MI, USA 2Utah State University, Department of Watershed Sciences, Logan, UT, USA Corresponding author.Pierre Goovaerts: goovaerts.pierre/at/gmail.com ; Samson Gebreab: samson/at/gis.usu.edu Received January 8, 2008; Accepted February 4, 2008. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. This article has been cited by other articles in PMC.Abstract Background Geostatistical techniques are now available to account for spatially varying population sizes and spatial patterns in the mapping of disease rates. At first glance, Poisson kriging represents an attractive alternative to increasingly popular Bayesian spatial models in that: 1) it is easier to implement and less CPU intensive, and 2) it accounts for the size and shape of geographical units, avoiding the limitations of conditional auto-regressive (CAR) models commonly used in Bayesian algorithms while allowing for the creation of isopleth risk maps. Both approaches, however, have never been compared in simulation studies, and there is a need to better understand their merits in terms of accuracy and precision of disease risk estimates. Results Besag, York and Mollie's (BYM) model and Poisson kriging (point and area-to-area implementations) were applied to age-adjusted lung and cervix cancer mortality rates recorded for white females in two contrasted county geographies: 1) state of Indiana that consists of 92 counties of fairly similar size and shape, and 2) four states in the Western US (Arizona, California, Nevada and Utah) forming a set of 118 counties that are vastly different geographical units. The spatial support (i.e. point versus area) has a much smaller impact on the results than the statistical methodology (i.e. geostatistical versus Bayesian models). Differences between methods are particularly pronounced in the Western US dataset: BYM model yields smoother risk surface and prediction variance that changes mainly as a function of the predicted risk, while the Poisson kriging variance increases in large sparsely populated counties. Simulation studies showed that the geostatistical approach yields smaller prediction errors, more precise and accurate probability intervals, and allows a better discrimination between counties with high and low mortality risks. The benefit of area-to-area Poisson kriging increases as the county geography becomes more heterogeneous and when data beyond the adjacent counties are used in the estimation. The trade-off cost for the easier implementation of point Poisson kriging is slightly larger kriging variances, which reduces the precision of the model of uncertainty. Conclusion Bayesian spatial models are increasingly used by public health officials to map mortality risk from observed rates, a preliminary step towards the identification of areas of excess. More attention should however be paid to the spatial and distributional assumptions underlying the popular BYM model. Poisson kriging offers more flexibility in modeling the spatial structure of the risk and generates less smoothing, reducing the likelihood of missing areas of high risk. Background Cancer mortality maps are used by public health officials to identify areas of excess and to guide surveillance and control activities. Interpretation of those maps is frequently hampered by the presence of noise caused by unreliable extreme rates computed from sparsely populated geographical units or for diseases with a low frequency of occurrence [1]. Ignoring the uncertainty attached to rate estimates can lead to misallocation of resources to investigate unreliable clusters of high risk, while areas of real concern might go undetected. Over the years, statistical techniques of increasing complexity, usually involving Bayesian models, have been developed to improve the reliability of risk estimates by borrowing information from neighboring entities [2-6]. The implementation of these models in software such as WinBUGS [7] has fostered their adoption by both the statisticians and epidemiologists. Yet, the estimation of model parameters requires iterative procedures, such as Markov Chain Monte Carlo (MCMC) methods, that are computer intensive and require fine-tuning, which makes their application and interpretation challenging for non-statisticians [8-10]. This might explain why the majority of applications nowadays still rely on one of the early models introduced by Besag, York and Mollie in 1991 [5] which is commonly referred to as BYM model. Quoting the 2005 comparison paper by Best et al. [2]"a (non exhaustive) search of the major epidemiological journals over the past decade indicates that the BYM model appears to be the only fully Bayesian spatial model to have been used in published applications of disease mapping outside of the statistical literature". This model is overwhelmingly used with the conditional auto-regressive (CAR) model for defining the random effect associated with spatial autocorrelation [2,9,11]. The arbitrary neighborhood relationship underlying the CAR model is computationally convenient but is not well-suited to situations where geographical entities have different sizes and shapes and are not arranged in a regular pattern [12]. The choice of a neighborhood weight structure (i.e. adjacency-based versus distance-based spatial weights) is also very subjective and seldom justified despite its substantial impact on the results [13]. Last, simulation studies have demonstrated the strong smoothing effect of Bayesian disease-mapping models, in particular the BYM model, which limits their ability to detect localized increases in risk [14]. Geostatistics provides an alternative, yet still little known, model-based approach to disease mapping [15]. The first initiative to tailor geostatistical tools to the analysis of disease rates must be credited to Christian Lajaunie [16] from the Center of geostatistics in Fontainebleau, France. Although it was introduced the same year as the BYM model, the geostatistical approach, called binomial cokriging, went largely unnoticed. The rare applications of the method are the study of the risk of childhood cancer in the West Midlands of England [17-19] and the mapping of lung cancer mortality across the US [20]. More recently, a similar approach, called Poisson kriging, was developed in the field of marine ecology and generalized to the analysis of cancer mortality rates and cholera incidence data [21-23]. Poisson kriging was combined with stochastic simulation to generate multiple realizations of the spatial distribution of cancer mortality risk, allowing the propagation of uncertainty through the detection of cancer clusters and outliers [24]. Point Poisson kriging, which assigns each measured rate to the geographic centroid of the unit over which it has been recorded, was later replaced by Area-to-Area (ATA) Poisson kriging that accounts for the geometry of administrative units and the spatial repartition of the population at risk in semivariogram estimation and kriging [25]. Like fully Bayesian models, Poisson kriging yields the full posterior distribution of the risk. This distribution is however assumed to be Gaussian and fully characterized by the kriging estimate and variance, while the BYM posterior distribution is modeled by the empirical distribution of simulated risk values. The trade-off cost for the simplicity of Poisson kriging is that, unlike the full Bayesian approach, the uncertainty attached to the parameters of the correlation function is ignored in the analysis, which should lead to smaller prediction variances in general [26]. Only a few comparison studies of geostatistical and Bayesian estimators have been published in the literature. In the earliest study [27], marginal and conditional generalized linear models (GLM) were compared to three implementations of universal kriging: no nugget effect, unweighted nugget effect, and a nugget effect that is weighted inversely proportional to the expected counts. Despite their simplicity, the linear kriging models provided surprisingly good predictions, in particular the third implementation that acknowledges the larger uncertainty (noise) of rates computed from small populations (fewer expected counts). Recent studies extended the comparison to semivariogram estimators and kriging algorithms that were derived specifically for count data using the properties of binomial or Poisson distributions. First, the benefit of binomial or Poisson kriging over a few simple smoothers (i.e. population-weighted estimators and empirical Bayes smoothers) was assessed under different scenarios for the disease frequency, the population size, and the spatial pattern of risk [20,22]. Simulation studies showed that Poisson kriging outperforms other approaches for most scenarios, with a clear benefit when the risk values are spatially correlated. In a third study [26], Poisson kriging was compared to Diggle et al.'s "model-based kriging" [28] for mapping the relative abundance of species in the presence of spatially heterogeneous observation efforts and sparse animal sightings. Both methods gave equivalent results for 90% of the predictions; differences were observed for higher values with a smoothing for the kriging. Methods differed much more in terms of the prediction variance: the lognormal hypothesis underlying the Bayesian model induced similarities between the maps of the prediction variance and estimate (i.e. proportionality), while the Poisson kriging variance mainly reflects the observation effort (i.e. lower variance for longer observation times). Because this study was conducted on real data, the underlying risk pattern was unknown, precluding any objective assessment of prediction performances. This paper presents the first simulation-based evaluation of performance of Poisson kriging (both point and ATA implementations) and full Bayesian disease-mapping models. The benchmark for Bayesian methods is the BYM model since it is the most widely applied in the health literature. Both types of model are first illustrated using age-adjusted lung and cervix cancer mortality rates recorded for white females in two contrasted county geographies: 1) state of Indiana that consists of 92 counties of fairly similar size and shape, and 2) four states in the Western US (Arizona, California, Nevada and Utah) forming a set of 118 counties that are vastly different geographical units. Simulation studies are then conducted for the same two county geographies. Performance criteria include the magnitude of prediction errors of the underlying risk, the precision and accuracy of the model of uncertainty, and the ability to discriminate between background and high-risk areas. Methods Cancer data sets The basic properties of Poisson kriging and Bayesian model will be illustrated using directly age-adjusted mortality rates for a frequent (lung) and less frequent (cervix) cancer. These data were described in a previous study [25] and only their most salient features are summarized here. The analysis focuses on white female rates recorded over the 1970–1994 period and adjusted using the 1970 population pyramid. Two areas with contrasted county geographies were considered: 1) state of Indiana (Region 1), and 2) four states in the Western US (Arizona, California, Nevada and Utah) that will be referred to as Region 2. The choice of these two specific geography areas was guided by the need to compare performances in two contrasted settings: 1) all geographical units have a fairly similar size and shape, which is the "ideal" situation for point Poisson kriging or Bayesian methods implemented under the conditional auto-regressive (CAR) model that ignores the spatial support of the data, and 2) geographical units display a wide range of sizes and shapes, which should favour Area-to-Area Poisson kriging that implicitly accounts for the spatial support of the data in the analysis. The West coast provides a perfect example for the second type of geography (i.e. set of 118 vastly different counties), while Indiana includes a reasonable number (i.e. 92) of counties that are geometrically fairly similar. In addition to mortality rates, the population at risk is available at the county level and over a grid of 25 km2 cells discretizing each Region; see [25] for a description of these datasets. Figures Figures11
Simulated data sets An objective assessment of the performance of the geostatistical and Bayesian disease mapping techniques requires knowledge of the "true" underlying risk maps, which are unknown in practice. Simulation provides a way to generate multiple realizations of the spatial distribution of cancer mortality rates under specific scenarios for the underlying risk and population sizes. Predicted risks can then be compared to the risk maps used in the simulation. For both lung and cervix cancers, L = 100 maps of county-level mortality rates {z(l)(vα), α = 1,..., N; l = 1,..., L} were generated in a previous study [25] using a two-step procedure: 1) county-level mortality risks were computed as population-weighted averages of continuous risk maps created by sequential Gaussian simulation, and 2) the number of death within each county was simulated by random drawing from a Poisson distribution characterized by the county-level risk and population. The present comparison study used the odd-numbered realizations (i.e. realization #1,3,5,...,99) for each set. Area-to-Area (ATA) Poisson kriging The geostatistical methodology for the estimation of risk values from empirical frequencies, and the creation of isopleth maps, is described in details in Goovaerts [25]. This section provides a brief recall of the approach For a given number N of geographical units vα (e.g. counties), denote the observed mortality rates (areal data) as z(vα) = d(vα)/n(vα), where d(vα) is the number of recorded mortality cases and n(vα) is the size of the population at risk. The noise-filtered mortality rate for a given area vα, called mortality risk, is estimated as a linear combination of the kernel rate z(vα) and the rates observed in (K-1) neighboring entities vi: Unlike traditional kriging estimators, Poisson kriging is a non-exact interpolator since it does not return the observed rates: where μ(vα) is the Lagrange parameter, m* is the population-weighted mean of the N rates, and δij = 1 if vi = vj and 0 otherwise. The "error variance" term, m*/n(vi), derives directly from the Poisson distribution for the disease count variable D(vi). This term, which corresponds to the difference between the variances of the risk and rate variables (Equation 5 in [22]), leads to smaller weights for less reliable data (i.e. rates measured over smaller populations). In addition to the population size, the kriging system accounts for the spatial correlation among geographical units through the area-to-area covariance terms where Pi and Pj are the number of points used to discretize the two areas vi and vj, respectively. The weights wss' are computed as the product of population sizes within the square cells centred on the discretizing point us and us': In this study the discretizing points were identified with the nodes of a 5 km grid, yielding a total of 9 to 69 points per county in Indiana, and 11 to 2,082 discretizing points for the West Coast counties. When the geographical units differ by several orders of magnitude like in Region 2, it is not computationally efficient to use the same discretizing level for each unit. One solution is to use flexible discretizing grids that ensure a constant number of discretizing points within each unit. For example, in TerraSeer's STIS software [29] a given number of discretization points is distributed uniformly within each polygon according to a stratified random design. The point-support covariance of the risk C(h), or equivalently the point-support semivariogram γ(h), cannot be estimated directly from the observed rates, since only areal data are available. Thus, only the regularized semivariogram of the risk can be estimated as: where N(h) is the number of pairs of areas (vα, vβ) whose population-weighted centroids are separated by the vector h. The different spatial increments [z(vα)-z(vβ)]2 are weighted by a function of their respective population sizes, n(vα)n(vβ)/[n(vα)+n(vβ)], a term which is inversely proportional to their standard deviations. More importance is thus given to the more reliable data pairs (i.e. smaller standard deviations). Derivation of a point-support semivariogram from the model fitted to the experimental semivariogram The uncertainty about the cancer mortality risk prevailing within the geographical unit vα can be modeled using the conditional cumulative distribution function (ccdf) of the risk variable R(vα). Under the assumption of normality of the prediction errors, that ccdf is defined as: G(·) is the cumulative distribution function of the standard normal random variable, and σ(vα) is the square root of the kriging variance estimated as: where Point Poisson kriging Assimilating each unit vα to its population-weighted centroid uα greatly simplifies the implementation of the geostatistical methodology since it eliminates the need to discretize geographical units or perform a semivariogram deconvolution. Besides the gain in computational speed, the centroids-based (i.e. point) Poisson kriging can be accomplished using the public-domain executable poisson-kriging.exe described in Goovaerts [22]. This simplified version of Poisson kriging is included in the comparison study to: 1) quantify the potential loss in accuracy and precision resulting from such an approximation, and 2) apply Poisson kriging using the same point support as the BYM model. Bayesian model Clayton and Kaldor [34] first introduced empirical Bayesian inference for relative risks. It was later extended to a fully Bayesian setting by Besag et al. [5]. The BYM model is one of the most popular hierarchical Bayesian models. This model incorporates random effects due to unstructured and spatially structured heterogeneity into the log-linear model for the relative risk. The inclusion of these random effects allows smoothing relative risks at global and local levels. For a detailed description of this model, the readers are referred to Lawson et al. [35] and Wakefield et al. [36]. The three stages of the BYM model, as described in Lawson et al. [35], are briefly reviewed in this section. First-stage model For rare diseases such as cancers, the observed number of mortality cases (yi) in a geographic unit i is assumed to follow a Poisson distribution with mean eiθi:
where ei denotes the age-adjusted expected number of cases in the ith geographic unit, and θi is the "true" but unknown relative risk in that unit. Besag et al [5] proposed the following model for the log relative risk:
where the intercept μ represents the global mean, and the terms vi and ui are random effects modeling the unstructured and spatially structured heterogeneity, respectively. Second-stage model Within the Bayesian framework, a prior distribution for the random effects and intercept term needs to be specified. In this paper the intercept was assigned improper flat prior. The uncorrelated component vi does not depend on geographic location and is assumed to follow a normal distribution with zero mean and a common variance (precision parameter) τv2: The random effect associated with spatial autocorrelation, ui, is defined according to the conditional autoregressive (CAR) Gaussian prior model [5]: where the prior mean of each ui is defined as a weighted average of the other uj, j≠i. The weights wij define the relationship between the area i and its neighbors j. We adopted the common first order binary weighting scheme where wij=1 if areas i and j share a common border, and wij = 0 otherwise. The precision parameter τu2 controls the amount of variability for the random effect. Third-stage model A fully Bayesian model specification is completed by adding hyper-prior distributions on the precision parameters τv2 and τu2. With no prior estimation for precisions of the random effects, distributions with large variance (i.e. vague hyperpriors) are recommended. Following Kelsall and Wakefield's discussion [37] we adopted the Gamma distribution Γ(0.5, 0.0005) which yields a probability of 99% that the precision lies between 0.16 and 6635, with most of the probability concentrated towards 0 [9]. This prior choice is less informative and allows the likelihood data to dominate the prior information; hence, it will have minimum effect on the inference of relative risks. Implementation of the model As mentioned in the introduction, Bayesian inference of model parameters is not feasible through mathematically closed form. Instead, it requires computer intensive MCMC simulations, such as Gibbs sampler, which became affordable with recent increase in computational power. The BYM model for each data set (lung and cervix cancers) was fitted using WinBUGS version 1.4 [7] that implements the Markov Chain Monte Carlo (MCMC) methods [38]. The model was simulated using two independent chains starting with dispersed initial values. Convergence of the relative risks for the two chains was confirmed by graphing their traces, as well as computing the Gelman-Rubin (G-R) statistic and Monte Carlo errors (<5% of the posterior standard deviation). After a burn-in of 15,000 iterations, the following 10,000 iterations were sampled from the two chains, yielding a set of 20,000 values to approximate the posterior distribution of the relative risk for each geographical unit (i.e. county). For comparison with the Poisson kriging absolute risk estimate, each simulated relative risk value was multiplied by the population-weighted average of rates. The distributions of 20,000 sampled values were post-processed to derive summary statistics, such as mean, variance or the probability of exceeding specific risk thresholds. Results and Discussion Analysis of lung and cervix cancer data Mortality risks for lung and cervix cancers were estimated from the rates displayed at the top of Figures Figures11
Mortality risk estimates The two risk maps of Region 1 are very similar (correlation = 0.988); see Figure Figure11
Discrepancies between risk maps become more noticeable for the less frequent cervix cancer (Figure (Figure2,2
Prediction variance The Bayesian and geostatistical models differ much more in terms of prediction variance than estimated risk; see Figures Figures11 The smaller correlation between BYM and geostatistical estimates in Region 2, combined with the wide range of sizes and shapes of counties, enhances the differences between the prediction variance maps (Figure (Figure2,2 The spatial support (i.e. point versus area) has a much smaller impact on the prediction variance than the statistical methodology (i.e. geostatistical versus Bayesian models); see Table 1 (last two rows). For both regions the correlation between ATA and point kriging variances is strong: 0.986–0.989. Ignoring the spatial support of county-level data however leads, on average, to larger kriging variances: 3.275 vs 3.122 for Region 1 and 0.211 vs 0.187 for Region 2. This result is consistent with the "change-of-support" theory that predicts larger error variances for point versus areal estimation. The relative magnitude of the overestimation by point kriging is the largest for Region 2 that includes very large counties and has a wide range of county sizes and shapes. Decision-making Mapping cancer risk is a preliminary step towards further analysis that might highlight areas where causative exposures change through geographic space, the presence of local populations with distinct cancer incidences, or the impact of different cancer control methods. Figure Figure66
The local Moran test to evaluate local clustering or spatial autocorrelation [39] was applied to the map of Bayesian and geostatistical risk estimates. The use of PK instead of BYM estimates increases the size of the central cluster of high mortality risk and the Utah cluster of low risk, compare Figure Figure6A6A To depict the uncertainty attached to risk maps, several authors recommend mapping the 95th percentile range of the posterior distribution of risk values or the probability that the risk in each entity exceeds a specific threshold of interest [9,40]. This probability is readily derived from the PK estimate and variance using the analytical expression (6) or it can be computed from the empirical BYM posterior distribution. As an example, the population-weighted average of cervix cancer mortality rates (i.e. area-wide rate) was used as threshold of interest. The corresponding probability maps displayed in Figures Figures6C6C Simulation studies Figures Figures1,1 For example, Figure Figure7A7A
The Bayesian and geostatistical analysis of the simulated rate map confirms the conclusions drawn from the analysis of real cancer mortality rates in Figure Figure2.2
These maps are now compared to the reference risk map, and results for realization #50 in Region 2 are illustrated in Figures Figures88
Bias and accuracy of prediction The first two criteria are the mean error (ME) and mean absolute error (MAE) of prediction computed as: where r(vα) is the "true" simulated risk and Table 2 indicates that, on average over 50 realizations, all prediction methods are unbiased regardless of the weighting scheme or the cancer frequency. Methods differ more in terms of the mean absolute error of prediction, in particular for the least frequent cervix cancer (Table 3). The empirical Bayesian smoothers are consistently outperformed by other methods, although the average magnitude of the error never exceeds 5% of the mean rate. The benefit of Poisson kriging over BYM model is fairly systematic since it leads to smaller MAE values 100% of the time for cervix and 92% for lung cancer (see Figures Figures8A8A Smoothing effect and prediction variance The comparison of risk maps in Figures Figures1,1 In addition to a risk estimate, the BYM model and Poisson kriging provide a measure of the uncertainty attached to this estimate in the form of a prediction variance. For each interpolation method, the prediction variance was averaged over all counties, and the mean variances over all 50 realizations are listed in Table 4. As expected, the Poisson kriging variance increases when the estimation is based on fewer observations (i.e. adjacent counties versus 32 nearest counties) or ignores the support of the data (i.e. point versus ATA kriging). Yet, it remains on average 10–17% smaller than the prediction variance calculated for the BYM model. According to this criterion, the geostatistical model of uncertainty is more precise than the Bayesian one. Quality of the model of uncertainty The prediction variance does not fully capture the uncertainty attached to risk estimates, in particular for the BYM model where the entire probability distribution in the form of a set of simulated risk values is available for each geographic unit. From the distribution of 20,000 BYM simulated values we computed, for each of the counties, a series of symmetric p-probability intervals (PI) bounded by the (1-p)/2 and (1+p)/2 quantiles of that distribution. For example, the 0.5-PI is bounded by the lower and upper quartiles. Similar PIs were computed analytically from the Poisson kriging estimate and variance, under the assumption of normality of the prediction errors (Equation 6). A correct modeling of local uncertainty would entail that, for example, there is a 0.5-probability that the actual mortality risk falls into the 0.5-PI or, equivalently, that over the study area 50% of the 0.5-PI include the true risk value. These fractions are easily computed for simulation results since the true risk map is known. Following Deutsch [42], the agreement between observed, where wk = 1 if The accuracy plots in Figure Figure8C8C Not only should the true risk value fall into the PI according to the expected probability p, but this interval should be as narrow as possible to reduce the uncertainty about that value. In other words, among two probabilistic models with similar goodness statistic one would privilege the one with the smallest spread (more precise). Different measures of ccdf spread can be used: variance, interquartile range, and entropy. Following Goovaerts [43] the average width of the PIs that include the true value are plotted for a series of probabilities p. Once again, the geostatistical and Bayesian approaches yield fairly similar results for realization #50 (Figure (Figure8D).8D The geostatistical model of uncertainty for realization #50 presents the advantage of yielding PIs that are slightly narrower than the Bayesian ones, while being more likely to include the true value (i.e. larger observed fractions). This property however is not captured by the separate computation of the goodness statistic and average PI width. A more informative criterion is the proportion of probabilities p for which the geostatistical PI is narrower than the Bayesian PI (PK width < BYM width) while including a larger fraction of true values (PK fraction > BYM fraction). This situation corresponds to the case where both blue and red ratio curves in Figures 8E–F Classification accuracy By analogy with the classification conducted on observed cancer mortality rates in Figures Figures6E6E Results averaged over all 50 realizations confirm that, regardless the type of cancer or risk threshold, the BYM model causes a much larger proportion of false negatives than the geostatistical model (Figures 10C–F
Another way to compare the performances of Bayesian and geostatistical approaches, independently of the subjective choice of a probability threshold, is to compute the Receiver Operating Characteristic (ROC) curves. ROC curves plot the probability of false positive versus the probability of detection [44]. The probability of detection corresponds here to the proportion of true high-risk counties (i.e. counties with a mortality risk larger than a threshold) that are detected as the probability threshold decreases. In practice, the probability thresholds are identified with the probabilities of exceedence computed for these high-risk counties. For each of them the probability of false positive is calculated as the proportion of background counties that are wrongly declared as having elevated risks. Figure Figure9E9E The most efficient approach is the one that allows the detection of a larger fraction of high-risk counties at the expense of fewer false positives; that is the ROC curve should be as close as possible to the vertical axis. For example, Figure Figure9E9E Conclusion Since its early development for the assessment of mineral deposits, geostatistics has been used in a growing number of disciplines dealing with the analysis of data distributed in space and/or time. Its application to health data is fairly recent and, compared to the well established Bayesian modeling approach, the field of health geostatistics is still in its infancy. Nevertheless, the current geostatistical methodology offers a flexible framework to model the spatial variability of disease rates and estimate the underlying risk, taking into account not only the shape and size of geographical units, but also the spatial repartition of population within those units [15]. The approach described in this paper has been fully automated in TerraSeer's STIS software [29]: the "behind-the-scene" estimation, modeling, and deconvolution of risk semivariogram, as well as the automatic discretization of geographical units, makes Poisson kriging estimators as straightforward as empirical Bayesian smoothers while providing a measure of uncertainty in the form of the kriging variance. Furthermore, the comparison of point and area-to-area (ATA) Poisson kriging demonstrated the benefits of the geostatistical approach even under a simplified implementation whereby each geographical unit is assimilated with its population-weighted centroid. Over the years, statisticians have developed models of increasing complexity, combining fixed effects with both uncorrelated and spatially structured random effects, leading to mixed effects or hierarchical models. The rich class of Bayes models yields the full posterior distribution of the risk while accounting for the uncertainty in the parameters of the model. Implementation of these sophisticated methods however is still cumbersome and relies on strong distributional assumptions (e.g. lognormal hypothesis), while parameter estimation requires time-consuming iterative procedures. Another drawback is that the full Bayesian modeling approach is overwhelmingly used with the conditional auto-regressive (CAR) model for defining the random effect associated with spatial autocorrelation. This computationally convenient choice is reasonable if all geographical entities are of similar size and arranged in a regular pattern but it is not particularly attractive otherwise [12]. In particular, it prohibits any change of support and creation of isopleth maps of risk, an operation easily conducted within the framework of area-to-point kriging [25]. The objectives of this paper were two-fold: 1) to illustrate the major differences between Poisson kriging and the Bayesian disease-mapping approach in two contrasted county geographies, and 2) to present the first simulation-based comparison of performance of Poisson kriging and full Bayesian models. The comparison study is certainly not exhaustive and only the BYM model, among the many alternative formulations available for disease mapping [2,14], was considered. Yet, studies have demonstrated the robustness of the BYM model which remains the favorite among practitioners [2,45]. The analysis of lung and cervix cancer mortality rates highlighted the stronger smoothing effect of the BYM model: geostatistical risk estimates tend to be lower in low-valued areas, while they exceed BYM predictions in high-valued areas. The use of BYM results to guide surveillance and control activities will thus lead to declaring substantially fewer counties as having elevated risk. These results confirm previous studies that showed Bayesian disease-mapping models to be conservative [14]. The impact of the modeling procedure was the largest for the prediction variances that differ in both magnitude and spatial pattern. As expected, the prediction variance increases as the population at risk, hence the reliability of raw rates, decreases. Because it accounts for the shape and size of counties, the Poisson kriging variance also increases for larger spatial supports, i.e. counties of large extent. On the other hand, the lognormal hypothesis causes the BYM prediction variance to increase with the estimated risk. According to our simulation studies, the geostatistical approach yields smaller prediction errors, more precise and accurate probability intervals, and allows a better discrimination between counties with high and low mortality risks. The BYM model also generates smoother risk surfaces, leading to a much larger proportion of false negatives than the geostatistical model in particular as the risk threshold raises. The benefit of Poisson kriging is the largest for cervix cancer measured in Region 2 because of the combined effect of the low reliability of mortality rates and the wide range of county shapes and sizes. In general, the performance of Poisson kriging improves when data beyond the adjacent counties are used in the estimation. The smoothing effect of Poisson kriging is controlled by the number of neighbors used in the estimation (more neighbors induce more smoothing), the population size (unreliable rates computed from small populations are smoothed more), and the variogram model (larger nugget effect induces more smoothing). On the other hand, the smoothing effect of the Bayesian model depends on the choice of a neighborhood weight structure (i.e. adjacency-based versus distance-based spatial weights [13]), the population size (unreliable rates computed from small populations are smoothed more), the type of distribution for the relative risk (mean-based smoothing using the Gaussian distribution versus median-based smoothing for double-exponential distribution), and the type of spatial model for the risk surface (i.e. continuous surface for the BYM model versus discontinuous surface for the partition and spatial mixture models). Both Bayesian and kriging estimators in this paper are mean-based smoothers and use the same adjacent neighbors. Estimating the nugget effect of a semivariogram is always somewhat subjective, in particular when only areal data are available which precludes any sample information on the short-range variability. The nugget effect of the point-support semivariogram model was here set to zero to facilitate the convergence of the deconvolution procedure [33]. Relaxing this constraint increases the smoothing effect of Poisson kriging that becomes comparable to the BYM model in Region 1 (results now shown). It has however no impact on the results for Region 2, where discrepancies between the dispersion variances of kriging and BYM model estimates were the largest. An alternative would be to adopt semi-parametric Bayesian spatial models that allow discontinuities in the risk surface and typically produce less smoothing than the BYM model [2]. Yet, these models are less robust [45] and are unlikely to benefit the simulation results since all realizations are based on continuous risk surfaces generated using semivariograms with no nugget effect. The spatial support (i.e. point versus area) has a much smaller impact on the results than the statistical methodology (i.e. geostatistical versus Bayesian models). The trade-off cost for the easier implementation of point Poisson kriging is slightly larger kriging variances, which reduces the precision of the model of uncertainty and the ability to discriminate between background and risk-raised areas. The major drawback still remains the inability of point kriging to conduct any change of support and creation of isopleth maps of risk, an operation easily conducted within the framework of area-to-point kriging. A critical element of any simulation-based comparison study is the assumptions used to generate the reference risk maps. In the present study, the mortality rates (ratio of number of deaths over population size) were not directly simulated using a geostatistical procedure, but rather were generated according to a Poisson distribution, which is similar to the approach used in other comparison studies of Bayesian models [2,14]. Because the BYM model is not designed to pick up discontinuities in the risk surface, all simulations were generated using smooth risk surfaces. The ability of Poisson kriging to uncover more complex risk structures should be investigated in future simulation studies which will include other classes of Bayesian hierarchical spatial models that generate less smoothing. Competing interests The first author is affiliated with BioMedware a research company that also develops software for the exploratory spatial and temporal analysis of health and environmental data. With funding from the National Cancer Institute, the first author developed STIS (Space-Time Intelligence System), which is a commercial product of TerraSeer and will include a Poisson kriging function in a future release. Information about STIS software, as well as a beta-version of the geostatistical module, can be obtained by contacting the first author. Authors' contributions PG carried out all simulation studies and drafted the manuscript. SG conducted the Bayesian analysis of real and simulated data using WinBUGS. All authors read and approved the final manuscript. Acknowledgements This research was funded by grants R44-CA105819-02 and 2R44CA110281-02 from the National Cancer Institute. The views stated in this publication are those of the authors and do not necessarily represent the official views of the NCI. References
|
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Stat Med. 1999 Dec 15; 18(23):3201-9.
[Stat Med. 1999]Stat Methods Med Res. 2005 Feb; 14(1):35-59.
[Stat Methods Med Res. 2005]Stat Methods Med Res. 2005 Feb; 14(1):17-34.
[Stat Methods Med Res. 2005]Int J Health Geogr. 2004 Dec 8; 3(1):29.
[Int J Health Geogr. 2004]Int J Health Geogr. 2007 Nov 29; 6():54.
[Int J Health Geogr. 2007]Environ Health Perspect. 2004 Jun; 112(9):1016-25.
[Environ Health Perspect. 2004]IMA J Math Appl Med Biol. 1998 Sep; 15(3):279-97.
[IMA J Math Appl Med Biol. 1998]Int J Health Geogr. 2006 Oct 13; 5():45.
[Int J Health Geogr. 2006]Int J Health Geogr. 2006 Feb 9; 5():7.
[Int J Health Geogr. 2006]Int J Health Geogr. 2006 Nov 30; 5():52.
[Int J Health Geogr. 2006]Stat Med. 2003 May 15; 22(9):1415-32.
[Stat Med. 2003]Int J Health Geogr. 2005 Dec 14; 4():31.
[Int J Health Geogr. 2005]Int J Health Geogr. 2006 Nov 30; 5():52.
[Int J Health Geogr. 2006]Int J Health Geogr. 2006 Nov 30; 5():52.
[Int J Health Geogr. 2006]Int J Health Geogr. 2006 Nov 30; 5():52.
[Int J Health Geogr. 2006]Int J Health Geogr. 2006 Nov 30; 5():52.
[Int J Health Geogr. 2006]Int J Health Geogr. 2005 Dec 14; 4():31.
[Int J Health Geogr. 2005]Int J Health Geogr. 2004 Nov 8; 3(1):26.
[Int J Health Geogr. 2004]Int J Health Geogr. 2006 Nov 30; 5():52.
[Int J Health Geogr. 2006]Int J Health Geogr. 2005 Dec 14; 4():31.
[Int J Health Geogr. 2005]Biometrics. 1987 Sep; 43(3):671-81.
[Biometrics. 1987]Int J Health Geogr. 2004 Dec 8; 3(1):29.
[Int J Health Geogr. 2004]Int J Health Geogr. 2006 Nov 30; 5():52.
[Int J Health Geogr. 2006]Int J Health Geogr. 2006 Feb 9; 5():7.
[Int J Health Geogr. 2006]Int J Health Geogr. 2004 Dec 8; 3(1):29.
[Int J Health Geogr. 2004]Environ Health Perspect. 2004 Jun; 112(9):1016-25.
[Environ Health Perspect. 2004]Int J Health Geogr. 2007 Jul 24; 6():32.
[Int J Health Geogr. 2007]Environ Health Perspect. 2004 Jun; 112(9):1016-25.
[Environ Health Perspect. 2004]Int J Health Geogr. 2006 Nov 30; 5():52.
[Int J Health Geogr. 2006]Science. 1988 Jun 3; 240(4857):1285-93.
[Science. 1988]Int J Health Geogr. 2004 Nov 8; 3(1):26.
[Int J Health Geogr. 2004]Int J Health Geogr. 2006 Nov 30; 5():52.
[Int J Health Geogr. 2006]Stat Methods Med Res. 2005 Feb; 14(1):35-59.
[Stat Methods Med Res. 2005]Environ Health Perspect. 2004 Jun; 112(9):1016-25.
[Environ Health Perspect. 2004]Stat Med. 2000 Sep 15-30; 19(17-18):2217-41.
[Stat Med. 2000]Environ Health Perspect. 2004 Jun; 112(9):1016-25.
[Environ Health Perspect. 2004]Int J Health Geogr. 2007 Nov 29; 6():54.
[Int J Health Geogr. 2007]Stat Methods Med Res. 2005 Feb; 14(1):35-59.
[Stat Methods Med Res. 2005]Stat Med. 2000 Sep 15-30; 19(17-18):2217-41.
[Stat Med. 2000]Stat Methods Med Res. 2005 Feb; 14(1):35-59.
[Stat Methods Med Res. 2005]Environ Health Perspect. 2004 Jun; 112(9):1016-25.
[Environ Health Perspect. 2004]