![]() | ![]() |
Formats:
|
||||||||||||||||||||||||||||||||||||||||||||||||||
Likelihood based tests for spatial randomness 1Department of Statistics, University of Connecticut, Storrs, CT, 06269, U.S.A. 2Department of Ambulatory Care and Prevention, Harvard Medical School and Harvard Pilgrim Health Care, 133 Brookline Avenue, 6th Floor, Boston, MA 02215, U.S.A.‡ *Correspondence to: Changhong Song, Department of Statistics, University of Connecticut, 215 glenbrook road, U-4120, Storrs, CT, 06269, USA. Email: changhon/at/stat.uconn.edu †Email: martin kulldorff/at/hms.harvard.edu SUMMARY Many different methods have been proposed to test the spatial randomness of a point pattern adjusting for an inhomogeneous background population. These tests can be classified into cluster detection tests, concerned with the detection and inference of local clusters, and global clustering tests, which collect evidence for clustering throughout the study region. This paper is mainly concerned about global clustering tests. Some tests for spatial randomness are based on likelihoods, including the spatial and space-time scan statistics with variable window size and Gangnon and Clayton’s weighted average likelihood ratio tests. Both of these tests perform well compared to other tests for cluster detection and global clustering respectively. In this study, we develop other likelihood based tests for global clustering and we explore the use of different weight functions with these tests. The power of these tests is evaluated using simulated data set and compared with existing methods. Keywords: Likelihood ratio, Weight, Power, Disease clustering, Spatial statistics, Benchmark data 1 INTRODUCTION Tests for spatial randomness are widely used in many areas of applications including agriculture [1], botany [2], ecology [3], geography [4], sociology [5] and public health [6, 7]. In most situations, the interest is to detect whether the spatial distribution of the data is random or not after adjusting for some known spatial inhomogeneity. For example, when evaluating the spatial distribution of cancer incidence, it is necessary to adjust for the uneven spatial population density since there will obviously be more cancer cases in a one square mile area in the city than on the country side, simply because there are many more people living there. Many different tests for spatial randomness have been proposed. When we are using these test statistics, it is important for us to know if they have good statistical power. The power of a test statistic depends on the type of clustering model generated under alternate hypothesis. There are two common kinds of clustering models used: localized clusters and global clustering [12, 8]. For localized clusters, the relative risk of the disease at some location and its surrounding area is larger than the risk of other areas. For global clustering, there may be a lot of locations with disease hazard rate larger than other areas. Reasons for a localized cluster may be due to a geographical feature, such as a toxic waste site or a power plant. Reasons for global clustering can be that the disease is infectious or there are a lot of health hazards scattered in the study region, such as gas stations and hazardous waste sites. Tests for spatial randomness can also be generalized into two categories: cluster detection tests and global clustering tests. The former is concerned with local clusters (or hot spot clusters), and is applied when we are interested in detecting the location of clusters and testing their statistical significance. The latter collects evidence of clustering throughout the entire region without evaluating the statistical significance of individual clusters. Examples of cluster detection test include Turnbull’s CEPP [9] and the spatial scan statistic [10]. Examples of global clustering tests are Whittemore’s Test [11], Besag-Newell’s R [12], Cuzick-Edwards’ [13] k-NN, Tango’s Maximized Excess Events Test (MEET) [14], Swartz Entropy test [15] and Gangnon and Clayton’s weighted average likelihood ratio tests [16]. This paper is mainly concerned with global clustering tests. Tests based on likelihoods have been developed for both cluster detection and global clustering evaluation, including the spatial and space-time scan statistics with variable window size and Gangnon and Clayton’s weighted average likelihood ratio tests. The spatial scan statistic uses Poisson or Bernoulli likelihood ratio and takes the test statistic as the maximum likelihood ratio over a large set of circular areas. Based on the Poisson likelihood ratio, Gangnon and Clayton proposed a weighted average likelihood ratio test (WALR)[16], as well as two likelihood based cluster detection tests, one of which is a penalized scan statistic that maximizes a weight adjusted likelihood rather than the likelihood itself [17]. For this paper, we develop a large set of likelihood based tests for global clustering, using different structural forms and different weight functions. The power is estimated for all test statistics and compared with the power of the spatial scan statistic, Gangnon and Clayton’s WALR tests and Tango’s MEET. In general, weighted likelihood tests have better power for local hot-spot clusters while weighted log likelihood tests have better power for global clustering models. 2 Likelihood Based Test Statistics Spatial public health data is usually only available at the level of some aggregated geographical units, such as census tracts, zip-codes or counties. We will without loss of generality use ‘county’ to denote the geographical unit throughout this paper. The location of each county is represented by a single lat/long location, which could be its geographical or population weighted centroid. 2.1 Notation First we label all the counties in the data with i = 1, . . ., I, where I denotes the total number of counties. For county i, we denote ci as the number of cases observed in that county, ni as its population size, and ai as its geographical area size. The distance between the centroids of counties i and j is denoted as di,j. When we order all counties by their distance from county i, we use the notation (j), j = 0, . . ., I − 1 to denote the jth closest county to county i. Here (j = 0) is county i itself since the distance is 0, (j = 1) is county i’s nearest neighbor, and so on. We denote di,(j) as the distance between county i and its jth nearest neighbor, ci,(j) as the number of cases within the area of county i and its j nearest neighbor counties, and ni,(j) as the population size in county i and its j nearest neighbor counties. Next we denote C = ∑i ci as the total number of cases, N = ∑i ni as the total population size and A = ∑i ai as the total area of all counties. Last, we define Ji as the maximum cluster size for clusters centered on county i, defined in terms of the maximum number of neighboring counties that can be included. In this paper we use Ji(p) = max{(j) : ni,(j) ≤ p*N}, where 0 < p < 1 is a fixed constant defining a maximum circle size as a percent of the total population. 2.2 Structural form Likelihood ratio based tests in this paper can be generalized into twelve different structural forms. The first one is a scan statistic defined as
where LRi,(j) shorthand for the likelihood ratio defined in the next section for the circle centered around county i and containing the j closest neighbors. The second form is
The third function is defined as
The next three forms are the same with the addition of an indicator function so that we only count the likelihood ratio if there are more cases than expected under the null hypothesis of no clustering. These are denoted by LRmmh, LRwmh and LRwwh respectively. The remaining six structural forms are obtained by replacing the likelihood ration (LR) with the log likelihood ratio (LLR), denoting those tests as LLRmm, LLRwm, LLRww, LLRmmh, LLRwmh and LLRwwh respectively. We are using the subscripts mm, wm and ww to denote different combinations of maximizations and weighted sums. 2.3 Likelihoods For each county i and its j closest neighbors, the Poisson based likelihood ratio for that potential cluster is When only using the circles with more cases than expected, we have The log likelihood ratios are LLRi,(j) = log(LRi,(j)) and LLRi,(j)h = LLRi,(j)I(ci,(j)/ni,(j) > C/N) respectively. 2.4 Weight functions Nine different weight functions are used in this paper. The first is always identical to one, three depend only on the centroid of county i while five depend on both the centroid of county i and the number of neighbors j. The first weight function is just the raw summation of likelihood ratios or log likelihood ratios, denoted as While we do not expect this weight to perform very well, we include it as a baseline comparison. Gangnon and Clayton [16] proposed the following county based (CB) weight function for one of their likelihood based test statistics Like the one above, the next two weight functions also depend only on the centroid of county i. We call one the area based weight (AB) and the other the population based weight (PB). They are defined as
and
respectively. The next weight function is called the distance based exponential weight (DE). Tango proposed this weight function for his Maximized Excess Events Test (MEET) [14] and it is defined as
where λ is a parameter reflecting the spatial scale of clustering. Usually, λ is set to range from a small value to half of the largest distance between two counties. As an alternative to the distance based exponential weight, we have proposed a population adjusted exponential weight function (PE) for Tango’s MEET [18]. This can also be used for likelihood based tests, and is denoted as
where λi = di,(ti),ti = max{j : ni,(j) ≤ k}, and k is a parameter set by the user and can be viewed as a population measure for the scale of the clustering. k can be set to range from a small number such as 1% or 0.1% to 50% of the overall population. In this study, we set k to range from 5% to 50% of the overall population, because some counties have a population size over 1% of the overall population. Based on nearest neighbor property, we can define another weight function as
where s > 0 is a spatial scal parameter set by the user. In our calculation, we set the parameter s to range from 0.1 to 8. We call this weight function the nearest neighbor based weight (NN). Gangnon and Clayton [16] proposed a distance adjusted area weight (DA) for their other likelihood based test statistic. This is defined as
The above weight function can be modified to be based on the county population size rather than geographical area size. We name it distance adjusted population weight (DP) and define it as
2.5 Maximization over weight function parameters Some of the weight functions defined above depends on a spatial scale parameter λ, k or s, and each choice of that parameter leads to a different test statistic. A combined maximized test can be constructed that evaluates a large set of parameter values, adjusting the test for the multiple testing inherent in the many parameter values considered. Proposed by Tango for a different type of test for spatial randomness, the minimum p-value over the parameters is first calculated, and this minimum p-value is used as the test statistics, whos p-value is calculated by Monte Carlo simulations. Details of this maximization have been provided by Tango [14]. The maximized versions of the test statistics are denoted as MLRww(XX) and MLLRww(XX) respectively. 2.6 Maximum number of neighbors The last parameter Ji in the formulas in section 2.2 is the number of neighbors to maximize over or to do the summation over. This parameter can be set in different ways, such that the circle includes a fixed number of counties, a fixed geographical area size or a fixed population size. In this study we chose the latter, so that In this paper we use Ji(p) = max{(j) : ni,(j) ≤ p * N}, where 0 < p < 1 is a fixed constant defining a maximum circle size as a percent of the total population. We will use two different values of p, so that either a maximum of 10 or 50 percent of the population is included. We denote these tests as LRxx(XX, 10) and LRxx(XX, 50) respectively for the likelihood ratio based test statistics and equivalently for the log likelihood ratio and maximized versions. 2.7 Existing test statistics Most of the test statistics evaluated in this paper are new, but a few of them have been proposed before. The spatial scan statistic [10] is defined by LRmm. Note that since this test takes the maximum likelihood ratio over both the j neighbors and centroids i, it does not matter whether one uses the likelihood ratio or the log likelihood ratio. While the values of the test statistic are different, the p-values are always identical. Two of the other likelihood based test statistics were proposed by Gangnon and Clayton [16], LRww(CB) and LRww(DA) respectively. These two tests, as well as the weights proposed by Tango [14] for his excess events test, served as the inspiration for many of the new tests statistics evaluated in this paper. 3 Tango’s Maximized Excess Events Test (MEET) Compared to other tests for global spatial clustering, Tango’s MEET has in other studies been shown to have very good power [8, 19]. Hence, we will for comparison purposes include this test as well, even though it is not based on a likelihood formulation. For some weight function wi,j, Tango’s Excess Events Test (EET ) is a weighted sum of excess events defined as
Tango proposed two weight functions,
Tango’s EET depends on a scale parameter λ. To be able to detect clustering irrespectively of its geographical scale, Tango [14] proposed the maximized excess events test (MEET). With the DE weight it is defined as
where eet(DE, λ) is the observed values of EET(DE, λ) conditioning on λ and U is an upper limit on λ. Basically, the maximized test is using the minimum of the profile p-values as the test statistics adjusting for the multiple testing resulting from the many parameter values considered. 4 Power evaluation 4.1 Benchmark data A simulated benchmark data set is used to evaluate the statistical power of the different test statistics. This data set is based on the 1990 female population in the 245 counties and county equivalents in the northeastern United States, consisting of the states of Maine, New Hampshire, Vermont, Massachusetts, Rhode Island, Connecticut, New York, New Jersey, Pennsylvania, Delaware, Maryland and the District of Columbia. The total population was 29,535,210. The smallest size population is 2,319 in Forest County, Pennsylvania, and the largest size population is 1,231,176 in Kings County (Brooklyn), New York. The benchmark data has been described in detail elsewhere [8], and here we just give a brief description. The benchmark data sets can be downloaded at ‘http://satscan.org/datasets/’. First, 99,999 random data sets were generated under the null hypothesis by randomly and independently allocating 600 cases to counties with probabilities proportional to their population size. The null data is used to estimate the critical values, which is the cut-off point for the significance at the 0.05 level. For each of 26 alternative clustering model, 10,000 random data sets were generated to estimate the power. Two types of clustering models were used, localized hot spot clusters and global chain clustering. For the global chain clustering process, the counties are tied together sequentially on a chain that passes through each county exactly once, after which it reconnects with the first county on the chain, forming a Hamiltonian cycle, as shown on a map in an earlier publication [8]. Spatial clustering is generated by first randomly locating 300 cases on the map according to the null hypothesis. Each of these original cases then generates one additional ’twin’ case. The first kind of global clustering model is constructed by assigning the twins to the same county, so that they are a distance zero apart. In the second type of global clustering the twin may be assigned to a neighboring county along the chain, with the county selected so that there is a fixed population adjusted distance between the twins, with 0.5%, 1%, 2%, 4% or 8% of the total population between them. For the third type of global clustering models, the distance was instead selected at random according to the exponential distribution with mean 0.5%, 1%, 2%, 4% and 8% respectively. The chain does not imply that the disease itself spreads around the chain, just that twin cases are located in either of the two directions, as defined by the chain. Hot spot clusters were generated by setting the relative risk in some counties to be larger than 1. Three different sets of hot spot clusters are constructed in a rural, urban and mixed area respectively. In each area, there are five different sized clusters with 1, 2, 4, 8 and 16 counties respectively for each of these three sets. The center of the rural cluster is Grand Isle County in Vermont. The center of the mixed cluster is Allegheny County (Pittsburgh) in Pennsylvania. The center of the urban cluster is New York County (Manhattan) in New York. Detailed information about the hot spot clusters are provided in Table 1.
4.2 Power for parameter dependent tests Three weight functions DE, PE and NN depend on a parameter. Tables 2–5 show the estimated power of these tests for different parameter values. For each test and clustering model, the highest power among the parameter values is noted in bold face. The average power of each test statistic is reported in the last column of the table. For comparison, the power of the maximized test with the same weight function is reported in the last row for each test. The maximized versions perform well in all cases, with the average power falling at most slightly below the best performing parameter specific test and in many cases considerably better than the worst performing test in the same calss.
Table 2 shows estimated power of the three weighted likelihood ratio tests for the global chain clustering models. When the distance between cases increase, which means the spatial scale of clustering is larger, LRww(DE, 50, λ) has higher power for larger values of λ and vice versa. For LRww(NN, 50, s), the same is true for smaller values of s. This is not surprising since a small s gives more weight to the counties far away. For LRww(PE, 50, k) a large k is always better, except for the zero distance clustering model. Table 3 shows the power of the three weighted log likelihood ratio tests for global chain clustering models. For LLRww(DE, 50, λ) and LLRww(PE, 50, k), a small parameter value is better for small scale clustering while a large parameter is better for large scale clustering. For LLRww(NN, 50, s), the trend is reversed.
Table 4 illustrates the power of weighted likelihood ratio tests for hot spot clusters. When the clustering models have more counties and a larger population, LRww(DE, 50, λ) and LRww(NN, 50, s) show similar trends as in Table 1, with a larger lambda and a smaller s providing higher power for large clusters. LRww(PE, 50, k) also shows a similar pattern, which indicates that a large parameter value is good for large clusters and a small parameter is good for small clusters.
Table 5 illustrates the power of weighted log likelihood tests for hot spot clusters. These tests have with some excpetions lower power than the equivalent likelihood ratio based tests in Table 4. The LLRww(DE, 50, λ) tests are good for detecting urban clusters if the right parameter value is choosen. LLRww(PE, 50, k) has good power for rural clusters when k is small, for mixed clusters when k is medium sized and for the very largest urban cluster when k is large. For LLRww(NN, 50, s), a small s is good for clusters with more counties and a large s is good for clusters with less counties. 4.3 Power for parameter independent tests Table 6 shows estimated power of the parameter independent tests for the global chain clustering. To make comparison easier, the highest power for each clustering model is highlighted. The best performing likelihood based tests were MLLRww(DE, 50) and MLLRww(NN, 50), which have similar power as Tango’s MEET(DE), which has previously been shown to have very good power for these alternative models compared to seevral other global clustering tests [19]. Several of the other likelihood based tests also perform well for global chain clustering.
Table 7 shows the estimated power for the hot-spot cluster alternatives. Overall, the best performing test was LRwmh(PB, 50), followed by LRwmh (PB, 10), LRwmh(ONE, 50), LRmmh(50) and MLRww(DE, 50). The first and third of these have higher power for large clusters while the latter two have higher power for small clusters. The log likelihood based tests have in general lower power than the equivalent likelihood based test statistics, with the exception of the scan statistics LRmm and LLRmm for which the two are identical by definition. Tests based on a maximum circle size of 10 percent have slightly higher power for the smaller and medium size hot-spot clusters but much lower power for the two largest urban clusters, compared to the equivalent tests with a 50 percent maximum.
5 Discussion In this study, we have developed and evaluated several likelihood based tests for spatial randomness. Several of the new tests perform very well, similar to the best performing tests from earlier power comparisons [8, 16, 19]. By looking at different components of the test statistics, we were in many cases able to determine what component parameters work well for different types of alternative hypotheses. In two previous studies [8, 19] we systematically evaluated eight test statistics using the same benchmark data: Besag-Newell’s R [12], Bonetti-Pagano’s M [21], Cuzick-Edwards’ k-NN [13], Moran’s I [22], Swartz’ Entropy Test [15], Spatial scan statistic, Tango’s MEET and Whittemore’s Test [11]. The power varies greatly for these different test statistics. Tango’s MEET has the best power for global clustering and the spatial scan statistic has the best power for hot-spot clusters. In this paper we have shown that a couple of likelihood based test statistics have as good power as Tango’s MEET for global clustering, while several likelihood based tests have about the same power as the spatial scan statistic for hot-spot alternatives. While several tests have very good power for the various hot-spot cluster alternatives, there is a major advantage of the spatial scan statistic is that we can pinpoint the location causing the rejection of the null hypothesis [10]. This means that the p-value of the test can be assigned to the detected cluster and not only to the study region as a whole. Since the tests based on weighted summations use evidence from all parts of the map to decide whether to reject the null hypothesis or not, no such cluster specific p-values can be assigned from those tests. Since the scan statistics also have among the best power of the tests evaluated, we recommend its use when the interest is in the detection of hot-spot clusters and the evaluation of their statistical significance. For global clustering, Tango’s MEET(DE), MLLRww(DE, 50) and MLLRww(PE, 50) all have very good power, and we recommend using either of these if the main interest is to evaluate whether there exist clustering in general throughout the study region. Note that MLLRww(DE, 50) and MLLRww(PE, 50) both depend on the maximum population bound of 50%, and the strength of these two tests varies with different choice of the population bound. When the maximum is smaller such as 10%, the power may decrease when the scale of the global clustering is large, but increase if if the scale of clustering is small. As we know, the strength of the test statistics depend not only on the clustering models, but also on the underlying population. This power evaluation used the female population in 245 counties in Northeastern United States as the underlying population. It is possible that the relative strength of the various test statistics evaluated may be different if a different underlying population is used. The same is true to an even large extent when it comes to the choice of alternative hypotheses, and there are probably some clustering models for which the results will be very different. Similar studies may be done for more typical geographical regions with different clustering models. The type of power evaluations done in this paper are, in spite of these limitations, very important though. For any practical application, a test must be chosen, and the power estimates presented in this paper provides some help with that choice. It should be noted though, that as long as one of the better tests is chosen, it may not matter much exactly which one. Footnotes ‡This study was funded by grant RO1CA095979-01 from the National Cancer Institute References 1. Moran PAP. Random associations on a lattice. Preceedings of the Cambridge Philosophical Society. 1947;43:321–328. 2. Nagashima H, Terashima I, Katoh S. Effects of plant-density on frequency-distributions of plant height in cheopodium-album stands. Annals of Botany. 1995;75:173–180. 3. Meirmans PG, Vlot EC, Den Nijs JCM, Menken SBJ. Spatial ecological and genetic structure of a mixed population of sexual diploid and apomictic triploid dandelions. Journal of Evolutionary Biology. 2003;16:343–352. [PubMed] 4. Fotheringham AS, Charlton M, Brunsdon C. The geography of parameter space: an investigation of spatial non-stationarity. International Journal of Geographical Information Systems. 1996;10:605–627. 5. Cortese CF, Falk RF, Cohen JK. Further considerations on the methodological analysis of segregation indices. American Sociological Review. 1976;41:630–637. 6. Nkhoma ET, Hsu CE , Hunt VI, Harris AM. Detecting spatiotemporal clusters of accidental poisoning mortality among Texas counties, U.S., 1980 2001International Journal of Health Geographics 2004; 3: 25. 7. Klassen AC, Kulldorff M, Curriero F. Research Geographical clustering of prostate cancer grade and stage at diagnosis, before and after adjustment for risk factors. International Journal of Health Geographics. 2005;4:1. [PubMed] 8. Kulldorff M, Tango T, Park P. Power comparisons for disease clustering tests. Computational Statistics and Data Analysis. 2003;42:665–684. 9. Turnbull BW, Iwano EJ, Burnett WS, Howe HL, Clark LC. Monitoring for clusters of disease: application to leukemia incidence in upstate New York. American Journal of Epidemiology. 1990;132(1):S136–S143. [PubMed] 10. Kulldorff M. A spatial scan statistic. Communications in Statistics: Theory and Methods. 1997;26:1481–1496. 11. Whittemore AS, Friend N, Brown BW, Holly EA. A test to detect clusters of disease. Biometrika. 1987;74:631–635. 12. Besag J, Newell J. The detection of clusters in rare diseases. Journal of the Royal Statistical Society. 1991;A154:143–155. 13. Cuzick J, Edwards R. Spatial clustering for inhomogeneous populations. Journal of the Royal Statistical Society. 1990;m2:73–104. 14. Tango T. A test for spatial disease clustering adjusted for multiple testing. Statistics in Medicine. 2000;19:191–204. [PubMed] 15. Swartz JB. An entropy-based algorithm for detecting clusters of cases and controls and its comparison with a method using nearest neighbors. Health and Place. 1998;4:67–77. [PubMed] 16. Gangnon RE, Clayton MK. A weighted average likelihood ratio test for spatial clustering of disease. Statistics in Medicine. 2001;20:2977–2987. [PubMed] 17. Gangnon RE, Clayton MK. Likelihood-based tests for localized spatial clustering of disease. Environmetrics. 2004;15:797–810. 18. Song C, Kulldorff M. Tango’s maximized excess events test with different weights Under review. 19. Song C, Kulldorff M. Power evaluation of disease clustering tests. International Journal of Health Geographics. 2003;2:9. [PubMed] 20. Tango T. A class of tests for detecting ’general’ and ’focused’ clustering of rare diseases. Statistics in Medicine. 1995;14:2323–2334. [PubMed] 21. Bonetti M, Pagano M. The interpoint distance distribution as a descriptor of point patterns, with an application to spatial disease clustering. Statistics in Medicine. 2005;24:753–773. [PubMed] 22. Moran PAP. Notes on continuous stochastic phenomena. Biometrika. 1950;37:17–23. [PubMed] |
PubMed related articles
Your browsing activity is empty. Activity recording is turned off. |
|||||||||||||||||||||||||||||||||||||||||||||||||
J Evol Biol. 2003 Mar; 16(2):343-52.
[J Evol Biol. 2003]Int J Health Geogr. 2005 Jan 13; 4(1):1.
[Int J Health Geogr. 2005]Am J Epidemiol. 1990 Jul; 132(1 Suppl):S136-43.
[Am J Epidemiol. 1990]Stat Med. 2000 Jan 30; 19(2):191-204.
[Stat Med. 2000]Health Place. 1998 Mar; 4(1):67-77.
[Health Place. 1998]Stat Med. 2001 Oct 15; 20(19):2977-87.
[Stat Med. 2001]Stat Med. 2001 Oct 15; 20(19):2977-87.
[Stat Med. 2001]Stat Med. 2001 Oct 15; 20(19):2977-87.
[Stat Med. 2001]Stat Med. 2000 Jan 30; 19(2):191-204.
[Stat Med. 2000]Stat Med. 2001 Oct 15; 20(19):2977-87.
[Stat Med. 2001]Stat Med. 2000 Jan 30; 19(2):191-204.
[Stat Med. 2000]Stat Med. 2001 Oct 15; 20(19):2977-87.
[Stat Med. 2001]Stat Med. 2000 Jan 30; 19(2):191-204.
[Stat Med. 2000]Int J Health Geogr. 2003 Dec 19; 2(1):9.
[Int J Health Geogr. 2003]Stat Med. 2000 Jan 30; 19(2):191-204.
[Stat Med. 2000]Stat Med. 1995 Nov 15-30; 14(21-22):2323-34.
[Stat Med. 1995]Stat Med. 2000 Jan 30; 19(2):191-204.
[Stat Med. 2000]Int J Health Geogr. 2003 Dec 19; 2(1):9.
[Int J Health Geogr. 2003]Stat Med. 2001 Oct 15; 20(19):2977-87.
[Stat Med. 2001]Int J Health Geogr. 2003 Dec 19; 2(1):9.
[Int J Health Geogr. 2003]Int J Health Geogr. 2003 Dec 19; 2(1):9.
[Int J Health Geogr. 2003]Stat Med. 2005 Mar 15; 24(5):753-73.
[Stat Med. 2005]Biometrika. 1950 Jun; 37(1-2):17-23.
[Biometrika. 1950]Health Place. 1998 Mar; 4(1):67-77.
[Health Place. 1998]