• We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
Logo of amjphAmerican Journal of Public Health Web SiteAmerican Public Health Association Web SiteSubmissionsSubscriptionsAbout Us
Am J Public Health. 2003 October; 93(10): 1655–1671.
PMCID: PMC1448030

Race/Ethnicity, Gender, and Monitoring Socioeconomic Gradients in Health: A Comparison of Area-Based Socioeconomic Measures—The Public Health Disparities Geocoding Project


Use of multilevel frameworks and area-based socioeconomic measures (ABSMs) for public health monitoring can potentially overcome the absence of socioeconomic data in most US public health surveillance systems.

To assess whether ABSMs can meaningfully be used for diverse race/ethnicity–gender groups, we geocoded and linked public health surveillance data from Massachusetts and Rhode Island to 1990 block group, tract, and zip code ABSMs. Outcomes comprised death, birth, cancer incidence, tuberculosis, sexually transmitted infections, childhood lead poisoning, and nonfatal weapons-related injuries.

Among White, Black, and Hispanic women and men, measures of economic deprivation (e.g., percentage below poverty) were most sensitive to expected socioeconomic gradients in health, with the most consistent results and maximal geocoding linkage evident for tract-level analyses.

Despite growing interest in social inequalities in health,1–4 routine monitoring of socioeconomic gradients in health in the United States is hampered by a lack of socioeconomic data in most US public health surveillance systems.5,6 Notable exceptions include birth and death certificates, which since 1989 have included data on educational level,7 and also data obtained from specialized surveys, such as the National Health Interview Survey.8 However, surveys are incapable of monitoring important aspects of population health (e.g., cancer incidence, tuberculosis (TB), and childhood lead levels), nor can they readily provide the kinds of routine—and local—data required by health departments, especially on relatively small population subgroups, such as members of diverse racial/ethnic communities.5,9

Reflecting gaps created by unavailable socioeconomic data, the 2002 edition of Health, United States,10 the annual federal publication profiling the health of the nation, lacked socioeconomic data in 85.5% of its 71 tables on “health status and determinants”; virtually all tables, however, were stratified by “sex, race, and Hispanic origin.” Likewise, fully 70% of the 467 US public health objectives for the year 2010 had no socioeconomic targets, given a lack of baseline data.11,12 This absence of economic data from routine public health monitoring—equally evident in state health department publications—obscures socioeconomic gradients in health overall and within diverse race/ethnicity–gender groups, as well as the contribution of economic deprivation to racial/ethnic and gender inequalities in health.5,6,12–14

Fortunately, the methodology of geocoding residential addresses and using area-based socioeconomic measures (ABSMs) is a potential and relatively inexpensive solution to the problem of absent or limited socioeconomic data in US public health surveillance systems.6,12,15,16 In this approach, which draws on multilevel frameworks and area-based measures, both cases (numerators) and the catchment population (denominators) are classified by the socioeconomic characteristics of their residential area, thereby permitting calculation of rates stratified by the ABSMs.

Although this approach has been employed in US health research for over 75 years,17–20 to date there exists no consensus or standard as to which ABSMs, at which level of geography, are best suited for monitoring US socioeconomic inequalities in health.6,12,15,20 Nor, to our knowledge, have any investigations systematically assessed, empirically, whether specified ABSMs perform similarly or differently in diverse race/ethnicity–gender groups. Instead, published research has exhibited a remarkable eclecticism regarding choice of geographic level and types of ABSM used, both single variable and composite.6,12,21 Although such a plurality of measures may be useful for etiologic research, in the case of monitoring, such heterogeneity impedes comparing results across studies, outcomes, and regions and over time.

We accordingly launched the Public Health Disparities Geocoding Project to ascertain which ABSMs, at which geographic level (census block group, census tract, or zip code), would be most apt for monitoring US socioeconomic inequalities in health. To provide a robust evaluation, guided by ecosocial theory,22,23 we designed the study to encompass a wide variety of health outcomes, hypothesizing that some ABSMs and geographic levels might be more sensitive to socioeconomic gradients for some health outcomes than others.

Drawing on 1990 census data and public health surveillance systems of 2 New England states, Massachusetts and Rhode Island, we included 7 types of outcomes: mortality (all cause and cause specific), cancer incidence (all sites and site specific), low birth weight, childhood lead poisoning, sexually transmitted infections, TB, and nonfatal weaponsrelated injuries.24–26 Pertinent a priori considerations, derived in part from Rossi and Gilmartin’s criteria for valid and useful social indicators,27 included (a) external validity (do the measures find gradients in the direction reported in the literature, i.e., positive, negative, or none, and across the full range of the distribution?), (b) robustness (do the measures detect expected gradients across a wide range of outcomes?), (c) completeness (is the measure relatively unaffected by missing data?), and (d) user-friendliness (how easy is the measure to understand and explain?).

Our initial analyses focused on the total population of each state, with results suggesting that public health monitoring might be judiciously augmented by the use of census tract–level measures of economic deprivation, and specifically the measure “percentage of persons below poverty.”24–26 In this investigation, we extend our analyses by examining whether these conclusions hold for diverse race/ethnicity–gender groups.


Data Sources: Population and Health Outcomes

As described in our earlier publications,24–26 the study base comprised (a) populations and areas in Massachusetts and Rhode Island enumerated at the 1990 census and (b) health outcomes occurring in the period around the 1990 census. We obtained public health surveillance data from the Massachusetts Department of Public Health and the Rhode Island Department of Health for death, birth, cancer incidence, TB, sexually transmitted infections, childhood lead poisoning (Rhode Island only), and nonfatal weapons-related injuries (Massachusetts only).

Data for death, birth, cancer incidence, and childhood lead poisoning (among children 1 to 5 years old) were analyzed for persons. Data for TB, sexually transmitted infections, and nonfatal weapons-related injuries were analyzed for new cases only, since a given individual could experience the specified outcome more than once during the study period; data for lead poisoning were likewise analyzed only for a child’s first record in the study interval, not repeat follow-up tests. Slightly over 760 000 records were included in our final analytic data sets,24–26 restricted to records for in-state residents with health outcomes occurring during the specified study interval and not missing data on age, gender, or the specified outcome, plus additional restrictions described below. All records were geocoded to the census block group, census tract, and zip code levels by a commercial geocoding firm whose accuracy we validated (96%).28

With regard to outcomes, cause of death was categorized according to International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM) codes and cancer type by standard Surveillance, Epidemiology, and End Results (SEER) site/histology definitions.24,29–34 Mortality outcomes analyzed included premature mortality (< 65 years old) and selected causes of death ranked among the top 5 causes of death in each state within one or more racial/ethnic groups,24 including mortality due to heart disease, neoplasm, diabetes, HIV, and homicide. Incidence of cancer was analyzed for all cancers combined and the 5 leading sites reported nationally34: breast, cervix, colon, lung, and prostate.24

We analyzed births to mothers aged 15 to 55 years; we report results only for singleton births, using the conventional definition of low birthweight as less than 2500 g.25,30,35 For Rhode Island’s mandatory universal childhood lead screening program, blood specimens were obtained 2 ways, at the screening physician’s discretion: venous and capillary/fingerstick.25,36 Because the latter may be subject to contamination (e.g., lead dust on the pricked finger),37 we analyzed the 2 sample types separately and present only the venous results. Following guidelines issued by the Centers for Disease Control and Prevention (CDC) in 1997,37 elevated blood lead levels were defined as 10 g/dL or above.

Cases included in the sexually transmitted infection databases for both states were identified and reported to the state health department because they (a) were symptomatic patients, (b) sought testing because they were concerned about their exposure (i.e., after unsafe sex), (c) received a complete battery of sexually transmitted infection tests as part of seeking confidential HIV testing, (d) were sexual partners of persons identified as cases, or (e) received testing as part of a routine gynecologic examination.26,38,39 Cases in both states’ TB databases were identified and reported to the state health department via designated TB clinics and additional health care providers.40,41 Finally, data on nonfatal weapons-related injury (intentional and unintentional) were obtained from the Massachusetts Department of Public Health’s Weapons-Related Injury Surveillance System, which encompasses all Massachusetts acute care hospital emergency departments.42 Fully 97% of the nonfatal weapons-related injuries were intentional; data on whether the injury was intentional or not were obtained from the respondent, if conscious, and otherwise coded as “unknown.”

The 1990 census, the source of our denominators, used self-report data to classify race/ethnicity in accordance with censusdefined categories pertaining to “race” and Hispanic “ethnicity”; data on age and gender were also obtained by self-report.43 Data on race/ethnicity, gender, and age were reported by the next of kin or recorded by the funeral director for the death data and were abstracted by registry staff from medical records for the cancer data.24,29,30,32,33 For birth certificate data, mother’s race/ethnicity and age were obtained by self-report through use of closed-format questions; for childhood lead poisoning, race/ethnicity, gender, and age of the child were reported by the child’s parent (or adult guardian).25,30,35,36 Data on race/ethnicity, gender, and age for the sexually transmitted infection, TB, and injury cases were obtained by a mixture of self-report and observer report.26,38–42 Notably, the different databases employed different approaches to categorizing racial/ethnic data: some used separate fields for data on “race” and on “Hispanic origin” (or “ancestry”), permitting these data to be cross-classified, while others used only one field for these items.

To maximize the compatibility of racial/ethnic categories employed in the numerators (cases) and denominators (population) and to ensure adequate sample size to conduct meaningful analyses, our investigation thus focused primarily on 3 racial/ethnic groups: White, Black, and Hispanic. In these analyses, the White and Black populations were mutually exclusive and included all persons regardless of census-designated “ethnicity,” while the Hispanic population included persons of all census-designated “races.” According to the 1990 census,44 fully 98% of both the Massachusetts and Rhode Island White populations in 1990 identified themselves as “nonHispanic,” as did 92% of the Massachusetts and 89% of the Rhode Island Black population. Conversely, among the Hispanic population, 43% in Massachusetts and 48% in Rhode Island identified themselves as “White,” 8% and 9% as “Black,” and 47% and 41% as “other race.” Analyses for the American Indian and the Asian and Pacific Islander populations (respectively comprising, combined, 2.5% and 2.2% of the Massachusetts and Rhode Island populations) were limited to premature mortality and low birthweight, as they were the main outcomes with sufficient data to yield interpretable results.

Data Sources: ABSMs

As described in our prior investigations,24–26 we obtained 1990 census data for census tracts and block groups from US Census Bureau Summary Tape File 3A and zip code data from Summary Tape File 3B.43 According to the US Census Bureau, census tracts on average contain 4000 persons and are a “small, relatively permanent statistical subdivision of a county . . . designed to be relatively homogeneous with respect to population characteristics, economic status, and living conditions.45(pG-10,G-11) The census tract’s subdivision, the block group, contains on average 1000 persons and is the smallest geographic census unit for which census socioeconomic data are tabulated.45(pG-6)

Zip codes, in turn, have an average population of 30 000 and are “administrative units established by the United States Postal Service . . . for the most efficient delivery of mail, and therefore generally do not respect political or census statistical area boundaries,” and they can range in size from large areas cutting across states to a single building or company with a high volume of mail.43(pA-13) Moreover, unlike with census tracts and block groups, zip code boundaries may overlap (since “carrier routes for one ZIP Code may intertwine with those of one or more ZIP Codes”),46(p22) and they can be added, eliminated, or have their codes changed or boundaries redefined in nondecennial years.47,48

Three considerations guided our development of ABSMs24–26: (a) a priori conceptual definitions of socioeconomic position (SEP) and social class,6 (b) US and UK evidence emphasizing the detrimental effects of material deprivation on health,1–4,49 and (c) the need for measures that can be meaningfully compared over time and space, so as to permit valid monitoring and contrasts in relation to time period and region.6,24–26,49 Our project generated, at each level of geography for each state, 11 single-variable and 8 composite ABSMs that met these criteria (Appendix 1; available from the first author upon request), which together reflected 6 domains of SEP: occupational class, income, poverty, wealth, education, and crowding, premised on the understanding that social class, as a social relationship, fundamentally drives the distribution of these manifest aspects of SEP6,24–26.

Of note, one ABSM we included differs from the others: the Gini coefficient, which is a measure of within-area socioeconomic inequality rather than a measure of the average socioeconomic level of an area.50 We included this measure because of concerns about its uncritical use—for example, at the block group and census tract level—since realities of economic segregation imply that the Gini coefficient should be employed only for larger aggregations.51

Among the composite variables, 2 were US analogues of the UK Townsend21,49,52 and Carstairs21,49,53 deprivation indices, 1 used the algorithm for the CDC’s Index of Local Economic Resources (developed as a county-level measure),54 and the remainder were created exclusively for our study.24–26 Two of these latter composite variables, SEP1 and SEP2, were intended to mirror the skewed population distribution of socioeconomic resources and simultaneously combined categorical data on poverty, working class, and either wealth or high income. Finally, the “SEP index,” a summed z score akin to the Townsend index, was generated through inputs identified by factor analysis,55,56 as described for our prior analyses.24–26 Cutpoints for both the single-variable and composite categorical ABSMs were based on both their percentile distribution (e.g., quintiles) and a priori considerations (e.g., the federal definition of “poverty areas” as regions where 20% or more of the population is below the US poverty line)57,58 (Appendix 2; available from the first author upon request).

Data Analysis

Our analytic plan involved 4 steps, conducted separately for each race/ethnicity–gender group. In Step 1, we assessed the data distribution, including the extent of missing data. In Step 2, we calculated the relevant age-standardized average annual incidence rate or proportion (for low birthweight and childhood lead poisoning), stratified by each ABSM at each level of geography for each state.59,60 For age standardization, we employed the year 2000 standard million,61 using 5 age groups (birth–14, 15–24, 25–44, 45–64, ≥65 years). The numerators and denominators of the calculated rates and proportions consisted of persons residing in areas identified at the specified level of geography for which data on the specified ABSMs were available. Following standard practice for rates centered around a census,62,63 we set the total number of person-years in the denominator equal to the population in that socioeconomic stratum enumerated in the 1990 census multiplied by the relevant number of years of observation.

In Step 3, we visually inspected and quantified socioeconomic gradients for each outcome, using each ABSM at each geographic level. Following standard US reporting practices,1,5 we computed the incidence rate ratio or odds ratio, as warranted, comparing people living in areas with the least and most resources. We also calculated the relative index of inequality, a measure that takes into account the proportion of the population in each stratum as well as the effect estimate for that stratum, thereby providing a single metric that can be meaningfully compared across diverse socioeconomic measures (whether using categories that emphasize the extremes or yield more equal distributions, e.g., quintiles).64–66 In Step 4, we summarized findings across ABSMs and geographic levels, in relation to our above-mentioned a priori considerations regarding external validity, robustness, completeness, and user-friendliness of each measure. All analyses were conducted in SAS.67

Finally, to consolidate our key findings, we devised a “scaled relative index of inequality plot,” in which we display the relative index of inequality for a given ABSM divided by the median value for all the ABSMs being compared. This plot facilitates determining which ABSMs were likely to detect relative indexes of inequality similar to, higher than, or lower than the median relative index of inequality, for each given outcome, at each geographic level, for each race/ethnicity–gender group. To address concerns pertaining to unreliable data, results for outcomes with less than 5 cases are suppressed, while those for outcomes with either 5 to 20 cases or 20 or more cases, and for which the width of the 95% confidence interval is 2 times or more the value of the relative index of inequality, are separately flagged.


Table 1 [triangle] presents data on the study base, defined in terms of both population and areas (block group, census tract, and zip code), as enumerated in the 1990 census for Massachusetts and Rhode Island, which respectively included approximately 6 million and 1 million residents. As shown in Table 2 [triangle], about half the White population (both women and men) in both states lived in areas where fewer than 5% of persons lived below the poverty line; by contrast, over half of the Black and Hispanic populations lived in areas where the poverty rate was 20% or more (same for women and men). Similar patterns were evident for the ABSM for low education and, more starkly, the Townsend index, with no notable difference by gender within racial/ethnic groups.

Study Population, Public Health Disparities Geocoding Project—Adults, by Race/Ethnicity and Gender, and Areas, by Level of Geography: Massachusetts and Rhode Island, 1990
Distribution of the Population for the Public Health Disparities Geocoding Project, Stratified by Block Group–, Census Tract–, and Zip Code–Level Area-Based Socioeconomic Measures, Race/Ethnicity, and Gender: Massachusetts and ...

Table 3 [triangle] provides data on the number of cases for each outcome, by race/ethnicity and gender, and the percentage geocoded to the block group, census tract, and zip code level, which displayed little variation by race/ethnicity or gender. Overall, 93.7% of records were geocoded to the block group level, 99.5% to the census tract level, and 99.5% to the zip code level; only 0.6% were not geocoded to any of these 3 levels. All records geocoded to the block group and census tract level were linked to the relevant censusdefined areas. By contrast, at the zip code level, for several outcomes (especially birth, cancer incidence, and TB), often 10% to 15% of cases could not be linked to census zip code areas, because either their zip codes were for nonresidential sites or the zip codes were created or changed after the 1990 census.

Number of Cases for the Public Health Disparities Geocoding Project: Percentage Geocoded and Percentage Matched to Census Areas, by Level of Geography and Race/Ethnicity: Massachusetts and Rhode Island , ca. 1990

Table 4 [triangle] presents results for premature mortality for the Massachusetts census tract–level ABSMs and offers an illustration of the data we generated for each outcome, at each geographic level, for each ABSM, for each race/ethnicity–gender group, for both states (Appendix 3; available from the first author upon request). For each economic stratum of each ABSM, the table (men only) displays the number of cases, the denominator, the computed age-standardized rate, and the relative index of inequality (summarizing the socioeconomic gradient across all the economic strata); the incidence rate ratio, comparing rates among persons living in areas with the least versus most resources, is available from the first author. Thus, in the case of premature mortality, the median relative index of inequality for the 11 census tract–level ABSMs typically was 2 or higher in every race/ethnicity–gender group, with measures of economic deprivation most frequently detecting the steepest gradients. Among men, reliable estimates of the relative index of inequality ranged from a low of 0.9 (Black men: wealth ABSM) to a high of 3.4 (Asian and Pacific Islander men: poverty ABSM). Among women, reliable estimates of the relative index of inequality ranged from a low of 1.2 (Black women: wealth and working class ABSM) to a high of 3.2 (Asian and Pacific Islander women: Townsend ABSM).

Premature Mortality (<65 Years Old, Per 100 000a) Among Men, by Race/Ethnicity, Stratified by Census Tract–Level Area-Based Socioeconomic Measures, Massachusetts, 1989–1991: Public Health Disparities Geocoding Project

Building on these analyses, Figure 1 [triangle] employs the scaled relative index of inequality plot to summarize our findings for the census tract–level ABSMs across all outcomes for the total population and for the different race/ethnicity–gender groups. Four results stand out, which held for analyses at each geographic level (plots at the block group and zip code level available from the first author upon request). First, within both the total population and each race/ethnicity–gender group, the relative index of inequality for most ABSMs was close to the median for virtually all outcomes, suggesting that the impact of socioeconomic position on a given health outcome is robust. Second, measures of economic deprivation—such as the percentage of persons below poverty (the dark green line) and the Townsend index (the bright red line)—were most sensitive to expected socioeconomic gradients in health, with their relative indexes of inequality routinely at or above the median. By contrast, relative indexes of inequality for measures of wealth and income inequality generally fell below the median, and those for measures of education hovered around the median. Third, these patterns were especially evident for HIV mortality, homicide, TB, and sexually transmitted infections, for which much larger (and expected24–26) gradients were detected by ABSMs for economic deprivation compared with other ABSMs. Fourth, for virtually all outcomes, the median relative index of inequality typically was largest for the White population and smallest for the Black population.

Scaled relative index of inequality (RII) plot, with ln (RII/median RII) for the 11 area-based socioeconomic measures (ABSMs) at the census tract level (Massachusetts, ca. 1990) for all health outcomes for (a) total population, (b) White women, (c) Black ...

Two findings, however, differed by geographic level (data not shown; available from the first author upon request). First, in each race/ethnicity–gender group, the block group and census tract median relative indexes of inequality for most outcomes resembled each other (with the block group estimate often slightly larger), whereas the zip code median relative index of inequality variously was larger or smaller than the block group or the census tract median relative index of inequality. Second, as reported previously for the total population,48 in each of the race/ethnicity–gender groups, only zip code–level analyses yielded parameter estimates contrary to what has been reported in the literature (e.g., they detected a positive socioeconomic gradient when most research has reported a negative gradient).


Taken together, our results suggest that aptly chosen ABSMs can meaningfully augment US public health surveillance systems to permit routine monitoring of socioeconomic inequalities in health within diverse race/ethnicity–gender groups as well as the total population. Three key findings pertinent to selecting a particular ABSM at a particular level stand out. First, census tract–level analyses yielded the most consistent results with maximal geocoding linkage (i.e., the highest proportion of records both geocoded and linked to census-defined areas). Second, measures of economic deprivation were most sensitive to expected socioeconomic gradients in health. Third, the single-variable measure “percentage of persons below poverty” performed as well as more complex, composite measures of economic deprivation (e.g., the Townsend index). These findings, in conjunction with our a priori criteria pertaining to external validity, robustness, completeness, and user-friendliness, accordingly suggest that the census tract–level measure “percentage of persons below poverty” would be a plausible candidate variable. Also important would be use of a priori categorical cutpoints (including the policy-relevant 20% below poverty57,58) to facilitate meaningful comparisons across time and space.

In evaluating our results, it is important to consider several possible sources of bias and error, as well as issues regarding interpretation and use of ABSMs. First, bias could result if a person’s socioeconomic position were associated with being included in a given public health surveillance system, having an erroneous or ungeocodable address, or living in an area missing ABSM data.6,15 If, for example, these problems occurred more frequently among poorer persons, estimates of socioeconomic gradients would be deflated; if, less plausibly, these problems chiefly affected affluent persons, the estimate would be inflated. In our prior research, however, with the 2 databases containing individual-level socioeconomic data (birth and death), we found no variation by educational level in the proportion of records geocoded at each geographic level.24,25

Second, misclassification of race/ethnicity and differential census undercounts by race/ethnicity and socioeconomic position could also affect parameter estimates.14 In both cases, however, the resultant biases would be operative at each geographic level and thus would not invalidate comparisons of socioeconomic gradients across ABSMs or geographic levels. Moreover, only a tiny proportion of areas (typically under 1%) lacked data on ABSMs and, to minimize geocoding error, we used a geocoding firm whose accuracy we validated.28 Third, from a temporal standpoint, cross-sectional analysis cannot address issues of etiologic period; simultaneity of measurement of ABSMs and health outcomes, however, is appropriate for monitoring, given the goal of quantifying the population burden of ill health in relation to socioeconomic position.24–26

Use of ABSMs nevertheless does raise several concerns. First, associations between ABSMs and health status likely reflect a complex combination of 3 factors: (1) composition (people in poor areas have poor health because poor people, as individuals, have poor health), (2) context (people in poor areas also have poor health because a concentration of poverty creates or exacerbates harmful social interactions), and (3) location of public goods or environmental conditions (poor areas are less likely to have good supermarkets and are more likely to be situated next to industrial plants, thereby harming health of their residents).12,21,68–71

Ascertaining the relative contribution of each of these factors—a task relevant for etiologic research—would necessitate multilevel models with relevant individual-level and area-based data69–72 (i.e., precisely the data that most US public health surveillance systems lack). Germane to compositional effects, however, the handful of US studies comparing effect estimates using area-based and individual-level socioeconomic data in conjunction with individual health data have found effect estimates in the same direction, often with similar magnitude (at the block group and census tract levels, but not the zip code level).15,73–75 Multilevel models, moreover, additionally suggest that, for at least some health outcomes, area- and individual-level socioeconomic factors independently and jointly shape the population distributions of disease.68–72

A second concern pertains to “ecologic fallacy,” which occurs when both the dependent and independent variables are group-level data and confounding is introduced through the grouping process.69,76 This type of aggregation bias, however, is not germane to the method of appending ABSMs to individual records, because individuals constitute the unit of observation for both the dependent variable (health outcomes) and the independent variable (exposure to area-based socioeconomic conditions).15,24–26 Instead, as for any research using areal measures, at issue is whether the areas and contingent areal measures are meaningful entities51,71,77; in the case of monitoring, this translates to whether the areas and ABSMs are apt for characterizing the social contours of the population burden of disease.15,24–26 In the case of census tracts, not only are initial boundaries delineated to demarcate relatively economically homogeneous populations, as previously noted,45(pG-10,G-11) but these administrative areas also affect residents’ lives because they are used by federal, state, and local governments to plan programs and allocate resources—for example, to define “urban empowerment zones,” to designate “medically underserved areas,” and to delimit city neighborhoods served by public health agencies.

A third related concern is whether ABSMs meaningfully measure conditions experienced by all persons residing in the specified area, especially in areas marked by socioeconomic heterogeneity.21,49,71 Indeed, it was this concern which led us to investigate whether use of different geographic areas matters, and whether different patterns are seen for members of diverse race/ethnicity–gender groups. Our principal finding—that census tract and block group ABSMs yield similar parameter estimates, whereas zip code estimates are less consistent—importantly held, however, for both the total population and specified subpopulations. These results are consonant with other studies, especially in the United Kingdom, and support the view that one advantage of ABSMs is that they can be applied equally to all persons, regardless of age, gender, and employment status, thereby avoiding well-known problems associated with occupation- and education-based measures—that is, how to classify people not in the paid-labor force (children, housewives or househusbands, unemployed or retired persons) or who have not yet completed their education (school-age children and young adults).6,15,21,53,78

Concerns about ABSMs and the evidence addressing them thus suggests that while use of ABSMs for public health monitoring requires judicious interpretation, the information gained can usefully offset current gaps in knowledge due to the absence of socioeconomic data in most US public health surveillance systems. With these data, it becomes feasible to monitor socioeconomic gradients within diverse race/ethnicity–gender groups; by extension, the magnitude of race/ethnicity–gender inequality in health within specified economic strata could be assessed—as could the contribution of economic deprivation to racial/ethnic and gender disparities in health.12,15

For example, data in our study provided noteworthy evidence that economic gradients were steepest in the White population and shallowest in the Black population. Far from suggesting that economic inequality is less of a concern for African Americans’ health, these findings chiefly resulted from absolute rates being higher among Black compared with White Americans in areas with the most resources. In addition to underscoring that both absolute and relative rates must be considered when evaluating health disparities,65,79,80 these results emphasize why analyses of racial/ethnic inequalities in health need to take into account economic disparities and, conversely, why analyses of economic inequalities in health need to take into account racial/ethnic disparities.13,14,81

Relevant to future use of ABSMs are 2 additional considerations. The first is that starting with the year 2000 census, zip code–level data are no longer available and databases with only zip codes cannot be linked to census data absent geocoding to some other geographic level for which census data are available.48 Prompting this change were “difficulties in precisely defining the land area covered by each ZIP Code,”82 leading the US Census Bureau to create a new statistical entity built from census blocks: the 5-digit Zip Code Tabulation Area (ZCTA).83 Since ZCTAs and zip codes sharing the same 5-digit code may not necessarily cover the same area,83 zip codes obtained by self-report or from addresses in medical records cannot be assumed to correspond to census-defined ZCTAs.48

Second, regarding the timeliness of census data, pending authorization and funding by Congress, the decennial census long form (source of the socioeconomic data for the ABSMs) is scheduled to be replaced by the annual American Community Survey, which will provide yearly sociodemographic estimates at the national, state, and other geographic levels.84 Data at the census tract level are anticipated to be released starting in 2008, employing annual estimates based on 5-year rolling averages; less certainly, block group–level data may also be released starting in 2009 (C. Richard, senior program analyst, American Community Survey, oral communication, January 8, 2003). Presumably, a similar methodology, employing 5-year rolling averages, could also be used for health data among smaller population subgroups, thereby enabling the routine monitoring of socioeconomic inequalities in health among all race/ethnicity–gender groups, assuming that concerns about validity and consistency of racial/ethnic data across public health data systems could be addressed.14,85

In conclusion, results of our study highlight the importance of multilevel frameworks, including ecosocial theory, for public health research and practice.22,23,68–72,86–89 Tellingly, were data constrained only to the individual level, we would remain without any practical solution for improving routine monitoring of socioeconomic inequalities in the United States, other than continuing to advocate for inclusion of individual-level socioeconomic data in diverse public health surveillance systems. Even if it were possible to overcome resistance to including such measures5,90—let alone ensure use of identical measures across diverse databases to enable meaningful comparison—it would still not be possible to monitor secular trends in socioeconomic inequalities in health, owing to the absence of the individual-level socioeconomic data from previous years. By expanding the levels of analysis to include characteristics of areas in which people reside, it is instead possible to envision—and test—an alternative solution, that of geocoding and using ABSMs. By the same logic, further elaboration of multilevel frameworks and methods is likely to aid efforts to understand and address the persistent problem of social inequalities in health in the United States.22,23,68–72,86–91 One way to begin is by ensuring that the magnitude of these disparities is duly and routinely monitored, rather than hidden from view. We suggest that this can be accomplished by geocoding US public health surveillance systems and using the census tract–level measure “percentage of persons below poverty.”


This work was funded by the National Institutes of Health’s National Institute of Child Health and Human Development (1 R01 HD36865–01) and Office of Behavioral and Social Science Research (principal investigator, N. Krieger, PhD).

We thank Dr Daniel Friedman (assistant commissioner, Bureau of Health Statistics, Research and Evaluation, Massachusetts Department of Health) and Dr Jay Buechner (chief, Office of Health Statistics, Rhode Island Department of Health) for facilitating the conduct of this study using data from their respective health departments and for their helpful comments on our manuscript. We likewise thank the following health department staff for their contributions to accessing the study’s data:

Massachusetts Department of Public Health: Alice Mroszczyk, program coordinator for 24A/B/111B Review Committee; Cancer Registry: Dr Susan Gershman, director, Mary Mroszczyk, geocoding/special projects coordinator, and Ann R. MacMillan, data analyst; Registry of Vital Records and Statistics: Elaine Trudeau, registrar of vital records, and Charlene Zion, Public Information Office; Infectious Diseases: Alfred DeMaria, assistant commissioner, Yuren Tang, chief of Surveillance Program, and Sharon Sharnprapai, TB epidemiologist; Weapons-Related Injury Surveillance System: Victoria Vespe Ozonoff, program director, Beth Hume, data manager/analyst, Patrice Cummins, epidemiologist, and Laurie Janelli, site coordinator;

Rhode Island Department of Health: Vital Statistics: Roberta Chevoya, state registrar of vital records; Division of Disease Prevention and Control: Dr John Fulton, associate director, Ted Donnelly, senior public health epidemiologist, and Michael Goscminksi, epidemiologist; Environmental Health Risk Assessment: Susan Feeley, public health epidemiologist; Childhood Lead Poisoning Prevention: Magaly Angeloni, program manager.

Human Participant Protection
Use of the data employed in this study was approved by all relevant institutional review boards/human subjects committees at the Harvard School of Public Health, the Massachusetts Department of Public Health, and the Rhode Island Department of Health.


N. Krieger conceived and designed the study, directed data analysis, and led the writing. J. T. Chen contributed to the study design, led the data analysis, and assisted with manuscript preparation. P. D. Waterman contributed to the study design, secured the data, and assisted with data analysis, interpretation, and manuscript preparation. D. H. Rehkopf assisted with the study and analyses. S. V. Subramanian assisted with the study design and data interpretation. All authors helped to conceptualize ideas, interpret findings, and review drafts of the manuscript.

Peer Reviewed


1. Health, United States, 1998 With Socioeconomic Status and Health Chartbook. Hyattsville, Md: National Center for Health Statistics; 1998.
2. Leon D, Walt G, eds. Poverty, Inequality, and Health: An International Perspective. Oxford, England: Oxford University Press; 2001.
3. Marmot M, Wilkinson RG, eds. Social Determinants of Health. Oxford, England: Oxford University Press; 1999.
4. Berkman L, Kawachi I, eds. Social Epidemiology. Oxford, England: Oxford University Press; 2000.
5. Krieger N, Chen JT, Ebel G. Can we monitor socioeconomic inequalities in health? A survey of US Health Departments’ data collection and reporting practices. Public Health Rep. 1997;112:481–491. [PMC free article] [PubMed]
6. Krieger N, Williams D, Moss N. Measuring social class in US public health research: concepts, methodologies and guidelines. Annu Rev Public Health. 1997;18:341–378. [PubMed]
7. Tolson GC, Barnes JM, Gay GA, Kowaleski JL. The 1989 revision of the US standard certificates and reports. Vital Health Stat 4. 1991;No. 28. [PubMed]
8. National Center for Health Statistics. National Health Interview Survey. Available at: http://www.cdc.gov/nchs/nhis.htm. Accessed January 1, 2003.
9. Pollack AM, Rice DP. Monitoring health care in the United States—a challenging task. Public Health Rep. 1997;112:108–113. [PMC free article] [PubMed]
10. Health, United States 2002 With Chartbook on Trends in the Health of Americans. Hyattsville, Md: National Center for Health Statistics; 2002.
11. Healthy People 2010. Conference ed, 2 vol. Washington, DC: US Dept of Health and Human Services; 2000:objective 23–3. Available at: http://www.health.gov/healthypeople. Accessed January 1, 2003.
12. Krieger N, Zierler S, Hogan JW, et al. Geocoding and measurement of neighborhood socioeconomic position: a US perspective. In: Kawachi I, Berkman L, eds. Neighborhoods and Health. Oxford, England: Oxford University Press; 2003:147–178.
13. Krieger N, Rowley D, Hermann AA, Avery B, Phillips MT. Racism, sexism, and social class: implications for studies of health, disease, and well-being. Am J Prev Med. 1993;9(suppl):82–122. [PubMed]
14. Williams DR. Race/ethnicity and socioeconomic status: measurement and methodological issues. Int J Health Serv. 1996;26:483–505. [PubMed]
15. Krieger N. Overcoming the absence of socioeconomic data in medical records: validation and application of a census-based methodology. Am J Public Health. 1992;82:703–710. [PMC free article] [PubMed]
16. Lynch J, Kaplan G. Socioeconomic position. In: Berkman L, Kawachi I, eds. Social Epidemiology. Oxford, England: Oxford University Press; 2000:13–35.
17. Nathan WB. Health Conditions in North Harlem 1923–1927. New York, NY: National Tuberculosis Association; 1932.
18. Green HW. The use of census tracts in analyzing the population of a metropolitan community. J Am Stat Assoc. 1933;28:147–153.
19. Terris M. Relation of economic status to tuberculosis mortality by age and sex. Am J Public Health. 1948;38:1061–1070. [PMC free article] [PubMed]
20. Coulter EJ, Guralnick L. Analysis of vital statistics by census tract. J Am Stat Assoc. 1959;54:730–740.
21. Carstairs V. Socio-economic factors at areal level and their relationship with health. In: Elliott P, Wakefield J, Best N, Briggs D, eds. Spatial Epidemiology: Methods and Applications. Oxford, England: Oxford University Press; 2000:51–67.
22. Krieger N. Epidemiology and the web of causation: has anyone seen the spider? Soc Sci Med. 1994;39:887–903. [PubMed]
23. Krieger N. Theories for social epidemiology for the 21st century: an ecosocial perspective. Int J Epidemiol. 2001;30:668–677. [PubMed]
24. Krieger N, Chen JT, Waterman PD, Soobader M-J, Subramanian SV, Carson R. Geocoding and monitoring US socioeconomic inequalities in mortality and cancer incidence: does choice of area-based measure and geographic level matter? The Public Health Disparities Geocoding Project. Am J Epidemiol. 2002;156:471–482. [PubMed]
25. Krieger N, Chen JT, Waterman PD, Soobader M-J, Subramanian SV, Carson R. Choosing area-based socioeconomic measures to monitor social inequalities in low birthweight and childhood lead poisoning—The Public Health Disparities Geocoding Project (US). J Epidemiol Community Health. 2003;57:186–199. [PMC free article] [PubMed]
26. Krieger N, Chen JT, Waterman PD, Soobader M-J, Subramanian SV. Monitoring socioeconomic inequalities in sexually transmitted infections, tuberculosis, and violence: geocoding and choice of area-based socioeconomic measures—The Public Health Disparities Geocoding Project (US). Public Health Rep. 2003;118:240–260. [PMC free article] [PubMed]
27. Rossi RJ, Gilmartin KJ. The Handbook of Social Indicators: Sources, Characteristics, and Analysis. New York, NY: Garland STPM Press; 1980.
28. Krieger N, Waterman P, Lemieux K, Zierler S, Hogan JW. On the wrong side of the tracts? Evaluating accuracy of geocoding for public health research. Am J Public Health. 2001;91:1114–1116. [PMC free article] [PubMed]
29. Massachusetts Dept of Public Health, Division of Research and Epidemiology. Massachusetts death data. Available at: http://www.state.ma.us/dph/bhsre/resep/resep.htm. Accessed January 3, 2003.
30. Rhode Island Dept of Health. Office of Vital Records. Available at: http://www.health.ri.org/management/vital/home.htm. Accessed January 3, 2003.
31. International Classification of Diseases, Ninth Revision, Clinical Modification [computer file]. 6th ed. Washington, DC: Centers for Disease Control and Prevention, Health Care Financing Administration; 2000.
32. Massachusetts Dept of Public Health. Massachusetts Cancer Registry. Available at: http://www.state.ma.us/dph/bhsre/mcr/canreg.htm. Accessed January 3, 2003.
33. Rhode Island Dept of Health. Cancer Control Program, Rhode Island Cancer Registry. Available at: http://www.health.state.ri.us/disease/cancer/canreg.htm. Accessed January 3, 2003.
34. Reis LAG, Eisner MP, Kosary CL, et al., eds. SEER Cancer Statistics Review, 1973–1999. Bethesda, Md: National Cancer Institute; 2002. Available at: http://seer.cancer.gov/csr/1973_1999. Accessed January 1, 2003.
35. Massachusetts Dept of Public Health, Division of Research and Epidemiology. Massachusetts birth data. Available at: http://www.state.ma.us/dph/bhsre/resep/resep.htm. Accessed January 3, 2003.
36. Rhode Island Dept of Health. Rhode Island Lead Screening Plan (October2000). Available at: http://www.health.state.ri.us/family/lead/screeningplan.pdf. Accessed January 3, 2003.
37. Screening Young Children for Lead Poisoning: Guidance for State and Local Public Health Officials. Atlanta, Ga: Public Health Service, Centers for Disease Control and Prevention; 1997.
38. Massachusetts Dept of Public Health. 1998. Annual STD Report. Bureau of Communicable Disease Control, Division of Sexually Transmitted Disease Prevention. Available at: http://www.state.ma.us/dph/cdc/std/reports/std98.pdf. Accessed January 1, 2003.
39. Rhode Island Dept of Health. Communicable diseases in Rhode Island: diseases spread through sex (sexually transmitted diseases). Available at: http://www.health.state.ri.us/disease/communicable/dis_sex.htm. Accessed January 1, 2003.
40. Massachusetts Dept of Public Health. 2000 tuberculosis overview. Bureau of Communicable Disease Control, Division of Tuberculosis Prevention and Control. Available at: http://www.state.ma.us/dph/cdc/tb/tbstat2000.pdf. Accessed January 1, 2003.
41. Rhode Island Dept of Health. Tuberculosis. Available at: http://www.healthri.org/disease/communicable/tb.htm. Accessed January 1, 2003.
42. Massachusetts Dept of Public Health. Weapon injury data (April2001). Available at: http://www.state.ma.us/dph/bhsre/isp/wrisp/pubs/windata/01wpinjdata.pdf. Accessed January 1, 2003.
43. Census of Population and Housing, 1990: Summary Tape File 3 Technical Documentation. Washington, DC: Bureau of the Census; 1991.
44. US Bureau of the Census. American Factfinder. Available at: http://factfinder.census.gov. Accessed November 7, 2002.
45. Geographical Areas Reference Manual. Washington, DC: US Bureau of the Census; 1994. Available at: http://www.census.gov/geo/www/garm.html. Accessed January 1, 2003.
46. US Bureau of Census. Geographics changes for Census 2000 + glossary. Available at: http://www.census.gov/geo/www/tiger/glossary.html. Accessed January 1, 2003.
47. Address Information Products Technical Guide. Memphis, Tenn: US Postal Service National Customer Support Center; 2001.
48. Krieger N, Waterman P, Chen JT, Soobader M-J, Subramanian SV, Carson R. Zip code caveat: bias due to spatiotemporal mismatches between zip codes and US census–defined areas—The Public Health Disparities Geocoding Project. Am J Public Health. 2002;92:1100–1102. [PMC free article] [PubMed]
49. Lee P, Murie A, Gordon D. Area Measures of Deprivation: A Study of Current Methods and Best Practices in the Identification of Poor Areas in Great Britain. Birmingham, England: Centre for Urban and Regional Studies, University of Birmingham; 1995.
50. Cowell FA. Measuring Inequality. 2nd ed. London: Prentice Hall; 1995. LSE Handbooks in Economics Series.
51. Soobader M, LeClere FB. Aggregation and the measurement of income inequality: effects on morbidity. Soc Sci Med. 1999;48:733–744. [PubMed]
52. Townsend P, Phillimore P, Beattie A. Health and Deprivation: Inequality and the North. London, England: Croom Helm; 1988.
53. Carstairs V, Morris R. Deprivation and mortality: an alternative to social class? Community Med. 1989;11:210–219. [PubMed]
54. Casper ML, Barnett E, Halverson JA, et al. Women and Heart Disease: An Atlas of Racial and Ethnic Disparities in Mortality. Morgantown: Office for Social Environment and Health Research, West Virginia University; 1999.
55. Lawley DN, Maxwell AE. Factor Analysis as a Statistical Method. New York, NY: Macmillan Publishing Co; 1971.
56. Bartholomew DJ. Latent Variable Models and Factor Analysis. London, England: Charles Griffin & Co; 1987.
57. US Bureau of the Census. Poverty areas. Available at: http://www.census.gov/population/socdemo/statbriefs/povarea.html. Accessed January 1, 2003.
58. Jargowsky PA. Poverty and Place: Ghettos, Barrios, and the American City. New York, NY: Russell Sage Foundation; 1997.
59. Breslow NE, Day NE, eds. Statistical Methods in Cancer Research, Vol. II: The Design and Analysis of Cohort Studies. Oxford, England: Oxford University Press; 1987:54.
60. Rothman KJ, Greenland S. Modern Epidemiology. 2nd ed. Philadelphia, Pa: Lippincott-Raven; 1998:263.
61. Anderson RN, Rosenberg HM. Age standardization of death rates: implementation of the year 2000 standard. Natl Vital Stat Rep. 1998;37(3). [PubMed]
62. Breslow NE, Day NE. Statistical Methods in Cancer Research: Vol. I. The Analysis of Case–Control Studies. Lyon, France: International Agency for Research on Cancer; 1980:47. IARC Scientific Publication 32.
63. Boyle P, Parkin DM. Statistical methods for registries. In: Jensen OM, Parkin DM, MacLennan R, et al., eds. Cancer Registration: Principles and Methods. Lyon, France: International Agency for Research on Cancer; 1991:126–158. IARC Scientific Publication 95.
64. Pamuk ER. Social class inequality in mortality from 1921 to 1972 in England and Wales. Popul Stud. 1985;39:17–31. [PubMed]
65. Wagstaff A, Paci P, van Doorslaer E. On the measurement of inequalities in health. Soc Sci Med. 1991;33:545–557. [PubMed]
66. Davey Smith G, Hart C, Hole D, et al. Education and occupational social class: which is the more important indicator of mortality risk? J Epidemiol Community Health. 1998;52:153–160. [PMC free article] [PubMed]
67. SAS/STAT User’s Guide, Version 6. 4th ed, vol 1. Cary, NC: SAS Institute Inc; 1989.
68. Macintyre S, Ellaway A, Cummins S. Place effects on health: how can we conceptualise, operationalise and measure them? Soc Sci Med. 2002;55:125–139. [PubMed]
69. Diez-Roux AV. Multilevel analysis in public health research. Annu Rev Public Health. 2000;21:171–192. [PubMed]
70. Jones K, Duncan C. Individuals and their ecologies: analysing the geography of chronic illness within a multilevel modelling framework. Health Place. 1995;1:27–40.
71. O’Campo P. Invited commentary: advancing theory and methods for multilevel models of residential neighborhoods and health. Am J Epidemiol. 2003;157:9–13. [PubMed]
72. Subramanian SV, Jones K, Duncan C. Multilevel methods for public health research. In: Kawachi I, Berkman L, eds. Neighborhoods and Health. Oxford, England: Oxford University Press; 2003:65–111.
73. Krieger N. Women and social class: a methodological study comparing individual, household, and census measures as predictors of black/white differences in reproductive history. J Epidemiol Community Health. 1991;45:35–42. [PMC free article] [PubMed]
74. Geronimus AT, Bound J, Neidert LJ. On the validity of using census geocode data to proxy individual socioeconomic characteristics. J Am Stat Assoc. 1996;91:529–537.
75. Soobader M, LeClere FB, Hadden W, Maury B. Using aggregate geographic data to proxy individual socioeconomic status: does size matter? Am J Public Health. 2001;91:632–636. [PMC free article] [PubMed]
76. Alker HR Jr. A typology of ecologic fallacies. In: Doggan M, Rokkan S, eds. Social Ecology. Cambridge, Mass: MIT Press; 1969:69–86.
77. Oppenshaw S, Taylor PJ. The modifiable areal unit problem. In: Wrigley N, Bennett RJ, eds. Quantitative Geography. London, England: Routledge & Kegan Paul; 1981:60–69.
78. Sorensen A. Women, family, and class. Annu Rev Sociol. 1994;20:27–47.
79. Lahelma E, Manderbacka K, Rahkonen O, Karisto A. Comparisons of inequalities in health: evidence from national surveys in Finland, Norway and Sweden. Soc Sci Med. 1994;38:517–524. [PubMed]
80. Borrell C, Rue M, Pasarin MI, Benach J, Kunst AE. The measurement of inequalities in health. Gac Sanit. 2000;14(suplemento 3):20–33. [PubMed]
81. Krieger N. Does racism harm health? Did child abuse exist before 1962? On explicit questions, critical science, and current controversies: an ecosocial perspective. Am J Public Health. 2003;93:194–199. [PMC free article] [PubMed]
82. US Bureau of the Census. Census 2000 ZIP Code Tabulation Areas (ZCTAs). Available at: http://www.census.gov/geo/ZCTA/zcta.html. Accessed January 1, 2003.
83. US Bureau of the Census. Census 2000 ZCTAs ZIP Code Tabulation Areas technical documentation. Available at: http://www.census.gov/geo/ZCTA/zcta_tech_doc.pdf. Accessed January 1, 2003.
84. US Bureau of the Census. American Community Survey: what are the supplementary surveys? Available at: http://www.census.gov/acs/www/Sbasics/What/What2.htm. Accessed December 14, 2002.
85. Laws MB, Heckscher RA. Racial and ethnic identification practices in public health data systems in New England. Public Health Rep. 2002;117:50–61. [PMC free article] [PubMed]
86. Anderson NB. Levels of analysis in health science. A framework for integrating sociobehavioral and biomedical research. Ann N Y Acad Sci. 1998;840:563–576. [PubMed]
87. McMichael AJ. Prisoners of the proximate: loosening the constraints on epidemiology in an age of change. Am J Epidemiol. 1999;149:887–897. [PubMed]
88. Susser M, Susser E. Choosing a future for epidemiology, II: from black boxes to Chinese boxes and eco-epidemiology. Am J Public Health. 1996;86:674–677. [PMC free article] [PubMed]
89. McKinlay JB, Marceau LD. To boldly go. . . . Am J Public Health. 2000;90:25–33. [PMC free article] [PubMed]
90. Krieger N, Fee E. Measuring social inequalities in health in the United States: an historical review, 1900–1950. Int J Health Serv. 1996;26:391–418. [PubMed]
91. Friedman DJ, Hunter EL, Parrish RG. Shaping a vision of health statistics for the 21st century. Washington, DC: Centers for Disease Control and Prevention, National Center for Health Statistics, National Committee on Vital and Health Statistics; 2002. Available at: http://www.ncvhs.hhs.gov/hsvision. Accessed March 8, 2003.

Articles from American Journal of Public Health are provided here courtesy of American Public Health Association
PubReader format: click here to try


Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...