Logo of nihpaAbout Author manuscriptsSubmit a manuscriptHHS Public Access; Author Manuscript; Accepted for publication in peer reviewed journal;
Cancer. Author manuscript; available in PMC 2015 Jan 8.
Published in final edited form as:
PMCID: PMC4287245
NIHMSID: NIHMS652331
PMID: 24889136

The Children’s Oncology Group Childhood Cancer Research Network (CCRN)

Case Catchment in the United States

Abstract

BACKGROUND

The Childhood Cancer Research Network (CCRN) was established within the Children’s Oncology Group (COG) in July 2008 to provide a centralized pediatric cancer research registry for investigators conducting approved etiologic and survivorship studies. The authors conducted an ecological analysis to characterize CCRN catchment at >200 COG institutions by demographic characteristics, diagnosis, and geographic location to determine whether the CCRN can serve as a population-based registry for childhood cancer.

METHODS

During 2009 to 2011, 18,580 US children newly diagnosed with cancer were registered in the CCRN. These observed cases were compared with age-specific, sex-specific, and race/ethnicity-specific expected numbers calculated from Surveillance, Epidemiology, and End Results (SEER) Program cancer incidence rates and 2010 US Census data.

RESULTS

Overall, 42% of children (18,580 observed/44,267 expected) who were diagnosed with cancer at age <20 years were registered in the CCRN, including 45%, 57%, 51%, 44%, and 24% of those diagnosed at birth, ages 1 to 4 years, ages 5 to 9 years, ages 10 to 14 years, and ages 15 to 19 years, respectively. Some malignancies were better represented in the CCRN (leukemia, 59%; renal tumors, 67%) than others (retinoblastoma, 34%). There was little evidence of differences by sex or race/ethnicity, although rates in nonwhites were somewhat lower than rates in whites.

CONCLUSIONS

Given the low observed-to-expected ratio, it will be important to identify challenges and barriers to registration to improve case ascertainment, especially for rarer diagnoses and older age groups; however, it is encouraging that some diagnoses in younger children are fairly representative of the population. Overall, the CCRN is providing centralized, real-time access to cases for research and could be used as a model for other national cooperative groups.

Keywords: childhood cancer, United States, clinical trials, incidence, catchment

INTRODUCTION

In the United States, approximately 15,780 incident cancers are diagnosed annually in individuals aged <20 years.1 It is noteworthy that cancer is the second leading cause of death among those aged <15 years in the United States.2 Although survival rates have improved substantially over the last several decades, incidence rates have either remained steady or increased, and the etiology of most childhood cancers remains largely unknown.3,4 Etiologic investigations of childhood cancer face challenges, primarily because of its relative rarity; thus, collaboration across many hospitals and institutions is essential to obtain sufficient numbers of cases to conduct highly powered studies.

In the United States and Canada, the Children’s Oncology Group (COG) was created in 2000 through a merger of 4 pediatric clinical trials groups: the Children’s Cancer Group (CCG), the Pediatric Oncology Group (POG), the National Wilms Tumor Study Group, and the Intergroup Rhabdomyosarcoma Study Group.5 Currently, there are more than 200 COG member institutions in the United States and Canada and a smaller number outside of North America (eg Australia). A previous ecologic study of catchment reported that approximately 94% of children aged <15 years who were diagnosed with cancer in the United States were seen at a POG or CCG institution.6 Nevertheless, in a 1-to-1 record linkage study between CCG/POG cases and cases from the National Cancer Institute’s Surveillance, Epidemiology, and End Results (SEER) Program for an adjacent time period (1992–1997), Liu et al reported lower enrollment rates of approximately 71% for children aged <15 years and 24% for adolescents ages 15 to 19 years.7

In recent decades, case recruitment in the pediatric clinical trials groups was not conducive to nontherapeutic studies because of administrative burdens in gaining local institutional review board approvals, difficulty obtaining local institutional consent from families, and lower priority compared with clinical trials. The Childhood Cancer Research Network (CCRN) was established within COG in 2008 to ameliorate some of the logistic and regulatory challenges encountered. The CCRN is a collaborative effort to register all patients with childhood cancer (diagnosis at age <20 years) who are treated at a COG institution in the United States and Canada through the addition of an informed authorization process to the already established COG patient registration system.8 Newly diagnosed patients in COG and/or their parents are presented with 3 enrollment options: 1) refuse CCRN participation entirely, 2) register with the CCRN but refuse future contact for research purposes, and 3) both register with the CCRN and agree to possible future contact about participating in nontherapeutic research studies.8

Although the CCRN is presumed to be the sole source for conducting (nearly) population-based etiologic studies of childhood cancers in the United States, given the lack of a national pediatric cancer registry, case capture has not been examined before the current analysis. A pilot study of 1990 patients diagnosed between March 2002 and April 2006 suggests that CCRN enrollment rates are high once patients and/or their guardian are identified and approached for registration by staff at individual COG institutions, with 96% (n = 1901) providing consent to release of personal identifiers and potential future contact, 3% (n = 65) agreeing to release of identifiers only, and only 1% (n = 24) refusals.8 Although these percentages are encouraging, data are not available to confirm that all eligible patients and their parents are being approached for participation in the CCRN. Gaps in ascertainment of important subgroups based on patient age, sex, race/ethnicity, or geographic region also remain unknown. We assessed the completeness of enrollment in the CCRN registry during the first 3 years of operation (January 1, 2009 through December 31, 2011), both overall and by demographic and geographic characteristics. We hypothesized that the group ages 15 to 19 years would be under-represented in all geographic areas, given that many of these patients are seen by adult oncologists. In addition, we hypothesized that patients from rural geographic areas with limited access to a COG institution would be under-represented.

MATERIALS AND METHODS

To assess the catchment of the CCRN registry, we compared the observed numbers of pediatric cancer cases in the CCRN with the expected numbers based on data from SEER and the 2010 US Census Bureau.9,10 Observed-to-expected ratios were calculated overall and according to International Classification of Childhood Cancer (ICCC) category,11 geographic region, sex, race/ethnicity, and age at diagnosis. Analyses also were conducted of selected cancer subtypes to evaluate whether the case population in the CCRN is similar to the SEER population.

Observed Numbers

In total, 18,580 patients were diagnosed with incident pediatric cancers at US COG institutions between January 1, 2009 and December 31, 2011 and were enrolled in the CCRN. Any new patient who attended a COG institution with a cancer diagnosis who had an International Classification of Diseases for Oncology histologic behavior code of 2 (carcinoma in situ) or 3 (malignant) was eligible for CCRN enrollment. In addition, central nervous system (CNS) tumors were included regardless of histologic behavior. In this analysis, to be more consistent with the cases included in SEER, we excluded non-CNS tumors with a histology behavior code of 2, and only a small number (n = 5) of CCRN registrants had tumors with that code. For each patient, we obtained data on age, sex, age (birth date), race/ethnicity, treating institution, tumor histology/location, and zip code. Institutional review boards at all participating COG institutions approved the CCRN.

Missing Data

Because excluding patients who had missing zip code, race/ethnicity, age, or sex data would result in an under-representation of observed data and would artificially attenuate observed-to-expected ratios, we created an algorithm using the SAS software package (version 9.2; SAS Institute, Inc., Cary, NC) to impute and assign missing values based on the proportional distribution of the missing factor at the treating institution. In this way, assignment of the missing factor is done randomly, like flipping a coin, but the coin is biased to favor the more common outcome. For each missing factor, each patient would be randomly assigned a number drawn from a uniform distribution between zero and 1. Cutoff points proportional to the distribution of levels of the missing factor would then determine the assignment of the missing factor. For example, if the missing factor was sex and the treating institution had 10 female patients and 5 male patients, then the missing sex would be assigned as male if the random number was between zero and 0.33 and as female if the number was between 0.34 and 1.0, because approximately 33% of the patients at that institution were male and approximately 66% were female.

Expected Numbers

Age-specific, sex-specific, race-specific (white vs non-white), and ethnicity-specific (Hispanic vs non-His-panic) populations in the United States and age-specific, sex-specific, race-specific, and histology-specific incidence rates were used to calculate expected counts.9,10 Using nationally based SEER rates would have resulted in a loss of regional specificity in expected rates. Age-specific, sex-specific, and race/ethnicity-specific population counts were obtained from the 2010 US Census for each of the 33,120 zip code tabulation areas.12 Age-specific (ages <1 year, 1–4 years, 5–9 years, 10–14 years, and 15–19 years), race-specific (white vs nonwhite), ethnicity-specific (Hispanic vs non-Hispanic), diagnosis-specific (leukemia, lymphoma, CNS, peripheral nervous system, retinoblastoma, hepatic, renal, bone, soft tissue, and germ cell/gonadal), and sex-specific cancer incidence rates for the years 2007 to 2009 were obtained from the SEER Program (all 18 registries), which collects incidence and survival data from population-based cancer registries that cover approximately 28% of the total US population.13

For the zip code-specific analyses, we used the method described by Ross et al.6 Specifically, the assignment of each case to a SEER registry was based on whichever registry was most similar to the patient’s location both in terms of geographic proximity, as determined by distance, and population composition, as determined by a US almanac. SEER registry-specific rates were applied to produce more locally representative expected numbers for selected geographic areas, whereas combined rates of all SEER registries were applied to areas in which the appropriate specific registry was not clear. For example, all zip codes in the State of Iowa had the rates for the Iowa registry applied, whereas we used combined SEER registry rates for zip codes in Arkansas. The assignment of an area to a specific SEER registry was based on both the proximity to the registry as well as the similarity in population composition.

Age-specific, sex-specific, and race/ethnicity-specific expected numbers were computed by multiplying the census-based subpopulation counts for a given geographic area by the corresponding SEER-tabulated subpopulation cancer incidence rates. We then multiplied each expected number by 3.0, because we were comparing expected numbers with 3 years of observed data from the CCRN. Observed numbers were divided by expected numbers to produce observed-to-expected ratios, and corresponding 95% confidence intervals (CIs) were calculated following the method described by Vandenbroucke.14

Geographic Comparisons

A detailed description of case assignment to a particular SEER registry is provided online (see online supporting information). Briefly, we defined geographic areas as follows: First, we assigned all Metropolitan Statistical Areas defined in the 2010 US Census that had a population of at least 100,000 individuals aged <20 years as geographic unit. Next, to produce large enough population totals to yield a meaningful analysis, we grouped the remaining zip codes on the basis of geographic contiguity so that larger areas with approximately 100,000 individuals aged <20 years were produced. In total, 337 areas were defined using this method. Observed-to-expected ratios and 95% CIs were computed for each area according to the methods described above. Cases were assigned to a geographic area based on zip code at diagnosis.

Histologic and Topographic Subtypes

For an assessment of case representation in the CCRN, we also conducted cancer histology-specific and topography-specific analyses. We performed subanalyses for the most common forms of the most common childhood cancers (leukemia: acute lymphoblastic leukemia and acute myeloid leukemia; lymphoma: Hodgkin lymphoma and non-Hodgkin lymphoma [NHL]; CNS tumors: astrocytoma, primitive neuroectodermal tumors, other gliomas, and ependymoma). For disease sites, we chose cancers with high enough incidence rates to have stable expected and observed numbers (ie, >5 expected cases). Hence, we performed subanalyses by site for osteosarcoma and neuroblastomas.

RESULTS

Of the 18,580 patients with childhood cancers identified from the CCRN, 44.2% were female, 25.5% were non-white, and 23.1% were Hispanic (Table 1). Observed-to-expected CCRN enrollment ratios were similar for His-panics and non-Hispanics, whereas the ratio for nonwhites was lower than that for whites. The enrollment ratio was lower in males than in females.

TABLE 1

Sex-Specific, Race-Specific, Ethnicity-Specific, and Age-Specific Observed and Expected Values of Pediatric Cancer from January 2009 to December 2011 in the United States

VariableObserved
Expected
Observed/Expected Ratio95% CI
No.%No.%
Sex
 Males10,36555.4323,87653.940.460.45–0.47
 Females821544.5720,39146.060.430.42–0.45
Race
 White13,84174.4930,87069.740.450.44–0.46
 Other473925.5113,39730.260.350.34–0.36
Ethnicity
 Hispanic429023.09993722.450.430.42–0.44
 Non-Hispanic14,29076.9134,33077.550.420.41–0.43
Age group, y
 <113027.0128716.490.450.43–0.48
 1–4615133.1110,72724.230.570.55–0.59
 5–9399021.47780617.630.510.49–0.53
 10–14371720.01853419.280.440.42–0.45
 15–19342018.4114,32932.370.240.22–0.26

Abbreviations: CI, confidence interval.

Complete data were available for 17,651 (95%) of the identified patients. Missing data included zip code (n = 316; 1.7%), race (n = 743; 4%), ethnicity (n = 613; 3.3%), age at diagnosis (n = 167; 0.09%), or sex (n = 93; 0.05%), and 111 patients (0.6%) were missing more than 1 item. Missing values were imputed as described above.

Across all age groups, substantially fewer patients were enrolled compared with the expected numbers (Table 1). When the analysis was restricted to patients aged <15 years, the 15,160 cases observed represented only 50% of the expected total. With the exception of infants, there was also a trend of decreasing percentage of expected cases with increasing age: The group ages 15 to 19 years had substantially lower observed enrollment compared with all younger age groups.

Figure 1 illustrates the observed-to-expected ratios for all childhood cancers combined across all ages and races by geographic area. For example, the shading for Minnesota indicates that the majority of that state had 75% to 100% of the expected numbers of cases; whereas the lightest shaded regions—areas of Nevada, Arizona, Kansas, northern Oklahoma, and Arkansas and regions of Georgia—indicate areas with the lowest observed-to-expected ratios.

An external file that holds a picture, illustration, etc.
Object name is nihms652331f1.jpg

Observed-to-expected ratios of pediatric cancer cases in the United States enrolled in the Children’s Oncology Group Childhood Cancer Research Network from January 2009 to December 2011 are illustrated according to location of residence at diagnosis. The 4 levels of solid shaded areas reflect areas with observed-to-expected ratios of <0.25 (lightest shading), from ≥0.25 to <0.5, from ≥0.5 to <0.75, and ≥0.75 (darkest shading), whereas white areas represent regions with insufficient numbers of expected cases to generate reliable estimates.

Table 2 provides observed-to-expected ratios by ICCC subtype and age group. Some variation was present with respect to ICCC subtype. For example, the ratios for renal tumors were closer to 1 overall, whereas those for retinoblastoma were lower. Ratios for the groups ages 1 to 4 years and 5 to 9 years at diagnosis tended to be higher than ratios for the other age groups. Comparison of the disease subtype and tumor location for the cancers with sufficient numbers suggested some differences between the CCRN data and the SEER registry data (Table 3). For acute leukemia, 85.2% of the CCRN cases were of lymphoid lineage, and 14.8% were acute myeloid leukemia. This is similar to the distribution in SEER, in which lymphoid and myeloid leukemias comprise 80.2% and 19.8% of childhood acute leukemia diagnoses, respectively. Likewise, the distributions of CNS tumors were similar in CCRN and SEER. In contrast, 53.3% of CCRN lymphoma cases were NHL (excluding Burkitt lymphoma), which is much higher than in SEER, in which NHL comprises 39.2% of childhood lymphoma diagnoses. Table 3 provides the location-specific subanalyses for neuroblastoma and osteosarcoma. Overall, the distributions were similar between the CCRN and SEER, with the exception of over-representation of mediastinum neuroblastoma (16.2% of CCRN cases vs 7.3% of SEER cases) and under-representation of peripheral neuroblastoma (3.8% of CCRN cases vs 18.7% of SEER cases).

TABLE 2

Age-Specific and Site-Specific Observed and Expected Values of Pediatric Cancer From January 2009 to December 2011 in the United States

Cancer Type According to Age Group, ya,bObserved
Expected
Observed/Expected Ratio95% CI
No.%No.%
I. Leukemias, myeloproliferative diseases, and myelodysplastic diseases
 <12633.676205.100.420.37–0.48
 1–4299841.89457437.630.660.63–0.68
 5–9176524.66264821.790.670.64–0.70
 10–14122917.17211217.370.580.55–0.62
 15–1990212.60219918.070.410.38–0.44
II. Lymphomas and reticuloendothelial neoplasms
 <1150.61610.960.240.14–0.39
 1–42219.034507.100.490.43–0.56
 5–942717.45100115.760.430.39–0.47
 10–1472529.63154624.360.470.43–0.50
 15–19105943.28329251.850.320.30–0.34
III. CNS and miscellaneous intracranial and intraspinal neoplasms
 <11636.174636.040.350.30–0.41
 1–486232.65210627.510.410.38–0.44
 5–979230211627.630.370.35–0.40
 10–1453420.23163721.380.330.30–0.36
 15–1928910.45133417.430.220.19–0.24
IV. Neuroblastoma and other peripheral nervous cell tumors
 <140532.0258229.760.700.63–0.77
 1–465251.5498950.560.660.61–0.71
 5–915712.4124912.710.630.53–0.74
 10–14342.69844.280.410.28–0.56
 15–19171.34542.690.330.18–0.49
V. Retinoblastoma
 <112345.0533140.590.370.31–0.44
 1–413750.1842651.990.320.27–0.38
 5–9114.03424.800.280.13–0.44
 10–1410.37121.290.100.00–0.33
 15–1910.3700--
VI. Renal tumors
 <112110.8319111.380.640.52–0.75
 1–468761.5094256.130.730.67–0.79
 5–924121.5833720.150.710.63–0.81
 10–14423.76593.540.710.51–0.95
 15–19262.331448.580.180.12–0.26
VII. Hepatic tumors
 <16722.9511318.650.600.46–0.75
 1–415653.4229749.320.530.44–0.61
 5–9248.22406.490.610.38–0.87
 10–143010.27579.260.540.35–,0.74
 15–19155.149515.710.160.09–0.25
VIII. Malignant bone tumors
 <150.4570.310.730.22–1.50
 1–4454.03662.970.680.49–0.90
 5–918916.9237616.870.500.43–0.58
 10–1446641.7290640.590.510.47–0.56
 15–1941236.8887639.250.470.43–0.52
IX. Soft tissue and other extraosseous sarcomas
 <11038.122297.240.450.37–0.54
 1–427021.2956317.880.480.42–0.54
 5–926020.5054417.270.480.42–0.54
 10–1434327.0574323.590.460.41–0.51
 15–1931024.44106633.860.290.26–0.32
X. Germ cell tumors, trophoblastic tumors, and neoplasms of gonads
 <1467.742367.430.200.14–0.26
 1–46811.452016.300.340.26–0.43
 5–96310.612186.850.290.22–0.37
 10–1415826.6047214.890.340.28–0.39
 15–1925943.60203264.310.130.11–0.14

Abbreviations: CI, confidence interval; CNS, central nervous system.

aTumor categories were defined according to the International Classification of Childhood Cancer, 3rd edition.10
bAge group was based on age at diagnosis.

TABLE 3

Location-Specific Counts of Childhood Cancers From the Surveillance, Epidemiology, and End Results Program and the Children’s Oncology Group Childhood Cancer Research Network

Cancer Type and LocationNo. of Patients (%)
SEERCCRN
Leukemia
 ALL2514 (81.97)3196 (74.71)
 AML553 (18.03)1082 (25.29)
 Total3067 (100)4278 (100)
Lymphoma
 HL932 (60.80)1007 (46.69)
 NHL601 (39.20)1150 (53.31)
 Total1533 (100)2157 (100)
Neuroblastoma
 Mediastinum41 (7.31)137 (16.16)
 Peripheral nerves105 (18.72)32 (3.77)
 Retroperitoneum73 (13.01)165 (19.46)
 Connective tissue subcutaneous41 (7.31)37 (4.36)
 Kidney, unspecified12 (2.14)20 (2.36)
 CNS20 (3.57)10 (1.18)
 Adrenal269 (47.95)447 (52.71)
 Total561 (100)848 (100)
Osteosarcoma
 Long bone upper limb50 (14.12)85 (14.43)
 Long bone lower limb290 (81.92)489 (83.02)
 Pelvis, sacrum, coccyx14 (3.95)15 (2.55)
 Total354 (100)589 (100)

Abbreviations: ALL, acute lymphoblastic leukemia; AML, acute myeloid leukemia; CCRN, Children’s Oncology Group Childhood Cancer Research Network; CNS, central nervous system; HL, Hodgkin lymphoma; NHL, non-Hodgkin lymphoma; SEER, Surveillance, Epidemiology, and End Results.

DISCUSSION

In this study, we estimated the disparity between anticipated and actual enrollment of childhood cancers in the COG CCRN to quantify how representative registrants are of all childhood cancer cases in the US population and to characterize cases identified using the newly implemented consent system. We observed low observed-to-expected enrollment ratios for nearly all patient subgroups examined, although there was substantial variation by geographic location. When we analyzed subtype-specific and age-specific observed-to-expected ratios (Table 2), there were some differences with respect to disease subtype. For example, ratios were highest for leukemias, peripheral nervous system tumors, and renal tumors; whereas ratios were lowest for germ cell tumors, CNS tumors, and retinoblastomas. Overall, the groups ages 1 to 4 years and 5 to 9 years at diagnosis groups tended to have observed-to-expected ratios closest to 1, whereas the ratios were lowest for the oldest (15–19 years) and youngest (<1 year) age groups.

Although ratios were well below 1 for all cancer subtypes, ages, race/ethnicity groups, and sexes, there were some noteworthy trends. Age at diagnosis after infancy demonstrated a downward trend in enrollment, and the group ages 15 to 19 years had substantially lower ratios than all younger age groups. This is consistent with earlier reports15,16 and is attributed to the finding that older adolescents are often treated by adult oncologists.15 This difference in enrollment by age group may be responsible in part for some of the trends by cancer subtype. For example, a majority of germ cell tumors are diagnosed in adolescent boys16; thus, the predicted ratios for this tumor type would be much lower.

The identification of sufficient numbers of patients with childhood and adolescent cancers for the conduct of well powered, informative research studies is a global issue. In the United States, where there is no national cancer registry, the high-quality SEER Program data have historically been used to monitor trends in incidence and survival, both overall and among demographic and clinical subgroups.3,4,1720 More recently, the National Program of Cancer Registries of the Centers for Disease Control and Prevention has partnered with SEER in assembling data covering >90% of the US population,21 which will allow for increased power to monitor rarer subgroups as additional annual data are amassed. Similarly, the European Union developed the Automated Childhood Cancer Information System to conduct cancer incidence and survival surveillance in the region; the original data spanned from 1970 to 199922,23 and are currently being updated (available at: http://accis.iarc.fr/callfor-data/index.php; Accessed July 17, 2013). In addition, the EUROCARE Project has more recent population-based survival data (1978–2007).24 Some investigators, such as those in the United Kingdom and in Nordic countries, have been able to leverage national identification numbers to link data from birth, hospital, and cancer registries to conduct etiologic studies.25,26 Unlike the CCRN, those registries have nearly complete case ascertainment; however, studies based on those data are limited in the questions that can be answered based on which data are present. In addition, linked biologic specimens are not generally readily available for these cases, with some exceptions (eg, dried blood spots in some Nordic countries27). Thus, the collaboration of epidemiologists, bio-statisticians, and clinicians at treating institutions in cooperative networks like the COG remains the most promising avenue for large-scale nontherapeutic research, particularly because the networks also are needed for therapeutic studies.

Currently, the CCRN is the best source for conducting etiologic studies of childhood cancers in the United States. Because pilot data suggest that enrollment rates are high once patients and/or their guardian are identified and approached for enrollment,8 it is likely that the observed gap in ascertainment is because not all eligible patients and their families are being approached by COG institutions regarding CCRN participation. The differences observed by cancer subtype could be because of differential referral patterns—for example, patients with retinoblastoma being treated more often by an eye specialist—or perhaps because of increased recruitment of particular subtypes for which there were also concurrent open COG treatment protocols. The very low ratios in the group ages 15 to 19 years at diagnosis suggest that this subgroup requires particular attention in the future. In subgroup analyses of the most common cancers, the CCRN case distribution was similar to national rates with respect to leukemia and CNS tumors,11 but not lymphoma, in which there was over-representation of NHL in the CCRN. It is noteworthy that lymphomas tend to present later in life compared with leukemias; thus, enrollment data for the lymphoma patient population may not be as representative because of declines in registration with increasing patient age.28 Indeed, the observation that 66% of patients with Hodgkin lymphoma were in the group ages 15 to 19 years at diagnosis, whereas those with NHL comprised more patients aged <10 years at diagnosis, helps to explain this discrepancy.

Because of the low ratios, the CCRN cannot currently be considered a population-based pediatric cancer registry. The primary importance of being population-based lies in the theoretical framework of the classic case-control etiologic study—the only practicable study design for rare diseases like childhood cancer. In such a study, when the goal is to make inferences regarding exposures and disease in a given population, the controls must be drawn from the population that gave rise to the cases.29,30 That is, anyone who would have been identified as a case had they had developed the disease in question must have a chance of being selected as a control. Hence, a population-based registry is ideal, because corresponding control participants may be sampled from the general population rather than identifying a complex sampling frame that defines the source population (for example, defining the catchment area of a particular hospital where all of the cases are being recruited).

Nevertheless, the CCRN remains a useful tool for the etiologic study of childhood malignancies. Although the CCRN is not population-based, it does not necessarily follow that the CCRN is biased in its sampling. Studies with underascertainment or low enrollment still can provide unbiased effect estimates as long as the participation rates are not associated with prognosis or the exposure of interest.30,31 The assumption that study enrollment is not associated with an exposure of interest may have been more tenuous in COG-sponsored etiologic studies in the past, when exposures of interest were behavioral or environmental exposures measured through parental report, such as maternal alcohol consumption or diet during pregnancy—factors that carry sociocultural implications with a potential impact on a family’s decision to participate in a study.30,32 However, a large number of current etiologic studies of childhood cancer that use the CCRN focus on biologic factors like common genetic polymorphisms and epigenetics, for which it is highly unlikely that participation would be differential. Furthermore, many new study designs are family based, including the case-parent–triad design, in which traditional controls are not selected.33 These shifts in epidemiologic study design mean that the patients currently being enrolled in the CCRN are suitable for nontherapeutic studies.

The CCRN provides researchers with a substantial number of potential cases for diseases that are extremely rare. For example, in 3 years, the CCRN enrolled greater than 1000 children with a bone tumor, nearly 750 with a germ cell tumor or other gonadal tumor, and nearly 300 with retinoblastoma. Nearly all families (approximately 95%) have agreed to future contact regarding new research and, thus, are eligible to be contacted for participation in newly developed studies. Such studies would be extremely well powered compared with previous studies conducted on these malignancies in the United States, with sample sizes ranging from 24 to 278 patients.3436

Our study is the first to assess the completeness of CCRN enrollments. The use of the 2010 Census, and particularly the zip code tabulation area, to determine geographic areas allowed for improved matching of observed and expected populations, thereby improving the precision of the estimated expected counts. The SEER registry remains the best source for cancer rates in the United States. That we also were able to use multiple, recent, and overlapping years of enrollment in the CCRN as well as data from SEER means it is unlikely that our results are biased because of particularly low or high rates in a single year.

There are also some important limitations that must be considered. Of primary concern is the incomplete 1-to-1 mapping of case counts to expected rates by geographic area, because SEER does not universally capture every area of the United States; hence, there could be some accuracy lost in estimating the expected counts. We used region-specific rates only when an appropriate registry could be identified; otherwise, we used SEER averages.

Despite the overall low observed enrollment rates, the CCRN, if maintained and improved, will remain a valuable tool in the study of childhood cancer. Given that there is no comparable mechanism currently in place as an alternative, there is a need to continue enrollment to allow researchers to access willing patients as new etiologic questions are developed. The CCRN is an especially valuable source for extremely rare cancers, and particular attention should be paid to enrolling these patients. The CCRN currently captures minimal data (eg patient’s sex, age [birth date], race/ethnicity, treating institution, tumor histology/location, and zip code) because of limitations in personnel for collecting additional information at individual institutions. Etiologic investigations of childhood cancer using the CCRN rely on external funding to support and collect important environmental, biologic, and genetic information directly from the families. Ideally, in the future, the CCRN could capture additional etiologic information that could be useful in and of itself or as a basis for selecting patients for study (eg minimal family history, twinship, birth weight, place of birth, parental age, etc). Notably, there are plans in place in the COG to activate a new protocol, called EveryChild, which would include the CCRN current process along with biospecimen collection at the time of enrollment.

In conclusion, despite the overall low observed-to-expected ratios, the current results provide useful insight into the enrollment patterns by geography, age, sex, race, and histology in the completeness of case ascertainment of childhood cancer in the CCRN. This analysis demonstrates how the CCRN population differs from the entire population of childhood and adolescent cancers in the United States, so that researchers can adapt their enrollment strategies as needed to capture the patient population of interest. Steps must now be taken to improve the enrollment rates of the CCRN to maximize its utility as a research tool and fully realize its potential as a resource in the study of childhood cancer etiology.

Supplementary Material

Supplementary Information

Acknowledgments

FUNDING SUPPORT

This work was supported by grants from the National Institutes of Health (T32 CA099936, K05 CA157439, and U10 CA98543), the CureSearch Foundation, and the Children’s Cancer Research Fund (Minneapolis, Minn).

Footnotes

CONFLICT OF INTEREST DISCLOSURES

The authors made no disclosures.

Additional Supporting Information may be found in the online version of this article.

References

1. American Cancer Society; American Cancer Society, editor. Cancer Facts & Figures 2014. Atlanta, GA: American Cancer Society; 2014. Special section: cancer in children & adolescents; pp. 25–42.
2. Siegel R, Naishadham D, Jemal A. Cancer statistics, 2013. CA Cancer J Clin. 2013;63:11–30. [PubMed]
3. Linabery AM, Ross JA. Childhood and adolescent cancer survival in the US by race and ethnicity for the diagnostic period 1975–1999. Cancer. 2008;113:2575–2596. [PMC free article] [PubMed]
4. Linabery AM, Ross JA. Trends in childhood cancer incidence in the US (1992–2004) Cancer. 2008;112:416–432. [PubMed]
5. O’Leary M, Krailo M, Anderson JR, Reaman GH. Progress in childhood cancer: 50 years of research collaboration, a report from the Children’s Oncology Group. Semin Oncol. 2008;35:484–493. [PMC free article] [PubMed]
6. Ross JA, Severson RK, Pollock BH, Robison LL. Childhood cancer in the United States. A geographical analysis of cases from the Pediatric Cooperative Clinical Trials groups. Cancer. 1996;77:201–207. [PubMed]
7. Liu L, Krailo M, Reaman GH, Bernstein L. Childhood cancer patients’ access to cooperative group cancer programs: a population-based study. Cancer. 2003;97:1339–1345. [PubMed]
8. Steele JR, Wellemeyer AS, Hansen MJ, Reaman GH, Ross JA. Childhood cancer research network: a North American Pediatric Cancer Registry. Cancer Epidemiol Biomarkers Prev. 2006;15:1241–1242. [PubMed]
9. Surveillance, Epidemiology, and End Results (SEER) Program. SEER*Stat Database: Incidence-SEER 18 Regs Research Data + Hurricane Katrina Impacted Louisiana Cases, November 2012 Submission (2000–2010) <Katrina/Rita Population Adjustment>-Linked to County Attributes-Total US, 1969–2011 Counties. Bethesda, MD: National Cancer Institute, DCCPS, Surveillance Research Program, Surveillance Systems Branch; [Accessed March 7, 2014]. [released April 2013, based on the November 2012 submission]. Available at: www.seer.cancer.gov.
10. US Census Bureau. 2010 Census Summary File 1. Bethesda, MD: US Census Bureau; 2012.
11. Steliarova-Foucher E, Stiller C, Lacour B, Kaatsch P. International Classification of Childhood Cancer, third edition. Cancer. 2005;103:1457–1467. [PubMed]
12. US Census Bureau. 2010 US Census Data. Washington, DC: US Census Bureau; 2010. [Accessed August 17, 2013]. Available at: http://2010.census.gov/2010census/data.
13. Howlader N, Noone AM, Krapcho M, et al. SEER Cancer Statistics Review, 1975–2010. Bethesda, MD: National Cancer Institute; 2012. [Accessed June 15, 2013]. [based on the November 2012 SEER data submission, posted to the SEER web site, April 2013]. Available at: http://seer.cancer.gov/csr/1975_2010/
14. Vandenbroucke J. A shortcut method for calculating the 95 percent confidence interval of the standardized mortality ratio. Am J Epidemiol. 1982;115:303–304.
15. Bleyer WA. The impact of childhood cancer on the United States and the world. CA Cancer J Clin. 1990;40:355–367. [PubMed]
16. Poynter JN, Amatruda JF, Ross JA. Trends in incidence and survival of pediatric and adolescent patients with germ cell tumors in the United States, 1975 to 2006. Cancer. 2010;116:4882–4891. [PMC free article] [PubMed]
17. Breslow NE, Langholz B. Childhood cancer incidence: geographical and temporal variations. Int J Cancer. 1983;32:703–716. [PubMed]
18. Ries LG, Pollack ES, Young JL., Jr Cancer patient survival: Surveillance, Epidemiology, and End Results Program, 1973–1979. J Natl Cancer Inst. 1983;70:693–707. [PubMed]
19. Gurney JG, Severson RK, Davis S, Robison LL. Incidence of cancer in children in the United States. Sex-, race-, and 1-year age-specific rates by histologic type. Cancer. 1995;75:2186–2195. [PubMed]
20. Linet MS, Ries LA, Smith MA, Tarone RE, Devesa SS. Cancer surveillance series: recent trends in childhood cancer incidence and mortality in the United States. J Natl Cancer Inst. 1999;91:1051–1058. [PubMed]
21. Li J, Thompson TD, Miller JW, Pollack LA, Stewart SL. Cancer incidence among children and adolescents in the United States, 2001–2003 [serial online] Pediatrics. 2008;121:e1470–e1477. [PubMed]
22. Steliarova-Foucher E, Stiller C, Kaatsch P, et al. Geographical patterns and time trends of cancer incidence and survival among children and adolescents in Europe since the 1970s (the ACCIS project): an epidemiological study. Lancet. 2004;364:2097–2105. [PubMed]
23. Steliarova-Foucher E, Kaatsch P, Lacour B, et al. Quality, comparability and methods of analysis of data on childhood cancer in Europe (1978–1997): report from the Automated Childhood Cancer Information System project. Eur J Cancer. 2006;42:1915–1951. [PubMed]
24. Gatta G, Botta L, Rossi S, et al. Childhood cancer survival in Europe 1999–2007: results of EUROCARE-5—a population-based study. Lancet Oncol. 2014;15:35–47. [PubMed]
25. Bjorge T, Sorensen HT, Grotmol T, et al. Fetal growth and childhood cancer: a population-based study [serial online] Pediatrics. 2013;132:e1265–e1275. [PMC free article] [PubMed]
26. Kendall GM, Little MP, Wakeford R, et al. A record-based case-control study of natural background radiation and the incidence of childhood leukaemia and other cancers in Great Britain during 1980–2006. Leukemia. 2013;27:3–9. [PMC free article] [PubMed]
27. Norgaard-Pedersen B, Hougaard DM. Storage policies and use of the Danish Newborn Screening Biobank. J Inherit Metab Dis. 2007;30:530–536. [PubMed]
28. Percy CL, Smith MA, Linet M, Ries LAG, Friedman DL. Lymphomas and reticuloendothelial neoplasms. In: Ries LAG, Smith MA, Gurney JG, et al., editors. Cancer Incidence and Survival Among Children and Adolescents: United States SEER Program 1975–1995. Bethesda, MD: National Cancer Institute, SEER Program; 1999. pp. 35–49.
29. Szklo M, Nieto J. Epidemiology: Beyond the Basics. 2. Boston, MA: Jones and Bartlett Publishers; 2006.
30. Rothman KJ, Greenland S, Lash TL. Modern Epidemiology. 3. Philadelphia, PA: Wolters Kluwer Health/Lippincott Williams & Wilkins; 2008.
31. Greenland S, Robins JM. Identifiability, exchangeability, and epidemiological confounding. Int J Epidemiol. 1986;15:413–419. [PubMed]
32. Galea S, Tracy M. Participation rates in epidemiologic studies. Ann Epidemiol. 2007;17:643–653. [PubMed]
33. Weinberg CR. Less is more, except when less is less: studying joint effects. Genomics. 2009;93:10–22. [PMC free article] [PubMed]
34. Shu XO, Nesbit ME, Buckley JD, Krailo MD, Robinson LL. An exploratory analysis of risk factors for childhood malignant germ-cell tumors: report from the Children’s Cancer Group (Canada, United States) Cancer Causes Control. 1995;6:187–298. [PubMed]
35. Shankar S, Davies S, Giller R, et al. In utero exposure to female hormones and germ cell tumors in children. Cancer. 2006;106:1169–1177. [PubMed]
36. Fustino N, Rakheja D, Ateek CS, Neumann JC, Amatruda JF. Bone morphogenetic protein signalling activity distinguishes histological subsets of paediatric germ cell tumours [serial online] Int J Androl. 2011;34(4 pt 2):e218–e233. [PMC free article] [PubMed]