NCBI » Bookshelf » Health Services/Technology Assessment Text (HSTAT) » AHRQ Evidence Reports » Evaluation of Cervical Cytology: Evidence Report/Technology Assessment Number 5
 
hserta
AHRQ Evidence Reports
public health

Chapter  5:  Evaluation of Cervical Cytology: Evidence Report/Technology Assessment Number 5

A6784

THIS EVIDENCE REPORT IS OUTDATED AND IS NO LONGER VIEWED AS GUIDANCE FOR CURRENT MEDICAL PRACTICE. IT IS MAINTAINED FOR ARCHIVAL PURPOSES ONLY.

Prepared for:
Agency for Health Care Policy and Research

U.S. Department of Health and Human Services
2101 East Jefferson Street
Rockville, MD 20852
http://www.ahcpr.gov

Contract No. 290-97-0014

Prepared by:
Duke University, Durham, NC
Douglas C. McCrory, MD, MHSc
David B. Matchar, MD
Co-Project Directors
Lori Bastian, MD
Santanu Datta, MS, MBA
Victor Hasselblad, PhD
Jason Hickey, MS
Evan Myers, MD, MPH
Kavita Nanda, MD
Investigators

AHCPR Publication No. 99-E010

February 1999

THIS EVIDENCE REPORT IS OUTDATED AND IS NO LONGER VIEWED AS GUIDANCE FOR CURRENT MEDICAL PRACTICE. IT IS MAINTAINED FOR ARCHIVAL PURPOSES ONLY.

Prepared for:
Agency for Health Care Policy and Research

U.S. Department of Health and Human Services
2101 East Jefferson Street
Rockville, MD 20852
http://www.ahcpr.gov

Contract No. 290-97-0014

Prepared by:
Duke University, Durham, NC
Douglas C. McCrory, MD, MHSc
David B. Matchar, MD
Co-Project Directors
Lori Bastian, MD
Santanu Datta, MS, MBA
Victor Hasselblad, PhD
Jason Hickey, MS
Evan Myers, MD, MPH
Kavita Nanda, MD
Investigators

AHCPR Publication No. 99-E010

February 1999

Preface

The Agency for Health Care Policy and Research (AHCPR), through its Evidence-based Practice Centers (EPCs), sponsors the development of evidence reports and technology assessments to assist public- and private-sector organizations in their efforts to improve the quality of health care in the United States. The reports and assessments provide organizations with comprehensive, science-based information on common, costly medical conditions and new health care technologies. The EPCs systematically review the relevant scientific literature on topics assigned to them by AHCPR and conduct additional analyses when appropriate prior to developing their reports and assessments.

To bring the broadest range of experts into the development of evidence reports and health technology assessments, AHCPR encourages the EPCs to form partnerships and enter into collaborations with other medical and research organizations. The EPCs work with these partner organizations to ensure that the evidence reports and technology assessments they produce will become building blocks for health care quality improvement projects throughout the Nation. The reports undergo peer review prior to their release.

AHCPR expects that the EPC evidence reports and technology assessments will inform individual health plans, providers, and purchasers as well as the health care system as a whole by providing important information to help improve health care quality.

We welcome written comments on this evidence report. They may be sent to: Director, Center for Practice and Technology Assessment, Agency for Health Care Policy and Research, 6010 Executive Blvd., Suite 300, Rockville, MD 20852.

John M. Eisenberg, M.D.Douglas B. Kamerow, M.D.
AdministratorDirector, Center for Practice and Technology Assessment
Agency for Health Care Policy and ResearchAgency for Health Care Policy and Research
The authors of this report are responsible for its content. Statements in the report should not be construed as endorsement by the Agency for Health Care Policy and Research or the U.S. Department of Health and Human Services of a particular drug, device, test treatment, or other clinical service.

Structured Abstract

Evaluation of Cervical Cytology

Objective

This report compares new technologies for cervical cytological screening with conventional Papanicolaou (Pap) test screening in terms of diagnostic accuracy, costs, effectiveness, and cost-effectiveness in adult women of average cervical cancer risk

Search Strategy

Published literature on the accuracy of cervical cytological screening, costs of screening and treatment, and cost-effectiveness were identified in MEDLINE, CINAHL, CancerLit, EconLit, HealthSTAR, and EMBASE databases.

Selection Criteria

Diagnostic test studies were included if they compared cervical cytology diagnosis with concurrent colposcopy or biopsy and provided estimates of sensitivity and specificity. For the new technologies, studies were also included that used a cytology reference standard and allowed estimation of either sensitivity or specificity. Articles on costs and health outcomes were selected if they assessed the effect of screening on life expectancy or quality of life, number of cases of cervical cancer, or total health care costs.

Data Collection and Analysis

For diagnostic test studies, paired reviewers independently abstracted sensitivity and specificity data from each study. Quality scores were assessed on blind interpretation of screening test results, histological reference standard, verification of test negative subjects, description of disease spectrum, avoidance of bias in sample selection, publication type, and source of support. Diverse articles on costs and health outcomes were summarized and quality-scored according to criteria published by an expert panel.

Supplemental analyses include a meta-analysis to generate summary estimates of Pap test discrimination; cost analysis using claims databases to generate costs of treatment and screening; and a Markov model to estimate the effectiveness and costs of different technologies and clinical strategies.

Main Results

Conventional Pap smear screening, based on the few studies that avoided severe biases, showed specificity of 98 percent (95 percent confidence interval (CI); 97-99 percent) and sensitivity of 51 percent (95 percent CI; 37-66 percent). The sample prevalence of disease is strongly related to between-study variability in Pap test sensitivity and specificity and may reflect bias. Other indicators of study quality were not significant when prevalence was controlled. The Pap test is more accurate when a high-grade squamous intraepithelial lesion threshold is used with the goal of detecting a high-grade lesion than when lower thresholds, such as a low-grade squamous intraepithelial lesion (LSIL) or atypical squamous cells of uncertain significance (ASCUS), are used with the goal of detecting low- or high-grade dysplasia. Few studies of the new technologies used histology or colposcopy as a reference standard or allowed estimates of both sensitivity and specificity. In studies using a cytology reference standard, each of the new technologies appears to significantly improve sensitivity relative to conventional Pap smear screening; however, little information is available on the effects on specificity.

Cost-effectiveness ratios from published models comparing Pap smear screening with no screening fall into an acceptable range, but these models used parameters that overstate Pap test accuracy.

Base case estimates of the incremental cost-effectiveness of conventional Pap screening every 3 years compared with no Pap screening is $4,097 per life-year saved. A technology applied to the initial step in Pap screening that reduces the false negative rate by a factor of 0.6 at an incremental cost per slide of $10 has an incremental cost of $22,010 per life-year saved when performed every 3 years. With more frequent screening intervals, the cost per life-year saved is greater than $50,000. Technologies that allow 100 percent rescreening of slides initially read as normal by conventional Pap screening, at a reduction in false negative rate of 0.85 or higher, are more effective than technologies that improve initial screening with a reduction of 0.6. At these reductions in false negative rate, with identical incremental costs of $10 per slide and a screening interval of every 3 years, the cost per life-year saved of rescreening technologies compared with improved initial screening is $45,375. Findings were relatively insensitive to assumptions about cervical cancer incidence, cost of technologies, diagnostic strategies for abnormal screening results, and age at onset of screening. Findings were sensitive to both the reduction in false negative rate (i.e., improvement in sensitivity) and the relative specificity of the technologies compared with conventional Pap.

Conclusions

Estimates of the sensitivity of the conventional Pap test are biased in most studies; based on the least biased studies, sensitivity is near 50 percent, much lower than generally believed. Newer technologies improve sensitivity compared with conventional Pap screening; however, there are no precise estimates for their effect on specificity. Under assumptions favorable to improved initial screening technologies and rescreening technologies, either approach can result in acceptable cost per life-year saved at 3-year Pap screening intervals. However, the imprecision in estimates of effectiveness and cost of the new technologies makes drawing firm conclusions about their relative cost-effectiveness problematic.

This document is in the public domain and may be used and reprinted without permission, except for those copyrighted materials noted for which further reproduction is prohibited without the specific permission of copyright holders.

Suggested Citation:

McCrory DC, Matchar DB, Bastian L, et al. Evaluation of Cervical Cytology. Evidence Report/Technology Assessment No. 5. (Prepared by Duke University under Contract No. 290-97-0014.) AHCPR Publication No. 99-E010. Rockville, MD: Agency for Health Care Policy and Research. February 1999.

Summary

Overview

Worldwide, carcinoma of the cervix is one of the most common malignancies in women. It is estimated that approximately 13,700 new cases of the disease will occur in the United States in 1998. A woman's lifetime risk of being diagnosed with cervical cancer in the United States is currently 0.83 percent, and the risk of dying from the disease is 0.27 percent.

The incidence of cervical cancer and associated mortality have each decreased over 40 percent since 1973; the decreases are largely attributable to the success of mass screening using the Papanicolaou (Pap) test to diagnose premalignant or early-stage cases. The decreases in invasive cervical cancer incidence and mortality since the introduction of the Pap smear have been so dramatic that it is one of the few interventions to receive an "A" recommendation from the U.S. Preventive Services Task Force even though there are no randomized trials demonstrating its effectiveness.

Despite the indisputably dramatic impact of Pap screening, there is still uncertainty about the details of Pap smear performance, and much could be done to improve the performance of the test and followup of patients after screening. Controversy about the details of Pap smear performance is manifest in differing recommendations about the frequency of screening and the age (if any) at which screening may safely be stopped. A significant proportion of patients and providers fail to comply with even the least demanding recommendations for Pap screening frequency. Numerous barriers to screening have been identified that reduce access to Pap smears and other preventive services.

Recently, efforts to improve Pap smear performance have focused on reducing the number of false negative smears, that is, cases in which premalignant or malignant cells have been misdiagnosed as normal. Measures adopted to improve laboratory performance on this point include manual rescreening of a portion of slides initially evaluated as negative, an approach mandated by Federal law (Clinical Laboratory Improvement Amendments [CLIA]). Recently, several technologies have been developed to optimize Pap test screening by reducing the false negative rate. These technologies are a major focus of this report.

Reporting the Evidence

The report addresses three main questions:

  • 1

    What is the accuracy of cervical cytology using conventional Pap smears and new technologies (thin-layer cytology, computer rescreening, algorithm-based decision making technology) for detecting cervical cancer and its precursors?

  • 2

    What are the direct medical costs associated with cervical cancer screening, evaluation, treatment, and followup of cervical cytological abnormalities and treatment and followup of cervical cancer?

  • 3

    What are the effects on total health care cost, morbidity, and mortality of regular cervical cytological screening using thin-layer cytology and computer rescreening using neural network or algorithm-based decisionmaking technology compared with the conventional Pap smear in women participating in a screening program?

On the first point, the report will review published studies comparing cervical cytological diagnosis with clinical diagnosis based on colposcopy or biopsy. The results of this review will form the basis for a meta-analysis.

On the second point, the report will identify and examine current claims data and other datasets to estimate empirically costs associated with cervical cytological screening.

On the third point, the report will review the literature on the effectiveness and cost-effectiveness of cervical cytology screening and use these data to develop a comprehensive cost-effectiveness model to examine the impact of the newer screening technologies. In the absence of definitive clinical trials on key questions of cervical cancer screening, policymakers have relied on decision-modeling studies to integrate epidemiological data on the natural history of cervical cancer precursors, data on the performance of diagnostic tests for early cervical cancer or cervical cancer precursors, and data on cost. These models estimate the efficacy of various screening programs, balance estimated efficacy against estimated cost, and lead to decisions about appropriate screening intervals and age cutoffs.

New Technologies Assessed

Recent developments in specimen processing and interpretation may substantially improve the Pap smear as a diagnostic test for cervical cancer and cancer precursors. Three new devices recently approved by the Food and Drug Administration (FDA) are considered in this report: ThinPrep®, Papnet®, and AutoPap®. The three devices employ three different types of technology: thin-layer cytology (ThinPrep®) and computerized rescreening utilizing neural-network technology (Papnet®) or algorithmic classification (AutoPap®).

Each of these technologies was developed to reduce the false negative rate associated with cervical cytological screening. The two major components to this false negative rate are false negatives related to sampling error and false negatives related to detection error. About two-thirds of false negatives are a result of sampling error and the remaining one-third a result of detection error. Each of the new technologies is directed at one of these components of false negatives. Thin-layer cytology aims primarily to fix sampling error, whereas computerized rescreening targets detection error. This implies that neither technology will be able to reduce false negatives beyond a certain threshold.

One newly approved device, Papnet®, uses neural-network computerized rescreening of Pap smears initially read as negative by a cytotechnologist.The system works by using automated computerized imaging of Pap smear slides and interpretation of images using a computerized algorithm to identify slides that are likely to contain abnormal cells. The Papnet® system (Neuromedical Systems, Inc.) identifies cells or clusters of cells that require review and can display up to 128 images of the slide likely to contain abnormalities. These images can be reviewed by a cytotechnologist who can decide whether or not to review the slide using light microscopy.

AutoPap® 300 QC system (Neopath, Inc.), an algorithm-based decisionmaking technology, identifies slides exceeding a certain threshold for the likelihood of abnormal cells. The laboratfory can select different thresholds corresponding to 10, 15, and 20 percent review rates. In contrast to random rescreening, the population of slides selected by the AutoPap® 300 QC system is enriched with abnormalities and, at the 10-15 percent sort rate, this population of slides should contain 70-80 percent of the slides containing abnormalities missed by manual screening.

A variety of other technologies or clinical strategies have been proposed to improve Pap testing including various devices for collecting a cytological sample from the cervix. Still other technologies have been proposed to augment or replace cervical cytological screening, including colposcopic photographs for review by experts (cervicography) and DNA testing for specific human papillomavirus (HPV). These technologies are not considered in the present report.

Patient Population and Settings

The primary target population for this evidence report is women of average cervical cancer risk in the United States who are candidates for Pap smear screening. For the purposes of our analysis, candidates for Pap smear screening include women between the age of onset of sexual activity and the age of 85.

Although a large proportion of cervical cancer occurs in women with very limited or no screening, we did not examine programs or policies designed to improve screening compliance. Some previous studies have focused on special populations such as elderly women and elderly women who have not previously been screened.

The principal practice setting considered is the primary care practice in the United States (general internal medicine, family practice, adolescent medicine, and obstetrics/gynecology) and government and nongovernment family planning clinics (e.g., Planned Parenthood, public health clinics).

Methodology

The comprehensive review of the literature, from identification of databases through abstraction of individual articles into the evidence tables, was a multistep, sequential process. This process is detailed below

Literature Sources Used

MEDLINE, CancerLit, HealthSTAR, CINAHL, EMBASE, and EconLit computerized database searches, supplemented by manual journal searches and querying experts and device manufacturers, were the sources used to identify English language reports on the accuracy of cervical cytological screening, costs associated with screening and treatment, and cost-effectiveness.

Citations for the review of accuracy of cervical cytological testing were retrieved with a search strategy that combined various text word and index terms for cervical cytological tests with cervical cancer or dysplasia and sensitivity and specificity. The strategy to retrieve articles on the costs and health outcomes associated with cervical cancer screening combined cervical cytological test terms with terms describing cost analysis and mathematical modeling. Experienced librarians assisted with the design and translation of these search strategies for each database searched.

Screening of Articles

Separate sets of criteria for including articles in the evidence report were developed for the two topics that were the subject of literature reviews (diagnostic testing and cost and health outcomes). In each case, final screening criteria were developed through an iterative process. Each iteration of criteria was pilot-tested by each reviewer/abstractor on a subset of randomly chosen articles.

Articles on diagnostic testing were first screened based on information available through the online databases (primarily title, authors, and abstract when available). Citations were eliminated in Step 1 of the screening process if cervical cytology was not evaluated as a screening test or if the screening test results were not compared with a reference standard. In Step 2 of the screening process, full texts of articles were reviewed to select articles in which a reference standard of colposcopy or histology was used, the screening test and references standard were reasonably concurrent (i.e., within 3 months), and sufficient data to calculate both sensitivity and specificity were provided (i.e., all cells of a two-by-two table). Of the 939 bibliographic references reviewed, 561, or approximately 60 percent, were excluded during the first screening, and another 293, or 31 percent, during the second screening. Eighty-six articles were included according to these criteria: 84 studies of conventional Pap screening and one study each of ThinPrep® and Papnet®. Because so few studies of the new technologies met the original criteria, we modified the criteria to include studies of the new technologies that used a cytology reference standard and allowed estimation of either sensitivity or specificity. We considered a total of 59 studies (12 on AutoPap®, 27 on Papnet®, and 20 on ThinPrep®) during this final stage of the screening process (Step 3). The net result was the inclusion of 6 studies of AutoPap®, 11 of Papnet®, and 8 of ThinPrep®.

Articles on cost and health outcomes of cervical cytological screening were selected if they assessed the effect of screening on life expectancy or quality, number of cases of cervical cancer, or total health care costs for any of the following cytological screening technologies: conventional Pap smears, thin-layer cytology, or Pap smears with computerized rescreening. Of the 672 articles identified, 638, or 95 percent, were eliminated during the screening process. Thirty-four articles were included in the review.

Data Abstraction Process

Key information was abstracted onto specially designed forms and verified by either duplicate abstraction (two-by-two tables) or overreading by paired clinician-abstractors. Differences were resolved by consensus.

For the diagnostic testing articles, both members of each abstractor team also independently completed two-by-two tables for each study, extracting the key data to calculate sensitivity, specificity, and prevalence and other data to be used in the meta-analysis. The main outcome measures considered were the sensitivity and specificity of cytological abnormality by Pap test for detecting cases, where "cytological abnormality" was defined by one of three thresholds ranging from atypical squamous cells of uncertain significance (ASCUS) (threshold 1) to low-grade squamous intraepithelial lesion (LSIL) (threshold 2) to high-grade squamous intraepithelial lesion (HSIL) (threshold 3), and where a "case" was defined as a histological diagnosis of dysplasia or carcinoma. Equivalent categories in other classification schemes were also used. Two-by-two tables were constructed for four different combinations of cytological versus histological thresholds: ASCUS/ cervical intraepithelial neoplasia (CIN1), LSIL/CIN1, LSIL/CIN2-3, and HSIL/CIN2-3.

Criteria for Evaluating the Quality of Articles

Quality scores for articles on diagnostic testing were assigned according to predetermined methodological criteria based on blind interpretation of screening test results, use of a reference standard of histology, selection of test negative patients for verification, avoidance of bias in sample collection, description of the spectrum of disease in the sample, publication as a full report (as opposed to abstract), and source of support.

The quality of articles on costs and health outcomes was described according to recently published criteria by an expert panel on cost and effectiveness in medicine.

Supplemental Analyses

Meta-analysis of Pap Test Accuracy

We used the effectiveness score to combine data from multiple studies describing the performance of the conventional Pap test in discriminating between patients with and without cervical lesions. The effectiveness score takes account of both sensitivity and specificity by fitting a receiver operating characteristic (ROC) curve through a logistic odds transformation of the two and thus accounts for their interdependence. The effectiveness score is more normally distributed than either sensitivity or specificity and can be thought of as a gauge of the overall discriminatory ability of the test. Standardized effectiveness scores can be interpreted across different diagnostic tests. In general, a score of 3 reflects a test with good discrimination, whereas a score of 1 reflects a test that does not discriminate between disease positives and disease negatives.

We used maximum likelihood estimation techniques and a random effects model to calculate summary measures of effectiveness at each of the four explicit diagnostic thresholds (ASCUS/CIN1, LSIL/CIN1, LSIL/CIN2-3, HSIL/CIN2-3). We further evaluated the effect of variations in disease prevalence and in quality of study design and reporting on test discrimination.

Cost Analysis

Several available datasets were analyzed to estimate direct medical costs of screening, diagnosing, and treating cervical cancer, calculating separate estimates for women 20-64 years of age and those 65 years and older (eligible for Medicare). For women 20-64, the unit cost of screening, diagnosis, and treatment of cervical cancer was estimated from MEDSTAT data from 1992, 1993, and 1994, inflated to reflect 1994 charges and converted to costs using 1994 cost-to-charge ratios published by the American Hospital Association.

For women over 65, Medicare's resource-based relative value scale (RBRVS) fee schedule for physician services, Medicare's clinical laboratory fee schedule for laboratory services, and national average Diagnosis-related group (DRG) payments for hospital admissions were used to identify the payments associated with services received for cervical cancer screening, diagnosis, and treatment. Charges and payment information obtained from all sources were then converted to reflect costs associated with the services provided and all costs were inflated to 1997 dollars.

Cost-effectiveness Model

We constructed a 20-state Markov model that follows a cohort of women from age 15 to 85 and assumes that there are no prevalent cases of HPV infection or squamous intraepithelial lesion (SIL) at age 15. Cycle lengths are 1 year long. No Pap smear screening is compared with the following screening strategies: conventional Pap smears at 1-, 2- and 3-year intervals, thin-layer cytology smears at 1-, 2- and 3-year intervals, and 100 percent computerized rescreening at 1-, 2- and 3-year intervals.

We used a U.S. health system perspective and evaluated the direct and health-care specific costs associated with screening, diagnosis, and treatment of cervical cancer and its precursors. We did not consider other societal costs such as work loss. The model considers the following outcomes: cost per year of life saved, cost per cervical cancer death prevented and per cervical cancer case prevented, and the number of morbid therapies avoided.

We discounted costs and years of life at 3 percent annually in the base case and varied the discount rate from 0 to 5 percent in a sensitivity analysis. Specific parameter estimates were derived from a preliminary literature assessment conducted for this report and prior published models of cervical cancer screening.

Findings

Important findings regarding the discrimination about the accuracy of cervical cytological screening include the following:

  • Despite the demonstrated ability of cervical cytological screening in reducing cervical cancer mortality, the conventional Pap test is less sensitive than it is generally believed to be.

  • Few studies of primary screening were unaffected by "workup" bias, but the few that were provided estimates of the specificity of Pap smear screening of 0.98 (95 percent confidence interval; 0.97-0.99) and sensitivity of 0.51 (95 percent confidence interval; 0.37-0.66).

  • The Pap test is more accurate when a higher cytological threshold (HSIL) is used with the goal of detecting a high-grade lesion. Lower test thresholds or use of the Pap test for detecting low-grade dysplasia results in poorer discrimination.

The accuracy of the Pap test is strongly affected by disease prevalence. Higher disease prevalence is associated with higher estimates of sensitivity and lower estimates of specificity (with a greater effect on specificity). These findings are consistent with prevalence as a marker for workup bias and perhaps also reflect an imperfect reference standard that is more specific than sensitive.

  • Quality of the studies reviewed, based on previously described criteria, varied widely; however, quality score did not explain a statistically significant amount of the between-study variation in discrimination when the variation in the prevalence of disease was controlled.

  • Existing information fails to provide accurate estimates for specificity of thin-layer cytology or computerized rescreening technologies. Our initial requirement for verification of test negatives with colposcopy or histology led to the exclusion of all but one study each of ThinPrep® and Papnet® and all studies of AutoPap®. The values reported for sensitivity and specificity in the few studies that use histological or colposcopic reference standards are well within the range of sensitivity and specificity reported for the conventional Pap test. However, including studies that directly compare these new technologies with conventional Pap smear testing (screening or rescreening) using a cytological reference standard results in significant improvements in sensitivity.

Important findings regarding the costs of cervical cytological screening and cervical cancer diagnosis and treatment include the following:

  • Pap smear screening cost is somewhat higher in older women than younger women chiefly because physician and total time spent in obtaining Pap smears during office visits is longer for older women.

  • Estimated costs of cervical cancer treatment calculated from episodes of care are substantially higher than estimates based on average procedure-specific costs because of both the provision of related services and the effect of complicated cases with unusually high costs. Estimates based on procedure-related costs alone will underestimate the true direct medical costs.

Important findings from a review of previously published models of the cost and effectiveness of cervical cytological screening include the following:

  • Published models examining the cost and effectiveness of Pap smear screening have consistently found Pap screening to have a significant impact on the incidence and mortality of cervical cancer and to have an acceptable range of cost-effectiveness ratios when compared with no screening.

  • Estimates of Pap test accuracy used in these models generally overestimated Pap test performance, as determined by recent unbiased studies and the findings of this report, and a previously published meta-analyses. Best estimates of Pap test performance fall outside the range used in sensitivity analyses of some models.

Important findings from a new model of cost and effectiveness of cervical cytological screening include the following:

  • The cost-effectiveness of either a technology that improves primary screening sensitivity (e.g., thin-layer cytology), or one that improves rescreening sensitivity (e.g., computerized rescreening), is directly related to the frequency of screening -- longer intervals result in lower estimates of cost per life year saved.

  • Our findings were relatively insensitive to assumptions about cervical cancer incidence, the cost of technologies, diagnostic strategies for abnormal screening results, age at onset of screening, or most other variables tested.

  • There is substantial uncertainty about the estimates of sensitivity and specificity of thin-layer cytology and computerized rescreening technologies compared with each other and with conventional Pap testing. The uncertainty is not reflected in the point estimates for effectiveness or cost-effectiveness. Although it is clear that both thin-layer cytology and computerized rescreening technologies provide an improvement in effectiveness at higher cost, the imprecision in estimates of effectiveness makes drawing conclusions about the relative cost-effectiveness of thin-layer cytology and computerized rescreening technologies problematic.

Future Research

Our research suggests several areas for possible future study.

  • Future decision models, cost-effectiveness studies, and health policy decisions should consider the sensitivity of Pap smear screening close to 50 percent.

  • Thin-layer cytology technology (ThinPrep®), the computerized rescreening device (AutoPap®), and the algorithm-based decisionmaking technology (Papnet®) have received regulatory approval from the FDA based on their demonstration of improved sensitivity compared with conventional Pap smear techniques. However, the evidence currently available does not fully describe the impact of these technologies on the specificity of the screening process. It is possible that a new technology might simultaneously raise both sensitivity and specificity; however, this has not been conclusively demonstrated for the devices reviewed in this report. Future studies of these technologies should include verification of test negative subjects to allow estimation of specificity.

  • Comparisons with cytological reference standards attest to the validity of the new technologies compared with optimal Pap screening, but comparison with a histological reference standard provides a more relevant outcome for clinical decisionmakers, since histological diagnosis forms the basis of most clinical management decisions. Further research is needed to validate negative cytological diagnoses made with the new technologies with colposcopy, in both low-prevalence and high-prevalence populations. This could be accomplished by subjecting a random sample of cytology-negative women to colposcopy, which would permit statistical correction for workup bias and estimation of test specificity.

  • Further research is needed to quantify the effect of cervical cancer and premalignant cervical lesions and various treatments for cervical cancer or dysplasia on quality of life. These data will allow a more comprehensive assessment of the impact of technologies for cervical cytological screening.

Introduction

Background

Worldwide, carcinoma of the cervix is one of the most common malignancies in women. It is estimated that approximately 13,700 new cases of the disease will occur in the United States in 1998 (Landis, Murray, Bolden et al., 1998). A woman's lifetime risk of being diagnosed with cervical cancer in the United States is currently 0.83 percent, and the risk of dying from the disease is 0.27 percent (Ries, Miller, Hankey et al., 1997).

The incidence of cervical cancer and associated mortality have both decreased over 40 percent since 1973. The discrepancy between incidence and mortality risk, and the decrease in both over time, are largely attributable to the success of mass screening using the Papanicolaou (Pap) test to diagnose premalignant or early-stage disease (Cannistra and Niloff, 1996; Ries, Kosary, Hankey et al., 1998; Shingleton, Patrick, Johnston et al., 1995; U.S. Preventive Services Task Force, 1996). The decrease in invasive cervical cancer incidence and mortality since the introduction of the Pap smear has been so dramatic that it is one of the few interventions to receive an "A" recommendation from the U.S. Preventive Services Task Force even though there are no randomized trials demonstrating its effectiveness (U.S. Preventive Services Task Force, 1996).

Despite the indisputably dramatic impact of Pap screening, there is still uncertainty about the details of Pap smear performance, and much could be done to improve the performance of the test and followup of patients after screening. Controversy about the details of Pap smear performance is manifest in differing recommendations about the frequency of screening and the age (if any) at which screening may safely be stopped (American Academy of Pediatrics, 1988; American Cancer Society, 1993; American College of Obstetricians and Gynecologists, 1995; American College of Physicians, 1991; American Medical Association, 1994; Canadian Task Force on the Periodic Health Examination, 1994; Green, 1994).

A significant proportion of patients and providers fail to comply with even the least demanding recommendations for Pap screening frequency. Numerous barriers to screening have been identified that reduce access to Pap smears and other preventive services (Womeodu and Bailey, 1996). Organized screening programs implemented in other countries have shown higher compliance with recommended screening rates than ad hoc screening programs (Koopmanschap, van Oortmarssen, van Agt et al., 1990; van Ballegooijen, van den Akker-van Marle, Warmerdam et al., 1997).

Efforts have also been made to improve test performance by insuring appropriate followup of results. In the case of abnormal test results, efforts have been focused on improving patient compliance. In the case of normal results, models that vary the frequency of testing intervals have been used to determine the optimum timing of screening visits, although there is disagreement on the interpretation of the models and on the recommendations of authoritative bodies.

More recently, efforts to improve Pap smear performance have focused on reducing the number of false negative smears, that is, cases in which premalignant or malignant cells have failed to be diagnosed either because of sampling error (abnormal cells failed to be placed on the slide) or detection error (abnormal cells were misdiagnosed as normal). Measures adopted to improve laboratory performance on this point include manual rescreening of a portion of slides initially evaluated as negative, an approach mandated by Federal law (Clinical Laboratory Improvement Amendments [CLIA]). Recently, several technologies have been developed to optimize the Pap test by reducing sampling or detection error. These technologies are a major focus of this report.

Scope and Purpose

The purpose of the present report is threefold:

To determine the accuracy of cervical cytology using conventional Pap smears and newer technologies (thin-layer cytology, computerized rescreening, algorithm-based decisionmaking technology) for detecting cervical cancer and its precursors.

To estimate the direct medical costs associated with cervical cancer screening and evaluation, treatment and followup of cervical cytological abnormalities, and treatment and followup of cervical cancer.

To estimate the effects on total health care cost, morbidity, and mortality of regular cervical cytological screening using newer screening technologies (thin-layer cytology, computer rescreening, algorithm-based decisionmaking technology) compared with conventional Pap smear in women participating in a screening program.

On the first point, the report will review published studies comparing cervical cytological diagnosis with clinical diagnosis based on colposcopy or biopsy. The results of this review will form the basis for a meta-analysis.

On the second point, the report will identify and examine current claims data and other datasets to empirically estimate costs associated with cervical cytological screening.

On the third point, the report will review the literature on the effectiveness and cost-effectiveness of cervical cytology screening and use these data to develop a comprehensive cost-effectiveness model to examine the impact of the newer screening technologies. In the absence of definitive clinical trials on key questions of cervical cancer screening, policymakers have relied on decision-modeling studies to integrate epidemiological data on the natural history of cervical cancer precursors, data on the performance of diagnostic tests for early cervical cancer or cervical cancer precursors, and data on cost. These models estimate the efficacy of various screening programs, balance estimated efficacy against estimated cost, and lead to decisions about appropriate screening intervals and age cutoffs (Eddy, 1990; Fahs, Mandelblatt, Schechter et al., 1992; Mandelblatt and Fahs, 1988; Schechter, 1996; U.S. Congress Office of Technology Assessment, 1981, 1990).

New Technologies Assessed

Three new devices recently approved by the Food and Drug Administration (FDA) are considered in this report: ThinPrep®, Papnet®, and AutoPap®. The three devices employ three different types of technology: thin-layer cytology (ThinPrep®), and computerized rescreening utilizing neural-network technology (Papnet®) or algorithmic classification (AutoPap®).

Each of these technologies was developed to reduce the false negative rate associated with cervical cytological screening. The two major components to this false negative rate are false negatives related to sampling error and false negatives related to detection error. About two-thirds of false negatives are due to sampling error and the remaining one-third due to detection error. Each of the new technologies is directed at one these components of false negatives. Thin-layer cytology aims primarily to fix sampling error, whereas computerized rescreening targets detection error. This implies that neither technology will be able to reduce false negatives beyond a certain threshold.

Thin-layer cytology is a new technology for processing cytological samples. The sample is collected as in the conventional Pap test using a cervical broom or cervical spatula and endocervical brush, but rather than smearing the cytological sample directly onto a microscope slide, this new method suspends the sample cells in a fixative solution, disperses them, and then selectively collects cells on a filter. The cells are then transferred to a microscope slide for cytological interpretation. Because cytological samples are fixed immediately after collection, there are fewer artifacts in cellular morphology. Fewer cells on the slide are obscured, both because the process reduces artifactual material such as blood and mucous and because cells are deposited on the slide in a monolayer. Clinical studies of the ThinPrep® 2000 (Cytyc Corporation, Boxborough, MA) have shown that the sensitivity is improved compared with conventional Pap smears; however, few data are available on the specificity of this technology compared with conventional Pap smears. The improvement in sensitivity appears to be greater in populations with a low prevalence of cytological abnormalities.

Papnet® is a newly approved device that uses neural-network computerized rescreening of Pap smears initially read as negative by a cytotechnologist. The system works by using automated imaging of Pap smear slides and computerized interpretation of images. The Papnet® system (Neuromedical Systems, Inc.) identifies cells or clusters of cells that require review and displays up to 128 images of the slide likely to contain abnormalities. These images must be reviewed by a cytotechnologist who can decide whether to review the slide using light microscopy.

AutoPap® 300 QC system (Neopath, Inc.), which uses algorithm-based decisionmaking technology, identifies slides exceeding a certain threshold for the likelihood of abnormal cells. The laboratory can select different thresholds, corresponding to 10, 15, and 20 percent review rates. In contrast to random rescreening, the population of slides selected by the AutoPap® 300 QC system is enriched with abnormalities and should contain 70-80 percent of the slides containing abnormalities missed by manual screening.

A variety of other technologies or clinical strategies have been proposed to improve Pap testing, including various devices for collecting a cytological sample from the cervix. Still other technologies have been proposed to augment or replace cervical cytological screening; for example, colposcopic photographs for review by experts (cervicography) and DNA testing for specific human papillomavirus (HPV). These technologies are not considered in the present report.

Cervical Cancer

Incidence and Prevalence

The majority of cases of invasive cervical cancer occur in women who have either never had a Pap smear or have not had a smear in the previous 5 years (Shingleton, Patrick, Johnston et al., 1995). Lack of screening may explain many of the differences in cervical cancer rates among ethnic groups (Miller, Kolonel, Bernstein et al., 1998); for example, overall mortality from cervical cancer in black women is higher than in white women; stage-specific survival is identical, but black women present with more advanced disease (Ries, Kosary, Hankey et al., 1997).

Although all sexually active women are at risk for cervical cancer, the disease is more common in women of low socioeconomic status and in those who smoke, have a history of multiple sex partners, or have early onset of sexual intercourse.

Certain types of HPV have a strong epidemiological association with cervical cancer. Types 16 and 18 are two of several oncogenic strains of the more than 70 types of HPV. Both incidence and prevalence of HPV are greatest in women under the age of 25 (Koutsky, 1997). Ho, Bierman, Beardsley et al. (1998) have recently reported a cumulative 3-year incidence of 43 percent, or an average annual incidence of 14 percent, in a cohort of college women. Prevalence of HPV DNA in women with normal cervical cytology decreases with age in a variety of populations (Bauer, Hildesheim, Schiffman et al., 1993; Coker, Jenkins, Busnardo et al., 1993; ; Figueroa, Ward, Luthi et al., 1995; Hildesheim, Gravitt, Schiffman et al., 1993; Wheeler, Parmenter, Hunt et al., 1993). A second peak is seen after age 40 in some studies (Figueroa, Ward, Luthi et al., 1995); whether this is due to reinfection, or a cohort effect secondary to differences in age at onset of sexual activity, is unclear. Data on postmenopausal women are rare. Reported prevalence in peri- and postmenopausal women ranges from 1 percent (Ferenczy, Gelfand, Franco et al., 1997) to 38.1 percent (Smith, Johnson, Figuerres et al., 1997), a difference that may be attributable to differences in populations, cohort effects, or viral assay techniques. An international study found prevalence in women between 50 and 59 ranging from 4.1 percent in Spain to 13.6 percent in Brazil (Munoz, Kato, Bosch et al., 1996). Prevalence in a German cohort of women over 55 was 3.2 percent (De Villiers, Wagner, Schneider et al., 1992).

The natural history of HPV infection is complex, with clearance and persistence of viral DNA, along with progression to squamous intraepithelial lesion (SIL), varying depending on the viral type, patient characteristics such as age, and study design and assay methods (Herrero, 1996; Kiviat, 1996; Koutsky, 1997; Mitchell, Tortolero-Luna, Wright et al., 1996 ; Schiffman, Bauer, Hoover et al., 1993).

Disease Biology

Although a small percentage of cervical cancers do not have detectable HPV DNA, even with sensitive assays, there is consensus that HPV infection is the causative agent for the vast majority of cervical cancers (Herrero, 1996; Kiviat, 1996; Koutsky, 1997; Schiffman, Bauer, Hoover et al., 1993). Certain HPV types are clearly more likely to progress to cancer than others, and identification of these types in cervical cells may have a role in determining optimal diagnostic and treatment strategies for patients with abnormal Pap smears (Cox, Lorincz, Schiffman et al., 1995).

Natural History

Invasive cervical cancer usually is preceded by a long, premalignant phase during which it is readily treatable (Cannistra and Niloff, 1996; Wright and Richart, 1992; Wright, Kurman, and Ferenczy,1994). Even if this premalignant phase progresses to invasive cancer, early-stage disease is highly curable: 5-year survival rates for early-stage disease are above 90 percent, but if the disease has spread outside of the pelvis (Stage IV), cure rates are only 10-15 percent.

Epidemiological data suggest that a substantial proportion of patients with low-grade squamous intraepithelial lesions (LSIL) will have regression if the lesion are not treated, a finding that supports a "watchful waiting" strategy involving retesting.

Burden of Illness

Despite the success of Pap smear screening, women continue to develop cervical cancer. Treatment of early-stage disease is effective, but both of the alternatives, radical hysterectomy or radiation therapy, can have significant short- and long-term morbidity (Cannistra and Niloff, 1996; Landoni, Maneo, Columbo et al., 1997). Mortality from late-stage disease remains high.

Patient Population and Settings

Target Populations

The primary target population for this evidence report is women of average cervical cancer risk in the United States who are candidates for Pap smear screening. For the purposes of our analysis, candidates for Pap smear screening include women between the age of onset of sexual activity and the age of 85.

Although a large proportion of cervical cancer occurs in women with very limited or no screening, we did not examine programs or policies designed to improve screening compliance. Some previous studies have focused on special populations such as elderly women (Fahs, Mandelblatt, Schechter et al., 1992; U.S. Congress Office of Technology Assessment, 1990) and elderly women who have not previously been screened (Mandelblatt and Fahs, 1988).

Practice Settings

The principal practice setting considered will be the primary care practice in the United States (general internal medicine, family practice, adolescent medicine, and obstetrics/gynecology) and government and nongovernment family planning clinics (e.g., Planned Parenthood, public health clinics).

Methodology

This section documents explicitly the methods and procedures used to develop the evidence report, emphasizing the comprehensive evaluation of the evidence and the analytic framework used. The section begins with a description of the research questions and evidence model that guided our work and proceeds to a detailed description of the techniques and approaches used in the literature review, including descriptions of the literature search and review parameters, MeSH terms used, types of study design included, number and identity of databases searched, and years included in the search. The methodologies used to produce cost estimates and carry out the meta-analysis and cost-effectiveness analysis are then described. Finally, the role of the report partner (American College of Obstetricians and Gynecologists) and quality control methods are described.

Evidence Report Questions and Evidence Model

The report addresses three main questions:

  • 1

    What is the accuracy of cervical cytology using conventional Pap smears and new technologies (thin-layer cytology, computerized rescreening using neural network or algorithm-based technology) for detecting cervical cancer and its precursors?

  • 2

    What are the direct medical costs associated with cervical cancer screening, evaluation, treatment, and followup of cervical cytological abnormalities and treatment and followup of cervical cancer?

  • 3

    What are the effects on total health care cost, morbidity, and mortality of regular cervical cytological screening using thin-layer cytology, computerized rescreening using neural network technology, and computerized rescreening using algorithm-based technology compared with conventional Pap smear in women participating in a screening program

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F001.jpg.

   Figure 1. Evidence model

The evidence model or "causal pathway" used, which is illustrated in Figure 1 , identifies the population, critical interventions, and outcomes (intermediate, surrogate, beneficial health outcomes, and harms) considered and also indicates explicitly the relationships among these factors (Woolf, 1994). Arrows link the intervention to outcomes either directly or through intermediate steps. Different linkages may have different levels of evidence supporting whether an association is present. In some cases, linkages may be indirectly supported by studies in slightly different populations.

We drafted and revised the evidence model, soliciting input from the study's Advisory Panel of Technical Experts regarding the outcomes and harms of interest, and identified the type of evidence that exists to support each linkage. The specific linkages identified as key questions were the subject of a systematic literature review.

The project focused on several linkages that are crucial to the question of the cost an effectiveness of several new technologies for cervical cytological screening. The overall question of the effectiveness of cervical cytological screening on morbidity and mortality is described by the overarching linkages between cervical cytological screening and reductions in mortality and morbidity, as well as influences on cost and other health outcomes such as quality of life. Strong but indirect evidence supports the effectiveness of cervical cytological screening in reducing cervical cancer morbidity and mortality. This evidence includes quasi-experimental observational studies describing cervical cancer incidence before and after implementation of screening programs. Additional indirect evidence comes from decision- and simulation-modeling of cervical cancer screening.

Current costs associated with cervical cytological screening are not well described in the literature, and in this project an empirical investigation of health care claims and other databases was undertaken to address this linkage in the evidence model.

The link between cervical cytological screening and detection of abnormalities is investigated in a literature review on the diagnostic accuracy of conventional Pap smear screening and the new technologies. Discrimination, sensitivity, and specificity are the most relevant data for the assessment of the new technologies.

We did not specifically address the well-established linkages between detection of abnormal cytology or early cervical cancer and reductions in cervical cancer morbidity and mortality through early treatment of cervical lesions.

The evidence model also functioned as a conceptual basis for the construction of a working mathematical model of the cost and effectiveness of cervical cytological screening that synthesizes all the linkages depicted regarding cervical cancer screening. Our approach thus extends the idea of the evidence model to a mathematical model that allows us to examine the impact of various cervical cancer screening strategies on the health and economic outcomes depicted in the evidence model. Both models provide an inventory of the inputs relevant to a screening decision and give a sense of which screening questions are tractable and for which questions reasonable scientific data exist. The models thus also guided the design of our literature search to focus on those inputs that most strongly influence the relative desirability of alternative decision strategies to various clinical decisionmakers.

Methodology: Literature Review

The comprehensive review of the literature, from identification of databases through abstraction of individual articles into the evidence tables, was a multistep, sequential process. This process is detailed below.

Literature Sources Used

Primary sources of literature for the two key research questions that were the subject of literature reviews were six of the most widely used online bibliographic databases: MEDLINE, CancerLit, HealthSTAR, CINAHL, EMBASE, and EconLit. Although there is considerable overlap among these databases, each contains unique information.

Produced by the U.S. National Library of Medicine, the MEDLINE database is widely recognized as the premier source for bibliographic and abstract coverage of biomedical literature. MEDLINE coverage began in 1966. More than 8.7 million records from more than 3,600 journals are indexed, plus selected monographs of congresses and symposia. Abstracts are included for about 67 percent of the records.

Produced by the U.S. National Cancer Institute, CancerLit is an important source of bibliographic records (most with abstracts) beginning in 1983 and pertaining to all aspects of cancer therapy, including experimental and clinical cancer therapy; chemical, viral, and other cancer-causing agents; mechanisms of carcinogenesis; biochemistry, immunology, and physiology of cancer; and mutagen and growth factor studies. Approximately 200 core journals contribute a large percentage of the records. Other entries are drawn from proceedings of meetings, government reports, symposia reports, theses, and selected monographs. Indexed materials include articles from journals, abstracts of papers presented at professional meetings, government and technical reports, dissertations, and monographs.

HealthSTAR, produced cooperatively by the U.S. National Library of Medicine and the American Hospital Association, contains citations to the published literature on health services, technology, administration, and research from 1975 to the present. Topics included that were of particular interest to us are evaluation of patient outcomes; effectiveness of procedures, programs, products, services, and processes; health economics and financial management; and quality assurance. This database contains citations and abstracts (when available) to journal articles, monographs, technical reports, meeting abstracts and papers, book chapters, government documents, and newspaper articles.

The Cumulative Index to Nursing & Allied Health (CINAHL) provides authoritative coverage of the literature related to nursing and allied health from 1983 to the present. CINAHL was chosen for the literature searches on cervical cytology primarily because it indexes several medical laboratory journals.

EMBASE is a comprehensive pharmacological and biomedical database containing over 3 million records from 1980 to the present, indexed from 3,500 journals published in 70 countries. More than 65 percent of the records contain abstracts.

EconLit contains abstracts of journal articles, books, and working papers, book reviews, and citations to articles in collective volumes. It covers economic literature broadly, from 1969 to the present.

Computerized searches of these online databases were supplemented by secondary searches, including the manual scanning of newly published journal issues that had not yet been indexed in the online databases. This scanning was accomplished through online searches of journal websites as well as searches of printed journals obtained through subscriptions and the Duke University Medical Center Library. In addition, the World Wide Web of the manufacturers of the automated cytology devices reviewed in this report were checked several times a week, along with those of relevant professional societies.

Search Strategies for Computerized Databases

The two strategies used to search the above computerized databases were developed in consultation with a Duke University Medical Center librarian who specializes in evidence-based medicine research. There were several iterations of each search strategy before the final strategies were agreed upon. The final strategy for articles on diagnostic testing for cervical cancer was:

  • 1

    Vaginal smears/

  • 2

    ((pap or papan$) and (smear$ or test$)).tw

  • 3

    (papnet or autopap or thinprep).tw.

  • 4

    1 or 2 or 3

  • 5

    exp Cervix neoplasms/

  • 6

    Cervix dysplasia/

  • 7

    Cervical Intraepithelial Neoplasia/

  • 8

    dyskaryo$.tw.

  • 9

    5 or 6 or 7 or 8

  • 10

    exp "Sensitivity and Specificity"/

  • 11

    (sensitivity or specificity).tw.

  • 12

    exp Diagnostic errors/

  • 13

    4 and (10 or 11 or 12)

  • 14

    13 and 9

  • 15

    limit 14 to (human and english language)

  • 16

    Papillomavirus, Human/

  • 17

    15 not 16

The final search strategy for articles on the costs and health outcomes associated with cervical cancer screening was as follows:

  • 1

    Vaginal smears/

  • 2

    ((pap or papan$) and (smear$ or test$)).tw

  • 3

    (papnet or autopap or thinprep).tw.

  • 4

    1 or 2 or 3

  • 5

    exp "costs and cost analysis"/

  • 6

    (cost$ or expenditure$).tw.

  • 7

    ec.fs.

  • 8

    5 or 6 or 7

  • 9

    4 and 8

  • 10

    limit 9 to (human and english language)

  • 11

    exp Decision Support Techniques/

  • 12

    exp Models, Statistical/

  • 13

    Technology assessment, biomedical/

  • 14

    Monte carlo method/ or Survival Analysis/

  • 15

    11 or 12 or 13 or 14

  • 16

    4 and 15

  • 17

    limit 16 to (human and english language)

  • 18

    10 or 17

These strategies were initially developed using the National Library of Medicine MeSH key word nomenclature developed for MEDLINE. The Duke University Medical Center librarians assisted with the translation of these search strategies to the key word structures used by other databases. The basic search terms remained the same. Each online search began with the first year of the database (see individual database descriptions above).

After the initial runs in January 1998, the computerized searches were updated twice, in February and March. The final yield of the two searches was 1,538 unduplicated articles--939 articles were identified by the diagnostic testing searches and 671 by the cost and health outcomes searches, including articles identified through manual searches. Approximately 67 percent of all articles were identified through MEDLINE, 23 percent through EMBASE, and 5 percent through manual searches, with the remaining 5 percent divided among CancerLit, HealthSTAR, and CINAHL. EconLit did not identify any relevant or unique articles.

Development of a Bibliographic Database

As online literature searches were conducted, results were imported into Pro-Cite Version 2.0, a database software program for storing, managing, and retrieving bibliographic information. Each article was given a unique identifier within the Pro-Cite database. An ongoing search for duplicates was conducted. Initially, each article was coded to indicate its source and key question(s). As screening and data abstraction progressed, each article was coded for the stage at which it was excluded from the evidence report.

Screening of Articles

Separate sets of criteria for including articles in the evidence report were developed for the two topics that were the subject of literature reviews (diagnostic testing and cost and health outcomes). In each case, final screening criteria were developed through an iterative process. Each iteration of criteria was pilot-tested by each reviewer/abstractor on a subset of randomly chosen articles.

Articles on Diagnostic Testing

These articles were clinically and methodologically complex and thus required a two-step screening process. In Step 1, articles were screened on the basis of the basic information available through the online databases (primarily title, authors, and abstract when available). Articles at this step were eliminated from further consideration based on answers to three questions:

  • 1

    Was cervical cytology evaluated as a screening test? If no, then article excluded.

  • 2

    Was the screening test for primary screening only, rescreening only, or primary screening with rescreening? If none of these, then article excluded.

  • 3

    Was the reference standard histology, histology or negative colposcopy, or cytology? If none of these, then article excluded.

At this stage, the following information was also recorded:

  • 4

    Type of rescreening included: none, manual, interactive, or independent.

  • 5

    Description of target population.

Bibliographic records were screened by three pairs of clinicians, two pairs consisting of a general internist and obstetrician/gynecologist and one pair consisting of a medical student and obstetrician/gynecologist. The decisions of the two partners in each pair were compared, and a kappa statistic was calculated to estimate the strength of the agreement between them. Differences of opinion were then reconciled. Only when both clinician-reviewers agreed to include or exclude an article was the decision entered into the Pro-Cite database. Copies of the full text of all articles passing the initial screening were obtained. Each included article was subjected to a further screening (Step 2) by the same pair of reviewers who had performed the initial screening of the corresponding bibliographic record (Step 1).

As in Step 1, the criteria used in Step 2 went through several iterations to assure that the most salient articles would be included. The four questions used for excluding articles at Step 2 were:

  • 1

    Did the study use a reference standard? If not, then article excluded.

  • 2

    Was the reference standard histology or colposcopy? Histology and colposcopy qualified the article for inclusion; articles with cytology reference standard were excluded.

  • 3

    For studies comparing histology or colposcopy with cytology as a screening test: were these tests reasonably concurrent, i.e., up to 3 months apart? If not, then article excluded.

  • 4

    Can all cells of a 2-by-2 table be completed? If not, then article excluded. The 2-by-2 table is defined as a tabulation of the number of subjects with normal and abnormal screening test results according to the specified screening test threshold versus the number of subjects with normal or abnormal reference standard results according to the reference standard threshold.

The review process was the same as in the first step, with the two clinicians in each team comparing their decisions and reaching agreement to include or exclude the article. The decision to exclude at Step 2 was then entered into the Pro-Cite database. The two major reasons for excluding an article at this stage were lack of histology/colposcopy as a reference standard and the lack of sufficient data to complete a 2-by-2 table.

Of the 939 bibliographic references reviewed, 561, or approximately 60 percent, were excluded during Step 1 screening, and another 293, or 31 percent, during Step 2 screening. Eighty-six articles passed both screens.

Few articles pertaining to the new technologies were included among the 86 articles passing both screens. In most cases, this was due to the lack of a histology/colposcopy reference standard. We sought guidance from cytology society guidelines (Intersociety Working Group for Cytology Technologies, 1998 ) and FDA documents (Food and Drug Administration, 1994) about acceptable reference standards other than histology or colposcopy. These resources suggested that cytology would be an acceptable surrogate reference standard, but only when an independent panel of cytology professionals was used to arrive at a consensus diagnosis. Furthermore, the Intersociety Working Group guidelines called for histological validation of a significant proportion of high-grade cytological findings.

We also explored the methodological literature on diagnostic tests to determine techniques for dealing with lack of verification of test negative subjects. Information about the incremental test performance characteristics can be obtained from a direct comparison of independently applied conventional and new tests, in which case it is not necessary to obtain reference standard results for subjects who are negative on both tests (Chock, Irwig, Berry et al.,1997). Using this method, one cannot calculate sensitivity and specificity directly, but a relative true positive rate (TPR) and a relative false positive rate (FPR) can be calculated. These statistics can be used to modify estimates of the conventional test performance to yield estimates of the performance of the new test relative to the reference standard. These methods apply only when the tests being compared are used independently. For example, studies that applied initial manual screening followed by computerized rescreening could not be evaluated with these methods because the use of rescreening was conditional on a negative initial screen and thus the two tests were not applied independently. These methods would apply primarily to studies of alternatives to conventional Pap when used as primary screeners or studies that use independent comparisons of computerized versus manual rescreening.

This review of methodological issues led to the development of a separate set of screening criteria for studies of the new technologies (Step 3 screen). We reevaluated articles on monolayer slide preparation and neural-network or algorithm-based screening technologies that had been excluded in Step 2 either because they used a cytology reference standard or because they failed to allow estimates of both sensitivity and specificity. Again, two clinicians in each team compared their decisions and reached agreement on whether to include or exclude an article. The Step 3 screening criteria were as follows:

  • 1

    Did the study use a two-armed prospective design? If not, then article excluded. (A two-arm design implies that the tests being compared are applied to the same set of subjects or slides.)

  • 2

    Were discordant results from the two study arms adjudicated by an independent panel of experienced cytology professionals? If not, then article excluded.

  • 3

    Were the majority of those testing positive for HSIL verified with histology or colposcopy?Studies which did not verify at least half of those with screening tests positive for HSIL were excluded.

  • 4

    Did the study design allow for separate analyses of sensitivity (or relative TPR) and specificity (or relative FPR)? If not, then article excluded.

A total of 59 articles were considered at this stage (12 on AutoPap, 27 on Papnet®, and 20 on ThinPrep ®). Forty-three of these fit the description provided above (excluded during Stage 2 screen because of cytology reference standard or failure to allow estimates of sensitivity and specificity). Sixteen new articles were also brought to our attention by reviewers or manufacturers. Of these 16 articles, 7 were unpublished manuscripts and 2 were published only in abstract form.

Applying Step 3 screening criteria to these 59 articles led to the inclusion of only one study of the new technologies: an evaluation of ThinPrep® (Roberts, Gurley, Thurloe et al.,1997). To provide some description of the available literature on the effectiveness of the new technologies, we have included in Evidence Table 1 and described in the text of this report all those studies considered for Step 3 that also met the criterion of a concurrent reference standard (histology, coposcopy, or cytology), even though they did not permit estimates of specificity or relative FPR. The net result was the inclusion of 6 studies of AutoPap®, 11 of Papnet ®, and 8 of ThinPrep®.

Articles on Cost and Health Outcomes

Articles were excluded on the basis of responses to two questions:

  • 1

    Was the screening test AutoPap ®, Papnet ® , ThinPrep ® , or conventional Pap? If none of these, then article excluded.

  • 2

    Does the article assess the effect of screening on life expectancy or quality, number of cervical cancer cases avoided, or total health care costs? If none of these, then article excluded. Note: Studies that reported costs of screening test alone, without subsequent evaluation- and management-related costs, were excluded.

Bibliographic records (title, author, and abstract, when available) from the Pro-Cite database were screened by two pairs of reviewers, each including a doctoral-level economics graduate student and either a clinician or health policy analyst. Decisions by the partners were recorded, a kappa statistic was calculated, and agreement was reached where discrepancies existed. If the abstract satisfied the screen, then the full article was reviewed with respect to the same two criteria. The decision to exclude an article was entered into the computer database after the abstract and/or the full article had been screened.

Of the 672 articles identified, 638, or 95 percent, were eliminated during the screening process. Thirty-four articles were included in the review.

Study Design of Included Articles

At the time the literature searches were conducted, there were no completed randomized clinical trials comparing conventional Pap tests with new technologies to detect cervical cancer and its precursors. Study designs included in the diagnostic testing review were diagnostic test evaluations, meta-analyses, and review articles. Cost and health outcomes articles included were cost studies, typically cost-effectiveness and cost-benefit analyses.

Data Abstraction Process

The review team responsible for including a given article was also responsible for abstracting data and other key information from that article. The goal of the data abstraction process was to abstract essential information directly into the template of the evidence table in the format required by the Agency for Health Care and Research (AHCPR).

The evidence table entries were created by one member of each review team, over-read by the other team member, and then revised. For the cost and health outcomes articles, each evidence table entry was again over-read by a clinician.

Figure 2. Map of cervical cytology classification schemes
Classification SystemWithin Normal LimitsBenign Cellular ChangesEpithelial Abnormalities
The Bethesda System (National Cancer Institute Workshop, 1993)NormalInfection reactive repairASCUSSquamous intraepithelial lesion (SIL)Invasive carcinoma
Low grade (LSIL)High grade (HSIL)
Richart (1973) Condylom aCervical intraepithelial neoplasia (CIN)
Grade 1 Grade 2Grade 3
Reagan (1979) (WHO)AtypiaMild dysplasiaModerate dysplasiaSevere dysplasiaIn situ carcinoma
Papanicolaou (Nyirjesy, 1972)IIIIIIIVV

ASCUS = atypical squamous cell of uncertain significance.

For the diagnostic testing articles, both members of each paired clinician-reviewer team also independently completed 2-by-2 tables for each study, extracting the key data permitting calculation of sensitivity, specificity, prevalence, and other data to be used in the meta-analysis. The main outcome measures considered were the sensitivity and specificity of cytologic abnormality on Pap test for detecting cases, where "cytologic abnormality" was defined by one of three thresholds ranging from atypical squamous cells of uncertain significance (ASCUS) (threshold 1) to LSIL (threshold 2) to HSIL (threshold 3), and where a "case" was defined as a histologicdiagnosis of dysplasia or carcinoma. Equivalent categories in other classification schemes were also used according to a mapping of cervical cytology classification schemes (Figure 2). Data were abstracted according to four different combinations of cytological versus histological thresholds: ASCUS/CIN1, LSIL/CIN1, LSIL/CIN2-3, and HSIL/CIN2-3. Indeterminant cases (e.g., due to uninterpretable cytology or lack of reference standard confirmation) were noted in the data abstraction process but excluded from the calculation of sensitivity and specificity. Where more than one study population was included in a single report with data provided separately, they were treated as individual studies. All the 2-by-2 table data were entered into a computer database, and data from the two members of each reviewing team were compared and reconciled.

Some studies that failed to permit calculation of sensitivity and specificity directly nonetheless allowed calculation of relative TPR and relative FNR (false negative rate) according to the method of Chock, Irwig, Berry et al. (1997). This method was applied to studies comparing two independently applied screening tests in which verification was obtained on any patient testing positive on either test (but not those testing negative on both tests).

Criteria for Evaluating the Quality of Articles

Given the two very distinct sets of articles reviewed for the evidence report, two separate sets of criteria to judge their quality were needed.

Articles on Diagnostic Testing

The goal was to develop a single numeric or alphabetic summary of the methodological quality of individual studies. A systematic approach was used to reach consensus on rigorous, predetermined methodological criteria and the relative numeric weights to be assigned to each. The participants were nine members of the study's work group (six clinicians, two economists, one health policy analyst). Each member of the group was familiar with the diagnostic testing articles. The group initially identified more than a dozen evaluation criteria; the consensus process narrowed the list to seven. The weight given to each criterion was determined by each participant writing his/her numerical assessment of the value of that criterion (blinded to the rest of the group). The means of these "votes" were calculated. Each participant received a copy of his/her own responses, graphically depicted in relation to the mean of all responses, and was requested to confirm or reconsider his/her responses. The revised responses were then calculated and the means affirmed by the group as the weight. The quality review criteria and their weights (in parentheses) are:

  • 1

    Were the test and reference standard measured independently (blind) of each other?yes (2); no (0)

  • 2

    Was the test compared with a valid reference standard?histology (2); histology or negative colposcopy (1); cytology (0)

  • 3

    Was the choice of patients who were assessed by the reference standard independent of the test's results? all test positives and test negatives verified (2); test positives and random fraction of test negatives verified (1); test positives and selected sample or none of test negatives verified (0)

  • 4

    How was the study sample collected?consecutive or random (2); other (0)

  • 5

    Was the spectrum of disease/nondisease defined? yes (1); no (0)

  • 6

    How was the study published? paper (1); abstract (0)

  • 7

    What was the industry relationship to the study? neither done nor supported by a manufacturer (1); supported by a manufacturer (1/2); done by a manufacturer (0)

Articles on Cost and Health Outcomes

The major difficulty with this set of articles was that the two types of studies (health outcomes and costs), sometimes distinct and sometimes overlapping, generated many criteria. There were several expert sources of quality criteria by which to judge these articles that were readily affirmed by the work group and reaffirmed by the study's Advisory Panel of Technical Experts (Drummond, Richardson, O'Brien et al., 1997 ; Gold, Siegel, Russell et al., 1996 ; O'Brien, Heyland, Richardson et al., 1997 ; Siegel, Weinstein, Russell et al., 1996).

Letters representing the criteria that were not satisfied by a given article are shown in the last column of the evidence table. The criteria were:

For articles that modeled health outcomes:

  • 1

    There was a description or diagram of a decision model.

  • 2

    The clinical strategies were described in detail.

  • 3

    A comprehensive literature review was conducted.

  • 4

    The control group was described.

  • 5

    Sources of data for probability estimates were identified.

  • 6

    The source of the utility scale or measurement method was given.

  • 7

    Clinically relevant outcomes of interest were described in detail.

  • 8

    Results of a sensitivity analysis were presented.

For articles that modeled costs:

  • 9

    There was a description of the framework for the cost analysis.

  • 10

    Modeling assumptions were stated explicitly.

  • 11

    There was sufficient description of estimates of types of resources used, amount of resource use, and unit costs, and their sources.

  • 12

    There was a critique of data quality.

  • 13

    The cost data were current, i.e., 1988 or more recent.

  • 14

    There was a description of a method to adjust costs for inflation.

  • 15

    There was a statement of discount rates used (for multiperiod or longitudinal studies).

For articles that compared health outcomes with and without Pap smear screening or with and without use of new Pap technology:

  • 16

    Study population was representative or clearly defined.

  • 17

    Screened and unscreened populations were concurrent (e.g., before-after design).

  • 18

    Screened and unscreened populations were randomly allocated.

  • 19

    The method of ascertainment of health outcomes was the same in the screened and unscreened groups.

  • 20

    The ascertainment of health outcomes was complete (loss to followup <5 percent ).

Methodology: Cost Estimates

The costs of screening, diagnosing, and treating cervical cancer were estimated by Health Economics Research, Inc., utilizing both claims and secondary data sources. A comprehensive approach was used to incorporate costs associated with all medical services provided. For instance, the costs associated with both physician and pathology services were included in the total cost of screening for cervical cancer. In the case of diagnosis and treatment, all costs of providing outpatient and inpatient services were estimated. However, societal costs or indirect costs, such as those associated with lost wages or with waiting time at the physician's office, were not included. The methodology employed is discussed in detail below.

Separate estimates are provided for those 20-64 years of age, and those 65 years and older (eligible for Medicare). The costs for the under-65 population were estimated from MEDSTAT data, and for the older subgroup Medicare's resource-based relative value scale (RBRVS) fee schedule, clinical laboratory fee schedule, diagnosis-related group payment rates, and ambulatory surgery center payment rates were utilized. Charges and payment information obtained from these sources were then converted to reflect costs associated with the services provided. Finally, all costs were inflated to 1997 dollars. The methodology used to estimate the cost for each group is discussed separately, starting with the estimates for women 20-64 years.

Cost Estimates for Women 20-64 Years

To obtain cost estimates for this age group, Health Economics Research, Inc., analyzed the MarketScan database created by the MEDSTAT Group and previously used in cost analyses of AHCPR practice guidelines. The MEDSTAT MarketScan database provides claims information for a large sample of privately insured individuals employed at large private companies across the country. The claims data are collected from over 100 different insurance companies, Blue Cross and Blue Shield plans, and third-party administrators (for self-insured employers). The data represent the medical experiences of insured active employees, early retirees, Consolidated Omnibus Budget Reconciliation Act (COBRA) participants, and their dependents. These data were used to estimate screening and treatment costs for the population under the age of 65. For our analyses, we used four types of claims files: inpatient claims, physician claims, outpatient department claims, and pharmaceutical claims. Information on service dates, procedures, diagnoses, and payments was abstracted. We analyzed data for a 3-year period, 1991-93, which represented the medical experience for roughly 2.5 million privately insured beneficiaries with more than 500,000 inpatient admissions and over 76 million outpatient claims.

Although the MarketScan database is a rich source of medical utilization information for the under-65 population, a number of issues arise when these data are used that may or may not affect our analyses. First, our ability to track treatment costs over time due to individuals' dropping in and out of the MarketScan database because of employment changes is limited. Thus, the data were used to estimate average costs of treatment and screening at an aggregated level. It is also important to note that the MarketScan data represent a nonrandom sample of the under-65 population. The MarketScan database includes a disproportionate number of persons employed in large firms, especially in California, Michigan, Ohio, and Texas. To the extent that large employers generally offer more generous insurance benefits than do smaller firms, utilization estimates derived from the MarketScan database are likely to overstate the general under-65 utilization experience. This bias would most directly affect cost estimates for cancer treatment, rather than cost estimates for specific diagnostic procedures or therapeutic services, as episode of care estimates are most affected by differential utilization patterns. Better insured women may use more services as well as more expensive services than less well-insured women. Last, the MarketScan database reflects the mix of fee-for-service and managed care arrangements present in the early 1990s. To the extent that the risk sharing and payment arrangements for the set of large firms in the MarketScan database differ systematically from those observed for the general under-65 population, there might be biases in the derived cost estimates. However, the direction of the bias is not readily known.

We used the MEDSTAT data to estimate the unit cost of screening, diagnosis, and treatment of cervical cancer. Since 3 years of data (1992, 1993, and 1994) were analyzed, all 1992 and 1993 charges were inflated to reflect 1994 charges. The national medical care services (includes hospital and physician services) inflation rate of 12.02 percent for 1992 charges and 5.17 percent for 1993 charges were utilized (CPI-U, U.S. Bureau of Labor Statistics, 1982-84).

The costs associated with hospital and physician services were derived from the MEDSTAT data estimates by utilizing appropriate adjustments. In the case of services provided in a hospital setting, the total covered charge for each service submitted by the hospital was identified from the MEDSTAT data. The inpatient hospital charges include room and board and ancillary charges. To convert these charges to costs, 1994 cost-to-charge ratios published by the American Hospital Association (AHA) (obtained by AHA through its Annual Survey of Hospitals) were utilized. The 1994 cost-to-charge ratio was 0.58. That is, on average across the Nation, the cost associated with the inpatient services provided was 58 percent of the charge.

Since no similar cost-to-charge ratio was available for physician charges, it was assumed that the payments made by the health care plans closely reflected the costs associated with providing these services. Thus, payments were used as a proxy for costs. This approach provides a "total" cost estimate for a given procedure or set of procedures. It not only includes the amount reimbursed by the insurance plan but also any copayments made by the patient. These payments differ from charges because they include only the amount eligible for payment under the benefit plan after pricing guidelines such as discounts and fee schedules are applied.

Cost Estimates for Women 65 Years and Older

Medicare's RBRVS fee schedule for physician services, Medicare's Clinical Laboratory fee schedule for laboratory services, national average diagnosis-related group (DRG) payments for hospital admissions, and ambulatory surgery center payment rates for outpatient procedures were used to identify the payments associated with services received for cervical cancer screening, diagnosis, and treatment. The 1994 Medicare payments were used to allow for comparability in the estimates between the over-65 and the under-65 age groups (MEDSTAT data estimates). The Medicare payments were assumed to closely approximate the national average cost of providing the service, and therefore no conversion factors were employed (Eisenberg, 1989).

Cost estimates for both the under-65 and the over-65 age groups were updated to reflect 1997 dollars. Separate inflators were employed for services provided at institutional and noninstitutional settings. Services received at institutions were updated to 1997 dollars using the DRI-Prospective Payment System (PPS) Hospital Market Index and Health Care Financing Administration (HCFA) Capital Index. A rate of 9.85 percent was utilized. The rate was calculated using DRI-PPS Hospital Consumer Price Survey (CPS) Market Index (DRI/McGraw Hill) for operating costs, and HCFA Capital Index (Federal Register) for capital cost. A rate combining the operating and capital inflator was obtained by assuming that 90 percent of costs were related to operations and 10 percent to capital. The Medicare Economic Index (MEI) was used to inflate 1994 costs of all noninstitutional services to 1997 dollars. The MEI of 8.40 percent was utilized.

Methodology: Meta-Analysis of Pap Test Accuracy

To combine data from multiple studies describing the performance of the conventional Pap test in discriminating between patients with and without cervical lesions, we used the effectiveness score, a measure described by Hasselblad and Hedges (1995) and Fahey, Irwig, and Macaskill (1995). The effectiveness score takes account of both sensitivity and specificity. As is well known, sensitivity and specificity are interdependent; both vary according to the test threshold used, but in opposite directions. The effectiveness score allows us to consider both measures by fitting a receiver operating characteristic (ROC) curve (Swets, 1988) through a logistic odds transformation of sensitivity and specificity, thus accounting for the interdependence of the two.

The classifications used to describe the results of cervical cytology in the clinical research literature have an ordered categorical response instead of a dichotomous one. Such a response permits the use of multiple discrete cut-points for defining an abnormal Pap test. In addition to these explicit thresholds, there may be differences in the implicit thresholds used by cytologists in labeling a slide as abnormal. One of the main advantages of using the effectiveness score is that it is often independent of the cut-points used for the test.

The effectiveness score is more normally distributed than either sensitivity or specificity and can be thought of as a gauge of the overall discriminatory ability of the test. To standardize the effectiveness score, we adjusted the standard deviation of the logistic normal distribution curve by the square root (3)/pi to get the following measure:

delta = (square root [3]/pi) * (log(sensitivity/[1-sensitivity])
+ log(specificity/[1-specificity]))

Because it is standardized, the effectiveness score can be interpreted across different diagnostic tests. In general, a score of 3 reflects a test with good discrimination, whereas a score of 1 reflects a test that does not discriminate between disease positives and disease negatives.

Another measure, the log odds ratio, may be more familiar to some readers:

log odds ratio = log(sensitivity/[1-sensitivity]) + log(specificity/[1-specificity]).

This measure differs from the effectiveness score by multiplication by a constant, square root (3)/pi.

We calculated summary measures of effectiveness at each of the four explicit diagnostic thresholds (ASCUS/CIN1, LSIL/CIN1, LSIL/CIN2-3, HSIL/CIN2-3), using maximum likelihood estimation techniques and a random effects model. We chose a random effects model because of the wide variation in reported sensitivity and specificity among the included studies. We also evaluated the effect of differences in implicit threshold on effectiveness scores by including the following variable, S, in the model:

S = log(sensitivity/[1-sensitivity]) - l og(specificity/[1-specificity])

Because the model results were not changed by including this value, we used the parameter estimates from the simpler one-parameter (intercept only) logistic equations to generate summary estimates of sensitivity and specificity and to produce ROC curves.

Our data included studies with different reference standards, primarily histology (cone biopsy, hysterectomy, or punch biopsy) and negative colposcopy. These reference standards are themselves subject to inaccuracy, and one problem that may occur when a diagnostic test is compared with an imperfect independent reference standard is underestimation of sensitivity and specificity (Boyko, 1996). In such a case, the sensitivity and specificity of the test may vary with prevalence of disease. To address the possible imperfect nature of histology/colposcopy, we evaluated the effect of disease prevalence on our summary estimates of effectiveness. Multiple logistic regression was used to estimate the variation in overall effectiveness due to variations in prevalence. We also evaluated the effect of study quality on summary effectiveness scores, initially using individual components of the score, then using the total score both as a continuous and a dichotomous (quality score <7, >7) variable.

Methodology: Cost-Effectiveness Analysis

Introduction

This section describes our response to the question, "What are the effects on total health care costs, morbidity, and mortality of regular cervical cytological screening using thin-layer cytology or computer rescreening compared with conventional Pap smear in women participating in a screening program?"

Our basic approach to this question was to create a decision analytic model to simulate the health and economic impact of various cervical cancer screening strategies based on the results of our literature review and meta-analysis. The model, borrowing from previous similar efforts, was created to generate a wide variety of health and cost outcome projections, including survival, cumulative cost, and incremental cost-effectiveness. However, because limited data were available regarding the natural history of cervical cancer and the costs and effectiveness of cervical cancer screening and treatment, we concluded that the sort of precise projections that would be of use to decisionmakers would not be possible. In particular, we determined that the aggregate effect of uncertainty in the model inputs (see below) led to substantial uncertainty in the outputs, such that a point estimate of incremental cost-effectiveness could not be taken as a reliable representation of the value of many of the screening strategies under investigation. As a consequence, we focused the model on the general question, "What characteristics would a cervical cancer screening strategy need to have in order to be cost-effective?"The results of this investigation are intended to guide future research regarding cervical cancer screening strategies.

Two model inputs that are crucial to the precise estimation of the cost-effectiveness of new screening technologies are particularly uncertain, namely, test operating characteristics (sensitivity and specificity) and test costs. Evaluation of the sensitivity and specificity of new screening technologies is particularly problematic given the discrepancy in reference standards. As previously stated, because the clinical management of the patient depends on cytology, colposcopy, and histology taken together (and certainly not cytology alone), histologic diagnosis, or at least colposcopic diagnosis, is the most appropriate reference standard. However, given the practical and methodological difficulties in applying this reference standard, and the paucity of studies available that used such a standard, we attempted to identify other studies that met alternative criteria, as outlined above. Unfortunately, there were few studies that met even these alternative criteria, and the resulting estimates of relative true positive rates and false positive rates had very wide confidence intervals.

Our estimates for the costs associated with implementation of ThinPrep ®, AutoPap®, and Papnet® are based on a variety of sources, including the manufacturers. There is a wide range of estimates, and issues such as training costs, amortization of equipment, and labor costs are contestable. Our original estimate for one technology, based primarily on information from the manufacturer, was thought to be unacceptably low by every reviewer of the draft report except that manufacturer.

Because of the difficulty in estimating operating characteristics and costs that have face validity for all readers, we decided that presenting cost-effectiveness estimates for generic technologies across the range reported would ultimately prove more useful to readers than selecting base case estimates that were uncertain and lacked face validity. Our alternative approach is to provide cost-effectiveness estimates across a range of test characteristics and incremental costs and to identify ranges of these parameters (thresholds) that lead to cost-effective cervical cancer screening. This allows readers to use their own judgment about which costs and effectiveness measures are most appropriate. The thresholds also provide a "target" for future studies -- convincing evidence of either cost or effectiveness that meets these threshold values will be evidence for acceptable cost-effectiveness. To perform this analysis, we considered two generic strategies for improving Pap sensitivity -- 3/4improving the sensitivity of the initial screening test, or allowing 100 percent rescreening with improved sensitivity. By varying sensitivity, specificity, incremental cost, and screening frequency, we sought to identify threshold values where the incremental cost per life-year saved was within generally acceptable limits (in this case, we used a threshold value of $50,000/life-year).

To address the general question of what would constitute a substantially valuable improvement in Pap technology, we reframed the specific question to be addressed by the cost-effectiveness modeling to: "What are the ranges of incremental cost, sensitivity, specificity, and screening frequency that meet conventional levels of cost per life-year saved for technologies that improve Pap test performance by (1) improving the sensitivity of the initial screening step, or (2) allowing 100 percent rescreening at improved sensitivity?"

Also, since some decisionmakers prefer presentation of specific outcomes rather than an aggregate outcome, the incremental cost-effectiveness ratio, we pursued a secondary question: "What are the expected number of cervical cancer cases, deaths, and treatments prevented at different levels of test sensitivity and screening frequency?"

Structure of the Model

For the cost-effectiveness analysis, we constructed a model with two major components. The first component is a 20-state Markov model (Sonnenberg and Beck, 1993) intended to simulate the natural history of cervical cancer in the absence of screening. The second component is an intervention model that represents possible screening strategies. The model was developed in DATA 3.0 software (Boston, MA: TreeAge Software, Inc).

Natural History

Table 1. Markov Model Description
StateDefinitionAllowable Transitions
WellNever infected with HPV, no history of SILWell, benign hysterectomy, death from other cause, undetected HPV
Benign hysterectomyHysterectomy for cause other than cervical cancer or SILBenign hysterectomy, death from other cause
Death from other causeDeath from cause other than cervical cancerAbsorbing state
Death from cervical cancerDeath from cervical cancerAbsorbing state
Undetected HPVUndiagnosed cervical HPV infectionWell, death from other cause, benign hysterectomy, undetected HPV, LSIL
Detected HPVDiagnosed cervical HPV infection or posttreatment for SILWell, detected HPV, death from other cause, benign hysterectomy LSIL, HSIL
LSILLow-grade squamous intraepithelial lesionLSIL, death from other cause, benign hysterectomy, undetected HPV, detected HPV, well, HSIL
HSILHigh-grade squamous intraepithelial lesionHSIL, death from other cause, benign hysterectomy, detected HPV, undetected HPV, LSIL, well
Unknown Stage I Undiagnosed Stage I cervical cancer Unknown Stage I, detected Stage I, death from other cause, unknown Stage II
Detected Stage IStage I cancer diagnosed by Pap or symptomsDetected Stage I, death from cervical cancer (over 5 years), death from other cause
Stage I survivor5 years after initial diagnosis of Stage IStage I survivor, death from other cause
Unknown Stage II Undiagnosed Stage II cervical cancer Unknown Stage II, detected Stage II, death from other cause, unknown Stage III
Detected Stage IIStage II cancer diagnosed by Pap or symptomsDetected Stage II, death from cervical cancer (over 5 years), death from other cause
Stage II survivor5 years after initial diagnosis of Stage IIStage II Survivor, death from other cause
Unknown Stage III Undiagnosed Stage III cervical cancer Unknown Stage III, detected Stage III, death from other cause, Unknown Stage IV
Detected Stage IIIStage III cancer diagnosed by Pap or symptomsDetected Stage III, death from cervical cancer (over 5 years)
Stage III survivor5 years after initial diagnosis of Stage IIIStage III Survivor, death from other cause
Unknown Stage IVUndiagnosed Stage IV cervical cancer Unknown Stage IV, detected Stage IV, death from other cause
Detected Stage IVStage IV cancer diagnosed by Pap or symptomsDetected Stage IV, death from cervical cancer (over 5 years)
Stage IV survivor5 years after initial diagnosis of Stage IVStage IV survivor, death from other cause
The model follows a cohort of women from age 15 to 85 and assumes that, at the beginning of the simulation, the prevalence of HPV infection is 10 percent and the prevalence of LSIL is 1 percent. Cycle lengths are 1 year long. Women infected with HPV can undergo regression, no change, or progression. Although most progress from HPV to LSIL, a proportion progresses directly to HSIL. Women with LSIL can undergo regression (to either "Well" or the HPV-infected state), no change, or progression to HSIL. Women with HSIL can regress, stay in the same state, or progress to Stage I cancer. Women with cancer either become symptomatic or progress through Stages II-IV. Once a cancer diagnosis is made, the probability of survival is stage-specific. Women without cancer are at risk for hysterectomy for other causes, and all women are at risk for death from other causes. The states and transitions of the natural history model are shown in Table 1.

Table 2. FIGO Staging of Cervical Cancer
StageDefinition
0 Carcinoma in situ (corresponds to CIN III or HSIL) Ia Confined to the cervix, detectable by microscopy only, depth of invasion not greater than 3 mm in depth or 7 mm in width Ib Confined to the cervix, larger than Ia IIa Involvement of upper 2/3 of vagina, no parametrial involvement IIb Involvement of parametria but not to pelvic sidewall IIIaInvolvement of lower 1/3 of vagina but not to pelvic sidewall if parametria involved
IIIbExtension to pelvic sidewall and/or hydronephrosis or nonfunctioning kidney
IVaInvolvement of mucosa of bladder or rectum
IVbDistant metastasis or disease outside of pelvis
Cervical cancer is staged clinically. The two most commonly used staging systems are the Federation Internationale de Gynecologie et Obstetriques (FIGO) staging system and the Tumor, Nodes and Metastases (TNM) staging system of the American Joint Committee on Cancer (AJCC). The two systems are compatible as described in the AJCC Manual (American Joint Committee on Cancer, 1997). The FIGO staging system is described in Table 2.

Because cervical cancer is clinically staged, findings at exploratory surgery or during subsequent treatment do not change the stage. Progression of disease also does not change the stage. In addition, imaging modalities such as computerized axial tomography (CAT) scanning or magnetic resonance imaging (MRI), while useful in determining the extent of the disease, are not permitted to be used in assigning stage.

The FIGO staging system is not used by the Surveillance, Epidemiology, and End Results (SEER) Program of the National Cancer Institute, the source of the most reliable estimates of cervical cancer incidence and mortality. SEER reports cancer stages as "Local," "Regional," and "Distant." This classification is also used in the MEDSTAT data we used to generate cost estimates associated with cervical cancer. Although cervical cancer staging does not create as many difficulties in synthesizing data for the purposes of constructing a model as does the variety of classification systems for preinvasive cervical changes, there are still some issues that must be addressed.

First, the definition of Stage 0 has changed with time. Initially, this referred only to carcinoma in situ (CIS). Subsequently, when carcinoma in situ was included as CIN3, lesions that previously would not have been included as cervical cancer cases were classified as such at some SEER sites. Adaptation of The Bethesda System further increases the potential number of cases, since lesions previously classified as CIN2 are included along with CIN3 and CIS in the HSIL classification. Because of this, cases that would not meet FIGO criteria for invasive cancer may be included in SEER data (Noller, 1996).

Second, it is unclear whether the Local/Regional/Distant classification is based on clinical, radiological, or surgical findings. Cervical cancer spreads most commonly by local extension and/or by lymphatic spread. The likelihood of pelvic lymph node metastases increases with advancing FIGO stage (DiSaia and Creasman, 1997), but if the diagnosis of nodal metastases is made on the basis of surgery or imaging modalities such as CAT or MRI, the initial FIGO stage should not change.

Interventions

Screening strategies

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F003.jpg.

   Figure 3. Distribution of Pap results based on primary and rescreening steps. The model assumes that the sensitivity of cytological screening using an ASCUS threshold is identical for all histological states

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F009.jpg.

   Figure 9. Distribution of cytological results for women with the cancer. For a sensitivity of 51 percent, 49 percent of results after the primary screening step will be normal, 6.2 percent ASCUS, 3.7 percent LSIL, 10.6 percent HSIL, and 30.6 percent cancer

We compared no Pap smear screening to the following screening strategies: (a) conventional Pap smears at 1-, 2-, and 3-year intervals; (b) a technology that improves the initial screening step, increasing overall test sensitivity by 40, 60, or 90 percent at 1-, 2-, and 3-year intervals; and (c) a technology that allows 100 percent rescreening, again increasing overall test sensitivity by 40, 60, or 90 percent at the same screening intervals. Strategies for the evaluation of abnormal cytological results are discussed below. The intervention strategies are illustrated in Figures 3 - 9 . We compared screening with the different technologies at intervals of 1, 2, and 3 years, since this reflects current recommended maximum screening intervals in the United State (U.S. Preventive Services Task Force, 1996). Although several European countries recommend screening at 5-year intervals (Koopmanschap, van Oortmarssen, van Agt et al., 1990), no American organization recommends a screening interval of longer than 3 years.

We did not model combinations of screening intervals--for example, every 3 years after three consecutive negative annual Pap tests as recommended by the American Cancer Society (U.S. Preventive Services Task Force, 1996). Modeling such strategies significantly increases the complexity of the model. A model derived from age-specific incidence rates in unscreened populations suggests that the age at which screening occurs, rather than the interval between screens or the number of preceding negative smears, is the most important determinant of screening efficiency (Gustafsson and Adami, 1992). This is consistent with the natural history of the disease, where the incidence and prevalence of low-grade lesions that are unlikely to progress are high in younger ages, and high-grade lesions have peak incidence and prevalence between ages 30 and 45. Thus, three negative annual smears at age 18 or 20, just when HPV incidence is peaking, might have a low predictive value for the development of lesions 15-20 years later and would not significantly improve screening efficiency.

Diagnostic Strategies for Abnormal Pap Tests

We assumed that any cytologic result of HSIL or cancer will result in immediate referral for colposcopy and biopsy if warranted and that endocervical curettage is performed routinely as part of the colposcopic evaluation. Because not every patient will undergo endocervical curettage, this assumption may overestimate the sensitivity of colposcopy and overestimate the average cost of colposcopy. However, because we use the global estimate for the evaluation of LSIL and HSIL in the base case and vary these costs in sensitivity analysis, this assumption should not have a major impact on the cost-effectiveness estimates. We also assumed that a colposcopic or histologic diagnosis that is significantly less advanced than the cytologic result (for example, a biopsy result of LSIL in the setting of a cytologic diagnosis of cancer, or a biopsy result of normal in the setting of cytologic HSIL) would result in either a cone biopsy or a loop electrosurgical excision procedure (LEEP) (DiSaia and Creasman, 1997) after the exclusion ofdisease elsewhere in the lower genital tract.

We compared two possible strategies for the management of ASCUS and LSIL. In the base case, we assumed that all patients with LSIL or higher will receive colposcopy, whereas those with ASCUS will receive a repeat smear within 6 months, followed by a colposcopy if abnormal. Because this strategy depends on a Pap of high sensitivity (since colposcopy depends on the persistence of an abnormality), we used this in the base case. We also evaluated an "aggressive" strategy where any Pap with a result of ASCUS or higher is referred for colposcopy as the base case, since this should result in the highest detection rate. Both of these strategies are within current American College of Obstetricians and Gynecologists (ACOG) recommendations

Measures of Effectiveness

We estimated the cost-effectiveness of cervical cytology using several outcomes. First, we calculated the cost per year of life saved. This allows comparison with other health interventions and with other cost-effectiveness analyses of cervical cancer screening. However, use of life expectancy, or life-years saved, may underestimate some of the benefits of screening. Early-stage cervical cancer is slow-growing and has a very high cure rate (approximate 85-90 percent 5-year survival for Stage I tumors). Thus, failure to detect preinvasive lesions that are then detected as Stage I tumors will not result in much decrement in life expectancy. However, the therapeutic options for all but the most minimally invasive Stage I tumors are radical hysterectomy or radiation therapy, both of which have substantial short- and long-term morbidity that may have significant impact on quality of life.

We were unable to identify any data that would allow calculation of quality-adjusted life years. We therefore calculated other outcomes that can be used to draw some inferences about morbidity. We produced estimates of cost per cervical cancer death prevented and per cervical cancer case prevented. In addition, by estimating the number of cases presenting in each stage and using data on the probability of specific treatments by stage, we also estimated the number of morbid therapies avoided using different screening strategies.

Assumptions of the Model

Perspective

We used a U.S. health system perspective and evaluated the direct health care specific costs associated with screening, diagnosis, and treatment of cervical cancer and its precursors. We did not consider other societal costs such as direct nonmedical costs (e.g., costs attributable to work lost by patients or caregivers due to screening, diagnosis, treatment, or premature mortality). A societal perspective that included these costs would be preferable (Gold, Siegel, Russell et al., 1996); however, estimating these costs (for example, wage/wage equivalent costs, transportation time, and time lost while waiting for an appointment) would be extremely difficult. Invasive cervical cancer is a disease affecting mostly midlife and older women, whereas the much more common HPV infection and SIL, most of which do not progress to cervical cancer, are seen primarily in younger women; therefore, it seems likely that the nonhealth care costs associated with screening and treatment of SIL are greater than those associated with the diagnosis and treatment of invasive cervical cancer. If this is true, then the cost-effectiveness of screening, and of new technologies to improve screening performance, will be more favorable from a health care system perspective than from a societal perspective, especially if new technologies increase sensitivity at the expense of specificity.

Discount Rate

We discounted costs and years of life at 3 percent annually in the base case and varied the discount rate from 0 to 5 percent in sensitivity analysis (Gold, Siegel, Russell et al., 1996).

Population

Our model follows a cohort of U.S. women from age 15 through 85. We calculated screening strategies as described above beginning at age 15, at age 18 (as recommended by ACOG), at age 20 (Eddy, 1990), at age 35 (Gustaffson and Adami, 1992), at age 50, and at age 65 (Fahs, Mandelblatt, Schechter et al., 1992). We also varied the incidence and progression rates of HPV and SIL to reflect higher risk populations. However, it is likely that other parameters in the model would also be affected in these populations (for example, smoking, which is associated with higher rates of preinvasive and invasive cervical cancer, is also associated with a higher risk of mortality from other causes).

Histological Subtypes of Invasive Cervical Cancer

Squamous cancer of the cervix accounts for approximately 80-85 percent of invasive cervical cancer cases. Adenocarcinoma, which accounts for another 10-15 percent, may be increasing in incidence (DiSaia and Creasman, 1997). Cervical cytology may also be less sensitive for adenocarcinoma. However, we did not distinguish between histologic subtypes in any of our estimates for screening or treatment. This is consistent with other models.

Similarly, the clinical implications of a cytological diagnosis of ASCUS (atypical SQUAMOUS cells of uncertain significance) and AGUS (atypical GLANDULAR cells of uncertain significance) appear to be quite different. A cytological diagnosis of AGUS may be associated with premalignant or malignant changes of the cervix or uterus in as many as 30-35 percent of cases (Veljovoich, Slater, Anderson et al., 1998). The diagnostic evaluation of AGUS usually involves endometrial biopsy as well as colposcopy and endocervical curettage. When ASCUS and AGUS are combined for purposes of conditional probabilities, the proportion of true histological lesions with cytologic diagnoses of atypia is increased. Using ASCUS as our threshold has the overall effect of increasing the sensitivity of screening. Given the relatively low proportion of AGUS as a diagnosis (0.3%), this is unlikely to have a significant impact on our cost-effectiveness estimates.

Patient and Provider Behavior

Our model assumes that all women in the cohort receive the screening test at the appropriate interval and that all patients receive appropriate diagnostic and therapeutic interventions based on the results of the screening tests. Lack of appropriate followup on the part of the patient or provider accounts for a significant proportion of cervical cancer cases (Janerich, Hadjimichael, Schwartz et al., 1995), and lack of patient compliance with treatment has been shown to increase the marginal cost-effectiveness of other screening tests (Myers, Thompson, and Simpson, 1998). Between 30 and 75 percent of women with abnormal Pap smears results do not return for followup (Paskett and Rimer, 1995; Paskett, Carter, Chu et al., 1990). We did not directly test the impact of postscreening behavior on cost-effectiveness estimates. However, patient failure to return for followup visits, or inappropriate management by providers, clearly decreases the effectiveness of a screening program. The actual cost-effectiveness of any screening program will vary depending on patient and provider behavior.

We also did not test the impact of complete lack of screening on the overall cost-effectiveness of different screening strategies. A model based on Dutch data showed that increasing participation was significantly more cost-effective than decreasing screening intervals (Koopmanschap, van Oortmarssen, van Agt et al., 1990). The proportion of women with no prior screening in series of women with invasive cervical cancer is between 30 and 40 percent (Bearman, MacMillan, and Creasman 1987; Janerich, Hadjimichael, Schwartz et al., 1995; Pretorius, Semrad, Watring et al., 1991; Schwartz, Hadjimichael, Lowell et al., 1996). Current Public Health Service estimates are that 5-10 percent of women have never had a Pap smear (Martin, Calle, Wingo et al., 1996). Clearly, the issue of the appropriate allocation of resources for improving the sensitivity of current screening tests versus improving the utilization of any screening test in women who are unscreened or underscreened is important for policymakers to consider. Given reliable estimates of the costs and effectiveness of interventions designed to improve screening rates, this model could be further developed to explore this issue. However, estimating the effectiveness of programs to improve participation in screening is difficult and is beyond the scope of the current report.

Further Assumptions

Specific assumptions associated with individual model parameters are discussed in detail in the appropriate sections. Our overall aim in selecting base case estimates was to bias the model in favor of more sensitive screening strategies. In addition to the use of a health system perspective (likely creating a bias in favor of more sensitive screening strategies as mentioned above), the following assumptions were incorporated into the cost-effectiveness model:

  • 1

    We chose an age-specific incidence rate for cervical cancer 30 percent higher than that reported in the unscreened U.S. population of the 1930s.

  • 2

    We chose estimates for progression rates from HPV to LSIL, from LSIL to HSIL, and from HSIL to cancer at the higher end of the reported ranges.

  • 3

    We used a base case estimate of conventional Pap sensitivity that is considerably lower than that used in most previous models of cervical cancer screening.

  • 4

    Our cost estimates for the diagnosis and treatment of cervical cancer are based on average global diagnosis and treatment charges. Because these incorporate the full range of outcomes, including complications, they are considerably higher than estimates based on average charges for the components of care, the technique used in previous models. The higher the estimated cost of treating cancer, the greater the potential savings from etection and treatment of premalignant lesions.

  • 5

    We assumed that the diagnostic evaluation of abnormal cervical cytology would detect all true histological abnormalities. Failure of colposcopy and biopsy to detect true premalignant or malignant lesions would decrease the clinical effectiveness of screening, therefore increasing the cost-effectiveness ratio.

  • 6

    We assumed that all patients with abnormalities on cervical cytology would receive appropriate followup and treatment. Failure of the patient to return for appropriate followup, or of the provider to institute appropriate followup, decreases the clinical effectiveness of screening, again increasing the cost-effectiveness ratio.

  • 7

    We assumed in the base case that new technologies would increase sensitivity without any decrement in specificity.

  • 8

    Although Pap smears can detect vaginal or cervical infections with organisms such as Candida, Trichomonas, or Chlamydia, we did not consider the impact of detection of these diseases. Other tests are available for the detection of these organisms that are more sensitive and specific than Pap screening.

These assumptions favor strategies that increase the detection of premalignant and early malignant changes; therefore, our assumptions tend to favor more sensitive screening strategies.

Parameter Estimates from Available Data: Natural History

The sources and rationale for specific transition probability estimates and ranges for sensitivity analysis are discussed in detail below. Some transition probabilities are based on large datasets using standard reporting methods (e.g., age-specific death or hysterectomy rates, or stage-specific mortality for cervical cancer); in such cases, estimation was relatively straightforward. However, some transition probabilities had to be estimated from smaller datasets that used nonstandard reporting; in these cases, it was difficult to estimate transition probabilities related to HPV and SIL from reports of incidence, regression, persistence, and progression rates probabilities (Miller and Homan, 1994). Regression and progression are usually reported as percentages of either prevalent or incident infections, with followup time varying widely among studies. In addition, very few data are available that allow calculation of transition probabilities for untreated cervical cancer, such as the annual stage-specific probability of symptom development or the annual probability of progression between stages. Estimates for these probabilities were derived from the literature and prior published models of cervical cancer screening, especially those of Eddy (1990) and Muller, Mandelblatt, Schechter et al. (1990), which have served as the basis for several subsequent models (Brown and Garber, 1998; Radensky and Mango, 1998; Schechter, 1996).

Hysterectomy for Benign Disease

Age-specific hysterectomy rates from the National Hospital Discharge Survey (Lepine, Hillis, Marchbanks et al., 1997) and Maryland discharge data (Kjerluff, Guzinski, Langenberg et al., 1993) were used to estimate age-specific hysterectomy rates. We did not correct these rates for procedures performed for cervical cancer or preinvasive disease for two reasons. First, because the rates do not include radical hysterectomies for invasive cervical cancer, which have a separate International Classification of Diseases, ninth revision (ICD-9) code. Second, the proportion of procedures performed for preinvasive diseases is relatively small, less than 2 percent of all procedures over a 6-year period in North Carolina (E. Myers, unpublished data, 1998).

We did not correct for hysterectomy prevalence in our natural history model, since most population-based cancer incidence rates do not do so. However, since the relative likelihood of hysterectomy with removal of the cervix clearly affects an individual's risk for developing cervical cancer, we included this factor in the cost-effectiveness model and tested the impact of doing so on cost-effectiveness estimates. We did not specifically address continued cytological screening after hysterectomy because of questions about its effectiveness (Fetters, Fischer, and Reed, 1996) and because posthysterectomy Pap smears are screening tests for vaginal cancer, not cervical cancer.

We also did not consider the effectiveness of conventional Pap smears or the new technologies in the followup of treatment for invasive cervical cancer. Our analysis is limited to the use of cervical cytology in screening for previously undetected disease, not the detection of persistent or recurrent cancer after treatment. Costs for followup smears are included in the global estimates of treatment and followup costs.

Human Papillomavirus (HPV) Infection

Although not essential for the current analysis, HPV is included in the model. In this section, we document this component of our cervical cancer screening model is documented.

Incidence of HPV infection

The model assumes that all cases of cervical cancer begin with infection with human papillomavirus. We have incorporated HPV status into the model for two reasons. First, although a small percentage of cervical cancers do not have detectable HPV DNA, even with sensitive assays, there is consensus that HPV infection is the causative agent for the vast majority of cervical cancers (Herrero, 1996; Kiviat, 1996; Koutsky, 1997; Schiffman, Bauer, Hoover et al., 1993). Second, certain HPV types are clearly more likely to progress to cancer than others, and identification of these types in cervical cells may have a role in determining optimal diagnostic and treatment strategies for patients with abnormal Pap smears (Cox, Lorincz, Schiffman et al., 1995). Although this role is still unclear, the current model can be adapted in the future to allow consideration of technologies such as HPV testing in screening for cervical cancer and its precursors. In addition, incorporating HPV data also allows our model to be used for assessing the cost-effectiveness of vaccines against HPV.

The natural history of HPV infection is complex, with clearance and persistence of viral DNA, along with progression to SIL, varying depending on the viral type, patient characteristics such as age and immune status, and study design and assay methods (Herrero, 1996; Kiviat, 1996; Koutsky, 1997; Mitchell, Tortolero-Luna, Wright et al., 1996; Schiffman, Bauer, Hoover et al., 1993). We do not distinguish between different types of HPV. Our incidence, progression, and regression estimates are averages for all viral types. The risk of developing cervical cancer after infection is clearly related to HPV type. With further data on the test characteristics of screening tests incorporating HPV typing, this model can be adapted for estimating the utility and cost-effectiveness of such tests.

Table 3. Age-specific Annual Incidence of HPV Infection
AgeIncidence (percent per year)
150.1
160.1
170.12
180.15
190.17
200.15
210.12
220.1
230.1
24-290.05
30-490.01
50+0.005
We used the following baseline age-specific estimates for HPV incidence in the model and varied them over ranges that, combined with our estimates of regression and progression, resulted in age-specific prevalence of HPV infection and age-specific incidence of cervical cancer within the ranges reported in the literature (Table 3). We varied the incidence rates by factors of 0.5-2 to examine the effects of changing HPV incidence on cervical cancer incidence in unscreened populations. We also varied the age of peak incidence from 20 to 30 in order to examine the effects of a later onset of sexual activity on age-specific cervical cancer incidence.

Regression, persistence, and progression of HPV infection

Estimates of regression and progression rates of HPV infection are subject to variability in study design, patient population, and viral assay techniques and are complicated by the mathematical difficulty of converting rates to probabilities. Reported regression rates for prevalent cases include 70 percent after 2 years in a cohort of adolescents and college-age women (Moscicki, Shiboski, Broering et al., 1998), 68 percent over 14 months for women under 25, and 35 percent for women over 30 (Hildesheim, Schiffman, Gravitt et al., 1994). Ho, Bierman, Beardsley et al. (1998) reported a 1-year regression rate of 70 percent. All of these results were in women with normal cytology. The overall regression rate in a large Finnish cohort was 42.8 percent over 50 months (Kataja, Syrjanen, Mantyjarvi et al., 1989; Kataja, Syrjanen, Syrjanen et al., 1990). However, determination of disease status in this group was made on the basis of cytological evidence of HPV infection. Under the Bethesda System, these would be classified as LSIL.

As with incidence, regression rates and possibly progression rates appear to vary with age. Whether this is truly related to age or is a function of duration of infection (the longer HPV persists, the less likely it is to regress) is unclear. Again, this question can be further explored with the current model. For the purposes of a cohort simulation, there is not much difference between an age-based or duration-based progression rate as long as the age-specific incidence is constant.

Table 4. Regression and Progression Rates of HPV Infection
RateEstimateRange
Regression:
Age 15-240.7/18 months0.60-0.9
Age 25-290.50/18 months0.45-0.6
Age 30-390.25/18 months0.10-0.2
40-490.15/18 months0.10-0.2
Age 50 +0.05/18 months0.05-0.2
Progression:0.2/36 months0.15-0.3
Proportion progressing to HSIL directly0.10.05-0.5
Progression rates are equally difficult to determine. Moscicki, Shiboski, Broering et al.(1998) reported progression to SIL in 15.7 percent over 40 months, and 7.6 percent of these progressed directly to HSIL. Ho, Bierman, Beardsley et al. (1998) reported a cumulative 36-month incidence of SIL of approximately 25 percent, and 6.5 percent of these were high-grade lesions. Koutsky, Holmes, Critchlow et al. (1992) found a 2-year cumulative incidence of HSIL of 28 percent in women with HPV DNA, and only 36 percent of these had had a prior LSIL smear. Consistent with our other estimates, our base case estimates (Table 4) are derived from the higher end of the reported ranges. Cases that progress directly to HSIL are similar to the "rapidly progressive" cases of the Eddy (1990) model. Again, we varied age-specific regression rates in order to achieve an age-specific cancer incidence curve similar to that seen in unscreened populations.

Because there is some uncertainty in the literature about whether treatment of SIL results in total eradication of HPV, we created a Diagnosed HPV state to represent women who have been treated for SIL. We assumed that the probability of progression was 5 percent of the age-specific progression rates. This is equivalent to a 95 percent cure rate for SIL treatment used in other models (Eddy, 1990; Fahs, Mandelblatt, Schechter et al., 1992; Schechter, 1996).

Low-grade and High-grade Squamous Intraepithelial Neoplasia

Table 5. Progression and Regression Estimates for LSIL and HSIL
EstimateRange
LSIL
Regression
Age 15-340.65/72 months0.6-0.8
Age 35 +0.4/72 months0.3-0.6
Progression
Age 15-340.1/72 months0.1-0.3
Age 35 +0.35/72 months0.3-0.5
HSIL
Regression0.35/72 months0.3-0.5
Progression0.4/120 months0.3-0.5
Determining transition probabilities from the literature that accurately reflect natural history is as problematic for SIL as it is for HPV infection. The difficulties in converting rates collected over varying, often unspecified times and in heterogeneous populations are further magnified by differences in terminology. For example, many studies report transitions from HPV-associated cytologic changes to CIN 1 to CIN 2 to CIN 3 to CIS, which may be difficult to translate into the Bethesda System terminology of LSIL and HSIL. For our base case, we use the estimates of Syrjanen, Kataja, Yliskoski et al. (1992), the largest cohort that reports results using the Bethesda System. We assume an age dependence in regression and progression rates (Hildesheim, Schiffman, Gravitt et al., 1994; Munoz, Kato, Bosch et al., 1996; van Oortmarssen and Habbema, 1991). The length of time for HSIL progression is more difficult to estimate from these data than the other parameters; in this cohort, all patients who progressed to carcinoma in situ under an older classification system were treated. Prior models have estimated the duration from severe dysplasia/CIS to invasive cancer of 10-15 years. We use 12 years for the base case (Table 5), since this interval resulted in an age-specific incidence of cervical cancer most consistent with observed data.

Conditional Probabilities of Cytologic Results for Given Histologic States

Because the model is based on transitions from true histological states of LSIL, HSIL, and invasive cancer and Pap smear results are categorical, the probability of a particular cytological diagnosis for a given histological state requires estimates beyond "sensitivity" and "specificity" for cytology.

In the model, any Pap smear result of ASCUS or above is considered abnormal. For patients without histological changes (Well, Unknown HPV Infection, and Detected HPV Infection), the probability of a normal smear will be the specificity of the Pap smear or other test for a cytological threshold of ASCUS or above, since specificity is defined as the probability of a normal test result given no pathology, whereas the probability of an abnormal result, or false positive, is 1.0 minus the specificity.

For patients with histological changes (LSIL and higher), our base case estimate of the overall probability of an abnormal result is equal to our sensitivity estimate, using an ASCUS threshold, from the three studies by Baldauf, Dreyfus, Lehmann et al. (1995); Davison and Marty (1994); and Hockstad (1992). This estimate was 51 percent. We varied the sensitivity up to 78 percent, the mean estimate of the meta-analysis.

For the base case for conventional Pap smears, we used an estimate of specificity based on the three studies of the use of conventional Pap smear in primary screening that were unaffected by verification bias (Baldauf, Dreyfus, Lehmann et al., 1995; Davison and Marty, 1994; Hockstad, 1992). This estimate is 97 percent, which we varied down to 0.88 percent, the mean specificity estimate for all of the studies included in the meta-analysis. For the base case, we assumed that new technologies would have no impact on specificity. We varied the relative specificity of the new technologies from a 50 percent increase in specificity to a 50 percent decrease in specificity.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F004.jpg.

   Figure 4. Effect of a new technology that improves the sensitivity of the primary screening step on the distribution of normal and abnormal Pap results. The model assumes that 10 percent of slides intially read as normal are rescreened at the sensitivity and specificity of the new technology.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F005.jpg.

   Figure 5. Effect of a new technology that improves the sensitivity of the rescreening step on the distribution of Pap results. 100 percent of slides that are read as normal by conventional Pap are re-read at the sensitivity and specificity of the new technology

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F006.jpg.

   Figure 6. Distribution of Pap results when underlying histology is normal(including patients with HPV infection but no cytological or histological change) (Jones and Novis, 1996). If specificity of conventional Pap is 97 percent, then 97 percent of cytology results will be normal, 3 percent X 52.5 percent = 1.6 percent will be ASCUS, 1.2 percent LSIL, 0.3 percent HSIL, and 0.01 percent cancer after the primary screening step

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F007.jpg.

   Figure 7. Distribution of specific abnormal Pap results when underlying pathology is LSIL. If sensitivity is 51 percent, then 49 percent of patients with LSIL will have initially Paps, 51% X 23.5% = 12.0 percent will have ASCUS, 34.8 percent LSIL, 4.1 percent HSIL, and 0.08 percent cancer as the cytological diagnosis.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F008.jpg.

   Figure 8. Distribution of cytological resutls for underlying histological diagnosis of HSIL. At sensitivity of 51 percent, 49 percent of Paps will be normal, 5.3 percent will be ASCUS, 15.3 percent LSIL, 29.9 percent HSIL, and 0.56 percent cancer.

Abnormal results on Pap testing include ASCUS, LSIL, HSIL, and cancer. For each histological state, we calculated the proportion of all abnormal results represented by each possible cytological diagnosis using data from a large data set of biopsy-cytology pairs (Jones and Novis, 1996). For our model, we collapsed the categories as follows: ASCUS and AGUS = ASCUS (this is not meant to imply comparability in the relative clinical significance of these two diagnoses, but rather only to simplify the calculations, as discussed below), indeterminate SIL and LSIL = LSIL, any malignancy or glandular carcinoma-in-situ = cancer. Because data correlating ThinPrep®, AutoPap®, or Papnet® results with histological results were limited, we assumed that these proportions are similar for new technologies. We present these proportions graphically in Figures 3 - 9

We assumed that the sensitivity of the screening test is independent of the underlying histology. In other words, the sensitivity using an ASCUS threshold is identical for LSIL, HSIL, or cancer. This assumption may not be true. This is a complex issue that would be difficult to model in a useful way. In keeping with other models, we have assumed constant sensitivity and specificity across histological states. Misclassification of cases, whether to a higher or lower stage, could significantly alter procedure or cost parameters.

Natural History of Invasive Cancer

Table 6. Progression Rates for Cervical Cancer Stages I through IV
Cancer StageProgression RateAnnual Probability of SymptomsProportion of Cases in Unscreened Population Predicted by ModelProportion Reported in Literature
I0.9/4 years0.150.4630.34-0.55
II0.9/3 years0.2250.2700.13-0.28
III0.9/1.25 years0.60.1810.12-0.19
IVN/A0.90.0850.06-0.10
As stated above, there are almost no data for estimating the rates of progression from Stage I through Stage IV cervical carcinoma. There is also no way to determine the likelihood of developing symptoms. Since the distribution of cases by stage in an unscreened population should be a function of the progression rate and the likelihood of symptoms (since cases would only be detected upon presentation with symptoms), we have adopted the approach taken by others (Eddy, 1990; Fahs, Mandelblatt, Schechter et al., 1992) and adjusted these estimates so that the proportion of cases represented by each stage is similar to that seen in cervical cancer patients who have never been screened (Bearman, MacMillan, and Creasman, 1987; Janerich, Hadjimichael, Schwartz et al., 1995; Pretorius, Semrad, Watring et al., 1991; Schwartz, Hadjimichael, Lowell et al., 1996). Our base case estimates are described in Table 6.

Stage-Specific Treatment Probabilities

Table 7. Stage-specific Treatment Probabilities
TreatmentStage IStage IIStage IIIStage IV
Simple hysterectomy0.1460.0060.0040.003
Radical hysterectomy0.3550.0430.0130.009
Radiation therapy0.2040.6470.5370.398
Radiation therapy and surgery0.1460.0720.0460.017
Radiation therapy and chemotherapy0.0220.1610.3200.307
Chemotherapy only0.0020.0040.0040.103
Other therapy0.0380.0280.0270.020
No Therapy0.0870.0390.0490.143
The proportion of women in each stage of cervical cancer who received a specific treatment was estimated using 1990 data from the American College of Surgeons' Patient Care Evaluation (PCE) study for cervical cancer (A. Fremgen, personal communication to E. Myers, 1998; Jones, Shingleton, Russell et al., 1995; Russell, Shingleton, Jones et al., 1996). Table 7 shows the stage-specific data. Of note, 21.8 percent of the patients in this dataset had an unknown stage. Of those, almost one-half were treated with either a simple or radical hysterectomy, suggesting that most were earlier stage disease.

These proportions were used to calculate the number of specified procedures resulting from each diagnosed case of a specific stage of cervical cancer predicted by the model. These proportions were also used to generate estimates of stage-specific treatment costs (see below).

Stage-Specific Survival

Survival probabilities at 1, 2, 3, 4, and 5 years postdiagnosis for each stage were obtained from PCE data (Fremgen, personal communication; Jones, Shingleton, Russell et al., 1995). These values were chosen because they represent data from a wide range of facilities treating women with cervical cancer. Five-year survival rates based on these data were Stage I, 86.0 percent; Stage II, 62.5 percent; Stage III, 37.9 percent; and Stage IV, 11.3 percent.

We assumed that there was no cancer-related mortality after 5 years. Although the PCE data show some deaths after 5 years in all stages, they are relatively rare compared with those in the first 5 years. Other models have also used 5-year survivals. The PCE data are disease-specific; patients are also at risk for other causes of death during the 5-year postdiagnosis period.

Non-Cervical Cancer Mortality

Mortality from causes other than cervical cancer was estimated by subtracting age-specific cervical cancer mortality rates from age-specific all-cause mortality rates using U.S. life tables for 1995 (obtained from the National Center for Health Statistics World Wide Web page: http://www.cdc.gov/nchswww/daa/gm291_1.pdf).

Parameter Estimates from Available Data: Test Characteristics

We derived our estimates for conventional Pap sensitivity and specificity from the systematic literature review and meta-analysis. For sensitivity and specificity of the new technologies, we used articles reviewed under the revised criteria, the review of Brown and Garber (1998), and estimates provided by the manufacturers. The derivation of these estimates is described in greater detail below.

Conventional Pap Smear Test Characteristics

For conventional Pap smears, we used estimates of sensitivity and specificity from the meta-analysis for a cytological threshold of ASCUS and a histological diagnosis of LSIL or higher. For the base case, we used an estimate derived from the three studies not affected by verification bias (Baldauf, Dreyfus, Lehmann et al., 1995; Davison and Marty, 1994; Hockstad, 1992). These values have a sensitivity of 51 percent and a specificity of 97 percent. We varied this level up to the mean values of the test characteristics from the meta-analysis, 78.8 percent for sensitivity and 88.8 percent for specificity. However, these estimates, like those used in all previous models, assume that there are no joint effects for sensitivity and specificity, an assumption that is not confirmed by the meta-analysis.

The prevalence of underlying cervical neoplasia was a significant predictor of test sensitivity in both the meta-analysis of Fahey, Irwig, and Macaskill (1995) and in ours. We used sensitivity estimates based on fairly low prevalence estimates for the base case. We also did not account for the effect of increasing the screening interval on Pap sensitivity: Because prolonging the screening interval should increase the prevalence of the disease, Pap test performance should be better at longer screening intervals compared with shorter. We made no adjustment for this bias against longer screening intervals.

We assumed in the base case that the combination of history, pelvic examination, and Pap have a sensitivity of 100 percent in the diagnosis of Stage II, III, or IV cervical cancer (cancer that has spread beyond the cervix). Because, by definition, these tumors involve the vagina and/or parametrial tissues, the combination of a speculum and bimanual examination and cervical cytology should, in theory, detect all disease. This is especially true because cervical cancer is clinically staged. Therefore, the failure to detect malignancy cannot be attributed solely or, arguably, even primarily to a failure of cytology. We used a sensitivity of 85 percent as the lower bound.

Adequacy of Specimens

The adequacy of a cervical slide for interpretation can be affected by the presence of inflammatory cells, bloods, vaginal or cervical secretions, the number or type of cells collected, and other factors. The Bethesda System requires a statement of adequacy by the pathologist, along with the reason for any inadequacy. One category used is "Satisfactory but limited by" (SBLB)," which states the reason the interpretation of the slide may be compromised. The American College of Obstetricians and Gynecologists recommends that the decision to repeat a smear in such cases be made at the discretion of the individual practitioner.

A technology that improves the adequacy of the specimen may be superior to conventional techniques in two ways. First, the sensitivity of the smear may be enhanced because the slide may be easier to read and interpret. Second, the number of inadequate slides, and subsequent need for repeating the smear, may be reduced. New thin-layer technologies may produce both types of improvement. For example, in a large study of ThinPrep®, the number of "Satisfactory but limited by" smears was reduced from 27.8 percent with conventional smears to 19.8 percent (Lee, Ashfaq, Birdsong et al., 1997). However, of note, the number of smears that were inadequate because of a lack of endocervical cells was higher in the ThinPrep® group in this same study (15.8 percent versus 9.4 percent for conventional smears). Many clinicians would argue that the absence of endocervical cells is of potentially greater significance than "Satisfactory but limited by" smears, since it may indicate that sampling of the transformation zone was inadequate or that lesions were missed in the endocervical canal. Although this finding may be a result of technical factors in the design of the study, the main issue is that the clinical significance of an "unsatisfactory" slide is widely variable.

Although we do not specifically address the issue of specimen adequacy in our model, we do address its consequences. First, we assume that the primary screening technology has improved sensitivity compared with conventional Pap. Second, reducing the number of "Satisfactory but limited by" smears may reduce costs by reducing the number of repeat smears. However, we are unaware of any data on the actual proportion of "Satisfactory but limited by" smears that are repeated in clinical practice. This will probably vary widely, depending on patient populations, perception of malpractice risk, reimbursement issues, etc. Previously published models of cervical cancer screening suggest that the risk of an undetected premalignant or malignant lesion in a patient with a history of regular normal smears is very small. It is likely that the number of repeat smears actually saved by reducing the "Satisfactory but limited by" category is less than the full 10 percent reported by Lee, Asfaq, Birdsong et al. (1997). There are several reasons for this. First, the compliance rate with followup for Pap smears that are truly abnormal is only 30-70 percent (Paskett and Rimer, 1995). Second, the most compliant patients are likely to be those at lowest risk. Third, ACOG has left the decision to repeat the smear up to the individual practitioner. A small pilot study by Paskett, Carter, Chu et al (1990) found that patient perception of the accuracy and severity of an abnormal report was a significant predictor of the likelihood of returning for followup. Even in the best case scenario, where 100 percent of providers would order a repeat smear for a "Satisfactory but limited by" result, the actual reduction in repeat smears would not be 0.08 (0.278-0.198) but instead would be between 0.027 (0.3 X 0.08) and 0.063 (0.7 X 0.08). In any event, the effect on the need for repeat specimens is primarily a cost issue, which we address by decreasing the extra cost of the new technology in sensitivity analysis.

New Screening Technologies

Because our estimates of conventional Pap smear sensitivity and specificity are based on a histological gold standard, there are limited data that allow direct comparison of the new technologies with conventional Pap smears. We have modeled the potential impact of these technologies on the cost-effectiveness of various screening strategies by assigning two separate sensitivity and specificity estimates, one for initial screening and one for rescreening. We assumed that 10 percent of all slides initially read as normal using either a conventional Pap or a hypothetical new technology will be rescreened. We also assumed that the new technology improves the sensitivity of the initial screening step with no decrement in specificity. We incorporated the cost of rescreening 10 percent of slides into the global cost estimate for Pap tests and further assumed that, with either of these procedures, the sensitivity and specificity of the rescreening step is identical to the initial step. We also tested the impact of 100 percent rescreening using an automated rescreening device, again with improved sensitivity and no decrement in specificity. Because of significant uncertainty about the actual changes in sensitivity and specificity of the new technologies, we have modeled their test characteristics as functions of the sensitivity and specificity of the conventional Pap smear.

Calculating Improvement in Test Characteristics

We approached the problem of calculating improvement in test characteristics in two ways:

  • 1

    As part of the reevaluation of new technology studies, we calculated relative true positive rates for studies where relative performance could be estimated. The relative TPR is the ratio of the true positive rate of a new technology to the true positive rate of the conventional test (TPRnt/TPRpap). For the studies of Roberts, Gurley, Thurloe et al. (1997) and Bolick and Hellman (1998), we calculated a relative TPR for ThinPrep® of 1.16, with confidence limits from 0.9 to 1.4.

  • 2

    From the review of Brown and Garber (1998), we calculated the reduction in false negative rate based on their base case estimates of sensitivity and specificity for conventional Pap smears and the new technologies. Sensitivity of the new technology was related to the sensitivity of conventional Pap smear as a function of the parameter x using the following formula:

Sensitivity of New Technology = Sensitivity of Conventional Pap +
[x * (1-Sensitivity of Conventional Pap)]
(equation a).

This variable, x, is useful for modeling probabilities. We refer to this variable in the Results section as the reduction in false negative rate. For example, if x = 0.4, then there is a 40 percent reduction in false negative rate.

Note that since TPR is the same as test sensitivity, x is algebraically related to relative TPR and sensitivity of conventional Pap smear. That is,

x = [Sensitivity of Conventional Pap * (relative TPR -1)] / (1-Sensitivity of Conventional Pap)
(equation b).

For example, Bolick and Hellman (1998) calculated a sensitivity of 0.94 for ThinPrep® and 0.85 for conventional Pap. The relative true positive rate would be 1.11 (0.94/0.85). On the basis of equation a, these numbers correspond to a reduction in FNR (x) of 0.6, since

0.94 = 0.85 + [x * (1- 0.85)],

so,

x = (0.94 - 0.85) / 0.15 = 0.6.

From equation b, the reduction in FNR would also be 0.6, since

[0.85 * (1.11-1)]/(1 - 0.85) = 0.6.

Sensitivity Estimates for Initial Screening Step

Table 8. Calculations of Reduction in False Negative Rate for Initial Screening Technologies
SourceThin Prep® SensitivityConventional Pap SensitivityCalculated Reduction in False Negative Rate
Roberts, Gurley, Thurloe et al (1997)Relative TPR 1.13NA0.91 (assumes same sensitivity and specificity as Bolick and Hellman, 1998)
Bolick and Hellman, (1998)0.940.850.6
Brown and Garber (1998)0.910.80.55
One way to improve Pap test sensitivity is to improve the sensitivity of the initial screening step, either through reducing potential sampling errors and interpretive errors, such as monolayer preparations, or through interpretive errors only, such as computerized technologies. We modeled an improvement in sensitivity by using the reduction in false negative rate (see Figures 3 - 5). As mentioned previously, we assumed that 10 percent of normal smears would be rescreened at the same sensitivity and specificity as primary screening. As a starting point, we used data from studies of ThinPrep® included in our meta-analysis as well as estimates of reduction in FNR used by Brown and Garber (1998) (Table 8).

We therefore use 0.6 as our base case reduction in FNR for the initial primary step. For sensitivity analysis, we varied the estimate of reduction in FNR from 0.4 to 0.9. This range is consistent with the "50 percent increase in detection of disease" cited by the manufacturer (R.Silverman, personal communication to D.C. McCrory, 1998 ).

Because these values are calculated relative to the sensitivity of the conventional Pap testing using adjudicated review, they should, in theory, be independent of the actual sensitivity of the conventional Pap testing. In fact, this argument is the justification for not requiring histological confirmation studies for FDA approval of these technologies. Therefore, in our model, the base case sensitivity of the initial screening technology will be

0.51 + (0.6 * 0.49) = 0.804.

We assume that 10 percent of slides will be rescreened at the same improved sensitivity, for an overall sensitivity of

0.51 + (0.6 * 0.49) + (0.1 * 0.6 * 0.49) = 0.833.

It is important to note that these estimates do not distinguish test sensitivity for screening and rescreening, nor do they differentiate sampling error and interpretive error. Test performance is likely to be different in a rescreening application, since the population subjected to rescreening is different from the primary screening population. For practical purposes, if sensitivity is improved sufficiently to make a new screening technology cost-effective, then it should not matter if that differential is due to technological superiority or difference in the type of error that is reduced. However, to examine the impact of this difference, we model differential sensitivities between initial and rescreening testing. Regarding different types of test errors, sampling errors may result from either poor technique on the part of the person obtaining the smear, or placement of cells on the slide which may not be representative of the potential areas of abnormality. Determining the relative effect of errors in technique (inadequate visualization of the cervix, inadequate sampling of the transformation zone, failure to clear off blood and mucous prior to obtaining the smear) versus errors in cell sampling from the collection device itself would require more histological confirmation than is currently available. Again, to permit examination of this issue, we model the effect of changes in differential sensitivities between initial and rescreening technologies.

Sensitivity Estimates for Rescreening Step

We also examined the potential impact of technologies that improve the sensitivity of the rescreening step. We assumed that the sensitivity of the primary screening step was that of the conventional Pap and that 100 percent of smears read as normal would be rescreened at the improved sensitivity and specificity (see Figure 5). We used a similar approach to the one described above.

For rescreening technologies, the overall improvement in sensitivity can be calculated as either

Sensitivity Conventional Pap + {1.0 * [Sensitivity Conventional Pap +x (1-Sensitivity
Conventional Pap)] * (1-Sensitivity Conventional Pap)}

or as

Sensitivity Conventional Pap + [1.0 * Sensitivity Rescreening Device * (1-Sensitivity
Conventional Pap)].

Table 9. Calculation of Reduction in False Negative Rate for Rescreening Technologies
SourceRescreening SensitivityCalculated Reduction in False Negative Rate
AutoPap® (Rescreening Only)
Stevens, Milne, James et al. (1997)0.430.43
Patten, Lee, Wilbur et al. (1997a)0.510.51
Colgan, Patten, and Lee (1995)0.6630.663
Brown and Garber (1998)0.770.77
Papnet®
Kaufman, Schreiber, and Carter (1998)0.38-0.410.38-0.41
Jenny, Isenegger, Boon et al. (1997)0.890.89
Brown and Garber (1998)0.880.88
Table 9 shows the sources and calculated reduction in false negative rate used to derive our estimated range for rescreening technologies.

Again, we chose 0.6 as the base case for the rescreening technology and varied it between 0.4 and 0.9. The base case estimate was chosen because it was in the mid-range of estimates. By beginning at identical values, identification of thresholds between the two types of technologies was simplified.

Specificity of New Technologies

Our base case assumes that the new technologies do not affect the specificity of Pap screening. We varied the relative specificity of the technologies from 0.9 to 1.0. However, since these are relative sensitivities and specificity, one would expect that the actual test characteristics of the new technologies will vary depending on the "true" sensitivity and specificity of conventional Pap testing when a histological gold standard is used. Thus, the lower the "true" conventional sensitivity, the lower the "true" sensitivity of the new technology.

Another issue is the relative contribution of sampling errors (failure to adequately collect abnormal cells) and interpretation errors (overlooking or falsely classifying abnormal cells on a smear) to the overall sensitivity of the test. There are several considerations. First, a certain unknown percentage of sampling errors will be attributable to the provider obtaining the smear; changes in Pap technology will not improve the ability of a provider to adequately visualize, clean, and sample the cervix. Second, although sampling errors may be more common, the relative impact on overall test sensitivity of different technologies would need to be demonstrated by direct comparison of smears obtained from conventional Paps, monolayers, and selected by rescreening devices, preferably with histological confirmation. It is difficult to see how the reduction in sampling error for a given technology can be determined without histological, or at least colposcopic, confirmation. We therefore did not consider sampling versus interpretive errors in the base case analysis. However, because it is possible that a differential impact on false negatives due to sampling or interpretation could have an impact on the relative cost-effectiveness of different technologies, we tested this in sensitivity analysis. We divided the false negative rate, 1-Sensitivity Conventional Pap, into two components as follows:

False Negative Rate =
[Relative Contribution Sampling Error * (1-Sensitivity)] +
[Relative Contribution Interpretive Error * (1-Sensitivity)].

We assumed in this sensitivity analysis that a rescreening technology would not affect false negatives attributable to sampling, but that a primary screening technology such as ThinPrep® could potentially reduce both types of errors. Again, the relative contributions of sampling and interpretive errors on the relative sensitivities of different technologies can ultimately only be answered by direct comparison of the technologies, preferably with colposcopic and histological confirmation.

Parameter Estimates from Available Data: Cost

Cost estimates for cervical cancer screening and for the diagnosis and treatment of LSIL, HSIL, and invasive cervical cancer were derived primarily from the review of MEDSTAT data conducted by Health Economics Research, Inc.

Costs of Screening

Table 10. Costs of Screening Tests
TestBase CaseRangeMedicare Cost
Conventional Pap, 20-64$38.68$35.32-43.7
Conventional Pap, 65+$47.73$44.24-56.98$35.01 (range: $34.55-36.92)
New technology which improves primary screening sensitivityIncremental cost: $10.00$5.00-15.00Not available
New technology which improves rescreening sensitivityIncremental cost: $10.00$5.00-15.00Not available
Estimates for the incremental cost per slide for conventional Pap smears, Paps smears obtained using ThinPrep® media, and Paps with 100 percent rescreening with AutoPap® or Papnet® were assigned on the basis of data from Health Economics Research, Inc., and estimates made by Brown and Garber (1998), Schechter (1996), Radensky and Mango (1998), peer reviewers of the draft evidence report, and the manufacturers. These costs are summarized in Table 10.

Costs of Diagnostic Evaluation for LSIL, HSIL, and Invasive Cancer

Diagnostic costs were estimated in two ways. First, the mean cost for diagnosis and treatment of LSIL, HSIL, and Stages I, II, III, and IV cervical cancer from MEDSTAT data was used to assign a global diagnosis and treatment cost to each level of pathology. Sensitivity analyses were run using the 25th and 75th percentiles as the minimum and maximum values. The MEDSTAT dataset did not contain information specific to FIGO stage, only World Health Organization (WHO) stage (Local, Regional, Distant). We assumed that "local" disease equaled Stage I, "regional" equaled Stage II or III, and "distant" equaled Stage IV.

Table 11. Procedure-specific Costs for Diagnostic and Treatment Procedures
Diagnostic ProcedureBase CaseRangeMedicare Cost
Colposcopy + biopsy + endocervical curettage$375$244-477$244
Colposcopy + biopsy only$276.57$195-351$172.63
LEEP$564$352-710$205
Cone biopsy$919$623-1114$409
The model was also evaluated using procedure-specific costs for diagnostic and treatment procedures. Means and 25th and 75th percentiles from the MEDSTAT data were used to establish base estimates and ranges for colposcopy and biopsy, LEEP, and cone biopsy (Table 11).

Table 12. Proportion of Patients in Each Stage Undergoing Specified Diagnostic Procedures
ProcedureAverage CostStage IStage IIStage IIIStage IV
Examination under anesthesia$2010.4140.4580.4620.462
Cystoscopy$9820.2010.3880.4540.454
Proctoscopy$7740.1520.2810.3160.316
Chest X-ray$430.7270.7810.7790.779
Intravenous pyelogram$3390.3130.3270.3110.311
Barium enema$1080.1360.1990.2130.213
Bone scan$2360.0450.1120.1360.136
CT scan$3120.4080.6960.7680.768
MRI scan$5630.0460.0670.0710.071
LymphangiogramNA0.0130.0320.0390.039
Fine needle biopsyNA0.0170.0350.0420.042

NA = not available.

The diagnostic costs associated with each stage of cervical cancer were estimated by calculating a weighted average cost using the proportion of patients within each stage who underwent a given diagnostic procedure in 1990 (Russell, Shingleton, Jones et al., 1996) and procedure-specific MEDSTAT costs (Table 12).

Table 13. Stage-specific Cost Estimates of a Diagnostic Workup
Cervical Cancer StageBase CaseRangeMedicare Costs
I$714$274-1042$348
II$1138$429-1642$520
III$1257$470-1837$563
IV$1257$470-1837$563
The base case estimates (Table 13) were derived using the average MEDSTAT costs, with lower and upper estimates calculated using the 25th and 75th percentiles. Similar estimates were made based on Medicare costs.

Because MEDSTAT data did not include professional charges for anesthesia for pelvic examination under anesthesia, or charges for lymphangiography or fine needle biopsy, the diagnostic costs are likely to underestimate the true average costs of diagnosis of cervical cancer to some extent.

Costs of Treatment

Table 14. Global Costs of Diagnosis and Treatment
Pathological FindingsBase CaseRange
LSIL$1728$675-2274
HSIL$3049$1384-4203
Stage I$17,645$9,439-21,946
Stage II-III$27,069$11,734-29,089
Stage IV$40,280$12,670-46,509
Like diagnostic costs, treatment costs were estimated in two ways. Global diagnostic and treatment costs estimated from MEDSTAT data (Table 14) were one source.

Table 15. Procedure-specific Costs
Therapeutic ProcedureBase CaseRangeMedicare Costs
LEEP$564$352-710$205
Cryotherapy$156$108-184$111
Laser ablation$730$390-976$205
Procedure-specific costs were also calculated and used to generate a second set of estimates (Table 15). Costs for procedures used in the treatment of SIL were obtained directly from MEDSTAT and Medicare data.

As noted elsewhere in the report, mean costs for cervical cancer treatment were often substantially higher than median costs, especially for more advanced procedures. This probably represents the effects of varying severity of disease within stages or of complications of treatment, which substantially increase costs. We have chosen to use mean costs for our base case estimate in order to include the economic impact of varying disease severity, comorbidities, and complications. Because of the extremely wide standard deviations resulting from the effect of outliers, we use the 25th and 75th percentiles as endpoints in sensitivity analysis.

Table 16. Number of Inpatient and Outpatient Treatment Procedures
TreatmentNumber of Inpatient ProceduresNumber of Outpatient Procedures
Simple hysterectomy only10
Radical hysterectomy only10
Surgery + radiation1 Radical hysterectomy or 1 Simple hysterectomy + 2 radiation treatments20 outpatient
Radiation + chemotherapy2 radiation treatments and 0 inpatient chemotherapy, or 6 inpatient chemotherapy6 outpatient (if no inpatient)
Chemotherapy66 (if no inpatient)
Radiation therapy220
Estimates of the proportion of patients undergoing a specified treatment in each stage were calculated from the American College of Surgeons data (see section on Stage-specific Treatment Probabilities above). Because the data were not specific about the exact radiation therapy or chemotherapy regimens used, we assumed that patients received standard regimens of each, which are described in Table 16 (A. Berchuk and D.L. Clarke-Pearson, personal communication to E. Myers, 1998; DiSaia and Creasman, 1997).

The cost estimates resulting from these assumptions about treatment regimens for each stage were then multiplied by the proportion of patients in each stage receiving each treatment in the American College of Surgeons PCE dataset to arrive at the following weighted averages for treatment of cervical cancer (Tables 17 and 18):

Table 18. Procedure-specific Costs for Diagnosis and Treatment Compared with MEDSTAT Global Estimates
StageProcedure-Specific EstimatesMEDSTAT Global Estimate
DiagnosisTreatmentTotal
I$714$12,243$12,957$17,645
II$1138$15,244$16,382$27,069
III$1257$15,244$16,501$27,069
IV$1257$17,431$18,688$40,280
Table 17. Stage-specific Costs for Treatment (based on MEDSTAT procedure-specific cost data, standard treatment regimens, and distribution of treatments in American College of Surgeons dataset)
StageBase CaseRangeMedicare
I$12,243$7320-15185$7724
II$15,244$7196-21,660$10,105
III$15,244$7526-25,144$10,159
IV$17,431$9296-34,365$8,001
The mean MEDSTAT global estimates (see Table 18) are substantially higher than the estimates derived by applying procedure-specific MEDSTAT costs to the distribution of treatments reported in the American College of Cancer data set (see Table 17). This may reflect differences in patient populations, professional charges, changes in practice patterns between 1990 and 1994, or additional costs, such as antiemetic treatment for chemotherapy patients, or followup outpatient clinic visits, not captured using procedure-specific costs. Again, to bias the model in favor of more sensitive screening strategies, we used the global costs for the base case estimate. In the base case, we also did not use the lower Medicare estimates for patients over 65, again to deliberately bias the results in favor of more sensitive screening strategies.

Costs of Terminal Care

The MEDSTAT data do not allow direct attribution of a cost associated with terminal care for cervical cancer patients. Eddy (1990) estimated a cost of $22,150 in 1989 dollars, based on 1987 Medicare data. The MEDSTAT data do report a category of "All stages (final diagnosis Stage 4)," with an average incremental cost of $16,530 above the average cost for Stage IV cancers. It is unclear whether these represent terminal care, recurrent cases, or progressive cases, since stage, once assigned, should not change.

Eddy's cost, inflated to 1997 dollars, is $34,805. This figure represents charges. Since we use a 0.5 cost/charge ratio, the MEDSTAT costs above represent charges of $33,060. Because of the similarity of these figures, we assume that the cost of terminal care for cervical cancer is $16,530 and assign this cost to all patients dying of cervical cancer.

Calibration Against Epidemiological Data

Epidemiological data on the incidence and prevalence of cervical cancer and its precursors (LSIL, HSIL, and HPV infection) were used to calibrate the cost-effectiveness model.

Converting reported incidence, regression, persistence, and progression rates for HPV and SIL to reliable transition probabilities for the model was difficult because of the way most of these values are reported in the literature. Regression and progression rates are usually reported as percentages of either prevalent or incident infections, but followup time varied widely among studies. Data are rarely provided in a way that easily allows conversion of these rates to yearly transition probabilities (Miller and Homan, 1994). In addition, very few data are available that allow calculation of transition probabilities for untreated cervical cancer, such as the annual stage-specific probability of symptom development or the annual probability of progression between stages. Estimates for these probabilities were derived from the epidemiological literature and prior published models of cervical cancer screening, especially those of Eddy (1990) and Muller, Mandelblatt, Schechter et al. (1990).

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F10.jpg.

   Figure 10. Predicted age-specific incidence of cervical cancer

The model was calibrated by adjusting natural history parameters to recreate the age-specific incidence of cervical cancer in an unscreened population (Figure 10) (Gustafsson, Ponten, Bergstrom et al., 1997). Data from several unscreened populations show a striking similarity in the pattern of age-specific incidence (Gustafsson and Adami, 1992 ; Gustafsson, Ponten, Bergstrom et al., 1997; Gustafsson, Ponten, Zack et al., 1997; Parkin, 1997)

Gustafsson, Ponten, Bergstrom et al. (1997) described two separate curves, one with a peak between ages 40 and 50 with a more rapid decline, and one similar to the U.S. data with a peak between ages 50 and 65 and a more gradual decline. Their modeling suggests that some of the difference results from increased incidence in successive birth cohorts. The early peak curves primarily were observed in western European countries with data collected predominantly between 1950 and 1975. Conversely, the U.S. data comes from Connecticut between 1935 and 1949. It is likely that early sexual activity, and therefore exposure to HPV, was more frequent in western Europe and Scandinavia during the 1950s and 1960s than in 1940s Connecticut. In addition, there are likely to be differences in access to care, or willingness to seek care for symptoms, across time and cultures.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F11.jpg.

   Figure 11. Change in peak HPV prevalence

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F12.jpg.

   Figure 12. Effect of peak age of HPV on cervical cancer incidence

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F013.jpg.

   Figure 13. Effect of age of peak HPV incidence and delay in seeking medical attention on cancer incidence

We tested the impact of a later age of onset of sexual activity, and thus a later age of peak HPV incidence, on the age-specific incidence of cervical cancer, holding all other variables constant. Figure 11 shows the differences in age-specific HPV incidence between the base case model and one in which peak HPV incidence is shifted forward to ages 25-30, and Figure 12 shows the resulting age-specific cervical cancer incidences. Later onset of HPV shifts the cervical cancer curve to the right, as expected, although the age of peak incidence is similar. We then decreased the annual probability of presenting with symptoms for Stage I from 15 percent to 1 percent, and for Stage II from 22.5 percent to 10 percent, with the resulting later age of peak incidence (Figure 13). The later peak incidence of cervical cancer seen in older unscreened cohorts is consistent with both later onset of sexual activity (and HPV infection) and delay in seeking medical attention for symptoms (or in access to care). Other combinations of variables, such as regression and progression rates for HPV and SIL, are also likely to affect the shape of the cancer incidence curve.

We elected to use values resulting in an earlier onset for the base case scenario. Prevention of fatal disease occurring at younger ages results in greater increases in life expectancy compared with prevention of disease at older ages; thus, our base case model is biased in favor of screening strategies that have increased sensitivity.

The peak incidence of cervical cancer used in the model (81/100,000) is somewhat higher than that observed in an unscreened U.S. population in the 1930s (approximately 60/100,000) but lower than that seen in developing nations (Gustafsson, Ponten, Bergstrom et al., 1997). Because there have been significant increases in sexual activity in adolescents over time (Centers for Disease Control and Prevention, 1991), it seems likely that the actual expected rate in an unscreened population would be higher than the rate observed 60 years ago, especially since mortality from other causes, particularly in younger women, has declined. This higher incidence estimate is consistent with our choice of other base case estimates for the model, in that we have tried to bias the model in favor of screening strategies with increased sensitivity compared with conventional Pap smears at longer screening intervals.

In an unscreened population, age-specific incidence of cervical cancer will be dependent both on the rate of progression of the disease and the stage-specific probability of developing symptoms that lead to diagnosis. Very few data are available to validate parameter estimates for rates of progression between stages of cancer or the probability of developing symptoms in any given stage. Therefore, these parameters were adjusted to result in a distribution of cases by stage that approximates that seen in patients without Pap smear screening (Bearman, MacMillan, and Creasman, 1987; Eddy, 1990). These proportions are also similar to those observed in larger series of cervical cancer patients (Jones, Shingleton, Russell et al., 1995; Pretorius, Semrad, Watring et al., 1991), presumably because at least 30-40 percent of cervical cancer patients have never been screened or have not been screened within the past 5 years.

Role of Report Partner

The American College of Obstetricians and Gynecologists formed the "private" half of the "public-private partnership" fostered by AHCPR through the Evidence-based Practice Centers initiative. ACOG played a significant role in this study by nominating evaluation of cervical cytology as a task order topic for funding through the EPC program. Dr. Stanley Zinberg, ACOG's Vice President of Practice Activities, is a member of the study's Advisory Panel of Technical Experts and Peer Review Panel. Through Dr. Zinberg's participation on these panels, ACOG was involved at key decision stages of the study, providing consultation on the key literature search questions and reviewing the first drafts of the evidence tables, the plans for the cost-effectiveness analysis and meta-analysis, and the draft evidence report. ACOG also assisted in writing a dissemination plan for this evidence report for AHCPR.

Quality Control

Internal and external quality control mechanisms were incorporated into each phase of the development of the evidence report.

Internal Quality Control Mechanisms

Quality control procedures were integrated into each step of the literature review. The search strategies used were checked and finalized by two professional medical librarians. For additional checks on completeness, the articles cited in reference lists of reviewed articles, particularly meta-analyses, were compared with those in the Pro-Cite literature database developed for this evidence report, and when appropriate, articles were copied, reviewed, and added to the Pro-Cite database. Further, the manufacturers of AutoPap®, Papnet®, and ThinPrep® were asked to submit articles and other documents that met the specified inclusion criteria for the evidence report.

With regard to the content of the evidence tables and data analyses, quality control procedures included: screening of all articles by at least two clinicians, abstracting of information into the evidence report by two clinicians, and independent construction of each 2-by-2 table by two clinicians and over-reading by a third. No data or other information was inserted into the evidence table or data analyses without absolute agreement by two, and often three, clinicians. Further, once the content was agreed upon by the clinicians, the information for the evidence table and data for the meta-analysis and cost-effectiveness analysis were double-entered and compared, and any discrepancies reconciled. Where appropriate, kappa statistics were calculated to show the strength of agreement between clinician-reviewers, particularly in the article screening stage and the construction of the 2X2 tables. When the strength of agreement was low, all reviewers met to discuss the nature of the discrepancies and to come to common agreement on the interpretation of the criteria and their application to the articles as well as agreeing on responses to articles that were in a "gray" area. In the latter case, the decision was to err on the side of including the article, recognizing that the data abstraction process and the development of the 2X2 tables would further illuminate the inclusion/exclusion status of the article.

Quality control in the cost-effectiveness analysis was provided by extensive sensitivity analyses, by validation with epidemiological data, and by comparisons with previously published cost-effectiveness models.

External Quality Control Mechanisms

Two external oversight and review panels were established. The Advisory Panel of Technical Experts was initiated early in the 11-month time frame for the study. The Advisory Panel of Technical Experts reviewed progress on the evidence report at four key stages of its development: the identification of the key literature search questions, the first drafts of the evidence tables, the plans for the cost-effectiveness analysis and meta-analysis, and the draft of the evidence report. The Technical Experts were sent draft documents that were discussed as a group in two conference calls and also in individual phone calls and written communications. The eight-member Panel consisted of clinical and methodological experts in relevant specialty areas, including clinical and laboratory pathology, obstetrics/gynecology, family practice, oncology, women's health, meta-analysis, cost-effectiveness analysis, and epidemiology. Consumers were represented on the Advisory Board of Technical Experts by Family Health International, a not-for-profit organization committed to improving the health of women and children; helping women and men have access to safe, effective, acceptable, and affordable family planning methods; and preventing the spread of AIDS and other sexually transmitted diseases (STDs).

The primary function of the second group, the Peer Review Panel, was to review and comment on the complete draft of the evidence report. The 23-member panel, which included AHCPR staff and the Advisory Panel of Technical Experts, consisted of clinical and methodological experts in relevant specialty areas, including clinical and laboratory pathology, obstetrics/gynecology, family practice, oncology, women's health, meta-analysis, cost-effectiveness analysis, and epidemiology, as well as the manufacturers of the Pap screening devices reviewed. The report was modified on the basis of their comments, with close attention to relevant studies not included in the draft report, misinterpretation of findings, and other issues deserving revision within the constraints of the methodology, time frame, and budget. The format for the report was designed by AHCPR.

Results

This section describes results on the diagnostic accuracy of cervical cytology screening tests, the meta-analysis of Pap test accuracy, and cost estimates of screening, diagnosis, and treatment of cervical cancer and its precursors. A description is also provided of the review of studies and models of effectiveness and cost-effectiveness of cervical cancer screening. Finally, results are provided from the cost-effectiveness analysis.

Results: Diagnostic Accuracy of Cervical Cytology Screening Tests

Table 19. Distribution of Articles in Literature Screening and Abstraction Process
Total Number of Articles939
Articles eliminated in initial screening561
Articles eliminated in second screening292
Articles included and abstracted86
Of the 939 citations identified by the literature search, approximately 60 percent were eliminated at screening Step 1 either because they did not give results on Pap smear screening for cervical cancer or precursors, or because the Pap smears were not compared with a reference standard. In Step 2, another 31 percent of citations were excluded because cytological screening was compared with a reference standard other than histology or colposcopy (e.g., cytology), the screening test and reference standard were not reasonably concurrent (i.e., more than 3 months apart), or the study failed to report data on both sensitivity and specificity (i.e., a 2-by-2 table could not be completed) (Table 19).

New Technologies

Because very few studies of the new technologies met the original Step 1 and Step 2 screening criteria (no studies of AutoPap®, one study of Papnet®, and one study of ThinPrep®), we modified the criteria to include studies pertaining to any of the new technologies that used a cytological reference standard if applied by an independent panel and if at least half of high-grade cytological results were verified histologically, as suggested in guidelines produced by the Intersociety Working Group for Cytology Technologies (1998). We further modified the screening criteria to include studies that failed to verify diagnosis in patients negative on two independent tests being compared. Though such studies do not estimate sensitivity and specificity, they can still provide estimates of relative TPR and relative FPR, as described by Chock, Irwig, Berry et al. (1997).

We considered a total of 59 studies (12 on AutoPap®, 27 on Papnet® and 20 on ThinPrep®) during this final stage of the screening process (Step 3). Forty-three of these articles had initially been excluded in Step 2, and 16 were brought to our attention at this time by reviewers or manufacturers (7 unpublished manuscripts and 2 published only in abstract form). Ultimately, we decided to describe in the evidence table and the text of this report all the studies considered for Step 3 that also met the criteria requiring the application of a concurrent reference standard (histology, colposcopy, or cytology). The net result was the inclusion of 6 studies of AutoPap®, 11 of Papnet®, and 8 of ThinPrep®.

Six studies of either AutoPap 300 QC® or the AutoPap Primary Screening System® permitted estimates of sensitivity. Little information was available on which to base an estimate of the specificity of rescreening. In three studies of the performance of AutoPap 300 QC® in slides manually screened as negative, the estimated sensitivity for detecting ASCUS or worse at the 20 percent review rate was 0.43 (Stevens, Milne, James et al., 1997), 0.51 (Patten, Lee, Wilbur et al., 1997a), and 0.663 (Colgan, Patten, and Lee, 1995). The estimated sensitivity for detecting LSIL or worse at the 20 percent review rate was 0.66 and 0.77 in the latter two studies. Studies suggested that sensitivity at the 20 percent review rate for ASCUS or worse was higher when these technologies were used for screening than when they were used for rescreening: 0.86 (Wilbur, Bonfiglio, Rutkowski et al., 1996) for the AutoPap 300 QC®, and 0.86 (Wilbur, Prey, Miller et al., 1998) and 0.92 (Lee, Kuan, Oh et al., 1998) for the AutoPap Primary Screening System®. The estimated sensitivity for detecting LSIL or worse at the 20 percent review rate was 0.92 (Wilbur, Prey, Miller et al., 1998). An estimate of specificity of 0.98 was obtained for screening use of the AutoPap Primary Screening System® (Lee, Kuan, Oh et al., 1998); however, this estimate was based on initial screening rather than rescreening. Because the spectrum of disease is different in patients whose smears are initially manually screened as negative compared with patients undergoing initial screening, this specificity estimate may be inaccurate. Furthermore, the Primary Screening System uses a somewhat different algorithm for classification than does the AutoPap 300 QC® and therefore could be expected to have a different specificity.

For Papnet®, one study provided estimates of both sensitivity and specificity (Kaufman, Schreiber, and Carter, 1998). This study estimated sensitivity at 0.38-0.41 and specificity at 0.82-0.92, depending on whether the screening test threshold was ASCUS or LSIL. Other available studies, described in Evidence Table 1, provided data on the impact of Papnet® rescreening on estimated sensitivity. For slides that had initially been read as normal, studies suggested that the yield of Papnet® rescreening (that is, the number of false negatives [FNs] detected divided by the number of slides rescreened) was between 0.3 percent and 3.3 percent. One study reported a reference standard diagnosis on all slides rescreened, permitting an estimated sensitivity of the rescreening step for detecting FN slides of 0.89 (Jenny, Isenegger, Boon et al., 1997). Other studies using a histological reference standard obtained lower estimates (Mango and Radensky, 1998). Mango and Radensky (1998) provide a comprehensive review of all studies of the accuracy of Papnet® screening and rescreening, including unpublished studies reviewed by the FDA as part of the premarketing application.

Two studies of ThinPrep® permitted estimation of both sensitivity and specificity (Bolick and Hellman, 1998; Roberts, Gurley, Thurloe et al., 1997). Bolick and Hellman (1998) compared ThinPrep® Pap smear diagnoses of LSIL or higher with a histological reference standard of CIN2-3 or higher, permitting direct estimation of the sensitivity of 0.94 and specificity of 0.58. In the same report, conventionally prepared Pap smears achieved a sensitivity of 85 percent and a specificity of 36 percent according to the same thresholds. Roberts, Gurley, Thurloe et al. (1997) compared conventional and ThinPrep® slides prepared with a split sample technique. Any positive result on either test was verified either cytologically or histologically; histological verification was obtained on a majority of HSIL samples. Although sensitivity and specificity could not be directly estimated, the relative performance of ThinPrep® to conventional Pap smears could be estimated. The relative TPR was 1.13, indicating that ThinPrep® had higher sensitivity, and the relative FPR was 1.12, indicating that ThinPrep® had slightly lower specificity. Details of all these studies are provided in Evidence Table 1.

Conventional Pap Test

Table 20. Quality of Selected Studies
Quality CriterionNumber of Studies (N = 84)Percent
Reference standard
Histology7083
Histology or negative colposcopy1417
Independence of assessments Blinded Not blinded 22 62 26 74
Verification All test positives and test negatives verified Test positives and random fraction of test negatives verified Test positives and selected sample or none of test negatives verified 23 3 58 27 4 69
Study sample Consecutive or random Other 73 11 87 13
Spectrum of disease/nondisease Defined Not defined 9 75 11 89
Publication type Paper Abstract 84 0 100
Industry relationship Neither done nor supported by a manufacturer Supported by a manufacturer Done by a manufacturer 82 1 1 98 1 1
Characteristics of the 84 studies that met the inclusion criteria for the meta-analysis of conventional Pap test accuracy are displayed in Table 20. Sample sizes in these studies ranged from 14 to 22,412, with a median of 194. The majority of studies (83%) used histology as a reference standard, but only 31 percent obtained verification of all or a random fraction of test negative subjects. A quarter of the studies did not ensure independent assessment of the test and reference standard or blinding. A majority (87%) of studies selected consecutive subjects. All included studies were published in full-length reports; abstracts identified in the screening process uniformly failed to provide sufficient data to meet inclusion criteria. In contrast to the studies of the new technologies, few of these studies were performed or supported by a medical device manufacturer.

Data were available according to different combinations of test thresholds and reference standard thresholds: ASCUS/CIN1 (31 studies), LSIL/CIN1 (69 studies), LSIL/CIN2-3 (43 studies), and HSIL/CIN2-3 (45 studies). In two studies, the test and reference standard thresholds were not explicitly stated and could not be precisely inferred. Most studies permitted us to construct 2-by-2 tables using more than one combination of test and reference standard threshold.

Sensitivity estimates from these studies covered nearly the entire spectrum, ranging from 0.06 to 0.99. Similarly, specificity estimates ranged from 0.06 to 0.99. Because sensitivity and specificity are interdependent, simple statistics such as means do not provide an accurate description of the diagnostic performance of the test. We used several analytical strategies to determine the cause of the large amount of variability in sensitivity and specificity estimates and to ascertain a reasonable estimate of the performance of Pap tests in a low prevalence screened population. These analyses are described in the section on "Meta-analysis of Pap Smear Accuracy."

The setting in which most of these studies were conducted has important implications for interpreting their results and explaining the differences among them. Most of the studies were conducted in colposcopy clinics in patients who had been referred for colposcopy because of a cytological abnormality on an initial Pap smear screening. In such studies, the repeat Pap smear taken at the time of colposcopy usually served as the "test" result that was compared to the result of either colposcopy (available for all subjects) or histology (available only for women who underwent biopsy; that is, women with a negative colposcopy would be systematically excluded).

The assembly of the study sample from a population referred because of cytological abnormality on initial Pap screening can be expected to bias the spectrum of disease. Women who have a negative Pap smear at the time of colposcopy (following an initial positive smear) are likely to be different from women with a negative result on initial smear. Some have suggested that the removal of cervical epithelium with the initial Pap smear might remove enough abnormal cells that the subsequent Pap may be normal. Our analysis assumes that women with subsequent normal Pap smears had an initial false positive smear. If untrue, this assumption would lead to underestimation of Pap test accuracy.

Returning to the above example, when disease status verification is preferentially obtained in women with a colposcopically visible lesion who undergo biopsy, the study sample is further biased. This selection of subjects for verification results not only in a high prevalence of histological abnormalities in the study sample, but also in "workup" bias (Ransohoff and Feinstein, 1978). Although sensitivity and specificity are relatively invariant to prevalence, they are subject to workup bias. This bias can be expected to lead to elevated estimates of sensitivity and lowered estimates of specificity (Feinstein, 1985).

Many studies comparing conventional Pap screening with new technologies applied the reference standard (adjudicated cytology or histology) only to discrepant cases. Concordant positive and concordant negative test results are assumed to be true positives and true negatives, respectively, but may actually be concordant false results. This study design has a consistent underlying bias that can be expected to overestimate sensitivity and specificity of the new test (Miller, 1998). When the conventional and new test are conditionally dependent, as when tests may have similar problems with sample collection or interpretation of mild disease, this bias can be substantial.

Our goal was to obtain estimates of Pap test performance that are applicable to a low prevalence screened population. However, only three studies identified patients undergoing initial Pap smear screening and verified all (or a random fraction) of test negative subjects. The study of Baldauf, Dreyfus, Lehmann et al. (1995) was the largest of these, comprising 1,539 women. In this study, a 10 percent random sample of Pap negative women underwent colposcopy and biopsy to verify their disease status; this allowed for correction of any verification bias. Davison and Marty (1994) studied 200 women, and Hockstad (1992) tested 73 women; all test negative women in these two studies had their disease status verified with the reference standard test. These studies estimated sensitivity at 56 percent, 53 percent, and 29 percent, respectively, and specificity at 98 percent, 100 percent, and 97 percent, respectively.

Results: Meta-analysis of Pap Smear Accuracy

Table 21. Studies Meeting Inclusion Criteria for Meta-analysis of Conventional Pap Smear Accuracy
First AuthorYearTest ThresholdReference Standard ThresholdNumber of SubjectsTrue PositivesFalse NegativesFalse PostivesTrue NegativesPrevalence of Disease 1
Anderson1992HSILCIN2-3228609922470.7
Andrews1989ASCUSCIN13536256891460.33
LSILCIN13533187292060.33
Baldauf1995ASCUSCIN11178412570.73
HSILCIN2-311724213690.38
LSILCIN111779622100.73
LSILCIN2-311742359130.38
Baldauf1995ASCUSCIN13243527242380.19
ASCUSCIN2-3324158442560.07
Beeby1993LSILCIN11000417218802850.64
LSILCIN2-31000420513042250.47
Bigrigg1990HSILCIN2-39815671401171570.72
LSILCIN19819003431160.95
LSILCIN2-398169116240340.72
Bolick1998LSILCIN18957101480.75
Cecchini1993ASCUSCIN2-348653694090.02
LSILCIN2-348653154630.02
Chomet1987HSILCIN2-314210110.21
HSILCIN2-313014445670.45
LSILCIN1130633114220.72
LSILCIN11436140.64
LSILCIN2-3130441433390.45
LSILCIN2-31421290.21
Coibion1994LSILCIN116327966340.75
LSILCIN2-31631410191200.15
Cox1992ASCUSCIN1482113241112340.28
LSILCIN14826077273180.28
Cox1995ASCUSCIN12173020381290.23
LSILCIN1217193171600.23
Davis1981HSILCIN2-38740313130.82
LSILCIN1875725050.94
LSILCIN2-38751206100.82
Davison1994LSILCIN1196161401660.15
Del Priore1995LSILCIN1521648240.38
LSILCIN2-3525119270.12
DiBonito1993ASCUSCIN191819059476220.27
ASCUSCIN2-39188841496770.1
LSILCIN191815297296400.27
LSILCIN2-3918884937330.1
Fahim1991LSILCIN1757275481652690.43
Ferenczy1996ASCUSCIN136413056451330.51
ASCUSCIN2-336467141081750.22
Ferris1998ASCUSCIN127992887920.36
ASCUSCIN2-3279194160960.08
LSILCIN12793763261530.36
LSILCIN2-3279518581980.08
Frisch1994LSILCIN1511426380.78
Fung1997LSILCIN1301188298760.72
LSILCIN2-33019614100910.37
Garutti1994LSILCIN12003042181100.36
Germain1994LSILCIN14308040652450.28
Giles1989LSILCIN111261140370.67
LSILCIN2-311238323480.37
Giles1988LSILCIN12001715251430.16
Glenthoj1988HSILCIN2-3936421170.91
LSILCIN1937116060.94
LSILCIN2-3937015170.91
Gonzalez1996ASCUSCIN16217417240.34
LSILCIN1628135360.34
LSILCIN2-3621012490.02
Gundersen1988LSILCIN1564223270.46
LSILCIN2-356324470.09
Haddad1988HSILCIN2-31214700470.61
LSILCIN11219114970.87
LSILCIN2-312169531160.61
Hellberg1987HSILCIN2-3513110280.8
LSILCIN151405240.88
LSILCIN2-351374550.8
Helmerhorst1987HSILCIN2-313241611290.77
Herrington1995ASCUSCIN114110325670.91
ASCUSCIN11651032515220.78
HSILCIN2-3141192101010.28
HSILCIN2-3165192101250.24
LSILCIN11418840580.91
LSILCIN1165884011260.78
LSILCIN2-314135558430.28
LSILCIN2-316535564610.24
Hirschowitz1992HSILCIN2-3111761112120.78
Hockstad1992ASCUSCIN170252610.1
Johansen1979ASCUSCIN11821111117430.67
LSILCIN11821101217430.67
Jones1996ASCUSCIN122412149481444320628140.73
LSILCIN122412122104182152644940.73
Jones1992HSILCIN2-314371801180.17
LSILCIN114330528530.57
Jones1987HSILCIN2-32363722240.04
LSILCIN1236104841740.25
LSILCIN2-32365592170.04
Kaufman1997ASCUSCIN1462158167361010.7
ASCUSCIN2-346242251522430.15
HSILCIN2-34621750173780.15
Kealy1986HSILCIN2-33006311102160.25
LSILCIN13008013251820.31
LSILCIN2-3300704351910.25
Koonings1992HSILCIN2-3147391911780.39
HSILCIN2-3143342811700.43
LSILCIN1143612720350.62
LSILCIN1147621620490.53
LSILCIN2-3143481433480.43
LSILCIN2-314749933560.39
Korach1995NSNS503136100.68
NSNS9531320410.36
Korn1994HSILCIN2-38510173550.32
HSILCIN2-373450640.12
LSILCIN18542245140.78
LSILCIN17322136320.48
LSILCIN2-3851985260.32
LSILCIN2-37347221430.12
Kwikkel1986HSILCIN2-3184792126580.54
LSILCIN118414433430.8
LSILCIN2-31849917950.54
Lederer1973HSILCIN2-33073267993726700.12
Lozowski1982HSILCIN2-3155662025440.55
LSILCIN1155107820200.74
LSILCIN2-315583344250.55
MacCormac1988ASCUSCIN16680201622137240710.33
LSILCIN16680189334423942040.33
Maggi1989HSILCIN2-3142121721110.2
LSILCIN1142401243470.37
LSILCIN2-314224559540.2
Mann1993ASCUSCIN124392012130.12
Mayeaux1995ASCUSCIN142814918020790.77
HSILCIN2-34282783183000.26
LSILCIN142814418519800.77
LSILCIN2-34286545982200.26
Melnikow1997LSILCIN177158375100130.85
LSILCIN16018583075100130.98
Morrison1988ASCUSCIN111517478160.18
Morrison1992HSILCIN2-31553250.53
LSILCIN115102030.8
LSILCIN2-31580250.53
Naslund1986HSILCIN2-3361924110.58
LSILCIN1362303100.64
LSILCIN2-3362105100.58
Nyirjesy1972LSILCIN1136664413130.81
Okagaki1991HSILCIN2-3354534027136625680.17
Oyer1986LSILCIN14022237422830.74
Pang1977HSILCIN2-352403360.83
Parham1991HSILCIN2-312626241661313410.63
LSILCIN11262108817129280.88
LSILCIN2-312627828435370.63
Parker1986LSILCIN136510235351930.38
Ramirez1990HSILCIN2-31873440.56
Rasbridge1995HSILCIN2-3595197104762180.51
LSILCIN15953745881820.73
LSILCIN2-3595269321861080.51
Ritter1988ASCUSCIN11911113911300.79
Ronco1996HSILCIN2-35020124140.64
Singh1985ASCUSCIN1107992510.94
HSILCIN2-31071780190.91
LSILCIN11077625420.94
LSILCIN2-31077324730.91
Skehan1990HSILCIN2-397402018190.62
LSILCIN19756122450.7
LSILCIN2-3975192980.62
Slawson1992ASCUSCIN1121283817380.55
Smith1987LSILCIN1122712013180.75
Soost1991ASCUSCIN120861486173344830.8
LSILCIN1208612051864542410.67
Soutter1986HSILCIN2-3172614214550.6
LSILCIN1172891837280.62
LSILCIN2-3172861740290.6
Spinillo1998ASCUSCIN1932391638740.06
HSILCIN2-3932831688250.11
ASCUSCIN1202181941610.18
HSILCIN2-3202471741340.32
Spitzer1987ASCUSCIN197322319230.57
LSILCIN19710452400.57
Stafl1981LSILCIN12679190.62
Syrjanen1987LSILCIN138511844401830.42
Tait1988HSILCIN2-3127202120660.32
LSILCIN1127381314620.4
LSILCIN2-312732920660.32
Tawa1988ASCUSCIN13971467252910.2
Tay1987HSILCIN2-3443150260.41
LSILCIN1441224080.82
LSILCIN2-3447115210.41
Upadhyay1984HSILCIN2-3308222068180.72
LSILCIN1308239153150.78
LSILCIN2-3308222070160.72
Walker1986LSILCIN12141404215170.85
Wetrich1986HSILCIN2-316074912501647020.46
LSILCIN116079542211432890.73
LSILCIN2-31607657844404260.46
Wheelock1989ASCUSCIN12731013273670.49
HSILCIN2-3273176991780.32
LSILCIN12736469261140.49
LSILCIN2-32734838421450.32
Wright1995ASCUSCIN139812356941250.45
Wright1994bASCUSCIN135387223160.04
LSILCIN135341153330.04
Young1993ASCUSCIN14122765242420.8
HSILCIN2-34125643113020.24
LSILCIN14122369223610.8
LSILCIN2-34129181681450.24

1 Disease defined according to reference standard threshold.

ASCUS = atypical squamous cells of undetermined significance; CIN = Cervical intraepithelial neoplasia; LSIL = low-grade squamous intraepithelial lesion; HSIL = High-grade squamous intraepithelial lesion

Table 22. Summary Effectiveness Scores from Single-Parameter Model
Test Thresholds (screening/reference)Summary Effectiveness Score95% Confidence IntervalLog Odds Ratio95% Confidence Interval
ASCUS/CIN11.0270.777, 1.1441.8631.409; 2.075
LSIL/CIN11.0840.942, 1.2261.9661.709; 2.224
LSIL/CIN2-31.0620.872, 1.2521.9261.582; 2.271
HSIL/CIN2-31.2871.075, 1.4992.3341.950; 2.719

ASCUS = atypical squamous cells of undetermined significance; CIN = cervical intraepithelial neoplasia; LSIL = low grade squamous intraepithelial lesion; HSIL = high grade squamous intraepithelial lesion

Data from the 84 studies of conventional Pap tests included in the meta-analysis are summarized in Table 21. These data were examined in a series of logistic regression models, one for each of the four screening test/reference test threshold combinations, and one for each of the outcomes sensitivity, specificity, and effectiveness score (D). Data from studies using a colposcopy reference standard were combined with those from studies using a histological reference standard. To attempt to explain between-study variation in effectiveness score, we added a term describing an implicit threshold (S) to the models; however, it did not explain a statistically significant amount of variation in the outcomes; therefore, the simpler one-parameter models are displayed. The summary effectiveness score and log odds ratio for each of the four thresholds are shown in Table 22. The summary effectiveness scores ranged from 1.03 (95% confidence interval [CI]; 0.78 to 1.14) for ASCUS/CIN1 to 1.29 (95% CI; 1.08 to 1.50) for HSIL/CIN2-3. Although the effectiveness score for each threshold was relatively low, better discrimination was seen with higher cytological and histological thresholds.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F014.jpg.

   Figure 14. Summary ROC curve of studies reporting on ASCUS/CIN1 threshold

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F015.jpg.

   Figure 15. Summary ROC curve of studies reporting on LSIL/CIN1 threshold

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F016.jpg.

   Figure 16. Summary ROC curve of studies reporting on LSIL/CIN2-3 threshold.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F017.jpg.

   Figure 17. Summary ROC curve of studies reporting on HSIL/CIN2-3 threshold

Effectiveness score estimates were used to derive the ROC curves displayed in Figures 14-17, with each figure representing a different test threshold/reference standard threshold combination. Points representing sensitivity and specificity combinations from individual studies are also plotted to describe the range of data from which the ROC curve was generated. It is important to note in interpreting these figures that the movement along a single ROC curve does not correspond to movement across different grades of cytological abnormality (e.g., ASCUS through CIN2/3) because each ROC curve describes studies using a single combination of test threshold and reference standard threshold. Rather, the change in sensitivity and specificity should reflect differences in "implicit" threshold.

Table 23. Parameter Estimates From Model That Includes Disease Prevalence
ThresholdParameterEstimate95% Confidence Interval
ASCUS/CIN1Prevalence-0.704-1.772 to 0.364
Intercept1.3461.058 to 1.634
LSIL/CIN1Prevalence-0.756-1.212 to -0.300
Intercept1.5371.346 to 1.728
LSIL/CIN2-3Prevalence-0.906-1.483 to -0.330
Intercept1.4011.292 to 1.510
HSIL/CIN2-3Prevalence-2.107-2.692 to -1.522
Intercept2.2572.122 to 2.391

ASCUS = atypical squamous cells of undetermined significance; CIN = cervical intraepithelial neoplasia; LSIL = low grade squamous intraepithelial lesion; HSIL = high grade squamous intraepithelial lesion

Effectiveness score = intercept + beta 1 X prevalence

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F018.jpg.

   Figure 18. Summary ROC curve of studies reporting On ASCUS/CIN1 threshold with varying prevalence of disease

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F019.jpg.

   Figure 19. Summary ROC curve of studies reporting on LSIL/CIN1 threshold with varying prevalence of disease

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F020.jpg.

   Figure 20. Summary ROC curve of studies reporting on LSIL/CIN2-3 threshold with varying prevalence of disease.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F021.jpg.

   Figure 21. Summary ROC curve of studies reporting on HSIL/CIN2-3 threshold with varying

Next, we attempted to explain between-study variation in effectiveness score by including the prevalence of "disease" according to the reference standard threshold as an independent variable in each logistic regression model. This proved to be highly statistically significant. The proportion of "diseased" subjects in the included studies ranged from 0.02 to 0.98. The parameter estimates from the model are shown in Table 23. The negative parameter estimate for prevalence term indicates that as prevalence increases, the effectiveness score decreases, sensitivity increases, and specificity decreases. This effect is consistent with the idea that prevalence is a marker of workup bias. To illustrate its effect, we calculated separate ROC curves for disease prevalences of 0.10, 0.25, and 0.50. As seen in Figures 18-21, as prevalence increases, test effectiveness decreases.

We next examined the effect of the various determinants of the quality of studies on effectiveness scores. We examined the presence of blinded evaluation of the screening test in a model that included prevalence for each of the four test threshold/reference standard threshold combinations. A significant effect was observed only for the HSIL/CIN2-3 threshold; the parameter estimate of 0.940 (95% CI; 0.455 to 1.425) was positive, indicating that studies employing blinded evaluation of cytology demonstrated better discrimination.

The effects of verification of test negative subjects and type of reference standard on between-study variation in effectiveness scores were difficult to assess together with prevalence because there were only a few studies and because of collinearity, leading to nonconvergence of maximum likelihood estimates for most thresholds. Because of these problems, we evaluated the effects of these variables separately.

We also evaluated the effect of the quality score on sensitivity, specificity, and effectiveness. Neither verification nor the numerical dichotomous quality score variables were significantly associated with the test operating characteristics. The type of reference standard was significantly associated with effectiveness only at the HSIL/CIN2-3 thresholds, where a histology reference standard led to improved effectiveness scores compared with a colposcopy reference standard.

To calculate estimates for Pap test performance that are applicable to a low-prevalence screened population we had two options. One option was to use predictions from the regression model that included prevalence, substituting appropriately low values for prevalence. Then from the resulting effectiveness score, we would need to choose an appropriate combination of sensitivity and specificity. The other option was to use the subset of studies that were conducted in low-prevalence screened populations and avoid workup bias.

We chose the latter option for two reasons. First, we suspected that the prevalence term was more an indicator of biases than of different prevalence in the underlying population of interest; thus, adjusting for "low prevalence" might have uncertain effects. Second, we had no clear rationale for choosing any particular combination of sensitivity and specificity from the joint distribution described by the adjusted effectiveness score that would result from the first option.

Only three studies identified patients undergoing initial Pap smear screening and verified all (or a random fraction of) test negative subjects. In this subgroup, the combined sensitivity was 0.51 (95 percent CI; 0.37-0.66) and the combined specificity was 0.98 (95% CI; 0.97-0.99). The summary effectiveness score was 2.185 (95% CI; 1.658-2.712). Prevalence of disease in these studies ranged from 10 percent to 19 percent. The data points representing these three studies are circled in Figure 14.

Results: Cost Estimates of Screening, Diagnosis, and Treatment

The following sections describe cost estimates for using the Pap smear (and other technologies) to screen for, diagnose, and treat cervical cancer and its precursors.

Cost of Screening

Estimating the costs of using the Pap smear to screen for cervical cancer is a complex process because costs need to be estimated for both obtaining and processing the smear. Estimates of these screening costs, and the methods used to obtain the estimates, are described below. Estimates of the total cost of Pap smear screening and of the cost of new screening techniques are also provided.

Cost of Obtaining Pap Smears

Several steps were taken in this study to estimate the cost of collecting cytology samples. In the past, studies have used either the cost of an entire office visit or the amount of reimbursement made for the sample collection as a proxy. This overestimates the cost, as Pap smears are usually performed as part of an annual gynecological exam, which involves other procedures as well. In this study, attempts were made to arrive at more accurate estimates. Only the cost of the proportion of the office visit that was directly attributable to obtaining a Pap smear was included. To do so, first the time spent by the physician and other health care staff in obtaining smears was estimated. Second, the cost per minute of office visit time was calculated. Finally, the time devoted to collecting the samples during the office visit was multiplied by the cost per minute to obtain the cost attributable to the collection process.

The physician's time spent to obtain the smears was estimated from the National Ambulatory Medical Care Survey (NAMCS). This survey provides data on ambulatory medical care provided in physicians' offices and is based on a sample of nationally representative patient visits. The survey contains information on age, race, and sex of the patient, organized by selected physician characteristics such as type of specialty and geographic location. Information provided about the clinical substance of the visit includes the patient's problem, the physician's diagnosis (ICD-9-CM), the diagnostic/screening services provided, the surgical procedures performed, and the duration of the visit. Health Economics Research, Inc., analyzed the 1992 NAMCS data, which contained 34,606 patient records from 1,558 doctors who participated in the survey.

The NAMCS data provide information on the amount of time (minutes) spent in direct face-to-face contact with the physician. Separate estimates of physician time were obtained for patients who were 20-64 years old and for those who were 65 years or older. A regression technique was employed to derive the proportion of time spent obtaining a Pap smear. The model used is shown below:

Log (MINUTES) = B1 + B2 PAP_TEST

where

PAP_TEST = 1 if performed; = 0 otherwise

The B1 value is an intercept, and the B2 value is the proportion of time spent obtaining the smears. A log distribution was used to model the skewed distribution of office visit minutes.

Table 24. Estimates of Physician and Total Time Spent Obtaining Pap Smears During Office Visits
Age (yr)Percent. of Time Spent Obtaining Pap Smear 1 (%)XAverage Duration of Pap Smear Visit (minutes)=Physician Time Spent Obtaining Pap Smear (minutes)+Other Medical Staff Time Spent 2 (minutes)=Total Time Spent Obtaining Pap Smears (minutes)
Age 20-647.6718.781.445.867.30
Age 65 and older27.020.075.425.8611.28
1

Percentage increase in time was obtained by econometric estimation: log(MINUTES) = B1 + B2 PAP_TEST, where PAP_TEST = 1 if performed; 0 otherwise.

2

Other medical staff time was calculated as total time minus physician time (7.30 - 1.44 = 5.86 minutes). For total time spent obtaining Pap smear, see Waugh, Smith, Robertson et al. 1996a

SOURCE: Analysis by Health Economics Research, Inc., of data from the National Ambulatory Medical Care Survey.

Table 24 shows the calculations used to derive estimates of time spent by physicians to obtain Pap smears during office visits. The physician time spent in obtaining the smears was 1.44 minutes for women under the age of 65 and 5.42 minutes for those 65 and older. The difference in the amounts of physician time reflects the difficulties that may be experienced in obtaining cytology samples from women as they age.

The NAMCS data only contain information on time the doctor spent with the patient, not on time the patient spent receiving care from someone else, for instance, a nurse or a technician. Therefore, time spent by other medical staff who assisted in obtaining the smears was estimated by subtracting physician time from total time, calculated by Waugh, Smith, Robertson et al. (1996a), for collecting smears from the under-65 population. When the time spent by other staff was included, the total time spent obtaining the smears was estimated as 7.30 minutes for the younger age group and 11.28 minutes for the older age group.

Table 25. Calculating of Office Visits for Pap Smears
Age 20-64Age 65 & Older
CPT CodeNo. of Pap Visits% SmearsCost per Visit (MEDSTAT)Time (minutes)No. of Pap Visits% SmearsCost per Visit (MEDSTAT)Cost per Visit (Medicare)Time (minutes)
992112,5391.9$32.095262.2$32.09$13.895
9921210,6518.038.5910413.538.5924.8510
9921341,18830.950.621527823.950.6235.4515
9921444,05933.167.212536631.567.2154.8325
9921516,29212.289.654034629.889.6586.6240
992016900.554.631080.754.6328.8810
992022,0311.560.2720100.960.2746.4220
992036,4864.975.0130232.075.0163.6030
992046,4294.894.9645332.894.9695.0345
992052,7252.0113.0660302.6113.06119.1660
Average -- -- $64.3523.93 -- -- $70.11$60.4227.52
Range -- -- $50.62-$89.6515-40 -- -- $50.62-$89.65 $35.45-$86.6215-40
Cost per minute -- -- $2.70 -- -- -- $2.55$2.20 --
Cost per minute- range -- -- $2.24-$3.37 -- -- -- $2.24-$3.37$2.6-$2.37 --

NOTE: All cost estimates are reported in 1997 dollars. CPT = Current Procedures Terminology. SOURCE: Analysis by Health Economics Research, Inc., of MEDSTAT data.

In Table 25, the cost per minute of office visits for Pap smears is shown. For the 20- to 64-year-old age group, office visit cost was obtained from MEDSTAT data; for patients 65 years old and older, data from both MEDSTAT and Medicare payments were used. Weighted average costs per visit and time spent were obtained using the distribution of Pap smears by type of office visit. The cost per minute derived from these estimates was $2.70 for those 20-64 years old (from MEDSTAT data), and $2.55 and $2.20 for the 65 years and older group (from MEDSTAT data and from the Medicare Fee Schedule, respectively). Ranges of the cost estimates were also provided to allow for sensitivity analyses.

Cost of Processing Pap Smears

Table 26. Average Cost for Processing Cytology Samples from Pap Smears
CPT CodeDescription% of SmearsAge 20-64 MEDSTAT CostsAge 65 & Older Medicare Payments
88150Screening by technician under physician supervision80.7$18.75$8.33
88151Requiring interpretation by physician6.1$19.08$36.16
88155With definitive hormonal evaluation13.2$20.81$9.65
-- Average cost per smear -- $18.97$10.19
-- STD -- $6.94 --
-- Range +/- STD -- $12.03-$25.91 --

All cost estimates are reported in 1997 dollars.

CPT = Current Procedures Terminology; RBRVS = resource based relative value scale; STD = standard deviation.

SOURCE: MEDSTAT data analysis; Medicare's RBRVS fee schedule and Clinical Laboratory fee schedule.

The average cost of processing cytology samples from Pap smears is shown in Table 26. The proportion of smears requiring physician interpretation and the proportion of those with definitive hormonal evaluation are presented, along with the charges associated with processing the cytology samples. The laboratory charges were derived from MEDSTAT data and from Medicare's Clinical Laboratory fee schedule and from the resource-based relative value scale fee schedule (RBRVS). The weighted average cost per smear read from the MEDSTAT data was $18.97, with a standard deviation of $6.94, for patients aged 20 to 64. The average Medicare reimbursement for processing Pap smears was estimated to be $10.19 for patients aged 65 years and older.

Total Cost of Pap Smear Screening

Table 27. Total Cost of Pap Smear Screening
Age (yrs)Total Time Spent Obtaining Pap Smear (minutes)XCost per Minute of Office Visit Time=Office Visit Cost for Pap Smear+Pap Smear Lab Processing cost=Total Cost of Performing Pap Smears
Age 20-647.30$2.70$19.71$18.97$38.68
Range -- $2.24-$3.37$16.35-$24.60 -- $35.32-$43.57
Age 65 and Older
MEDSTAT11.28$2.55$28.76$18.97$47.73
Range -- $2.24-$3.37$25.27-$38.01 -- $44.24-$56.98
Medicare11.28$2.20$24.82$10.19$35.01
Range -- $2.6-$2.37$24.36-$26.73 -- $34.55-$36.92

All cost estimates are reported in 1997 dollars.

SOURCE: Analysis by Health Economics Research, Inc., of National Ambulatory Medical Care Survey and MEDSTAT data, and Clinical Laboratory fee schedule.

In Table 27, the total cost of performing Pap smears is presented. Total cost, which included the office visit cost and the processing cost, was calculated for the two groups separately. For women ages 20-64 years, the total cost in 1997 dollars was $38.68. For women 65 years or older, the MEDSTAT estimate was $47.73 and the Medicare estimate was $35.01. Range estimates were provided for all cost estimates to allow for sensitivity analyses.

Cost of New Screening Techniques

Table 28. Estimates of Screening Costs for Cervical Cancer
Screening ProcedureOffice Visit Cost for Pap TestCost of Conventional Pap SmearCost of New TechnologyTotal Cost Estimate
Conventional Pap Age 20-64 Average Range $19.71 $16.35-$24.80 $18.97 -- -- -- $38.68 $35.32-$43.77
Age 65 and older -- MEDSTAT Average Range $28.76 $25.27-$38.01 $18.97 -- -- -- $47.73 $44.24-$56.98
Age 65 and older -- Medicare Average Range $24.82 $24.36-$26.73 $10.19 -- -- $35.01 $34.55-$36.92
AutoPap® Age 20-64 Average Range $19.71 $16.35-$24.80 $18.97 -- $7.58 $7.15-$8.00 $46.26 $42.47-$51.77
Age 65 and older -- MEDSTAT Average Range $28.76 $25.27-$38.01 $18.97 -- $7.58 $7.15-$8.00 $55.31 $51.39-$64.98
Age 65 and older -- Medicare Average Range $24.82 $24.36-$26.73 $10.19 -- $7.58 $7.15-$8.00 $42.59 $41.70-$44.92
Papnet® Age 20 to 64 Average Range$19.71 $16.35-$24.80$18.97 -- $15.00 $10.00-$18.00 $53.68 $45.32-$61.77
Age 65 and older -- MEDSTAT Average Range $28.76 $25.27-$38.01 $18.97 -- $15.00 $10.00-$18.00$62.73 $54.24-$74.98
Age 65 and older -- Medicare Average Range $24.82 $24.36-$26.73 $10.19 -- $15.00 $10.00-$18.00 $50.01 $44.55-$54.92
ThinPrep® Age 20 to 64 Average Range $19.71 $16.35-$24.80 -- -- $24.57 $15.00-$30.00 $44.28 $31.35-$54.80
Age 65 and older -- MEDSTAT Average Range $28.76 $25.27-$38.01 -- -- $24.57 $15.00-$30.00 $53.33 $40.27-$68.01
Age 65 and older -- Medicare Average Range $24.82 $24.36-$26.73 -- -- $24.57 $15.00-$30.00 $49.39 $39.36-$56.73
®

SOURCE: Analysis by Health Economics Research, Inc., of MEDSTAT data from the National Ambulatory Medical Care Survey, and information from ThinPrep, AutoPap, and Papnetdevelopers.

Numerous technological advances have been made in the collection of specimens and in the interpretation of Pap smears. Table 28 contains the cost or charge per smear, using various screening procedures available in the market today -- conventional Pap, ThinPrep®, AutoPap®, and Papnet®. Cost estimates were provided for the conventional Pap smear, and charges were presented for the others (cost information was not available).

ThinPrep®

The costs to perform a ThinPrep® Pap test can be broken into the following categories for a clinical laboratory: supplies, preparation costs, evaluation costs, and indirect expenses. The laboratory supplies consist of the ThinPrep® Pap test kit that is distributed to clinicians along with stains and fixatives. The current retail price to laboratories for the ThinPrep® Pap test kit and related supplies is $9.75. Preparation costs include the ThinPrep® 2000 Processor cost of $45,000 as well as costs associated with personnel for preparing the slide for processing and disposing of waste after processing. Amortizing the cost of the ThinPrep® 2000 Processor over a 5-year period and assuming that 40,000 slides are processed annually yields an estimated $0.22 equipment cost.

The last two categories of costs, evaluation and indirect expenses, as well as the personnel costs for preparing the slide for processing and disposing of waste, are essentially equivalent between the ThinPrep® technology and the conventional Pap smear technology. A recent study conducted by the College of American Pathologists (CAP) estimated these costs for conventional Pap smears to be approximately $14.60. Thus, a total cost estimate of $24.57 can be derived by summing the components.

This estimate, however, excludes the cost savings attributed to a reduction in the number of Pap smears that may be repeated because the specimen is determined to have a "Satisfactory but limited by" (SBLB) diagnosis. The manufacturer of ThinPrep® argues that specimen quality is significantly improved with this new technology, and it can reduce the number of specimens labeled SBLB (currently estimated to be 40 percent) as well as the number of repeat Pap smears that result from this labeling. Cytyc Corporation estimates a 50 percent reduction in the number of specimens labeled as SBLB; however, estimates on the number of women who return for a repeat Pap smear as a result of an SBLB diagnosis are not well known. The cost estimate of $24.57 does not include any adjustments to reflect these potential cost savings. Ignoring these costs will tend to favor conventional Pap smears over the new ThinPrep® technology, and this is a conservative assumption.

However, the cost analysis has been conducted from the perspective of the health care system, with the majority of costs estimated using payments or allowed charges as a proxy for costs. Thus, it seems reasonable to take a similar approach to the estimation of costs for the new technologies. Cytyc Corporation estimates from a large number of private insurers that the average payment to laboratories for primary screening using the ThinPrep® technology ranges from $15.00 to $30.00 and includes all nonoffice costs associated with the processing and evaluation of the ThinPrep® smears, including pathologist fees for reviewing abnormal slides. Because Medicare does not currently reimburse laboratories at a different level from conventional Pap smears when ThinPrep® smears are performed, we do not have current payment estimates for the Medicare population. Thus, the ThinPrep® payment rates from the private sector will be used as an estimate for the Medicare population as well.

Using the payment levels from the private sector for the ThinPrep® technology and office visit payment estimates derived from MEDSTAT data and Medicare's RBRVS physician fee schedule (see Table 27), the cost estimates for ThinPrep® smears were obtained (Table 28).

Papnet®

The incremental costs associated with the Papnet® technology may be broken down into the following categories: the costs of preparation and shipment of the "normal" slides to a central facility for processing, the costs associated with the Papnet® neural network computer system's processing of the slide at a central facility, the costs of a viewing station to review the data shipped back to the laboratory on compact disks, and the costs of triaging and reviewing selected slides by the cytotechnologist. Neuromedical Systems Inc. (NSI) estimates that the average cost to laboratories for these costs ranges from $12.00 to $18.00 depending on the laboratory's volume. These costs are in addition to the cytotechnologist and pathologist costs associated with the manual review of normal and abnormal slides and the costs of supplies for obtaining and processing the sample, costs that are already occurring with conventional Pap smears.

The charge for Papnet® rescreening has been reported in the literature and from NSI to range from $30.00 (Schechter, 1996) to $40.00 (literature provided by NSI). This charge is in addition to charges for the conventional Pap smear, which includes all nonoffice costs associated with the processing and evaluation of the conventional Pap smear, including pathologist fees for reviewing abnormal slides. However, we were unable to obtain information from NSI regarding current average payments for rescreening with Papnet® technology. Because cost estimates for the competing technologies were derived using average payments, rather than charges, we believe that it would be inappropriate to use charges for Papnet®. Thus, we model Papnet®'s cost-effectiveness using a cost estimate that more closely approximates the projected costs, rather than reflecting current charges (see Table 28).

AutoPap®

The incremental costs associated with the AutoPap® technology may be broken down into the following categories: equipment, software, service, and software licensing. NeoPath, Inc., estimates the total average cost to laboratories for these four cost components to be $4.50, with a range from $3.75 to $4.50. These costs are in addition to the cytotechnologist and pathologist costs associated with manual review of normal and abnormal slides and the costs of supplies for obtaining and processing the sample, which are already occurring with conventional Pap smears. When AutoPap® is used for quality control purposes, there are no additional incremental costs or savings associated with labor. In contrast, when it is used as a primary screening vehicle, NeoPath, Inc., estimates that there is an overall reduction in cytotechnologist labor of about 20 percent across all slides. The reduction is a function of fewer slides being processed for review by the laboratory's cytotechnologist.

This savings in cytotechnologist labor can be demonstrated through a simple example. Assuming a laboratory currently processes 100 patient slides using the conventional technique of manual review of all 100 slides by a cytotechnologist, the cytotechnologist must actually review 110 slides because of the CLIA quality control requirement of a 10 percent re-review. In contrast, the AutoPap® scores all 100 slides and archives 25 percent of them as the "most normal" of all slides. There is no manual review of these slides by the cytotechnologist. There is manual review of the remaining 75 slides as well as a review of 15 of the slides to meet CLIA quality control standards (set at 15 percent for AutoPap® versus 10 percent for conventional manual review). Thus, the total number of slides undergoing manual review by the cytotechnologist is actually 90 when the AutoPap® technology is used versus 110 when conventional manual review is done. Ignoring this cost will tend to favor conventional Pap smears relative to AutoPap®.

The cost analysis has been conducted from the perspective of the health care system, however, with the majority of costs estimated using payments or allowed charges as a proxy for costs. Thus, it seems reasonable to take a similar approach to the estimation of costs for the new technologies. NeoPath, Inc., estimates that average payment to laboratories for quality control screening using the AutoPap® technology ranges from $7.15 to $8.00 from a large number of public and private insurers, respectively. This payment is in addition to payment for the conventional Pap smear, which includes all nonoffice costs associated with the processing and evaluation of the conventional Pap smears, including pathologist fees for reviewing abnormal slides. Because Medicare does not currently reimburse laboratories at a different level from conventional Pap smears when AutoPap® is used to identify slides for review, we do not have current payment estimates for the Medicare population. Thus, the AutoPap® payment rates from the non-Medicare public and private sector will be used as an estimate for the Medicare population as well.

Using the above-referenced payment levels for the AutoPap® technology, conventional Pap smear payment rates to account for cytotechnologist and pathologist review costs and slide preparation costs, and office visit payment estimates derived from MEDSTAT data and Medicare's RBRVS physician fee schedule (see Table 27), the cost estimates for AutoPap® smears were obtained (see Table 28).

Cost of Diagnostic Procedures and Treatment

A claims analysis was performed to obtain the cost of diagnostic procedures and treatments for women 20-64 years old. Because no primary data analysis was undertaken for patients 65 years old and older, Medicare's RBRVS fee schedule, Clinical Laboratory fee schedule, diagnosis-related group payment rates, and ambulatory surgery center payment rates were reported.

All outpatient and inpatient claims for women (20-64 years old) with ICD-9 diagnosis codes for cervical cancer, carcinoma in situ, and dysplasia were included in the sample analyzed. The specific diagnosis codes used are shown below:

Cost of Diagnostic Procedures and Treatment
ICD-9 CodeDescription
180.xNeoplasm of cervix, primary (malignant)
233.1Carcinoma in situ of cervix
622.1Dysplasia of cervix

Pregnant women were excluded from the analysis, as costs associated with diagnosing and treating this group differ from those of the general population. Cases with only an ICD-9 code of 180.8, "other specified sites of cervix," and no other relevant diagnostic codes (180.0, 180.1, or 180.9) were excluded from the analysis as they may have indicated cases requiring specialized treatment. Very few cases were excluded because of this selection criterion.

Costs for procedures performed on an inpatient and on an outpatient basis are provided. Certain procedures are provided only on an inpatient basis (for example, exenteration), though others (colposcopy, cone biopsy) may be performed in either an inpatient or an outpatient setting. For the procedures in the latter category, the appropriate setting or the setting where the procedure was most often performed for women with cervical cancer was utilized to derive the estimates. Most of these procedures were estimated using outpatient (physician and hospital outpatient department) costs. For procedures where both inpatient and outpatient settings were deemed reasonable, the cost of the procedure was estimated for both the outpatient and the inpatient setting.

The analysis was limited to those individuals with a single procedure on a given day or admission to minimize differences related to multiple procedures. Thus, individuals who had both cone biopsy and LEEP performed on the same day would not have been included in the estimation because accurate allocation of costs to each specific procedure was not always possible. For outpatient procedures, in addition to the costs specifically related to the procedure (identified by the procedure code), we also estimated the cost of related services inherent in the performance of the procedure. Therefore, we evaluated all other outpatient services that were performed on the same day to determine if they were appropriate to include in our cost definitions. For instance, costs associated with anesthesia were added to the cost estimates to capture the total cost of performing the procedure. Supplementing procedure-specific costs with other related services provided allowed for a more comprehensive estimate of the costs associated with the procedure. In addition, all separately billed pathology services (CPT codes 88305 and 88307) were included in the final procedure cost.

Table 29. Cost Estimates for Diagnostic Procedures for Cervical Cancer
Age 20-64Age 65 and Older Medicare Payments
ProcedureNo. of Obs.MEDSTAT Average CostStandard Deviation25% Q150% Med.75% Q3
Colposcopy
Exploration5,412$142.6373.32$81$130$179$68.60
With Biopsy16,878$276.57101.30$195$267$351172.63
Cervical Biopsy1,266$189.81121.64$115$164$217131.46
1

All costs presented are in 1997 dollars.

Med. = median; Obs. = observations; Q1 = quartile 1; Q3 = quartile 3; RBRVS = resource based relative value scale.

SOURCE: Analysis by Health Economics Research, Inc., of MEDSTAT data and Medicare's RBRVS fee schedule.

Table 30. Cost Estimates for Cervical Cancer Treatment1
Age 20 to 64Age 65 and Older Medicare Payments
TreatmentNo. of Obs.MEDSTAT Average CostStandard DeviationDays25% Q150% Med75% Q3
1. Cryotherapy5,659$156$63 -- $108$142$184$111
2. Loop electrode excision procedure1,972564268 -- 352542710205
3. Laser ablation1,393730457 -- 390650976627
4. Cone biopsy
Oupatient4,856919443 -- 6238211,114831
Inpatient104,0522,2001.402,6403,2684,5642,459
5. Simple hysterectomy5185,8992,3483.054,3645,4076,9345,694
6. Radical hysterectomy11013,0776,3956.548,68811,45315,1058,535
7. Exenteration951,41361,66321.7817,01119,37353,2619,370
8. Radiation therapy
Inpatient (per admission)3,653
STAGE I723,7661,6312.382,6163,3974,537 --
STAGE II-III173,8832,3352.822,4393,8614,363 --
STAGE IV147,0745,8107.363,3324,69510,838 --
Outpatient (per service)4,531410468 -- 133258526196 2
9. Chemotherapy
Inpatient (per admission)2,523
All cases515,2264,9703.632,2533,4926,228 --
Less than 5 days393,5431,6762.642,2123,2463,932 --
Outpatient (per service)2,756191241 -- 5610522256 3
1

All costs presented are in 1997 dollars.

2

Medicare payment for most common radiation treatment received by women with cervical cancer -- weekly therapy (CPT code 77430).

3

Medicare payment for most common chemotherapy treatment received by women with cervical cancer -- infusion technique (CPT code 96410). This estimate does not include the cost of the chemotherapeutic agent.

CPT = Current Procedures Terminololgy; DRG = diagnostic-related group; Med. = median; Obs. = observations; Q1 = quartile 1; Q3 = quartile 3; RBRVS = resource based relative value scale.

SOURCE: Analysis by Health Economics Research, Inc., of MEDSTAT data and Medicare's RBRVS fee schedule/DRG payment.

Table 31. Cost Estimates for Other Procedures Used to Diagnose and Treat Cervical Cancer 1 (Continued)
ProceduresAge 20-64Age 65 and Older Medicare Payments
No. of Obs.MEDSTAT Average CostStandard Deviation25% Q150% Med.75% Q3
Endocervical curettage2,5469875$49$81$126$71
Dilation and curettage92525273$163$280$322642
Dilation of cervical canal9711988$75$82$13652
Endometrial sampling (biopsy)2,8331996$162$199$23658
Cauterization of cervix703178247$55$108$20195
Pelvic exam under anesthesia199201170$81$184$341136
Chest X-ray5,5344358$24$31$4835
CAT scan2,104312248$170$193$379271
Barium enema1,20410856$56$107$132101
Bone scan23236251$56$143$261122 2
Magnetic resonance imaging156563453$314$490$916497
Intravenous pyelogram7339267$105$248$477149
Cystoscopy28982851$226$674$1,546184 3
Proctoscopy7774543$335$672$1,179196 3
1

All costs presented are in 1997 dollars.

2

This estimate does not include the cost of the scanning agent.

3

These estimates only reflect costs associated with the professional component.

Med. = median; Obs. = observations; Q1 = quartile 1; Q3 = quartile 3; RBRVS = resource based relative value scale; CAT = computerized axial tomography.

SOURCE: Analysis by Health Economics Research, Inc., of MEDSTAT data and Medicare's RBRVS fee schedule.

Cost estimates in 1997 dollars for diagnosis and treatment of cervical cancer are provided in Tables 29 and 30 respectively. In Table 31, additional cost estimates are provided for other relevant procedures associated with the diagnosis and treatment of cervical cancer. These include procedures such as endocervical curettage, CAT scans, and cytoscopy. In each table, Medicare cost estimates are provided for physician services (Medicare's RBRVS fee schedule), and hospital admissions (DRG payments).

For the cost estimates derived from MEDSTAT data, the number of observations available for analysis and the standard deviation associated with the average cost are provided (see Tables 29, 30, and 31). For inpatient procedures, the number of hospital days is shown (see Table 30). In certain cases, the standard deviation is very large and, therefore, the values for the first quartile, the median, and the third quartile are also provided (see Tables 29, 30, and 31). These values may be used to perform sensitivity analyses. In addition, comparing these values with the mean provides a good indication of the distribution (skewness) of the cost for the procedure. For instance, the average cost for LEEP ($564) and the median ($542) is similar. This indicates that the costs are evenly distributed (closely approximating a normal distribution). On the other hand, the mean cost for exenteration ($51,413) is much larger than the median ($19,373), indicating a skewed distribution where a few large charges are influencing the mean.

Cost by Stage of Disease

Table 32. Identifying Stages of Cervical Cancer
StagePrimary ICD-9 CodesSecondary ICD-9 Codes
LSIL622.1No diagnosis indicating cancer or CIS
HSIL233.1No diagnosis indicating cancer
Stage I180.0, 180.1, 180.9No diagnosis indicating other forms of cancer
Stage II180.0, 180.1, 180.9198.82, 198.6, 179, 181, 182.x, 183.x, 184.x
Stage III180.0, 180.1, 180.9198.82, 198.6, 179, 181, 182.x, 183.x, 184.x and either of these codes 593.5, 599.6, 593.89, 591
Stage IV180.0, 180.1, 180.9199.1, 199.0, 140.x - 176.x, 188.x - 195.x, 197.x , 198.0, 198.1, 198.2, 198.3, 198.4, 198.5, 198.7, 198.81, 198.89

HSIL = high-grade squamous intraepithelial lesions; ICD-9 = International Classification of Diseases, version 9; LSIL = low-grade squamous intraepithelial lesions; CIS = Carcinoma in situ

SOURCE: ICD-9-CM Code Book, St. Anthony Publishing.

Costs associated with six categories were identified. These categories included: (1) low-grade squamous intraepithelial lesions (LSIL); (2) high-grade squamous intraepithelial lesions (HSIL); (3) Stage I; (4) Stage II; (5) Stage III; and (6) Stage IV. Table 32 specifies the ICD-9 codes used to distinguish the stages of cervical cancer.

Table 33. Cost Associated with Diagnosis, Treatment, & Follow-up of Cervical Cancer by Stage of Disease
Lesions/Stage of DiseaseNo. of Obs.Average Inpatient CostAverage Outpatient CostAverage Total CostHospital Days
LSIL9,329 -- $1,728$1,728 --
HSIL
With oupatient services only1,257 -- 3,0493,049 --
With oupatient & inpatient Services337$6,3522,8069,1593.17
Stage I24810,9486,69717,6456.31
Stage II/III
Only Stage II/III costs3920,2986,77127,0699.59
Stages I and II/III5419,73411,43931,17313.17
Stage IV
Only Stage IV costs1731,1159,16440,28020.29
All stages (final diagnosis Stage IV)7037,25819,55256,81028.46

All costs presented are in 1997 dollars; costs are associated with women ages 20-64 years old diagnosed with cervical cancer.

HSIL = high-grade squamous intraepithelial lesions; LSIL = low-grade squamous intraepithelial lesions; Obs. = observations.

SOURCE: Analysis by Health Economics Research, Inc., of MEDSTAT data.

The cost by stage of cancer was only estimated for the 20-64 age group, as current treatment protocols were not analyzed for women 65 and older. To estimate the total cost associated with treating cervical cancer, all costs for inpatient admissions and outpatient visits with the relevant diagnosis were included. The costs by stage, estimated from the MEDSTAT data, are presented in Table 33. The average inpatient, outpatient, and total costs are provided for each stage. In addition, the number of hospital days is also indicated.

Very few cases were identified as Stage III and, therefore, a combined cost estimate is provided for patients with Stage II and III cervical cancer. Because ICD-9 codes do not include stage, there may be some misclassification of recurrent or persistent disease as higher stage cancers. The costs associated with all the treatments women received were included in a single estimate. Thus, in Table 33, for Stage II/III and Stage IV, two separate cost estimates were provided: one indicating costs specifically related to that particular stage and another that included costs associated with treatment of earlier stages as well. Because stage does not change once initially assigned, these estimates presumably represent women with recurrent or persistent disease. As would be expected, the average total cost increased as the severity of the disease increased. The cost for diagnosing, treating, and providing follow-up care for LSIL is $1,728, compared with $17,645 and $40,280 for Stages I and IV, respectively.

Table 34. Mean and Standard Deviation of Total Cost for Cervical Cancer by Stage of Disease1
Lesions/Stage of DiseaseTotal CostStandard Deviation25% Q150% Med.75% Q3
LSIL$1,728$1,498$675$1,143$2,274
HSIL
With oupatient services only3,0492,0331,3842,7334,203
With oupatient & inpatient services9,1594,2076,7108,55611,540
Stage I17,64517,5729,43914,00521,946
Stage II/III
Only Stage II/III costs27,06955,95311,73415,39229,089
Stages I and II/III31,17351,23113,30322,63132,858
Stage IV
Only Stage IV costs40,28053,39012,67017,77246,509
All stages (final diagnosis Stage IV)56,81043,93027,59550,70575,350
1

All costs presented are in 1997 dollars; costs are associated with women 20-64 years old diagnosed with cervical cancer.

HSIL = high-grade squamous intraepithelial lesions; LSIL = low-grade squamous intraepithelial lesions; Med. = median; Q1 = quartile 1; Q3 = quartile 3. SOURCE: Analysis by Health Economics Research, Inc., of MEDSTAT data.

In Table 34, the standard deviation of the cost estimates and the values for the first quartile, the median, and the third quartile are provided. The mean costs associated with the lower stages are very close to the median values, indicating that these cost values closely approximate a normal distribution. On the other hand, the costs for the later stages of cervical cancer (Stages II, III, and IV) are very skewed, and the mean value is determined to a large extent by a few cases with large average treatment costs. Again, this probably includes the care of women with persistent or recurrent disease, especially women with recurrent Stage I disease who subsequently undergo radical therapy such as exenteration. Because of the lack of precision in the ICD-9 codes, there is no way to determine the actual clinical scenario with certainty. It is therefore important to use a range of cost estimates to perform sensitivity analyses when these average cost estimates are used. For the cost-effectiveness analysis, we used higher estimates for the treatment of cancer cases in order to bias the analysis in favor of more sensitive screening strategies.

Results: Review of Studies and Models of Effectiveness and Cost-effectiveness

Table 35. Studies Meeting Inclusion Criteria for Evidence Table 2
StudyInterventionStudy DesignEconomic Impact?Health Impact?Cost-Effective?
Barnes, 1981ScreeningMathematical modelYesYesYes
Barnes and Barnes, 1977ScreeningMathematical modelYesYesCost-benefit
Bergstrom, Adami, Gustafsson et al., 1993ScreeningPopulation-based cohort-Yes-
Bethwaite, Rayner, and Bethwaite, 1986Screening intervalMathematical modelYesYesYes
Boon, de Graaff Guilloud, 1981ScreeningMathematical modelYesYesYes
Brown and Garber, 1998Screening test (Papnet® vs. AutoPap® vs. ThinPrep® vs. conventional Pap)Mathematical modelYesYesYes
Carter, Coburn, and Luszczak, 1993Screening in pregnancyMathematical modelYesYesYes
Cecchini, Bonardi, Iossa et al., 1997Various screening strategiesMathematical modelYesYesYes
Chesebro and Everett, 1996Treatment of abnormal PapMathematical modelYesYesYes
Dickinson, 1972ScreeningMathematical modelYesYesYes
Eddy, 1990Screening intervals, age of onsetMathematical modelYesYesYes
Fahs, Mandelblatt, Schechter et al., 1992Screening intervalsMathematical modelYesYesYes
Johnson, Sutton, Thornton et al., 1993Treatment of abnormal PapMathematical model-Yes-
Koopmanschap, van Oortmarssen, van Agt et al., 1990Screening program (organized vs. spontaneous)Mathematical modelYesYesYes
Luce, 1981Screening intervalsMathematical modelYesYesYes
Mandelblatt and Fahs, 1988ScreeningMathematical modelYesYesYes
Mandelblatt, Freeman, Winczewski et al., 1997Opportunistic screening in emergency room>Mathematical modelYesYesYes
Matsunaga, Tsuji, Sato et al., 1997ScreeningMathematical modelYesYesYes
Melnikow, Nuovo, and Paliescheskey, 1996Treatment of abnormal PapMathematical modelYesYesYes
Prabhakar, 1992ScreeningMathematical modelYesYesYes
Raab, 1997Screening, rescreening strategiesMathematical modelYesYesYes
Radensky and Mango, 1998Screening and rescreening of negativesMathematical modelYesYesYes
Richart and Barron, 1981Screening intervalsMathematical model-Yes-
Schechter, 1996Papnet® vs. conventional Pap screeningMathematical modelYesYesYes
Schweitzer, 1974ScreeningMathematical modelYesYesYes
Schweitzer and Luce, 1978Screening intervalsMathematical modelYesYesYes
Sherlaw-Johnson, Gallivan, Jenkins et al., 1994Treatment of abnormal PapMathematical modelYesYesYes
van Ballegooijen, Habbema, van Oortmarssen et al., 1992Screening program (organized vs. spontaneous)Mathematical modelYesYesYes
van Ballegooijen, van den Akker-van Marle, Warmerdam et al., 1997Screening (Pap vs. Pap plus human papillomavirus)Mathematical modelYesYesYes
van Nagell, Dudik, Frank et al., 1987ScreeningMathematical modelYesYesYes
van Oortmarssen, Habbema, and van Ballegooijen, 1992ScreeningMathematical modelYesYesYes
van Oortmarssen and Habbema, 1995ScreeningMathematical model-Yes-
Waugh and Robertson, 1996bScreening intervalMathematical modelYesYesYes
Waugh, Smith, Robertson et al., 1996Screening program (organized vs. spontaneous)Mathematical modelYesYesYes
Because no definitive randomized controlled trials have been published on the effectiveness of cervical cancer screening, policymakers have relied on decision modeling studies to integrate epidemiological data on the natural history of cervical cancer precursors, performance data on diagnostic tests for early cervical cancer or cervical cancer precursors, and cost data to draw conclusions about the relative benefits and costs of various screening strategies. Table 35, Studies Meeting Inclusion Criteria for Evidence Table 2, identifies 34 studies that modeled health or economic outcomes of alternative cervical cancer screening programs or alternative strategies for managing cytological abnormalities. These studies were found through a search of MEDLINE and a search of citations from relevant publications.

Alternative screening programs or strategies addressed by the studies included the following:

  • 1

    Pap screening compared with no use of Pap screening

  • 2

    Screening intervals for repeating Pap smear screening

  • 3

    Rescreening of negative Pap smears

  • 4

    Management of women found to have abnormal Pap smears

The studies used various approaches to address the question of the impact of Pap screening on health and economic outcomes. Some studies used cost-benefit analysis, and others used Markov modeling to estimate cost-effectiveness. Some studies used data from actual populations, and others used hypothetical cohorts combined with incidence and prevalence data. The best-quality models (Eddy, 1990 ; Fahs, Mandelblatt, Schechter et al., 1992) are the most often cited and have both been used for updated analyses of related questions. Eddy (1990) initially addressed age of onset for Pap screening and Pap screening intervals, which was reused for the Blue Cross and Blue Shield's Technical Evaluation Center analysis of Papnet®, AutoPap®, and ThinPrep® (Brown and Garber, 1998) and Radensky and Mango's (1998) analysis of Papnet® rescreening versus conventional Pap testing. The analysis by Fahs, Mandelblatt, Schechter et al. (1992), based on a 1990 report for the U.S. Congress Office of Technology Assessment (OTA), was reused for the analysis by Schechter (1996) of Papnet® cost-effectiveness.

Cost-effectiveness of Conventional Pap Testing

A series of comprehensive models of the cost-effectiveness of cervical cancer screening was initiated with a 1981 OTA report (U.S. Congress Office of Technology Assessment, 1981). Subsequently, efforts by Eddy (1990), Mandelblatt and Fahs (1988), and OTA (1990) (and a related study by Fahs, Mandelblatt, Schechter et al., 1992) addressed similar questions in somewhat different populations and perspectives. Eddy's target population for initial screening was asymptomatic 20-year-old women of average risk, whereas Fahs, Mandelblatt, Schechter et al. (1992) assessed women over the age of 65. Both analyses were conducted from the payer's perspective. All of these models used Markov processes, with the modification that transition probabilities change as patients or population age. Of all the models, Eddy's is considered to be the most general.

There are significant discrepancies in the results of different models of cervical cancer screening. For example, both Eddy (1990) and Fahs, Mandelblatt, Schechter et al. (1992) reported the marginal cost-effectiveness of triennial Pap smear screening for women age 65 or over under two conditions: (1) assuming no prior screening and (2) assuming previous regular screening. Triennial Pap screening beginning at age 65 in women with no prior screening was estimated to cost $22,448 per year of life saved, according to Eddy (1990), but was actually cost-saving in the Fahs, Mandelblatt, Schechter et al. (1992) analysis. When considering screening in women with prior regular screening, Eddy and Fahs et al. estimated different marginal cost-effectiveness ratios: $52,241 per year of life saved versus $33,572 per year of life saved, respectively (Eddy, 1990; Fahs, Mandelblatt, Schechter et al., 1992). It is difficult to assess which differences in model assumptions account for the differing results. However, sensitivity of Pap smears is known to strongly influence cost-effectiveness ratios. Eddy used 85 percent sensitivity in his base case analysis, whereas Fahs et al. used 75 percent sensitivity in their model. Another principal difference between Eddy (1990) and Fahs, Mandelblatt, Schechter et al. (1992) was whether the models assessed implementing a change in screening practices that might detect a large, one-time benefit by detecting prevalent cases in an unscreened population versus assessing the ongoing benefit one might see in a continuing policy. By attempting to recreate the results of Eddy (1990) and Fahs, Mandelblatt, Schechter et al. (1992) with the model created for this report, we were able to identify other differences, which are discussed in greater detail below.

Cost-Effectiveness of Three New Technologies

Two analyses have addressed the cost-effectiveness of Papnet®, which provides interactive neural-network rescreening, versus unassisted manual interpretation of Pap smears (Schechter, 1996; Radensky and Mango, 1998). Schechter (1996) revised a previously published model constructed for OTA that examined the cost-effectiveness of implementing Pap screening as a Medicare benefit (Fahs, Mandelblatt, Schechter et al., 1992; U.S. Congress Office of Technology Assessment, 1990). Parameters in the model were updated to reflect the general U.S. population of women undergoing Pap testing, and the addition of a Papnet® strategy with information on the sensitivity of the new technology was supplied by the manufacturer. The primary result of this study, $48,474 per life-year saved for biennially screened women, is given in terms of the overall cost-effectiveness ratio (which compares Papnet® screening to no screening) rather than in terms of the conventional marginal (or incremental) cost-effectiveness ratio (which would require comparing Papnet® screening with conventional Pap screening). This makes it appear that Papnet® is highly cost-effective when, in fact, most of the cost-effectiveness is from conventional Pap testing; the incremental cost-effectiveness ratio for Papnet®-assisted compared with that for conventional Pap testing is much higher.

Radensky and Mango (1998) adapted Eddy's (1990) model to estimate the marginal cost-effectiveness ratio of using interactive neural network-assisted (INNA) rescreening of negative smears compared with unassisted manual examination of cervical smears at a frequency of every 3 years. The cost-effectiveness ratios ranged from $39,087 to $79,440, depending on the sensitivity of the Pap smear examination and on the specificity of INNA rescreening. More frequent screening intervals than every 3 years were not examined in this analysis.

Brown and Garber (1998) performed a cost-effectiveness analysis of Papnet®, AutoPap® and ThinPrep® for cervical cancer screening for the Blue Cross and Blue Shield Association's Technology Evaluation Center. This analysis was based upon the model by Eddy (1990). Results from this model show that Papnet®, AutoPap®, and ThinPrep® modestly improve on the diagnostic accuracy of conventional Pap screening. These screening technologies are likely to increase life expectancy by only 1 or 2 days for most women and only by a few hours for those women who already get Pap tests frequently. The marginal cost-effectiveness ratios of the new technologies were found to exceed $80,000 per additional year of life saved with biannual screening and far greater with annual screening. However, the cost-effectiveness ratios are sensitive to the costs of each technology, sensitivity of the initial test, prevalence of cervical cancer, and proportion of tumors that grow rapidly.

Results: Cost-Effectiveness Analysis

For the cost-effectiveness analysis, we compared conventional Pap smears at 1-, 2-, and 3-year intervals, with 10 percent manual rescreening of smears initially read as normal, with two types of technologies: one that improves the sensitivity of the initial screening step (10 percent of initially normal smears are rescreened), and one in which the sensitivity of the initial screening step is unchanged but 100 percent of initially normal smears are rescreened with improved sensitivity.

We attempted to define thresholds of cost, sensitivity, and specificity at which improved initial or rescreening technology would produce cost-effectiveness ratios of $50,000 per life-year or less. Because of the significant uncertainty surrounding both the effectiveness and the incremental costs of the new technologies, we did not attempt to estimate cost-effectiveness ratios for any specific technology.

The section begins with tables illustrating the effect of increasing conventional Pap sensitivity. We examine the effect of reducing the false negative rate by 40 percent, 60 percent, and 90 percent, using any technology, on life expectancy, cases of cancer, cancer deaths, and morbid treatments predicted by the model at 1-, 2-, and 3-year screening intervals. We also present estimates of the cumulative number of cases and deaths predicted for various age groups with each alternative screening strategy.

Because increasing the frequency of Pap smear screening is a legitimate option for increasing the sensitivity of a screening program, the results as incremental costs per life-year saved comparing all possible combinations of technology and frequency.

Terminology

The sensitivity of a test is defined as the conditional probability that, given the presence of disease, the test will be positive. In the setting of our model, where the presence of disease is defined by the various Markov states, the sensitivity is equivalent to the true positive rate. The FNR is equal to the quantity:
1 - sensitivity.

Because modeling probabilities is easiest with variables between 0 and 1, we expressed the potential improvement in Pap test performance as a reduction in the FNR:
X (1 -sensitivity).

A reduction in the FNR is equivalent to improving sensitivity within the context of the model. The two expressions are used interchangeably throughout this section of the report.

Base Case Parameter Estimates

Table 36. Probability Estimates for Natural History Model
ParameterBase CaseRangeReferences
Prevalence of HPV infection, age 150.100-0.25Assumption
Prevalence of LSIL, age 150.010-0.1Assumption
Age-specific incidence of HPV infectionKiviat, 1996; Koutsky, 1997;Ho, Bierman, Beardsley et al., 1998;Hildesheim, Schiffman, Gravitt et al., 1994
150.10.5-2 X base estimate
160.1
170.12
180.15
190.17
200.15
210.12
220.10
230.10
24-290.05
30-490.01
50 +0.005
Age-specific regression rate, HPV infection (HPV to Well)Ho, Bierman, Beardsley, et al, 1998;Moscicki, Shiboski, Broering et al., 1998; Koutsky, Holmes, Critchlow et al., 1992
15-240.7/18 months0.6-0.9
25-290.5/18 months0.45-0.6
30+0.15/18 months0.1-0.2
Progression rate (HPV to LSIL)0.2/36 months0.15-0.3
Proportion of infections progressing directly to HSIL0.10.05-0.5
Probability of progression after treatment for SIL0.05Eddy 1990; Fahs, Mandelblatt, Schechter et al., 1992; Schechter 1996
Regression rate (LSIL to unknown HPV) Syrjanen, Kataja, Yliskoski et al., 1992;van Oortmarssen and Habbema 1991; Hildesheim, Schiffman, Gravitt et al., 1994; Munoz, Kato, Bosch et al., 1996;Bearman, MacMillan, and Creasman, 1987; Pretorius, Semrad, Watring et al., 1991; Janerich, Hadjimichael, Schwartz et al., 1995; Schwartz, Hadjimichael, Lowell et al., 1996
Age 15-340.65/72 months0.6-0.8
35+0.4/72 months0.3-0.6
Progression rate (LSIL to HSIL)
Age 15-340.1/72 months0.1-0.3
35+0.35/72 months0.3-0.5
Regression rate (HSIL to LSIL)0.35/72 months0.3-0.5
Progression rate (HSIL to Stage I cancer)0.4/120 months0.3-0.5
Progression rates and probability of symptoms in unscreened patients with cancer
Stage I
Progression rate (Stage I to Stage II)0.9/4 years
Annual probability of symptoms0.15
Stage II
Progression rate (Stage II to Stage III)0.9/3 years
Annual Probability of Symptoms0.225
Stage III
Progression Rate (Stage III to Stage IV)0.9/2 years
Annual probability of symptoms0.6
Stage IV
Annual probability of symptoms0.9
Annual probability of survival after diagnosis by stageFremgen, personal communication; Jones, Shingleton, Russell et al., 1995
Stage I
Year 10.97
Year 20.97
Year 30.97
Year 40.97
Year 50.97
Stage II
Year 10.90
Year 20.89
Year 30.91
Year 40.95
Year 50.96
Stage III
Year 10.70
Year 20.74
Year 30.86
Year 40.93
Year 50.83
Stage IV
Year 10.40
Year 20.50
Year 30.86
Year 40.85
Year 50.90
Table 37. Probability Estimates for Screening
ParameterBase Case RangeReferences
Conventional Pap
Sensitivity, cytology ASCUS+ for histology LSIL + 0.510.51-0.788See meta-analysis
Specificity, cytology ASCUS+ for histology LSIL +0.970.888-0.97See meta-analysis
Percentage of smears read as normal in initial screening step re-read at same sensitivity and specificity0.10NANA
Technology improving initial screening step
Relative decrease in false negative rate of initial screening step0.60.4-0.9Roberts, Gurley, Thurloe et al., 1997;Bolick and Hellman, 1998; Brown and Garber, 1998
Relative specificity10.9-1.0Assumption
Percentage of smears read as normal in initial screening step re-read at same sensitivity and specificity10%NANA
Technology improving rescreening step
Relative decrease in false negative rate of rescreening step0.60.4-0.9Stevens, Milne, James et al., 1997;Patten, Lee, Wilbur et al., 1997aColgan, Patten, and Lee, 1995;Brown and Garber, 1998; Kaufman, Schreiber and Carter, 1998; Jenny, Isenegger, Boon et al., 1997
Relative specificity of rescreening step10.9-1.0Assumption
Percentage of smears read as normal in initial screening step re-read at same sensitivity and specificity100%NANA

NA = not available.

Sources of parameter estimates are described in detail in the section on Methodology. In Tables 36 and 37 these estimates are summarized for the base case implementation of the model.

Table 38. Total Sensitivity of Conventional Pap with 10 percent Rescreening, Improved Initial Screening with 10 percent Rescreening, and Improved Rescreening of 100 percent of Initially Normal Smears
StrategyPrimary StepRescreening Step Overall Sensitivity
SensitivitySensitivity X False negative rateSensitivity + (proportion rescreened X Sensitivity X False negative rate)
Conventional Pap0.510.51 X (1-0.51)10% Rescreening: 0.51 + (0.1) X (0.51) X (0.49) = 0.535
Improved initial screening[0.51 + (0.6) X (1-0.51)][0.51 + (0.6) X (1-0.51)] X (1-0.51)10% Rescreening: [0.51 + (0.6) X (1-0.51)] + {(0.1) X [0.51 + (0.6) X (1-0.51)] X 0.49} = 0.843
Improved rescreening0.51(0.6 X 0.49)0.51 + [(1.0) X (0.6 X 0.49] =0.804
Table 38 illustrates the calculation of the total sensitivity and specificity for conventional Pap testing with 10 percent rescreening of initially normal smears, improved initial screening with 10 percent rescreening of normal smears, and improved rescreening of 100 percent of initially normal smears.

Effect of Increasing Conventional Pap Sensitivity

Although life expectancy is the most common measure of effectiveness used in economic analyses of health policies, there are other important measures. We first present the predicted effects of various screening strategies on cervical cancer incidence, mortality, and treatment. Then we present the potential impact of improving the sensitivity of cervical cancer screening strategies on life expectancy, cost, and cost-effectiveness.

Cancer Cases and Mortality Avoided

Table 39. Effect of Reducing Pap Smear False Negative Rate by 40 percent, 60 percent, and 90 percent at Various Screening Intervals on Lifetime Cancer Incidence and Mortality
StrategyCervical Cancer Cases/100,000Cancer Cases PreventedCervical Cancer Deaths/100,000Cancer Deaths Prevented
No Pap3014.61057.3
Pap every 3 years:
Base case5062509115.7941.6
FNR Reduced 0.432218468.347.4
FNR Reduced 0.62467649.818.5
FNR Reduced 0.91618629.919.9
Pap every 2 years:
Base case30565
FNR Reduced 0.418112435.929.1
FNR Reduced 0.61324925.210.7
FNR Reduced 0.979541411.2
Pap every 1 year:
Base case10920.9
FNR Reduced 0.455549.911
FNR Reduced 0.633225.84.1
FNR Reduced 0.99241.54.3

FNR = false negative rate

Table 40. Cumulative Number of Cancer Cases and Deaths by Ages 50, 65, and 85 by Reducing Pap Smear False Negative Rate 40 percent, 60 percent, and 90 percent at Various Screening Intervals
StrategyCumulative Cases/100,000Cumulative Deaths/100,000
15-5015-6515-8515-5015-6515-85
No Pap1438.22970.23014.6450.1786.01057.3
Pap every 3 years:
Base case321.9403.650674.293.5115.7
FNR Reduced 0.4209.7258.2322.245.255.668.3
FNR Reduced 0.6162.6198.4246.233.540.849.8
FNR Reduced 0.9108.8130.8160.620.724.829.9
Pap every 2 years:
Base case200.7249.9305.442.852.765
FNR Reduced 0.4121.7149.6181.324.229.435.9
FNR Reduced 0.690.2110.0132.417.220.725.2
FNR Reduced 0.955.166.478.99.811.614
Pap every 1 year:
Base case74.389.810914.317.220.9
FNR Reduced 0.438.145.654.96.98.29.9
FNR Reduced 0.623.027.332.84.14.85.8
FNR Reduced 0.96.47.59.01.11.31.5

FNR = false negative rate.

Table 39 shows the effect of reducing the Pap smear false negative rate by 40 percent, 60 percent, and 90 percent on lifetime cancer incidence and mortality. Table 40 shows the predicted cumulative number of cervical cancer cases and deaths per 100,000 between ages 15 and 50, 15 and 65, and 16 and 85 under each strategy.

Table 39 illustrates that, as expected, progressively better screening methods lead to fewer cervical cancer cases and deaths, as does progressively more frequent screening. In Table 40 we see that at every level of improved test performance, the majority of cervical cancer cases and deaths in screened women occur in those individuals less than 50 years old. The proportion of cases in younger women actually increases as sensitivity increases. For example, without screening, 47.7 percent of cases and 42.6 percent of deaths occur in younger women. With Pap screening every 2 years, the proportions are 65.7 percent of cases and 65.8 percent of deaths. For women screened yearly with a test 1.9 times more sensitive than conventional Pap (i.e., FNR reduced 0.9), the proportions are 71.1 percent of cases and 73.3 percent of deaths.

In infrequently screened populations, part of this phenomenon is a result of detection of early cancer prior to the development of symptoms. In frequently screened populations, this is more consistent with variation in tumor biology, where some tumors will be more rapidly growing. It is also consistent with length bias, where screening tests will be more likely to detect slower growing, less aggressive tumors. This prediction has several implications. First, improving the sensitivity of cervical cancer screening will reduce the overall numbers of cervical cancer cases and deaths, but it will not substantially change the proportion of cases and deaths that occur in younger women. Second, changes in the age distribution of cancer cases may be related to the success of screening programs as well as to changes in the epidemiology of HPV.

Major Interventions Avoided

Table 41. Expected Number of Lifetime Cervical Cancer Treatments by Reducing Pap False Negative Rate by 40 percent, 60 percent, and 90 percent at Various Screening Intervals
Cumulative Treatments/100,000, ages 15-85
StrategySimple Hysterec-tomyRadical Hysterec-tomyRadiation OnlyRadiation + SurgeryRadiation + Chemo-therapyChemo-therapy Only
No Pap211539120929641635
Pap every 3 years:
Base case5212717161372
FNR reduced 0.4369010141222
FNR reduced 0.62972713213<1
FNR reduced 0.9215141227<1
Pap every 2 years:
Base case34849639191
FNR reduced 0.4226651249<1
FNR reduced 0.6174234185<1
FNR reduced 0.9112618113<1
Pap every 1 year:
Base case1433301551
FNR reduced 0.47181382<1
FNR reduced 0.6511851<1
FNR reduced 0.91321<1<1

FNR = false negative rate

A successful screening strategy will shift the distribution of cases primarily to Stages I and II. The model predicts that very sensitive strategies will result in 100 percent Stage I tumors. Table 41 shows the expected number of lifetime cervical cancer treatments resulting from progressively better screening tests.

These results show that increasing the sensitivity of the screening strategy by increasing the frequency and/or decreasing the FNR of the screening test results in significant decreases in morbid treatments. The relative proportion of treatments for early stage disease -- simple and radical hysterectomy, and radiation only -- also increases with increased sensitivity, as the distribution of cases shifts to primarily Stage I and II. As with all other outcomes, the incremental improvement with each successive strategy is small.

The shifts in stage distribution resulting from improved screening sensitivity would clearly reduce the number of cervical cancer treatments, many of which have significant short- and long-term morbidity. Given the potential impact of these treatments on quality of life, it is possible that adjustment for the effects of morbidity associated with cervical cancer treatment would alter cost-effectiveness estimates based on life expectancy alone. Further research on measurement of quality of life in women with cervical cancer is clearly needed.

Cost-Effectiveness: Costs per Life-Year Saved

Table 42. Base Case: Costs per Life-year Saved, 60 Percent Reduction in False Negative Rate
StrategyAverage Cost 1Incremental Cost 1Incremental Life Expectancy (days) 1Incremental Cost/Life-year 1
No Pap$893
Pap every 3 years$1,108$21419.2$4079
Improved primary screening every 3 years$1,240$1322.2$22,010
Pap every 2 years$1,255$15-0.65Dominated
Improved rescreening every 3 years$1,276$210.46$16,396
Improved initial screening every 2 years$1,433$1580.98$58,731
Improved rescreening every 2 years$1,490$56-0.11Dominated
Pap every 1 year$1,702$2120.17$448,469
Improved initial screening every 1 year$2,000$2980.63$173,484
Improved rescreening every 1 year$2,128$129-0.05Dominated
1

Discounted at 3 percent annually

In Table 42, we present the model results for the base case cost-effectiveness of various combinations of screening strategy and frequency. These results are based on the assumption that the relative reduction in FNR for each step is identical for each technology. However, as illustrated by Table 38, we assumed that the FNR reduction of the rescreening step for the improved initial screening technology is identical to that of the initial screening step, which means that this technology will have an overall sensitivity that is slightly higher than that seen with conventional Pap testing and 100 percent rescreening. Furthermore, it is based on an estimated incremental cost for each improved test of $10. This cost estimate is within the range of estimated incremental costs for all three of the currently available technologies.

Table 43. Screening Strategies Ranked by Cost-Effectiveness
StrategyCost/Life-Year Saved
Conventional Pap every 3 years$4,079
Improved initial screening every 3 years$22,010
Improved initial screening every 2 years$89,993
Improved initial screening every 1 year$302,277
Because improved initial screening with 10 percent rescreening has a slightly higher sensitivity than improved rescreening only, it is both less expensive and more effective under the assumptions of identical incremental costs and relative specificity. In addition, the slightly higher specificity compared with improved rescreening also reduces relative costs. Eliminating strategies dominated by conventional or extended dominance, the ranking of screening strategies in terms of cost-effectiveness is shown in Table 43.

Table 44. Screening Strategies Ranked by Cost-effectiveness, for Increasing Reductions in FNR
StrategyCost/Life-year Saved
Reduction in FNR 0.4Reduction in FNR 0.6Reduction in FNR 0.9
Conventional Pap every 3 years$4,079$4,079$4,079
Improved initial screening every 3 years$26,678$22,010$19,480
Improved initial screening every 2 years$71,118$89,993$129,300
Improved initial screening every 1 year$234,911$302,277$439,605

FNR = false negative rate.

We can see from this table that in the base case, all strategies involving improved rescreening are dominated. These results were not affected by varying the relative improvement in sensitivity from 0.4 to 0.9; however, the cost per life-year saved varied substantially (Table 44).

We can explain these findings as follows: At screening frequencies of less than every 3 years, the majority of lesions detected by improving sensitivity are LSIL lesions which would be less likely to progress. Therefore, the extra costs of diagnosis and treatment of low-grade (LSIL) lesions outweigh the increase in life expectancy and decreases in costs related to invasive cervical cancer diagnosis and treatment, thus resulting in higher cost per life-year as sensitivity increases. With every 3-year screening, enough significant lesions are detected that increasing sensitivity (i.e., reducing FNR) improves life expectancy. Similar results were obtained using an incremental cost of $5 per slide; incremental cost-effectiveness ratios were less than $50,000 per life-year at screening intervals of 3 years and decreased as the incremental sensitivity increased, but cost-effectiveness ratios were consistently above $50,000 per life-year at screening intervals of 1 or 2 years and increased as sensitivity increased.

Improving the sensitivity of the initial screening step had consistently favorable cost-effectiveness ratios compared with improving the sensitivity of the rescreening step provided that (1) a certain proportion of smears read as normal with the improved initial technology were rescreened at the same improved sensitivity, (2) the incremental costs per slide of both screening technologies were identical, (3) the sensitivity of the initial screening technology was at least as high as that of the rescreening technology, and (4) the specificity of the two technologies relative to conventional Pap smears was identical. We next present the results of threshold analyses performed to identify thresholds where (1) a greater reduction in FNR, (2) higher specificity relative to improved initial screening, or (3) lower cost for a rescreening technology would favor it over an initial screening technology, followed by sensitivity analyses of model and test parameters on cost-effectiveness.

Threshold Analysis

Improved Reduction in FNR

Table 45. Parameter Inputs for Threshold Analysis of Improved Rescreening Technology Reduction In FNR
Parameter Inputs
TechnologyReduction in FNRSpecificityIncremental Cost
Conventional Pap0.510.97 --
Improved initial screening step0.6 X (0.49)1.0 X 0.97$10
Improved rescreening step0.85 X (0.49)1.0 X 0.97$10

FNR = false negative rate.

Table 46. Incremental Cost/Life-year Saved with Reduction in FNR of Rescreening Technology of 0.85, Compared with Initial Screening Technology Reduction in FNR of 0.6
StrategyCost/Life-year Saved
Conventional Pap every 3 years $4,079
Improved initial screening every 3 years $22,010
Improved rescreening every 3 years $45,375
Improved rescreening every 2 years$121,103
Improved initial screening every 1 year$403,973
Improved rescreening every 1 year$511,993

FNR = false negative rate.

If both types of technology add $10 to the cost of a Pap test, and the FNR of the initial screening technology is 0.6 times the FNR of conventional Pap smear, then the threshold for FNR reduction of the rescreening technology is 0.85 times the FNR of conventional Pap. Inputs are shown in Table 45. Results are shown in Table 46 with dominated strategies eliminated.

Varying the incremental cost of the technologies from $5 to $15 did not significantly change the threshold value. With incremental costs at $10 for both technologies, and a relative improvement of 1.4 for the initial screening technology, a relative reduction in FNR of 0.5 for the rescreening technology resulted in cost-effectiveness ratios of less than $50,000 per life-year at 3-year intervals. At a reduction in FNR of 0.9 for the initial screening technology, a relative reduction of more than 0.99 for the rescreening technology would be needed.

Decreased Specificity of Initial Screening Step

Table 47. Parameter Inputs for Threshold Analysis of Relative Specificity
Parameter Inputs
TechnologyReduction in FNRSpecificityIncremental Cost
Conventional Pap0.510.97 --
Improved initial screening step0.6 X (0.49)0.96 X 0.97$10
Improved rescreening step0.6 X (0.49)1.0 X 0.97$10

FNR = false negative rate.

Table 48. Incremental Cost per Life-year Saved with Relative Specificity of Initial Screening Step of 0.96 Compared With Conventional Pap or Rescreening Technology
StrategyCost/Life-year Saved
Conventional Pap every 3 years$4,079
Conventional Pap every 2 years$34,940
Improved rescreening every 3 years$16,396
Improved initial screening every 3 years$99,280
Improved rescreening every 2 years$87,754
Improved initial screening every 2years$243,017
Conventional Pap every 1 year$863,580
Improved rescreening every 1 year$270,452
Improved initial screening every 1 year$1,367,852
Changing the assumption of equal specificity of the two types of technology relative to conventional Pap substantially changed the rankings. Inputs are shown in Table 47. Results are shown in Table 48 with dominated strategies eliminated.

Because the overall sensitivity of the improved initial screening step is slightly higher than improved rescreening at every screening level, the overall increase in life expectancy will be slightly higher. However, with only a 4 percent reduction in relative specificity, the excess diagnostic costs make improved rescreening at 3-year intervals the only strategy with a cost-effectiveness ratio of less than $50,000 per life-year. Given that the overwhelming majority of screened patients will not have LSIL, HSIL, or cancer, a small decrease in relative specificity can greatly increase the overall costs of screening of the cohort.

Incremental Costs

Table 49. Parameter Inputs for Threshold Analysis on Incremental Cost per Slide of Rescreening Technology
Parameter Inputs
TechnologyReduction in FNRSpecificityIncremental Cost
Conventional Pap0.510.97 --
Improved initial screening step0.6 X (0.49)1.0 X 0.97$10
Improved rescreening step0.6 X (0.49)1.0 X 0.97$3

FNR = false negative rate.

Table 50. Incremental Cost per Life-year Saved of Screening Strategies Using Incremental Cost of Rescreening of $3 per Slide Compared with $10 per Slide for Initial Screening Technology
StrategyCost/Life-year Saved
Conventional Pap every 3 years$4,079
Conventional Pap every 2 years$34,940
Improved rescreening every 3 years$16,396
Improved initial screening every 3 years$99,280
Improved rescreening every 2 years$87,754
Improved initial screening every 2 years$243,017
Conventional Pap every 1 year$863,580
Improved rescreening every 1 year$270,452
Improved initial screening every 1 year$1,367,852
We next identified a cost threshold where, holding sensitivity and specificity equal, improved rescreening would be favored. Inputs are shown in Table 49. Results are shown in Table 50 with dominated strategies eliminated.

Summary of Threshold Analysis

Table 51. Threshold Value Where Improved Rescreening Every 3 years Favored Over Improved Initial Screening at Cost per Life-year of $50,000 or Less
Model ParameterThreshold Value where Improved Rescreening every 3 years Favored at Cost/Life-year of $50,000 or less
Reduction in FNR of Initial Screening Step 0.4Reduction in FNR of Initial Screening Step 0.6Reduction in FNR of Initial Screening Step 0.9
Reduction in FNR of rescreening technology 0.480.85>0.99
Relative specificity of primary screening technologyConventional Pap every 2 years favored over both technologies below 0.950.960.98
Incremental cost of rescreening technology$2$3$4.50

FNR = false negative rate.

Given equivalent improvements in sensitivity, equivalent incremental costs, and equivalent relative specificities, a technology that improves the sensitivity of the primary screening step will always be more effective if a certain proportion of slides initially read as "normal" are rescreened. In this scenario, improved primary screening every 3 years is the only strategy with an incremental cost per life year of $50,000 or less. We were able to identify thresholds of improved sensitivity (reduced FNR), improved relative specificity, or reduced cost where improved rescreening every 3 years was the preferred strategy, at a cost per life-year of $50,000 or less. We could not identify any strategy where screening more than every 2 years, using any technology, including conventional Pap smear screening, had a cost-effectiveness ratio of less than $50,000. Table 51 summarizes our threshold values for an incremental cost of $10 for the initial rescreening technology.

There are several patterns that emerge from these results. First, as the reduction in FNR of the initial screening technology increases, the additional relative reduction in FNR required for a rescreening technology to be favored also increases. Proportionately greater improvements in detection are needed as the initial screening technology detection rate increases. Second, the relative decrease in specificity required from the initial screening technology in order to favor the rescreening technology decreases as sensitivity increases -- as the overall diagnostic costs increase, relatively small increases in false positive rates have a greater impact on costs. Finally, if test characteristics are equivalent, the incremental cost of the rescreening technology must be substantially lower in order for rescreening to be favored. The threshold differences decrease as sensitivity increases -- again, since most of the lesions detected by increased sensitivity do not contribute substantially to mortality risk, relatively small reductions in overall costs can cause large changes in the cost-effectiveness ratios.

These results have several important implications. First, small decreases in specificity can significantly affect the cost-effectiveness estimates of any technology that improves sensitivity. This makes sense intuitively: Since a large majority of the population, even a "high-risk" population, will be "normal" at any given point in time, a small increase in the false positive rate can lead to substantial increases in cost. Because the incremental gain in life expectancy is small (because of the relative rarity of the disease and the high survival rate for early invasive disease), the cost-effectiveness ratio increases rapidly as specificity decreases. Second, there are likely to be significant interactions between sensitivity, specificity, and cost-effectiveness thresholds. We did not perform formal two-way analyses, but clearly there will be combinations of sensitivity and specificity on either side of the threshold that result in one strategy being favored over the other. Third, the range of relative sensitivity, specificity, and incremental cost for the initial and rescreening technologies that we evaluated is well within the range reported in the literature or claimed by the manufacturers of ThinPrep®, AutoPap®, and Papnet®. Given the uncertainty surrounding these estimates, it is possible that all three technologies fall within accepted ranges of cost-effectiveness at 3-year screening intervals. No strategy or technology used for screening more often than every 3 years results in estimates of less than $50,000 per life-year. Under some scenarios, decreasing the screening interval to every 3 years and using an improved technology was more effective, at an acceptable cost, than using conventional Pap screening every 2 years.

Sensitivity Analyses

To see the relative effects of varying other model parameters on cost-effectiveness estimates, we elected to use sensitivity and specificity estimates for initial and rescreening technologies that resulted in incremental cost-effectiveness ratios of $50,000 per life-year or less for both technologies. Based on the threshold analysis above, these estimates were a reduction in FNR of 0.6 for initial screening and 0.85 for rescreening. Incremental costs for both were set at $10 and relative specificities were set at 1.0. Thus, the sensitivity analyses on model parameters were performed using parameters for the two types of technologies that resulted in cost-effectiveness ratios of $50,000 per life-year or less for both types in the base case model.

Diagnostic Strategies

Our threshold analyses suggested that a large portion of the excess costs associated with reduction of FNR is a result of the cost of diagnosis and treatment of low-grade lesions. Our base case assumption was that smears originally read as ASCUS would be repeated in 6 months, an assumption that should reduce overall diagnostic costs, but that might miss high-grade histologic lesions with cytologic diagnoses of ASCUS.

Table 52. Comparison of Cost per Life-year Gained for Different Strategies for Managing ASCUS Smears: Repeat Pap Smear versus Immediate Colposcopy
Cost/Life-year Gained
StrategyRepeat Pap in 6 MonthsImmediate Colposcopy
Conventional Pap every 3 years$4,079$5,254
Improved initial screening every 3 years$22,010$22,333
Conventional Pap every 2 yearsDominatedDominated
Improved rescreening every 3 years$11,035$12,707
Improved initial screening every 2 years$129,912$163,788
Improved rescreening every 2 years$105,540$150,517
Conventional Pap every 1 yearDominatedDominated
Improved initial screening every 1 year$173,484$178,367
Improved rescreening every 1 year$511,993$702,748

ASCUS = atypical squamous cells of uncertain significance.

We compared an aggressive diagnostic strategy of colposcopy for all Pap smears with ASCUS or higher to one where initial ASCUS results were repeated after 6 months. However, given the results of the threshold analysis above, it is important to bear in mind that, if the reduction in FNR of the rescreening step is less than 0.85 or the relative specificity of the initial screening step is less than 0.97, then only the alternate strategy would have a cost per life-year saved of less than $50,000 at 3-year intervals. The results are seen in Table 52.

Changing the strategy does not change the rankings. Again, because the bulk of the lesions detected through improved sensitivity will be low-grade lesions, an aggressive diagnostic workup will increase costs with minimal gain in life expectancy.

Cost of Diagnosis and Treatment of Low-Grade Lesions

Table 53. Effect of Reducing the Cost of Diagnosis and Treatment of LSIL on Incremental Cost-Effectiveness
StrategyCost/Life-year Gained
Base Case: Cost of Managing LSIL $1,728Cost of Managing LSIL $675
Conventional Pap every 3 years$4,079$1810
Improved initial screening every 3 years$22,010$15,525
Conventional Pap every 2 yearsDominatedDominated
Improved rescreening every 3 years$11,035$4,119
Improved initial screening every 2 years$129,912$116,211
Improved rescreening every 2 years$105,540$100,162
Conventional Pap every 1 yearDominatedDominated
Improved initial screening every 1 year$173,484$159,828
Improved rescreening every 1 year$511,993$486,790

LSIL = low-grade squamous intraepithelial lesion.

We next varied the cost of diagnosis and treatment of low-grade lesions from the base case value of $1,728, the mean value of the MEDSTAT data, to $675, the 25th percentile. The results are summarized in Table 53.

Decreasing the cost of managing LSIL significantly reduces the overall cost per life-year gained, especially at lower screening frequencies, but does not alter the ranking or put any other strategies below the $50,000 per life-year strategy.

Incidence of Cervical Cancer

Table 54. Effect on Cost per Life-year Gained of Increasing Incidence of HPV Compared with Increasing HSIL Progression Rate as Mechanisms for Increasing Cervical Cancer Incidence
StategyCost/Life-year Gained
Base Case: Peak Incidence 81/100,000Peak Incidence 106/100,000; HPV incidence increased 1.5XPeak Incidence 106/100,000; HSIL progression rate increased
Conventional Pap every 3 years$4,079$2,123$1,042
Improved initial screening every 3 years$22,010$7,801(compared with Pap every 2 years)$13,377
Conventional Pap every 2 yearsDominated$22,931 (compared with conventional Pap every 3 years)Dominated
Improved rescreening every 3 years$11,035$30,328$6,179
Improved initial screening every 2 years$129,912$89,520$85,166
Improved rescreening every 2 years$105,540$6,991$70,229
Conventional Pap every 1 yearDominatedDominatedDominated
Improved initial screening every 1 year$173,484$135,216$116,780
Improved rescreening every 1 year$511,993$341,279$344,290

HPV = human papillamavirus; HSIL = high-grade squamous intraepithelial lesion.

We varied the incidence of cervical cancer in two ways: by increasing the age-specific incidence rate of HPV infection 1.5 times and by decreasing the progression rate from HSIL to Stage I cancer from 40 percent in 12 years to 40 percent in 8 years. The results are seen in Table 54.

When the increase in peak incidence is due to an increase in HPV incidence, improved initial screening every 3 years is more expensive but more effective than conventional Pap smears every 2 years. When the increase in peak incidence (and lifetime risk) is due to a more rapid progression of HSIL lesions to cancer, improved initial screening every 3 years is less expensive and more effective than conventional Pap every 2 years. These results illustrate several points. First, although the cost-effectiveness ratios decrease with increasing cancer incidence, very sensitive tests at frequent intervals are still well above the $50,000 threshold. Second, lifetime risk of cancer alone is not the only important component in determining cost-effectiveness. Clearly, the natural history of premalignant changes plays an important role in determining the degree of benefit obtained from a screening program. Third, changing different parameters within the model can result in similar patterns of cancer incidence. This may well explain some of the difference in results between models. Additional data on the natural history of HPV infection and premalignant and malignant changes, as well as further modeling efforts, are needed.

Baseline Sensitivity and Specificity of Conventional Pap Smears

Table 55. Effect on Cost per Life-year Gained of Changing Sensitivity and Specificity of Conventional Pap Smear Screening
StrategyCost/Life-year Gained
Base Case: Sensitivity 51%, Specificity 97%Alternative: Sensitivity 78.8%, Specificity 88.8%
Conventional Pap every 3 years$4,079$8,272
Improved initial screening every 3 years$22,010$73,300
Conventional Pap every 2 yearsDominated$219,637
Improved rescreening every 3 years$11,035Dominated
Improved initial screening every 2 years$129,912$117,206
Improved rescreening every 2 years$105,540$975,591
Conventional Pap every 1 yearDominated$515,216
Improved initial screening every 1 year$173,484$711,828
Improved rescreening every 1 year$511,993$4,482,673
We tested the impact of changing the sensitivity and specificity of conventional Pap to 78.8 percent sensitivity and 88.8 percent specificity, the mean results for each test characteristic in our meta-analysis (Table 55). These estimates are also close to those of Brown and Garber (1998).

Technologies that reduce the FNR of the conventional Pap smear when the FNR is 21.2 percent (sensitivity = 0.788) result in increased cost-effectiveness ratios, and no strategy falls below the $50,000 per life-year threshold. If relative specificity were also lowered, ratios would be even higher. These higher base-case estimates also explain some of the differences between our findings and those of other authors.

Another important point is that increasing the sensitivity and decreasing the specificity, even without incremental costs associated with a new technology, significantly increases the cost-effectiveness ratio.

Effect of Hysterectomy Incidence

Table 56. Effect of Adjustment for Hysterectomy Rate on Cost per Life-year Gained
StrategyCost/Life-year Gained
Base Case: Adjusted for HysterectomyNo Adjustment for Hysterectomy
Conventional Pap every 3 years$4,079$3,579
Improved initial screening every 3 years$22,010$21,746
Conventional Pap every 2 yearsDominatedDominated
Improved rescreening every 3 years$11,035$8,585
Improved initial screening every 2 years$129,912 $139,536
Improved rescreening every 2 years$105,540$111,657
Conventional Pap every 1 yearDominatedDominated
Improved initial screening every 1 year$173,484$180,965
Improved rescreening every 1 year$511,993$549,649
We also examined the effect of our inclusion of hysterectomy risk on cost-effectiveness. The results are seen in Table 56.

At higher screening frequencies, the cost-effectiveness ratios increase somewhat when hysterectomy rates are not included. This is because including hysterectomy reduces the number of tests during the lifetime of the cohort, an effect that is most obvious at higher screening frequencies, and also because the actual risk of cervical cancer in women without hysterectomy is higher than the population-based risk that does not correct for hysterectomy, since the denominator in the incidence fraction is smaller. However, the inclusion of hysterectomy risk does not change the rankings of strategies.

Discount Rate

Table 57. Effect of Varying Discount Rates on Cost-Effectiveness
StrategyCost per Life-year Gained
Discount Rate 0%Discount Rate 3%Discount Rate 5%
Conventional Pap every 3 years$545$4,079$10,157
Improved initial screening every 3 years$9,90322,010$51,756
Conventional Pap every 2 yearsDominated (by improved rescreening every 3 years)Dominated$6,537
Improved rescreening every 3 years$26,984 (compared with improved initial screening every 3 years)$11,035$67,787
Improved initial screening every 2 years$23,512$129,912$171,985
Improved rescreening every 2 years$67,274$105,540$148,306
Conventional Pap every 1 yearDominatedDominatedDominated
Improved initial screening every 1 year104,905$173,484$250,175
Improved rescreening every 1 year355,979$511,993$669,308
We examined the effect of varying the discount rate from 0 to 5 percent (Table 57). As expected, the discount rate does affect the size of the cost-effectiveness estimate. For improved rescreening every 3 years, or for improved initial screening every 2 years, the discount rate does affect whether the strategy meets the $50,000 per life-year threshold.

Age at Beginning of Screening

Table 58. Effect of the Age at Which Cervical Cytological Screening Begins on Incremental Cost-Effectiveness
Age at Beginning Screening
15 (base case)1820355065
Conventional Pap every 3 years$545$366,$275$246$3,681$35,951
Improved initial screening every 3 years$9,903$9,450$8,897$6,074$11,022$33,855
Improved rescreening every 3 years$26,984 (compared with improved initial screening every 3 years)$25,254$23,800$20,302$23,348Dominated (by Conventional Pap every 2 years)
Conventional Pap every 2 yearsDominatedDominatedDominatedDominatedDominated$34,766 (compared with improved initial screening every 3 years)
Improved initial screening every 2 years$23,512$21,959$21,014$16,042$23,567$48,462
Improved rescreening every 2 years$67,274$61,792$59,227$52,293$53,809$89,147
Conventional Pap every 1 yearDominatedDominatedDominatedDominated$127,659Dominated
Improved initial screening every 1 year104,905$99,011$93,726$71,088$81,633$151,864
Improved rescreening every 1 year355,979$334,433$317,291$259,147$224,587$349,767
We compared the cost per life-year saved (at a 0% discount rate) at various starting ages for screening: 15 (the base case), 18 (ACOG recommendations), 20 (the starting point for other models), 35, 50, and 65 (the starting point for the model of Fahs, Mandelblatt, Schechter et al. (1992). The results are shown in Table 58. Dominated strategies are not excluded; with exclusion of strategies that are dominated by conventional or extended dominance, no strategy at a frequency of more than every 3 years costs less than $50,000 per life-year.

Results were similar at discount rates of 3 percent and 5 percent: cost-effectiveness ratios decrease until age 35 and then increase with increasing age. These findings are consistent with those of Gustaffson and Adami (1992). Using age-specific cancer incidence curves similar to ours, their model predicted that focusing screening efforts in the mid-30 age range was the most efficient strategy. Given the natural history of the disease, this is not a surprising finding. HPV infection and LSIL peak in the early 1920s, whereas HSIL is a disease primarily of the 1930s, and cancer a disease of the late 1940s and 1950s. A large proportion of LSIL will spontaneously regress, the majority of the HSIL lesions that progress do so slowly, and cancer incidence peaks in the late 1940s. Beginning screening at the time when those lesions destined to regress have done so, and before the cancer incidence begins to reach its peak, is the most efficient strategy for increasing life expectancy. This is especially true given the prediction of our model that those cancers that do occur in younger women are less likely to be detected by any regular screening program (see Table 40).

Fixed Screening Frequencies

Table 59. Cost per Life-year Gained of an Improved Initial Screening Technology and an Improved Rescreening Technology Compared with Conventional Pap Smear at Fixed Screening Intervals
StrategyCostIncremental CostDays of Life GainedCost/Life-year
Pap every 3 years:
Conventional Pap$1,107.74$214.40 (compared with no Pap)19.2 (compared with no Pap)$4,079
Initial screening technology$1,239.90$132.162.2$22,010
Rescreening technology$1,285.94$46.040.4$45,375
Pap every 2 years:
Conventional Pap$1,254.99$361.65 (compared with no Pap)20.7 (compared with no Pap)$6,370
Initial screening technology$1,433.29$178.301.4$45,265
Rescreening technology$1,500.59$67.310.2$105,450
Pap every 1 year:
Conventional Pap$1,701.76$808.4222.2$13,281
Initial screening technology$1,999.64$297.890.6$173,484
Rescreening technology$2,137.96$138.320.1$511,993
As demonstrated above, there is an interaction between screening frequency and test characteristics in determining cost-effectiveness. There may be some settings where reducing screening frequency is not a practical option. We therefore present estimates for the cost-effectiveness of an improved initial screening technology and an improved rescreening technology at fixed screening intervals. Conventional Pap smear screening is compared with no screening, not to conventional Pap smear screening at more or less frequent intervals. Costs and days of life are discounted at 3 percent, the incremental cost for each technology is $10, the relative specificity of each is 1.0, the rescreening technology has a reduction in FNR of 0.85, and the initial screening technology has a reduction in FNR of 0.6. Results are summarized in Table 59.

The addition of either technology to annual screening does not meet the $50,000 per life-year threshold. At every 3 years, both technologies do meet that threshold under these assumptions but, as illustrated above, reasonable changes in costs or test characteristics can change both the actual cost estimate and the favored strategy. At every 2 years, it is possible that, given these assumptions and no comparison to less frequent screening at higher sensitivity, one (or both) technologies might meet the $50,000 threshold. However, in several scenarios above, less frequent testing with a more sensitive test was both less expensive and more effective than alternatives with higher screening frequencies. Policymakers considering this information should look at all reasonable alternatives for improving the cost-effectiveness of cervical cancer prevention programs.

Summary Tables

Table 60. Average Costs, Average Life Expectancy, Number of Cervical Cancer Cases Expected/100,000, and Number of Cancer Deaths/100,000, at Various Levels of Test Sensitivity and Specificity (Incremental cost to increase test sensitivity: $10. Costs and life expectancy discounted 3 percent)
StrategyAverage Cost ($)Average Life ExpectancyCancer Cases/100,000Cancer Deaths/100,000
No Pap893.3427.70381933014.61057.3
Every 3 years:
Conventional Pap1,107.7427.7563797506115.7
FNR reduction 0.41,258.3727.758156832268.3
FNR reduction 0.61,262.4927.759226124649.8
FNR reduction 0.91,268.7627.760604616129.9
Every 2 years:
Conventional Pap1,254.9927.760594030565
FNR reduction 0.41,468.8427.761817618135.9
FNR reduction 0.61,474.2827.762531813225.2
FNR reduction 0.91,481.9727.76342957914
Every 1 year:
Conventional Pap1,701.7627.764689510920.9
FNR reduction 0.42,107.3127.7651945559.9
FNR reduction 0.62,113.2527.7655211335.8
FNR reduction 0.92,121.1027.765922591.5
Table 61. Average Costs, Average Life Expectancy, Number of Cervical Cancer Cases Expected/100,000, and Number of Cancer Deaths/100,000, at Various Levels of Test Sensitivity and Specificity (Incremental cost to increase test sensitivity: $5. Disount rate for costs and life expectancy 3 percent)
StrategyAverage Cost ($)Average Life ExpectancyCancer Cases/ 100,000Cancer Deaths/100,000
No Pap893.3427.70381933014.61057.3
Every 3 years:
Conventional Pap1,107.7427.7563797506115.7
FNR reduction 0.41,210.1827.758156832268.3
FNR reduction 0.61,214.2927.759226124649.8
FNR reduction 0.91,220.5627.760604616129.9
Every 2 years:
Conventional Pap1,254.9927.760594030565
FNR reduction 0.41,405.9827.761817618135.9
FNR reduction 0.61,468.8427.762531813225.2
FNR reduction 0.91,481.9727.76342957914
Every 1 year:
Conventional Pap1,701.7627.764689510920.9
FNR reduction 0.41,978.0927.7651945559.9
FNR reduction 0.61,984.0127.7655211335.8
FNR reduction 0.91,991.8227.765922591.5
Table 62. Average Costs, Average Life Expectancy, Number of Cervical Cancer Cases Expected/100,000, and Number of Cancer Deaths/100,000, at Various Levels of Test Sensitivity and Specificity (Incremental cost to increase test sensitivity: $15. Discount rate for costs and life expectancy 3 percent)
StrategyAverage Cost ($)Average Life ExpectancyCancer Cases/100,000Cancer Deaths/100,000
No Pap893.3427.70381933014.61057.3
Every 3 years:
Conventional Pap1,107.7427.7563797506115.7
FNR reduction 0.41,306.5727.758156832268.3
FNR reduction 0.61,310.6927.759226124649.8
FNR reduction 0.91,216.9527.760604616129.9
Every 2 years:
Conventional Pap1,254.9927.760594030565
FNR reduction 0.41,468.8427.761817618135.9
FNR reduction 0.61,481.9727.762531813225.2
FNR reduction 0.91,542.5827.76342957914
Every 1 year:
Conventional Pap1,701.7627.764689510920.9
FNR reduction 0.42,236.5327.7651945559.9
FNR reduction 0.62,242.4927.7655211335.8
FNR reduction 0.92,250.3727.765922591.5
Given the degree of uncertainty surrounding estimates of both the costs and effectiveness of the new technologies, we present tables that allow calculation of predicted cost-effectiveness ratios under varying assumptions (Tables 60-62). Each table presents the average costs and life expectancy (discounted at 3 percent), number of cervical cancer cases, and number of cervical cancer deaths, for a cohort of women followed from ages 15 to 85. These outcomes are tabulated for no screening; conventional Pap screening every 1, 2, and 3 years; and for reductions in conventional Pap false negative rates of 0.4, 0.6, and 0.9. The three tables present the costs associated with incremental costs for these reductions in FNR of $10 (our base case), $5, and $15.

In using these tables to estimate cost-effectiveness of a technology that improves the overall sensitivity of conventional Pap smears (i.e., reduces the FNR), the reader can choose an estimate for incremental cost and reduction in FNR and then use the table to calculate incremental costs per life-year gained, cancer case prevented, or cancer death prevented.

For example, if the estimated incremental cost of a new technology is $10 per slide, and the estimated reduction in FNR rate is 0.6, then the incremental cost per life-year saved for that technology every 3 years would be:
(Average cost for every 3-year screening at FNR reduction of 0.6-Average cost for every 3-year conventional Pap)/(Life expectancy for every 3-year screening at FNR reduction of 0.6-Life expectancy for every 3-year conventional Pap).

Validation of the Model

The model was validated against two types of external data: epidemiological data and previously published models of cervical cytological screening.

Validation Against Epidemiological Data

The model predictions of age-specific prevalence of HPV infection, LSIL, and HSIL were compared with external epidemiological data from the literature and expected consequences of changing various parameters.

Age-specific prevalence of HPV infection in women with normal cytology

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F22.jpg.

   Figure 22. Age-specific prevalence of HPV

The age-specific prevalence of HPV infection in women with normal cytology predicted by the model using base case estimates is shown in Figure 22. These prevalence figures are consistent with cross-sectional data in low-risk populations (Bauer, Hildesheim, Schiffman et al., 1993). The estimate for HPV prevalence in women over 50 years of age is at the high end of the reported range, but high estimates would result in a higher incidence of cancer at later ages, thus increasing the overall lifetime risk of cancer and favoring more sensitive screening strategies.

Age-specific prevalence of LSIL and HSIL

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F23.jpg.

   Figure 23. Predicted prevalence of LSIL and HSI

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F24.jpg.

   Figure 24. Predicted prevalence of HPV, LSIL, and HSIL by age

Figure 23 shows the age-specific prevalence of LSIL and HSIL predicted by the model. Figure 24 shows the relationship between HPV, LSIL, and HSIL prevalence predicted by the model. Distributions are similar to those reported in other studies and are consistent with estimates for HPV incidence and for progression and regression rates for HPV and SIL. In particular, the shape of the LSIL curve is similar to that reported by Carson and DeMay (1993).

Sensitivity analyses

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F25.jpg.

   Figure 25. Effect of HPV incidence on cervical cancer incidence

We tested the impact of varying the age-specific incidence of HPV from one-half to twice the base case estimates. As shown in Figure 25, the overall shape of the cancer incidence curve does not change, although the peak incidence and overall risk of cervical cancer varies with HPV incidence. This graph suggests the potential impact of primary prevention on cervical cancer incidence by such measures as increased use of barrier methods of contraception or, potentially, vaccination against HPV.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F26.jpg.

   Figure 26. Effect of baseline HPV/LSIL prevalence on cancer incidence

We also tested the impact of varying the prevalence of HPV and LSIL at age 15 on subsequent cervical cancer incidence (Figure 26). Increasing prevalence at younger ages without any changes in other parameters increases the overall incidence and decreases the youngest ages at which cancer appears. Delaying onset of sexual activity, especially with multiple partners, would also have a significant impact on cervical cancer risk.

Table 63. Parameters, Input Range for Senstivity Analysis, and Range of Lifetime Cervical Cancer Risk
ParameterBase Case RangeLifetime Risk of Cervical Cancer (Base Case 3.67%)
Relative risk of HPV infection (age-specific)1.00.5-2.02.15-6.00%
Proportion of HPV progressing directly to HSIL0.10-0.32.84-5.35%
Proportion of LSIL regressing to Well instead of HPV0.050-13.61-4.27%
Proportion of HSIL regressing to Well instead of LSIL0.010.13.47-3.67%
HPV prevalence at age 150.10-0.153.57-3.72%
HPV progression rate0.2/36 months0.15-0.32.5-4.7%
LSIL progression rate0.1-0.3/72 months (age dependent)0.1-0.52.42-5.88%
LSIL regression rate0.4-0.65/120 months (age dependent)0.3-0.82.92-4.83%
HSIL progression rate0.35/72 months0.3-0.52.91-4.1%
HSIL regression rate0.4/120 months0.3-0.52.98-3.8%
Finally, we tested the impact of our natural history estimates on the lifetime risk of cervical cancer in the absence of screening. Table 63 presents the parameters, the input range for sensitivity analysis, and the resulting range of lifetime cervical cancer risk. Cervical cancer risk is most sensitive to HPV incidence, to the proportion of HPV infections progressing directly to HSIL, and to LSIL progression rates. Changes in these parameters result in twofold to threefold differences in cervical cancer risk. Changes in LSIL regression rates, and HSIL progression and regression rates, resulted in 50-75 percent differences in cancer risk. The proportion of LSIL regressing directly to the "Well" state instead of to the "Unknown HPV" state, and the proportion of HSIL regressing to "Well" instead of to LSIL had minimal impact on cervical cancer risk.

Additional epidemiological data on the natural history parameters that affect cervical cancer risk in the absence of screening are needed. Different combinations of parameter estimates can result in similar predictions of cancer risk. Further refinement of this model may help to determine which parameters are most important in determining cervical cancer incidence.

Validation Against Other Models

Direct comparison with other models is difficult because of differences in terminology, assumptions, parameter estimates, and modeling techniques. However, we adjusted our model estimates to approximate previously published models as both a means of comparison and a validation technique.

Table 64. Differences Between Current Model and Eddy (1990) and Fahs, Muller, Mandelblatt et al. (1992)
Model CharacteristicEddy (1990)Fahs et al. (1992)Current Model
Model typeMarkov cohortMarkov cohortMarkov cohort
Population20-7565-10415-85
Natural History Parameters:
HPV effectNoNoYes
Incidence in unscreened population3 X age-specific incidence in US, 1988Based on incidence, regression, progression rates; prevalent cases at start of cohortCalibrated to shape of incidence curve in unscreened populations (Gustafsson, Ponten, Bergstrom et al., 1997; Gustafsson, Ponten, Zack et al., 1997)
Proportion of rapidly progressive lesions0.05Not given0.1 (Proportion of HPV progressing directly to HSIL)
Average duration of preclinical lesionsLong interval: 8 years, range 0-16 Rapidly progressive:1 year, 0-2Based on regression and progression ratesBased on regression and progression rates
Preinvasive regression ratesNot givenCIN: 3.81%/year CIS: 0/yearLSIL: 0.4-0.65/10 years (age dependent) HSIL: 0.4/10 years
Preinvasive progression ratesNot givenWell to CIN: 0.09/yearCIN to CIS: 17.8%/year CIS to Stage I: 26.1%/yearLSIL: 0.1-0.3/6 years (age dependent) HSIL: 0.35/6 years
Cancer progression ratesNot givenStage I-Stage II-IV:39%/yearStage I-Stage II: 90%/4 years Stage II to Stage III: 90%/3 years Stage III to Stage IV: 90%/2 years
Symptom recognition rateNot givenStage I: 12%/year Stage II-IV: 80%/yearStage I: 15%/year Stage II: 22.5%/year Stage III: 60%/year Stage IV: 90%/year
Proportion of cases in each stageStage I: 46% Stage II: 26% Stage III: 16% Stage IV: 12%Stage I: 50% Stage II-IV: 50%(Predicted): Stage I: 46.4% Stage II: 27.0% Stage III: 18.1% Stage IV: 8.5%
5-year survival by stageStage I: 86% Stage II:58% Stage III: 40% Stage IV: 18%Stage I: age-specific, varies from 67.7% to 52% Stage II-IV: age-specific, varies from 44.4%-15%Stage I: 83.9% Stage II: 65.7% Stage III: 37.9% Stage IV: 11.3%
Lifetime risk, cervical cancer2.5% (3 X SEER)Not given3.67%
Model CharacteristicEddy (1990)Fahs et al. (1992)Current Model
Lifetime risk, cervical cancer death1.18%Not given1.26%
Discount Rate5%5%3%, 5%
Screening Parameters:
Pap sensitivity0.80.80.535
Pap specificity0.9950.950.967
Sensitivity of combination of Pap and pelvic exam for Stages II-IV0.80.81.0
Atypia/ASCUS considered?NoNoYes
Rescreening of smears initially read as normal?NoNoYes
Costs:
Treatment of preinvasive lesionsNo difference between CIN/CIS, based on inpatient costsCIN, CIS differentiated, outpatient and inpatient costs includedLSIL, HSIL differentiated, outpatient and inpatient costs included
We attempted to systematically identify differences in predicted natural history between our model and the two most widely-used models: Eddy (1990) and Fahs, Mandelblatt, Schechter et al. (1992). Table 64 summarizes these differences.

Eddy (1990) Model

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F27.jpg.

   Figure 27. Age-specific incidence of cervical cancer by screening frequency

This widely-cited model is the basis for other recently published cost-effectiveness analyses (Brown and Garber, 1998; Radensky and Mango, 1998). We attempted to recreate Eddy's results using our model. Although Eddy's model parameters were adjusted to fit International Agency for Research on Cancer (IARC) data (IARC Working Group on Evaluation of Cervical Cancer Screening Programmes, 1986), the incidence of invasive cervical cancer in an unscreened U.S. population was estimated by assuming that it would be three times higher than that observed in a partially screened population. However, this assumption does not account for the fact that 30-50 percent of cancer cases in the United States are from an unscreened population. Figure 27 shows the effects of different screening frequencies on the age-specific incidence of cervical cancer using a cohort simulation. Increasing screening frequency will lead to earlier detection of earlier stage disease at younger ages than would be expected in an unscreened population. Because the incidence of cervical cancer in the United States reflects both screened and unscreened populations, as well as the effect of different cohorts with varying exposure to HPV, simply increasing the age-specific incidence by three-fold will overestimate the expected incidence in unscreened patients at younger ages. If the distribution of stages is not changed, then the ratio of incidence to mortality will be overestimated, since the ratio of early-stage cases in the SEER registries is much higher in younger women than in older women (66 percent localized in women under 50 compared with 37 percent in women over 50) (Ries, Kosary, Hankey et al., 1998).

Table 65. Recreating SEER Lifetime (ages 15-85) Risks of Cervical Cancer Diagnosis and Mortality Using Current Model Parameters
Screening FrequencyProportion of CohortLifetime Risk of CancerLifetime Risk of Cancer Mortality
None12.5%3.57%1.22%
Every 5 years7.5%0.98%0.25%
Every 3 years20%0.58%0.13%
Every 2 years40%0.35%0.07%
Every 1 year20%0.12%0.02%
Total for cohort0.7980.23%
Calculated from SEER data0.79%0.26%
To determine the effect of heterogeneous screening frequencies within a population on the relationship between cervical cancer incidence and mortality, we ran simulations varying the proportion of the cohort who received no screening and Pap smear screening every 5, 3, 2, and 1 years, at a Pap sensitivity of 51 percent and specificity of 97 percent, and we also attempted to match the lifetime risk of cervical cancer diagnosis (0.79 percent) and mortality (0.29 percent) predicted by SEER data (Ries, Kosary, Hankey et al., 1998). The results are summarized in Table 65.

We were able to approximate lifetime risks for cervical cancer diagnosis and mortality based on current SEER data by simulating a cohort with a reasonable distribution of screening frequencies. The proportion of women not having had a Pap smear in the preceding 3 years is estimated at 5-10 percent (Martin, Calle, Wingo et al., 1996). Because our model assumes perfect patient and provider compliance with appropriate treatment, a higher proportion of unscreened and underscreened women are needed to approximate observed incidence and death rates. This provides further evidence for the overall validity of our model as well as our base case estimate of Pap sensitivity and specificity.

An external file that holds a picture, illustration, etc., usually as some form of binary object. The name of referred object is f9_F28.jpg.

   Figure 28. Incidence of cervical cancer predicted by current model, threefold increase in 1988 SEER data, and current model adjusted to fit Eddy estimation

Table 66. Comparison of Current Model Adjusted to Fit Eddy Estimates with Eddy (1990) Model
No PapPap Every 4 YearsPap Every 3 YearsPap Every 2 YearsPap Every 1 Year
Cases cancer/10,000:
Eddy25043403428
Current model, Eddy estimates2522017104.2
Cancer deaths/10,000:
Eddy11814131110
Current model, Eddy estimates1146420.8
Increase in life expectancy in days: 1
Eddy9.549.729.8810.07
Current model, Eddy estimates8.599.289.569.77
Average cost: 1
Eddy$264$355$439$1,093
Current model, Eddy estimates$913$1,1301,355$1,924
Incremental cost/life-year: 1
Eddy$10,000$184,528$262,800>$1,000,000
Current model, Eddy estimates$25,726$114,485 $296,915$971,377
1

Discounted at 5 percent

We were able to approximate the lifetime cervical cancer risk of Eddy (2.5 percent) by altering our HPV incidence. Figure 28 shows the age-specific incidence predicted by our model, the age-specific incidence predicted by increasing 1988 SEER incidence rates threefold, and the adjustment in our model resulting in a 2.5 percent lifetime risk. However, the lifetime mortality risk predicted by this model is 0.88 percent, substantially lower than Eddy's estimate of 1.18 percent. We then adjusted rates for progression between cancer stages and symptoms to obtain similar incidence and mortality risks. By changing the progression rates to 90 percent in 2.5 years for Stage I to Stage II, 75 percent in 1 year from Stage II to Stage III, and 100 percent in 1 year for Stage III to Stage IV, and changing the probability of symptoms for Stage III to 35 percent, we obtained a lifetime risk of 2.52 percent and a mortality risk of 1.14 percent. In addition, since Eddy's model does not account for nonspecific cytological diagnoses such as ASCUS, we combined the diagnosis of ASCUS and LSIL in our conditional probabilities. Although these progression rates are substantially higher than those used in other models, they do replicate Eddy's results. We then tried this model, along with Eddy's cost and test characteristics estimates, to compare results (Table 66).

By attempting to replicate Eddy's results, we were able to identify several features that may affect conclusions about the cost-effectiveness of cervical cancer screening strategies when Eddy's model is used. First, the age-specific incidence in younger women may be overestimated, since the majority of cases in younger women in a partially screened population (the basis for Eddy's estimates) will be early-stage disease detected through screening. This in turn will contribute to an overestimation of the case-fatality rate if the distribution of stages is not adjusted. Second, because cytological atypia reported as ASCUS is not considered, the overall screening costs will be underestimated.

Fahs, Mandelblatt, Schechter et al. (1992) Model

We adjusted our model to include only two stages of cancer, "early" and "late," and used the parameters cited in the article, including the age-specific survival for early and late cervical cancer and began the cohort simulation at age 65. Using these parameters, we found that Pap smear screening every 5 or every 3 years results in cost-savings compared with no screening. Despite differences in total cost and effectiveness, our adjustments resulted in incremental costs for more frequent screening, which were of the same order of magnitude as those of Fahs, Mandelblatt, Schechter et al. (1992). For screening every 3 years versus every 5 years, our model predicted an incremental cost per life-year of $5,581 compared with the published value of $5,956. For screening every year versus every 3 years, our model predicted an incremental cost per life-year of $74,615 compared with the published value of $39,693.

Brown and Garber (1998) Model

Table 67. Comparison of the Current Model, Adjusted to Eddy's Parameters, and Brown and Garber
Screening StrategyIncremental Cost/Life-year Saved
Brown and GarberCurrent Model Adjusted to Eddy Parameters
Screening every 3 years:
Conventional Pap (compared with no screening)$8,996$8,393
ThinPrep®$37,074 (dominated by AutoPap®)$67,124 (dominated by AutoPap®)
AutoPap®$16,259 (compared with conventional Pap)$48,455 (compared with conventional Pap)
Papnet®$146,783$230,902
Screening every 2 years:
Conventional Pap (compared with no screening)$13,334$13,712
ThinPrep®$94,258 (dominated by AutoPap®)$181,705 (dominated by AutoPap®)
AutoPap®$42,666 (compared with conventional Pap)$132,705 (compared with conventional Pap)
Papnet®$343,444$624,870
Screening every 1 year:
Conventional Pap (compared with no screening)$26,8881$30,655
ThinPrep®$369,893 (dominated by AutoPap®)$979,183 (dominated by AutoPap®)
AutoPap®$166,474 (compared with conventional Pap)$726,610 (compared with conventional Pap)
Papnet®$1,069,660$3,314,970
We used our model, adjusted to approximate Eddy's results, in an attempt to replicate the results of Brown and Garber (1998). Our results, using their cost and probability estimates, are found in Table 67.

Discussion

By making adjustments to certain natural history parameters in our basic model, and by using the cost and sensitivity parameters of Eddy (1990) and Brown and Garber (1998), we were able to reasonably approximate the cost-effectiveness estimates for varying frequencies of conventional Pap smears of these models. We were also able to come relatively close to the incremental cost-effectiveness values of different screening frequencies in the model of Fahs, Mandelblatt, Schechter et al. (1992), although we found cost-savings for every 3- and 5-year Pap smear screening compared with no screening using their cost and probability estimates. We were able to replicate the relative ranking for ThinPrep®, AutoPap®, and Papnet® from Brown and Garber (1998); however, our adjusted model resulted in significantly higher estimates of incremental cost-effectiveness for these technologies at each level of screening frequency.

Although we have been unable to identify specific characteristics of our model to explain these differences, there are several potential factors that may explain some of these observed differences.

Our model consistently produced lower gains in life expectancy than other models for a given increase in screening sensitivity. This may be because of differences in the source of other-cause mortality estimates, different distribution of cases within stages between different models, or the inclusion of other Markov states such as those indicating HPV infection. Given these lower gains in life expectancy, it is not surprising that the cost-effectiveness estimates are somewhat higher.

We used specific 1-, 2-, 3-, 4-, and 5-year survival rates rather than 5-year survival rates averaged over 5 years. Because mortality for Stages II, III, and IV is higher in the first 2 years after diagnosis than in the next 3 years, discounting may affect the marginal gains in life expectancy. At the very low gains that all of the models found, this may be sufficient to change cost-effectiveness ratios appreciably.

None of the other models considered the effect of ASCUS diagnoses. We attempted to correct for this by changing the relative probability of an LSIL cytological diagnosis within each histological state. However, this correction might result in higher diagnostic and treatment costs (our average costs were substantially higher than those seen in all the models, although our incremental costs were similar) and, subsequently, higher cost-effectiveness ratios.

Despite these differences, we were able to replicate the ranking of preferred screening strategies and reasonably approximate the incremental cost per life-year saved of other previously published models for conventional Pap smears. Further studies with the current model should be able to further elucidate the reasons for the differences between our model and those previously published.

Although there are numerous differences between our model and other previously published models, there are striking similarities. All of the models show little marginal gains in life expectancy at screening intervals of more than 3 years. All of the models show only modest improvements in life expectancy from increasing the sensitivity of conventional Pap smears and, for those that evaluate new technologies, cost-effectiveness ratios for technologies that improve sensitivity (or reduce false negative rates) when screening is performed at frequent intervals.

Summary of Results of Cost-Effectiveness Analysis

We found that the incremental cost per life-year gained increased dramatically as screening intervals increased, as have prior cost-effectiveness analyses of Pap smear screening. We also found that the specificity of the test has profound implications for cost-effectiveness: Because the majority of women screened will be normal, the number of additional diagnostic tests generated by a small decrease in specificity will be quite large and will in fact exceed the number of potentially significant lesions detected by increases in sensitivity in many populations.

We were able to identify thresholds of cost, reduction in false negative rate, and relative specificity compared with conventional Paps where both technologies to improve initial screening and to improve rescreening meet conventional cost-effectiveness thresholds at every 3-year screening intervals. However, under most scenarios, each technology resulted in cost-effectiveness estimates of more than $50,000 per life-year when used every 1 or 2 years. Further studies are needed in order to provide more precise estimates of the costs and effectiveness of specific technologies in order to make more reliable estimates of cost-effectiveness.

Conclusions

Accuracy of Cervical Cytological Screening Tests

Recent studies provide important new evidence with which to estimate the accuracy of cervical cytological screening. Despite the demonstrated capability of such screening to reduce cervical cancer mortality, conventional Pap smear screening has a poorer capability of discriminating between diseased and nondiseased patients than has generally been believed to be true. Pap smear screening is more accurate when a higher cytological threshold (HSIL) is used to detect a high-grade lesion. Lower test thresholds, or use of the Pap smear to detect low-grade dysplasia, result in poorer discrimination.

The accuracy of Pap smear screening is strongly dependent on study characteristics. In the studies reviewed for this report, prevalence of disease was the strongest and most consistent factor associated with between-study variation in effectiveness scores. Higher disease prevalence was associated with higher estimates of sensitivity and lower estimates of specificity (with a greater effect on specificity). These findings are consistent with prevalence as a surrogate for "workup" bias and perhaps also reflect an imperfect reference standard that is more specific than sensitive.

The quality of the studies reviewed in this report varied widely; however, quality scores did not explain a statistically significant amount of the between-study variation in effectiveness scores, sensitivity, or specificity. It is possible that the separate components of the quality score affected the test effectiveness scores in different directions, leading to an overall lack of significance. However, when controlling for prevalence of disease, because of small numbers and collinearity, we were unable to separately assess the effects of verification of test-negative subjects and type of reference standard on between-study variation in sensitivity and specificity.

Few studies of initial screening were unaffected by workup bias, but the few that were provided estimates of the specificity of Pap smear screening of 97 to 100 percent and sensitivity of 29 to 56 percent, indicating sensitivity estimates much lower than those generally believed to be true.

Most of the studies reviewed were conducted on women referred for evaluation of a cytological abnormality. The cervical cytological test repeated at the time of colposcopy was often the cytological result reported, and that result was then compared with colposcopy or histology results. The fact that cytology-"negative" women in many of these studies had often had a recent positive Pap smear suggested that they may have been systematically different from women who had received a negative Pap test on initial screening.

Based on studies using a cytological or histological reference standard, ThinPrep® demonstrated improved sensitivity compared with conventional Pap smears. The only study allowing estimation of the relative FPR suggested that ThinPrep® had slightly lower specificity than did conventional smear; however, this difference was not statistically significant.

Both computer rescreening technologies (AutoPap®, Papnet®) have been shown to be capable of improving the diagnostic yield of false negatives, although the magnitude of the false negative reduction under different circumstances is difficult to estimate precisely. Similarly, estimates of the effect of rescreening on specificity are unreliable.

The evidence regarding the accuracy of the newer technologies on cervical cytological screening is insufficient for several reasons. First, little evidence is available with which to assess the effects of thin-layer cytology or computer rescreening on specificity. Second, most estimates of effect on sensitivity (or reducing the false negative rate) are based on a surrogate reference standard: cytology. Standards for cytological reference standards put forth by the FDA and the Intersociety Working Group for Cytology Technologies improve the validity of the cytological reference standard by requiring consensus among an independent panel of cytology professionals; however, this standard is often applied only to discrepant cases, and histological verification of high-grade lesions is also often lacking. Both of these deficiencies lead to overestimation of diagnostic performance.

Costs of Cervical Cytological Screening

Our analysis provides cost estimates in 1997 dollars for procedures associated with screening, diagnosis, and treatment of cervical cancer for women ages 20-64 years and those 65 and older. Previous cost-effectiveness analyses of cervical cancer diagnosis and treatment in the United States have relied on cost estimates derived from 1988 data (U.S. Congress Office of Technology Assessment, 1990). The present analysis adds substantially to the existing cost literature on cervical cancer because it is the first to estimate costs associated with younger (20-64 years) and older women (65 years and older) separately.

The cost of Pap smear screening was somewhat higher in older women than in younger women, chiefly because physician and total time spent in obtaining Pap smears during office visits was greater for older women. For costs associated with cervical cancer treatment, estimates were substantially lower for older women because the cost estimates for the under-65 age group were derived from primary analysis of claims data, whereas cost estimates for the 65-and-older age group were obtained from Medicare payments. Although the same time period was used to enhance comparability between the costs provided for the two groups, the use of different data sources likely affected the estimates derived. The claims data analyses reflect more comprehensive costs of providing the services than do Medicare payments, since costs for related services were not always added to the cost of the primary service.

Cost estimates for diagnosis and treatment of cervical dysplasia or cancer calculated from episodes of care are substantially higher than are estimates based on average procedure-specific costs because of both the provision of related services and the effect of complicated cases with unusually high costs. Estimates based on procedure-related costs alone will underestimate the true direct medical costs.

Cost-Effectiveness of New Technologies

Literature Review

Published models examining the cost and effectiveness of Pap smear screening have consistently found Pap screening to have a significant impact on the incidence and mortality of cervical cancer and to have an acceptable range of cost-effectiveness ratios when compared with no screening. Existing models were all consistent in this finding despite different modeling techniques, parameters, and assumptions.

Estimates of Pap test accuracy used in these models generally overestimated Pap test performance, as determined by recent unbiased studies and previous meta-analyses. Pap test performance was not found to be a key determinant of cost-effectiveness in studies that included sensitivity analyses of Pap test accuracy; however, our best estimates of Pap test performance fall outside the range used in sensitivity analyses in some models.

The studies suggest that screening programs become less cost-effective when the age of onset for Pap screening is lowered (e.g., to 17 years), when applied to low-risk subpopulations (e.g., previously screened pregnant women), or when the screening interval is more frequent (e.g., annual). Screening programs were found to be more cost-effective when applied to higher risk subpopulations (e.g., previously unscreened women, older women) or at less frequent screening intervals (e.g., every 3-5 years).

The several high-quality existing models of effectiveness and costs associated with Pap testing for cervical cancer screening provided a valuable resource for this project. These models provided a conceptual framework for the assumptions, modeling techniques, and literature evidence. The number of similar models and their degree of detail were sufficient to support a significant effort to describe and critique them. These models indicated that high-quality cost-effectiveness analysis is possible, and they not only provided resources to support the creation of such a model, but also indicated areas for improvement (e.g., better cost parameters).

Cost-Effectiveness Model

We found that under favorable assumptions, the use of technologies that improve initial screening sensitivity or rescreening sensitivity can have acceptable cost-effectiveness compared with conventional Pap smear screening at a frequency of every 3 years. However, the cost-effectiveness of these new technologies (and of conventional Pap smear screening) is directly related to the frequency of screening, with longer intervals resulting in lower cost-effectiveness estimates. Our findings were relatively insensitive to assumptions about cervical cancer incidence, the cost of technologies, diagnostic strategies for abnormal screening results, age at onset of screening, or most other variables tested. However, there is substantial uncertainty about the estimates of sensitivity and specificity of the new technologies compared with each other and with conventional Pap testing. It is clear from our sensitivity analysis that both sensitivity and specificity are important in determining cost-effectiveness. Although it is clear that both types of technology provide an improvement in effectiveness at higher cost, the imprecision in estimates of their effectiveness makes drawing conclusions about the relative cost-effectiveness of thin-layer cytology and computerized rescreening technologies problematic.

Although a societal perspective would be preferable, the inherent difficulties in estimating nondirect costs such as lost productivity led us to choose the simpler health care perspective. Invasive cervical cancer affects mostly women in midlife and older, whereas the much more common HPV infection and SIL (most of which do not progress to cervical cancer) are primarily seen in younger women. It therefore seems likely that the nonhealth care costs associated with screening and treatment of SIL may be greater than those associated with the diagnosis and treatment of invasive cervical cancer. If this is true, then the cost-effectiveness of screening, and of new technologies to improve screening performance, will be more favorable from a health care system perspective than from a societal perspective, especially if new technologies increase sensitivity at the expense of specificity.

Natural History of Cervical Cancer

Our model predicts an age-specific incidence of cervical cancer similar to that reported in unscreened populations. Our peak incidence, 81/100,000 at age 50, is higher and comes at an earlier age than that reported in an unscreened population in the United States in the 1930s (60/100,000 at age 60); is similar to that seen in India in the 1960s, Canada in the 1960s, or Singapore in the 1970s; and is somewhat lower than in Germany in the 1960s (peak 110/100,000) (Gustafsson, Ponten, Bergstrom et al., 1997). Our estimates of age-specific prevalence of HPV, LSIL, and HSIL are also within reported ranges. We are therefore confident that our model represents a reasonable approximation of the natural history of cervical cancer. Clearly, further refinements can be made, especially in the modeling of transitions between states. Further development of this model could be useful for analyzing the natural history of cervical cancer in greater detail, even at the molecular level. The model could also be used for future technology assessments of HPV testing or HPV vaccination programs.

Effects of Screening Programs on Outcomes

Our findings that the incremental gain in life expectancy decreases dramatically when the screening interval is lengthened are consistent with those of other previously published models. We found a similar pattern for cervical cancer deaths, cervical cancer cases, and morbidity associated with cervical cancer. Although further research on quality-of-life issues associated with cervical cancer screening and treatment is clearly needed, it seems unlikely that data on quality-of-life issues would significantly change our findings. Given the rarity of cervical cancer relative to HPV infection and SIL, the inconvenience, potential discomfort, and psychological distress (Paskett and Rimer, 1995) associated with screening and treatment of cancer precursors (many of which will never progress to become cancer) might well outweigh the negative impact of cancer itself on women's quality of life at the population level.

Our model also predicts that, even with the most sensitive screening program, there will continue to be cervical cancer cases and that the bulk of these cases will be in younger women. There are clearly differences in biological behavior of different tumors. The occurrence of tumors in women under 30 is itself evidence that there are some rapidly growing tumors. Since one would expect screening to preferentially detect slower growing tumors, the proportion of younger women among cervical cancer cases should increase as screening sensitivity increases.

Several authors have discussed the difficulties involved in modeling cervical cancer (Prorok, 1986; Wilson and Woodman, 1995). In addition to the uncertainty surrounding almost all of the parameter estimates, which are discussed in detail elsewhere, and given our assumptions about natural history, diagnosis, and treatment, there are limitations inherent in the underlying choice of a Markov model for our cost-effectiveness analysis.

First, we assumed that the probabilities of regression and progression of HPV and SIL, although varying with age, are constant within specified age ranges. Because the progression from HPV to SIL to invasive cancer appears to be dependent on a series of specific mutations in the HPV genome (Mitchell, Tortolero-Luna, Wright et al., 1996), it is likely that the true transition probabilities are not constant, but a function of the specific HPV viral type and host co-factors. A different model structure, using either stochastic modeling or Monte Carlo simulation, might result in a closer approximation of the true natural history. We chose a Markov cohort simulation because (1) it allowed faster computation of results, an advantage given the large number of variables, clinical strategies, and sensitivity analyses to be evaluated; (2) the interpretation of the model and its results may be more intuitive to readers with less experience in decision-analytic techniques; and (3) we lack data that would allow more precise estimation of the distribution of regression and progression probabilities.

Given the robustness of our findings to variation over a wide range of possible parameter values, it seems unlikely that a different model structure would have resulted in substantially different conclusions.

Second, like all Markov models, our model also assumes that transitions occur only at the beginning of each cycle. For life expectancy, we applied the half-cycle correction (Sonnenberg and Beck, 1993); however, this assumption may affect some of the results with certain diagnostic strategies. For example, if a Pap test with a cytological result of ASCUS or LSIL is followed by a repeat Pap test in 6 months rather than by immediate colposcopy, the underlying histological state may progress, regress, or persist during that time period, whereas our model assumes that, in the absence of treatment, a given state persists for at least 1 year. However, because regression or persistence is more likely than progression for LSIL or HSIL within a given year, this assumption results in an overestimation of the likelihood of finding SIL on the subsequent Pap. This, in turn, is a bias in favor of screening and in favor of any strategy that improves the sensitivity of screening.

Our validation technique does not account for the fact that age-specific incidence data collected at any one point in time may be dependent on the distribution in the population of different cohorts, each with its own age-specific incidence (Gustafsson, Ponten, Bergstrom et al., 1997). Our choice of an earlier onset of sexual activity and HPV infection is consistent with current demographic trends, and, again, is biased in favor of screening.

Clinical Implications

Many issues have been raised to justify the use of improved technologies for cervical cytological screening. This report does not address them all. Attention has been focused on quality control in cytopathology laboratories in an attempt to reduce the problem of false negative Pap smear tests; in fact, this has been one of the primary motivations for the development of computerized rescreening technologies. Variability in sensitivity and specificity between laboratories could be expected to have a profound effect on the cost-effectiveness, yet our model assumed uniform performance among the simulated cohort. In theory, the 10 percent manual rescreening mandated by CLIA could be a useful tool to assure uniform quality among cytopathology laboratories; however, it is an ineffective strategy alone for solving the problem of false negative Pap smear screening tests. Computerized rescreening technologies offer, in addition to improvements in performance, the advantage of greater uniformity across laboratories.

The shortage of cytotechnologists and the difficulties in implementing screening recommendations in medically underserved areas are additional factors that may provide incentives for using computerized rescreening or initial screening technologies. These issues were not considered in this analysis because many of these factors are difficult to quantify and model.

Another set of issues not included in the model relate to compliance with Pap smear screening recommendations. We assumed uniform compliance by patients and providers; however, increased costs of screening from implementation of these new technologies could result in worse compliance with cervical cytological screening and, on a national level, result in no reductions in cervical cancer incidence.

Our model suggests that improving the sensitivity of screening, even at no additional cost, might actually increase the cost-effectiveness of a cervical cancer screening program. Most of the abnormalities found are low-grade, a finding that leads to costs for diagnosis and treatment, but yields little benefit in terms of reduced cervical cancer incidence or life-years saved. Along these lines, there may not be a price at which improved initial or rescreening technologies can be cost-effective according to years of life saved or cervical cancer incidence. Only if one places a high value on detecting and treating low-grade cytological abnormalities can improvements in Pap smear screening be justified. No analyses, including this one, have taken such factors as psychological distress, quality of life, or costs of litigation for failure to diagnose cervical lesions into account; it may be factors such as these that are needed to justify the implementation of these technologies that hover near the threshold of acceptable cost-effectiveness.

Summary

The currently available data on the accuracy of thin-layer cytology and computerized rescreening technologies used for cervical cytological screening fail to describe reliable estimates for test specificity. The increased sensitivity permitted by both types of technologies has implications for the cost-effectiveness of screening that, according to our model, would result in moderate improvements in life expectancy at much higher cost than conventional Pap screening alone. The level of precision in estimates of their accuracy precludes determining the relative cost-effectiveness of thin-layer cytology and computer rescreening. However, our assumptions about their diagnostic performance do allow us to use the model to provide estimates of the cost-effectiveness of these new technologies relative to conventional Pap screening or no screening. Especially when considered as part of a strategy with screening intervals of 3 years, the new technologies have incremental cost-effectiveness ratios that are within the range of accepted health care practices.

Future Research

Performance and Cost-Effectiveness of Screening Tests

Few studies of primary Pap screening were conducted in low-prevalence populations and unaffected by "workup" bias, but these few studies provided estimates of the specificity of Pap smear screening of 98 percent and sensitivity 51 percent. The sensitivity estimate is much lower than that generally believed to be true. Future decision models, cost-effectiveness studies, and health policy decisions should consider this lower estimate.

Thin-layer cytology technology (ThinPrep®), the computerized rescreening device (Papnet®), and the algorithmic classifier (AutoPap®), have received regulatory approval from the FDA based on their demonstration of improved sensitivity compared with conventional Pap smear techniques. However, the evidence currently available does not fully describe the impact of these technologies on the specificity of the screening process. It is possible that a new technology might simultaneously raise both sensitivity and specificity; however, this has not been conclusively demonstrated for the devices reviewed in this report.

Thin-layer cytology and computerized rescreening devices primarily aim to reduce different sources of error: sampling error and detection error, respectively. Combined use of thin-layer cytology and computerized rescreening has the potential to exceed the effectiveness of either technology alone. Although with current pricing, this strategy would not be economically feasible, further research could lead to further improvements in sensitivity.

Comparisons with cytological reference standards attest to the validity of the new technologies compared with optimal Pap screening. Comparison with a histological reference standard provides a more relevant outcome for clinical decisionmakers, since histological diagnosis forms the basis of most clinical management decisions. Further research is needed on validating negative cytological diagnoses made with the new technologies with colposcopy in both low-prevalence and high-prevalence populations. This could be accomplished by subjecting a random sample of cytology-negative women to colposcopy, which would permit statistical correction for "workup" bias and estimation of test specificity. Precise estimates of the relative performance of new versus conventional technologies require verification of all patients in whom at least one test is positive. Methods such as discrepant analysis, in which subjects with concurrent test results (negative or positive) are not verified, should be avoided in future studies.

Differences in the performance of Pap testing have been observed between different types of laboratories. Attempts at quality control aimed at assessing and reducing between-laboratory variation in the interpretation of cervical cytology have had mixed success. It has been suggested that the 10 percent manual rescreening mandated by federal law is costly and ineffective in improving sensitivity; however, it may have an important role to play in assessing and maintaining quality in cervical cytology interpretation. The computerized rescreening strategy reviewed in this report functions as a quality control mechanism and will likely reduce between-laboratory variation in interpretation. Further research in this area may be warranted.

Patient and Provider Screening Behaviors

Many of the existing models of cervical cancer screening have provided insights into the effect of various screening intervals and strategies on cancer incidence and mortality and on the cost-effectiveness of screening. Despite consistent demonstration of extremely high marginal cost-effectiveness ratios for annual screening compared with less frequent biennial or triennial screening, many agencies recommend annual screening. This recommendation may be based on other considerations, including improving patient and provider compliance, providing marginal economies of scale, or even attempting to improve compliance with other health screening practices, such as breast or colorectal cancer screening. However, it is possible that a recommendation for annual screening might actually prevent those women who are apprehensive about pelvic examinations or Pap smears from receiving any care. Although this topic was beyond the scope of our analysis, increasing the number of women who have regular cervical cytological screening would appear to have a greater impact on mortality than improving the sensitivity of the screening test. Further research is needed in ways to improve patient and provider compliance with performing screening at appropriate intervals and in assuring followup of abnormal results, especially when screening is performed at less frequent intervals.

Quality of Life

Deaths from cervical cancer have been substantially reduced since the introduction of the Pap test. By limiting one's assessment of the impact of new technologies for cervical cancer screening to reduction in cervical cancer mortality, one fails to consider a number of other health outcomes that may be quite important. Pap screening has resulted in three trends: an increased proportion of preinvasive lesions, an increased proportion of earlier stage invasive cancer and, as suggested by our model, an increasing proportion of cases among younger women. Although these trends improve the likelihood of curative treatment, they also lead to an increase in treatment and to a longer life living with the potential sequelae of treatment. The shift toward diagnosis of more premalignant lesions, and the inconvenience, potential discomfort, and psychological distress associated with screening and treatment of precursors (many of which will never progress to become cancer), might well outweigh the negative impact of cancer itself on quality of life at the population level.

Further research on quality of life issues associated with cervical cancer screening and treatment is needed to quantify screening behaviors, quality of life associated with various treatments for cervical cancer, and quality of life associated with low-grade lesions.

Role of HPV Testing

Our current understanding of the biology and natural history of cervical cancer places HPV as the main causative agent. HPV testing may have an important role in cervical cancer screening or in triage of women with abnormal cytology. Studies are currently under way to evaluate the utility of HPV testing in predicting progression of atypia and low-grade lesions. The results of these and future studies may substantially change the clinical practice of cervical cancer screening. Our model has been constructed to allow assessment of the potential role of HPV testing or vaccination on various strategies for cervical cancer prevention.

Proposed New Primary Screening Devices

A new computerized device not reviewed in this report, the AutoPap® Primary Screening System, has received FDA approval to be used for primary screening rather than rescreening of Pap smears. Because of lack of data, we did not specifically assess the cost-effectiveness of using this type of computerized primary screening. Our model suggests that improving the sensitivity of the primary screening step is more cost-effective at a given screening interval than improving rescreening sensitivity, so it is possible that this strategy might compare favorably with the strategies we evaluated. However, more reliable estimates of the incremental cost, sensitivity, and specificity of the AutoPap® Primary Screening System are needed before meaningful comparisons can be made. Results of cervical cytological screening with this new device should also be compared with colposcopy or a histology reference standard, with verification of a random sample of test negative women.

Acronyms and Abbreviations

ACHPR: Agency for Health Care Policy and Research

ACOG: American Collage of Obstetricians and Gynecologists

AGUS: atypical glandular cells of uncertain significance

AHA: American Hospital Association

AJCC: American Joint Committee on Cancer

ASCUS: atypical squamous cells of uncertain significance

CAP: College of American Pathologists

CAT: computerized axial tomography

CDC: Centers for Disease Control

CE: cost-effectiveness

CER: cost-effectiveness ratio

CI: confidence interval

CIN: cervical intraepithelial neoplasia

CINAHL: Cumulative Index to Nursing & Allied Health

CIS: carcinoma in situ

CLIA: Clinical Laboratory Improvement Amendments

COBRA: Consolidated Omnibus Budget Reconciliation Act

CPS: Consumer Price Survey

CPT: Current Procedures Terminology

DRG: diagnosis-related group

EPC: Evidence-based Practice Center

FDA: Food and Drug Administration

FIGO: Federation Internationale de Gynecologie et Obstetriques

FN: false negative

FNR: false negative rate

FPR: false positive rate

HCFA: Health Care Financing Administration

HPV: human papilloma virus

HSIL: high-grade squamous intraepithelial lesion

IARC: International Agency for Research on Cancer

ICD: International Classification of Diseases

INNA: interactive neural network-assisted

LEEP: loop electrosurgical excision procedure

LSIL: low-grade squamous intraepithelial lesion

Med.: median

MEI: Medicare Economic Index

MRI: magnetic resonance imaging

NAMCS: National Ambulatory Medical Care Survey

NSI: Neuromedical Systems Inc.

Obs.: observations

OTA: Office of Technology Assessment

Pap: Papanicolaou

PCE: patient care evaluation

Q1: quartile 1

Q3: quartile 3

RBRVS: resource-based relative value scale

ROC: receiver operating characteristic

SBLB: Satisfactory but limited by

SEER: Surveillance, Epidemiology, and End Results

SIL: squamous intraepithelial lesion

STD: sexually transmitted disease

TNM: Tumor, Nodes and Metastases

TPR: true positive rate

WHO: World Health Organization

Contributors

Duke Evidence-based Practice Center (EPC) Investigators

  • Douglas C. McCrory, MD, MHSc

  • Task Order Director and EPC Co-Director

  • Duke Center for Clinical Health Policy Research;

  • Department of Medicine, Division of General Internal Medicine;

  • Durham Veterans Affairs Medical Center

  • Lori Bastian, MD

  • Department of Medicine, Division of General Internal Medicine;

  • Durham Veterans Affairs Medical Center

  • Santanu Datta, MS, MBA

  • Duke Center for Clinical Health Policy Research

  • Vic Hasselblad, PhD

  • Department of Community and Family Medicine

  • Jason Hickey, MS

  • Duke University Medical School

  • David B. Matchar, MD

  • EPC Co-Director

  • Duke Center for Clinical Health Policy Research;

  • Department of Medicine, Division of General Internal Medicine;

  • Durham Veterans Affairs Medical Center

  • Evan Myers, MD, MPH

  • Department of Obstetrics & Gynecology

  • Kavita Nanda, MD

  • Durham Veterans Affairs Medical Center;

  • Department of Obstetrics & Gynecology

  • (currently at Family Health International, Durham, NC)

Duke EPC Support Staff

  • Paul Abrahamse, MS

  • Duke Center for Clinical Health Policy Research

  • (currently in Ann Arbor, MI)

  • Ruth Goslin, MAT

  • Duke Center for Clinical Health Policy Research

  • Rebecca N. Gray, D.Phil.

  • Duke Center for Clinical Health Policy Research

  • Jane T. Kolimaga, MA

  • Project Manager

  • Duke Center for Clinical Health Policy Research

Health Economics Research, Inc

  • Nancy McCall, PhD

  • Sujha Subramanian, PhD

  • (currently at Boston Scientific Corporation)

Advisory Panel of Technical Experts

  • Leslie Dodd, MD

  • Duke University Medical Center

  • Margaret Gradison, MD

  • Duke University Medical Center

  • Andrea W. McChesney, FNP

  • Durham Veterans Affairs Medical Center

  • Charles S. Morrison, PhD

  • Family Health International

  • Kenneth L. Noller, MD

  • University of Massachusetts

  • Joanne T. Piscitelli, MD

  • Duke University Medical Center

  • David N. Sundwall, MD

  • American Clinical Laboratory Association

  • Stanley Zinberg, MD, MS, FACOG

  • American College of Obstetricians and Gynecologists

Peer Reviewers

  • Adalsteinn D. Brown, AB

  • University of Western Ontario

  • Charles Dunton, MD

  • Thomas Jefferson University Hospital

  • Alex Ferenczy, MD

  • Jewish General Hospital, Quebec

  • Daron G. Ferris, MD

  • Medical College of Georgia

  • Bruce Flamm, MD

  • Kaiser Permanente, Riverside, CA

  • Amy Fremgen, PhD

  • American College of Surgeons

  • Alan M. Garber, MD, PhD

  • Stanford University

  • Les Irwig, MBBCh, PhD

  • University of Sydney

  • Paul Krieger, MD

  • Quest Diagnostics, Inc.

  • Nancy C. Lee, MD

  • Centers for Disease Control and Prevention

  • James Linder, MD

  • University of Nebraska; Cytyc Corporation

  • Jeanne Mandelblatt, MD

  • Georgetown University

  • Laurie J. Mango, MD

  • Neuromedical Systems, Inc.

  • C. Jay Marshall, MD

  • ARUP Laboratories

  • Charles S. Morrison, PhD

  • Family Health International

  • Kenneth L. Noller, MD

  • University of Massachusetts

  • Jeffrey F. Peipert, MD, MPH

  • Womens & Infants Hospital, Providence, RI

  • Carolyn D. Runowicz, MD

  • Albert Einstein College of Medicine

  • David N. Sundwall, MD

  • American Clinical Laboratory Association

  • Robin T. Vollmer, MD

  • Durham Veterans Affairs Medical Center

  • David C. Wilbur, MD

  • University of Rochester; NeoPath, Inc.

  • Stanley Zinberg, MD, MS, FACOG

  • American College of Obstetricians and Gynecologists

Agency for Health Care Policy and Research Project Officer

  • Jean Slutsky, PA, MSPH

Evidence Tables

[Quality Scoring Key]

1 Code appears when article fails to meet criterion.

For articles that model health outcomes:
AThere was a description or diagram of decision model.
BThe clinical strategies were described in detail.
CA comprehensive literature review was conducted.
DThe control group was described
ESources of data for probability estimates were identified.
FThe source of the utility scale or measurement method was given.
GClinically relevant outcomes of interest were described in detail.
HResults of a sensitivity analysis were presented.
For articles that model costs:
IThere was a description of the framework for the cost analysis.
JModeling assumptions were stated explicitly.
KThere was sufficient description of estimates of types of resources used, amount of resource use, and unit costs, and their sources.
LThere was a critique of data quality.
MThe cost data were current, i.e., 1988 or more recent.
NThere was a description of a method to adjust costs for inflation.
OThere was a statement of discount rates used (for multiperiod or longitudinal studies).
For articles that compare health outcomes with and without Pap smear screening, or with and without use of new Pap technology:
PStudy population was representative or clearly defined.
QScreened and unscreened populations were concurrent (e.g. before-after design).
RScreened and unscreened populations were randomly allocated.
SThe method of ascertainment of health outcomes was the same in the screened and unscreened groups.
TThe ascertainment of health outcomes was complete (loss to followup >5%).

References
American Academy of Pediatrics. Guidelines for health supervision II. Elk Grove Village, IL: The Academy; 1988.
American Cancer Society. Guidelines for the cancer-related checkup: An update. New York: The Society; 1993.
American College of Obstetricians and Gynecologists. ACOG Committee Opinion: Recommendations on frequency of Pap test screening. Washington, DC: American College of Obstetricians and Gynecologists; 1995. Publication No. 152.
American College of Physicians. Screening for cervical cancer. In: Eddy DM, editor. Common screening tests. Philadelphia: American College of Physicians; 1991, p. 413-4.
American Joint Committee on Cancer. Cervix uteri.In: AJCC cancer staging manual. Philadelphia: Lippincott-Raven Publishers, 5th edition; 1997. P. 189-94.
American Medical Association. AMA guidelines for adolescent preventive services (GAPS): Recommendations and rationale. Chicago: American Medical Association; 1994.
Anderson DJ, Flannelly GM, Kitchener HC, Fisher PM, Mann EM, Campbell MK, Templeton A. Mild and moderate dyskaryosis: Can women be selected for colposcopy on the basis of social criteria? BMJ. 1992; 305: 847. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
Andrews S, Hernandez E, Miyazawa K. Paired Papanicolaou smears in the evaluation of atypical squamous cells. Obstet Gynecol. 1989; 73(5 Pt 1): 74750. [PubMed]
Ashfaq R, Liang Y, Saboorian MH. Evaluation of PAPNET system for rescreening of negative cervical smears. Diagn Cytopathol. 1995; 13: 316. [PubMed]
Ashfaq R, Solares B, Saboorian MH. Detection of endocervical component by PAPNET system on negative cervical smears. Diagn Cytopathol. 1996; 15: 1213. [PubMed]
Baldauf JJ, Dreyfus M, Ritter J, Philippe E. Colposcopy and directed biopsy reliability during pregnancy: A cohort study. Eur J Obstet Gynecol Reprod Biol. 1995; 62: 316. [PubMed]
Baldauf JJ, Dreyfus M, Lehmann M, Ritter J, Philippe E. Cervical cancer screening with cervicography and cytology. Eur J Obstet Gynecol Reprod Biol. 1995 Jan; 58: 339. [PubMed]
Barnes BA. Papanicolaou cervical smears for screening in asymptomatic women. Prim Care. 1981; 8: 13140. [PubMed]
Barnes BA, Barnes AB. Evaluation of surgical therapy by cost-benefit analysis. Surgery. 1977; 82: 2133. [PubMed]
Bauer HM, Hildesheim A, Schiffman MH, Glass AG, Rush BB, Scott DR, Cadell DM, Kurman RJ, Manos MM. Determinants of genital human papillomavirus infection in low-risk women in Portland, Oregon. Sex Transm Dis. 1993; 20: 2748. [PubMed]
Bearman DM, MacMillan JP, Creasman WT. Papanicolaou smear history of patients developing cervical cancer: An assessment of screening protocols. Obstet Gynecol. 1987; 69: 1515. [PubMed]
Beeby AR, Keating PJ, Wagstaff TI, Wilson EMJ, Sims PF, Manning PJ, Wadehra V. The quality and accuracy of cervical cytology: A study of five sampling devices in a colposcopy clinic. J Obstet Gynaecol. 1993; 13: 27681.
Bergstrom R, Adami HO, Gustafsson L, Ponten J, Sparen P. Detection of preinvasive cancer of the cervix and the subsequent reduction in invasive cancer. J Natl Cancer Inst. 1993; 85: 10507. [PubMed]
Bethwaite J, Rayner T, Bethwaite P. Economic aspects of screening for cervical cancer in New Zealand. N Z Med J. 1986; 99: 74751. [PubMed]
Bigrigg MA, Codling BW, Pearson P, Read MD, Swingler GR. Colposcopic diagnosis and treatment of cervical dysplasia at a single clinic visit. Experience of low-voltage diathermy loop in 1000 patients. Lancet. 1990; 336: 22931. [PubMed]
Bolick DR, Hellman DJ. Laboratory implementation and efficacy assessment of the ThinPrep cervical cancer screening system. Acta Cytol. 1998; 42: 20913. [PubMed]
Boon ME, de Graaff Guilloud JC. Cost effectiveness of population screening and rescreening for cervical cancer in the Netherlands. Acta Cytol. 1981; 25: 53942. [PubMed]
Boyko EJ. Meta-analysis of Pap test accuracy. Am J Epidemiol. 1996; 143: 4067. [PubMed]
Brown AD, Garber AM. The cost-effectiveness of three new technologies to enhance Pap testing. Chicago, IL: Blue Cross Blue Shield Association Technical Evaluation Center Assessment Program Special Assessment; 1998 April.
Bureau of Labor Statistics. CPI-U. Medicare Services Average Annual Index, Base Period, 1982-84.
Canadian Task Force on the Periodic Health Examination. Canadian guide to clinical preventive health care. Ottawa: Health Canada; 1994.
Cannistra SA, Niloff JM. Cancer of the uterine cervix. N Engl J Med. 1996; 334: 10308. [PubMed]
Carson HJ, DeMay RM. The mode ages of women with cervical dysplasia. Obstet Gynecol. 1993; 82: 4304. [PubMed]
Carter PM, Coburn TC, Luszczak M. Cost-effectiveness of cervical cytologic examination during pregnancy. J Am Board Fam Pract. 1993; 6: 53745. [PubMed]
Cecchini S, Bonardi R, Mazzotta A, Grazzini G, Iossa A, Ciatto S. Testing cervicography and cervicoscopy as screening tests for cervical cancer. Tumori. 1993; 79: 225. [PubMed]
Cecchini S, Bonardi R, Iossa A, Zappa M, Ciatto S. Colposcopy as a primary screening test for cervical cancer. Tumori. 1997; 83: 8103. [PubMed]
Centers for Disease Control and Prevention. Premarital sexual experience among adolescent women -- United States, 1970-1988. MMWR. 1991; 39: 92932. [PubMed]
Chesebro MJ, Everett WD. A cost-benefit analysis of colposcopy for cervical squamous intraepithelial lesions found on Papanicolaou smear. Arch Fam Med. 1996; 5: 57681. [PubMed]
Chock C, Irwig I, Berry G, Glasziou P. Comparing dichotomous screening tests when individuals negative on both tests are not verified. J Clin Epidemiol. 1997; 50: 12117. [PubMed]
Chomet J. Screening for cervical cancer: A new scope for general practitioners? Results of the first year of colposcopy in general practice. BMJ. 1987; 294: 13268. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
Coibion M, Autier P, Vandam P, Delobelle A, Huet F, Hertens D, Vosse M, Andry M, De Sutter P, Heimann R, et al. Is there a role for cervicography in the detection of premalignant lesions of the cervix uteri? Br J Cancer. 1994; 70: 1258. [PubMed]
Coker AL, Jenkins GR, Busnardo MS, Chambers JC, Levine LZ, Pirisi L. Human papillomaviruses and cervical neoplasia in South Carolina. Cancer Epidemiol Biomarkers Prev. 1993; 2: 20712. [PubMed]
Colgan TJ, Patten SF Jr, Lee JS. A clinical trial of the AutoPap 300 QC system for quality control of cervicovaginal cytology in the clinical laboratory. Acta Cytol. 1995; 39: 11918. [PubMed]
Cox JT, Schiffman MH, Winzelberg AJ, Patterson JM. An evaluation of human papillomavirus testing as part of referral to colposcopy clinics. Obstet Gynecol. 1992; 80(3 Pt 1): 38995. [PubMed]
Cox JT, Lorincz AT, Schiffman MH, Sherman ME, Cullen A, Kurman RJ. Human papillomavirus testing by hybrid capture appears to be useful in triaging women with a cytologic diagnosis of atypical squamous cells of undetermined significance. Am J Obstet Gynecol. 1995; 172: 94654. [PubMed]
Davis JR, Hindman WM, Paplanus SH, Trego DC, Wiens JL, Suciu TN. Value of duplicate smears in cervical cytology. Acta Cytol. 1981; 25: 5338. [PubMed]
Davison JM, Marty JJ. Detecting premalignant cervical lesions. Contribution of screening colposcopy to cytology. J Reprod Med. 1994; 39: 38892. [PubMed]
Del Priore G, Maag T, Bhattacharya M, Garcia PM, Till M, Lurain JR. The value of cervical cytology in HIV-infected women. Gynecol Oncol. 1995; 56: 3958. [PubMed]
de Villiers EM, Wagner D, Schneider A, Wesch H, Munz F, Miklaw H, zur Hausen H. Human papillomavirus DNA in women without and with cytological abnormalities: Results of a 5-year follow-up study. Gynecol Oncol. 1992; 44: 339. [PubMed]
DiBonito L, Falconieri G, Tomasic G, Colautti I, Bonifacio D, Dudine S. Cervical cytopathology. An evaluation of its accuracy based on cytohistologic comparison. Cancer. 1993; 72: 30026. [PubMed]
Dickinson L. Evaluation of the effectiveness of cytologic screening for cervical cancer. 3. Cost-benefit analysis. Mayo Clin Proc. 1972; 47: 5505. [PubMed]
DiSaia P, Creasman W. Clinical gynecologic oncology. St. Louis: Mosby; 1997.
Drummond MF, Richardson WS, O'Brien BJ, Levine M, Heyland D. Users' guides to the medical literature. XIII. How to use an article on economic analysis of clinical practice. A. Are the results of the study valid? For the Evidence-Based Medicine Working Group. JAMA. 1997; 277(19): 15527. [PubMed]
Duggan MA, Brasher P. Paired comparison of manual and automated Pap test screening using the PAPNET system. Diagn Cytopathol. 1997; 17: 24854. [PubMed]
Eddy DM. Screening for cervical cancer. Ann Intern Med. 1990; 113: 21426. [PubMed]
Eisenberg JN. Clinical economics. A guide to the economic analysis of clinical practices. JAMA. 1989; 262: 287986. [PubMed]
Fahey MT, Irwig L, Macaskill P. Meta-analysis of Pap test accuracy. Am J Epidemiol. 1995; 141: 6809. [PubMed]
Fahim HI, Faris R, Arab SE, Sammour MB. An epidemiologic study of Papanicolaou smear data at Ain-Shams University hospitals. J Egypt Public Health Assoc. 1991; 66: 97111. [PubMed]
Fahs MC, Mandelblatt J, Schechter C, Muller C. Cost effectiveness of cervical cancer screening for the elderly. Ann Intern Med. 1992; 117: 5207. [PubMed]
Farnsworth A, Chambers FM, Goldschmidt CS. Evaluation of the PAPNET system in a general pathology service. Med J Aust. 1996; 165: 42931. [PubMed]
Feinstein AR. Clinical epidemiology: The architecture of clinical research. Philadelphia: Pennsylvania:W.B. Saunders Company; 1985. p. 434.
Ferenczy A, Robitaille J, Franco E, Arseneau J, Richart RM, Wright TC. Conventional cervical cytologic smears vs. ThinPrep smears. A paired comparison study on cervical cytology. Acta Cytol. 1996; 40: 113642. [PubMed]
Ferenczy A, Gelfand MM, Franco E, Mansour N. Human papillomavirus infection in postmenopausal women with and without hormone therapy. Obstet Gynecol. 1997; 90: 711. [PubMed]
Ferris DG, Wright TC, Litaker MS, Richart RM, Lorincz AT, Sun X, Borgatta L, Buck H, Kramer L, Rubin R. Triage of women with ASCUS and LSIL on Pap smear reports: Management by repeat Pap smear, HPV DNA testing, or colposcopy? J Fam Pract. 1998; 46: 12534. [PubMed]
Fetters MD, Fischer G, Reed BD. Effectiveness of vaginal Papanicolaou smear screening after total hysterectomy for benign disease. JAMA. 1996; 275: 9407. [PubMed]
Figueroa JP, Ward E, Luthi TE, Vermund SH, Brathwaite AR, Burk RD. Prevalence of human papillomavirus among STD clinic attenders in Jamaica: Association of younger age and increased sexual activity. Sex Transm Dis. 1995; 22: 1148. [PubMed]
Food and Drug Administration. Points to consider for cervical cytology devices, Version 7/25/94. Washington, DC: FDA Center for Devices and Radiological Health, Document No. 968; 1994.
Frisch LE, Milner FH, Ferris DG. Naked-eye inspection of the cervix after acetic acid application may improve the predictive value of negative cytologic screening. J Fam Pract. 1994; 39: 45760. [PubMed]
Fung Kee Fung M, Senterman M, Eid P, Faught W, Mikhael NZ, Wong PT. Comparison of Fourier-transform infrared spectroscopic screening of exfoliated cervical cells with standard Papanicolaou screening. Gynecol Oncol. 1997; 66: 105. [PubMed]
Garutti P, Immovilli S, Chiari U, Segala V, Mollica G. Cervical screening for detection of cervical HPV. Cervix Lower Female Genital Tract. 1994; 12: 314.
Germain M, Heaton R, Erickson D, Henry M, Nash J, O'Connor D. A comparison of the three most common Papanicolaou smear collection techniques. Obstet Gynecol. 1994; 84: 16873. [PubMed]
Giles JA, Hudson E, Crow J, Williams D, Walker P. Colposcopic assessment of the accuracy of cervical cytology screening. BMJ. 1988; 296: 1099102. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
Giles JA, Deery A, Crow J, Walker P. The accuracy of repeat cytology in women with mildly dyskaryotic smears. Br J Obstet Gynaecol. 1989; 96: 106770. [PubMed]
Glenthoj A, Rank F, Peen U, Bostofte E. Diagnostic efficiency of brush cytology from the uterine cervix.In: Gierttler K, Fuchter G, Witte S, editors. New frontiers in cytology. Berlin: Springer; 1988. p. 421-4.
Gold MR, Siegel JE, Russell LB, Weinstein MC, editors. Cost-effectiveness in health and medicine.New York: Oxford University Press; 1996.
Gonzalez D, Hernandez E, Anderson L, Heller P, Atkinson BF. Clinical significance of a cervical cytologic diagnosis of atypical squamous cells of undetermined significance. Favoring a reactive process or low grade squamous intraepithelial lesion. J Reprod Med. 1996; 41: 71923. [PubMed]
Green M, editor. Bright futures: Guidelines for health supervision of infants, children and adolescents. Arlington, VA: National Center for Education in Maternal and Child Health; 1994.
Gundersen JH, Schauberger CW, Rowe NR. The Papanicolaou smear and the cervigram. A preliminary report. J Reprod Med. 1988; 33: 468.
Gustafsson L, Adami HO. Optimization of cervical cancer screening. Cancer Causes Control. 1992; 3: 12536. [PubMed]
Gustafsson L, Ponten J, Bergstrom R, Adami HO. International incidence rates of invasive cervical cancer before cytological screening. Int J Cancer. 1997; 71: 15965. [PubMed]
Gustafsson L, Ponten J, Zack M, Adami HO. International incidence rates of invasive cervical cancer after introduction of cytological screening. Cancer Causes Control. 1997; 8: 75563. [PubMed]
Haddad NG, Hussein I, Livingstone JR, Smart GE. Colposcopy in teenagers. BMJ. 1988; 297: .
Halford JA, Wright RG, Ditchmen EJ. Quality assurance in cervical cytology screening. Comparison of rapid rescreening and the PAPNET Testing System. Acta Cytol. 1997; 41: 7981. [PubMed]
Hasselblad V, Hedges L. Meta-analysis of diagnostic tests. Psychometric Bull. 1995; 11: 16778.
Hellberg D, Axelsson O, Gad A, Nilsson S. Conservative management of the abnormal smear during pregnancy. A long-term follow-up. Acta Obstet Gynecol Scand. 1987; 66(3): 1959. [PubMed]
Helmerhorst TJ, Dijkhuizen GH, Calame JJ, Kwikkel HJ, Stolk JG. The accuracy of colposcopically directed biopsy in diagnosis of CIN. Eur J Obstet Gynecol Reprod Biol. 1987; 24: 2219. [PubMed]
Herrero R. Epidemiology of cervical cancer. J Natl Cancer Inst Monogr. 1996; 21: 16. [PubMed]
Herrington CS, Evans MF, Hallam NF, Charnock FM, Gray W, McGee JD. Human Papillomavirus status in the prediction of high-grade cervical intraepithelial neoplasia in patients with persistent low-grade cervical cytological abnormalities. Br J Cancer. 1995; 71: 2069. [PubMed]
Hildesheim A, Gravitt P, Schiffman MH, Kurman RJ, Barnes W, Jones S, Tchabo JG, Brinton LA, Copeland C, Epp J, et al. Determinants of genital human papillomavirus infection in low-income women in Washington, DC. Sex Transm Dis. 1993; 20: 27985. [PubMed]
Hildesheim A, Schiffman MH, Gravitt PE, Glass AG, Greer CE, Zhang T, Scott DR, Rush BB, Lawler P, Sherman ME, et al. Persistence of type-specific human papillomavirus infection among cytologically normal women. J Infect Dis. 1994; 169: 23540. [PubMed]
Hirschowitz L, Raffle AE, Mackenzie EF, Hughes AO. Long term follow up of women with borderline cervical smear test results: Effects of age and viral infection on progression to high grade dyskaryosis. BMJ. 1992; 304: 120912. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
Ho GYF, Bierman R, Beardsley L, Chang CJ, Burk RD. Natural history of cervicovaginal papillomavirus infection in young women. N Engl J Med. 1998; 338: 4238. [PubMed]
Hockstad RL. A comparison of simultaneous cervical cytology, HPV testing, and colposcopy. Fam Pract Res J. 1992; 12: 5360. [PubMed]
IARC Working Group on Evaluation of Cervical Cytology Cancer Screening Programmes. Screening for squamous cervical cancer: Duration of low risk after negative results of cervical cytology and its implication for screening policies. BMJ. 1986; 293: 65964. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
Inhorn SL, Wilbur D, Zahniser D, Linder J. Validation of the ThinPrep Pap test for cervical cancer diagnosis. J. Lower Genital Tract Dis 1998; in press.
Intersociety Working Group for Cytology Technologies. Proposed guidelines for secondary screening (rescreening) instruments for gynecologic cytology. Acta Cytol. 1998; 42: 2736. [PubMed]
Janerich DT, Hadjimichael O, Schwartz PE, Lowell DM, Meigs JW, Merino MJ, Flannery JT, Polednak AP. The screening histories of women with invasive cervical cancer, Connecticut. Am J Public Health. 1995; 85: 7914. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
Jenny J, Isenegger I, Boon ME, Husain OA. Consistency of a double PAPNET scan of cervical smears. Acta Cytol. 1997; 41: 827. [PubMed]
Johansen P, Arffmann E, Pallesen G. Evaluation of smears obtained by cervical scraping and an endocervical swab in the diagnosis of neoplastic disease of the uterine cervix. Acta Obstet Gynecol Scand. 1979; 58: 26570. [PubMed]
Johnson N, Sutton J, Thornton JG, Lilford RJ, Johnson VA, Peel KR. Decision analysis for best management of mildly dyskaryotic smear. Lancet. 1993; 342(8863): 916. [PubMed]
Jones BA, Novis DA. Cervical biopsy-cytology correlation: A College of American Pathologists Q-Probes study of 22,439 correlations in 348 laboratories. Arch Pathol Lab Med. 1996; 120: 52331. [PubMed]
Jones DE, Creasman WT, Dombroski RA, Lentz SS, Waeltz JL. Evaluation of the atypical Pap smear. Am J Obstet Gynecol. 1987; 157: 5449. [PubMed]
Jones MH, Jenkins D, Cuzick J, Wolfendale MR, Jones JJ, Balogun-Lynch C, Usherwood MM, Singer A. Mild cervical dyskaryosis: Safety of cytological surveillance. Lancet. 1992; 339: 14403. [PubMed]
Jones WB, Shingleton HM, Russell A, Chmiel JS, Fremgen AM, Clive RE, Zuber-Ocwieja KE, Winchester DP. Patterns of care for invasive cervical cancer: Results of a national survey of 1984 and 1990. Cancer. 1995; (suppl);76: 193447. [PubMed]
Kataja V, Syrjanen K, Mantyjarvi R, Vayrynen M, Syrjanen S, Saarikoski S, Parkkinen S, Yliskoski M, Salonen JT, Castren O. Prospective follow-up of cervical HPV infections: Life table analysis of histopathological, cytological, and colposcopic data. Eur J Epidemiol. 1989; 5: 17. [PubMed]
Kataja V, Syrjanen K, Syrjanen S, Mantyjarvi R, Yliskoski M, Saarkoski S, Salonen JT. Prospective follow-up of genital HPV infections:survival analysis of the HPV typing data. Eur J Epidemiol. 1990; 6: 914. [PubMed]
Kaufman RH, Adam E, Icenogle J, Reeves WC. Human papillomavirus testing as triage for atypical squamous cells of undetermined significance and low-grade squamous intraepithelial lesions: Sensitivity, specificity, and cost-effectiveness. Am J Obstet Gynecol. 1997; 177: 9306. [PubMed]
Kaufman RH, Schreiber K, Carter T. Analysis of atypical squamous (glandular) cells of undetermined significance smears by neural network-directed review. Obstet Gynecol. 1998; 91: 55660. [PubMed]
Kealy WF. Correlation of cervical cytodiagnosis and histopathology -- an exercise in quality control. Ir J Med Sci. 1986; 155: 3818. [PubMed]
Kiviat N. Natural history of cervical neoplasia: Overview and update. Am J Obstet Gynecol. 1996; 175: 1099104. [PubMed]
Kjerulff KH, Guzinski GM, Langenberg PW, Stolley PD, Moye NEA, Kazandjian VA. Hysterectomy and race. Obstet Gynecol. 1993; 82: 75764. [PubMed]
Koonings PP, Dickinson K, d'Ablaing G 3d, Schlaerth JB. A randomized clinical trial comparing the Cytobrush and cotton swab for Papanicolaou smears. Obstet Gynecol. 1992; 80: 2415. [PubMed]
Koopmanschap MA, van Oortmarssen GJ, van Agt HM, van Ballegooijen M, Habbema JD, Lubbe KT. Cervical-cancer screening: Attendance and cost-effectiveness. Int J Cancer. 1990; 45: 4105. [PubMed]
Korach J, Menczer J, Lusky A. A comparison between referral and hospital cervical cytology laboratory reports. Isr J Med Sci. 1995; 31: 3525. [PubMed]
Korn AP, Autry M, DeRemer PA, Tan W. Sensitivity of the Papanicolaou smear in human immunodeficiency virus-infected women. Obstet Gynecol. 1994; 83: 4014. [PubMed]
Koutsky L. Epidemiology of genital human papillomavirus infection. Am J Med. 1997; 102(5A): 38. [PubMed]
Koutsky LA, Holmes KK, Critchlow CW, Stevens CE, Paavonen J, Beckmann AM, DeRouen TA, Galloway DA, Vernon D, Kiviat NB. A cohort study of the risk of cervical intraepithelial neoplasia grade 2 or 3 in relation to papillomavirus infection. N Engl J Med. 1992; 327: 12728. [PubMed]
Kwikkel HJ, Quaak MJ, de With C. Predictive value of the abnormal Pap smear: A retrospective analysis of error rates. Eur J Obstet Gynecol Reprod Biol. 1986; 21: 10112. [PubMed]
Landis SH, Murray T, Bolden S, Wingo PA. Cancer statistics, 1998. CA Cancer J Clin. 1998; 48: 629. [PubMed]
Landoni F, Maneo A, Columbo A, Placa F, Milani R, Perego P, Favini P, Fern L, Mangioni C. Randomised study of radical surgery versus radiotherapy for stage Ib-IIa cervical cancer. Lancet. 1997; 350: 53240.
Lederer H, Lambourne A. The results of screening by cervical cytology and of histological examination of gynaecological operation specimens. J Obstet Gynaecol Br Commonw. 1973; 80: 6771. [PubMed]
Lee JSJ, Kuan L, Oh S, Patten FW, Wilbur DC. A feasibility study of the AutoPap System location-guided screening. Acta Cytol. 1998; 42: 2216. [PubMed]
Lee KR, Ashfaq R, Birdsong GG, Corkill ME, McIntosh KM, Inhorn SL. Comparison of conventional Papanicolaou smears and a fluid-based, thin-layer system for cervical cancer screening. Obstet Gynecol. 1997; 90: 27884. [PubMed]
Lepine LA, Hillis SD, Marchbanks PA, Koonin LM, Morrow B, Kieke BA, Wilcox LS. Hysterectomy surveillance -- United States, 1980-1993. CDC Surveillance Summaries, August 8, 1997. MMWR. 1997; 46(No. SS-4): 114.
Lozowski MS, Mishriki Y, Talebian F, Solitare G. The combined use of cytology and colposcopy in enhancing diagnostic accuracy in preclinical lesions of the uterine cervix. Acta Cytol. 1982; 26: 28591. [PubMed]
Luce BR. The implications of cost-effectiveness analysis of medical technology. Background paper No. 2: Case studies of medical technologies. case study No. 7: Allocating costs and benefits in disease prevention programs: an application to cervical cancer screening. U.S. Congress, Office of Technology Assessment; 1981 Jun. 43 p.
MacCormac L, Lew W, King G, Allen PW. Gynaecological cytology screening in South Australia: A 23-year experience. Med J Aust. 1988; 149: 5306. [PubMed]
Maggi R, Zannoni E, Giorda G, Biraghi P, Sideri M. Comparison of repeat smear, colposcopy, and colposcopically directed biopsy in the evaluation of the mildly abnormal smear. Gynecol Oncol. 1989; 35: 2946. [PubMed]
Mandelblatt JS, Fahs MC. The cost-effectiveness of cervical cancer screening for low-income elderly women. JAMA. 1988; 259: 240913. [PubMed]
Mandelblatt J, Freeman H, Winczewski D, Cagney K, Williams S, Trowers R, Tang J, Gold K, Lin TH, Kerner J. The costs and effects of cervical and breast cancer screening in a public hospital emergency room. The Cancer Control Center of Harlem. Am J Public Health. 1997; 87(7): 11829. [PubMed]
Mango LJ, Radensky PW. Interactive neural network-assisted screening. A clinical assessment. Acta Cytol. 1998; 42: 23345. [PubMed]
Mango LJ, Valente PT. Neural network-assisted analysis and microscopic rescreening in presumed negative cervical cytologic smears. A comparison. Acta Cytol. 1998; 42: 22732. [PubMed]
Mann W, Lonky N, Massad S, Scotti R, Blanco J, Vasilev S. Papanicolaou smear screening augmented by a magnified chemiluminescent exam. Int J Gynaecol Obstet. 1993; 43: 28996. [PubMed]
Martin L, Calle E, Wingo P, Heath C. Comparison of mammography and Pap test use from the 1987 and 1992 National Health Interview Surveys: Are we closing the gaps? Am J Prev Med. 1996; 12: 8290.
Matsunaga G, Tsuji I, Sato S, Fukao A, Hisamichi S, Yajima A. Cost-effectiveness analysis of mass screening for cervical cancer in Japan. J Epidemiol. 1997; 7: 13541. [PubMed]
Mayeaux EJ Jr, Harper MB, Abreo F, Pope JB, Phillips GS. A comparison of the reliability of repeat cervical smears and colposcopy in patients with abnormal cervical cytology. J Fam Pract. 1995; 40: 5762. [PubMed]
Melnikow J, Nuovo J, Paliescheskey M. Management choices for patients with squamous atypia on Papanicolaou smear. A toss up? Med Care. 1996; 34: 33647. [PubMed]
Melnikow J, Nuovo J, Paliescheskey M, Stewart GK, Howell L, Green W. Detection of high-grade cervical dysplasia: Impact of age and Bethesda System terminology. Diagn Cytopathol. 1997; 17: 3215. [PubMed]
Miller BA, Kolonel LN, Bernstein L, Young JL, Swanson GM, West D, Key CR, Liff JM, Glover CS, Alexander GA et al. Racial/ethnic patterns of cancer in the United States 1988-1992. Bethesda, MD: U.S. Department of Health and Human Services, National Institutes of Health; 1998.
Miller DK, Homan SM. Determining transition probabilities: Confusion and suggestions. Med Decis Making. 1994; 14: 528. [PubMed]
Miller WC. Bias in discrepant analysis: When two wrongs don't make a right. J Clin Epidemiol. 1998; 51: 21931. [PubMed]
Mitchell H, Medley G. Detection of unsuspected abnormalities by PAPNET-assisted review. Acta Cytol. 1998; 42: 2604. [PubMed]
Mitchell M, Tortolero-Luna G, Wright T, Sarkar A, Richards-Kortum R, Hong WK, Schottenfeld D. Cervical human papillomavirus infection and intraepithelial neoplasia: A review. J Natl Cancer Inst Mongr. 1996; 21: 1725.
Morrison BW, Erickson ER, Doshi N, Russo JF. The significance of atypical cervical smears. J Reprod Med. 1988; 33: 80912. [PubMed]
Morrison EA, Goldberg GL, Hagan RJ, Kadish AS, Burk RD. Self-administered home cervicovaginal lavage: A novel tool for the clinical-epidemiologic investigation of genital human Papillomavirus infections. Am J Obstet Gynecol. 1992; 167: 1047. [PubMed]
Moscicki AB, Shiboski S, Broering J, Powell K, Clayton L, Jay N, Darragh TM, Brescia R, Kanowitz S, Miller SB, et al. The natural history of human papillomavirus infection as measured by repeated DNA testing in adolescent and young women. J Pediatr. 1998; 132: 27784. [PubMed]
Muller C, Mandelblatt J, Schechter CB, Power EJ, Duffy BM, Wagner JL. Costs and effectiveness of cervical cancer screening in elderly women. Washington, DC: US Congress, Office of Technology Assessment; 1990.
Munoz N, Kato I, Bosch FX, Eluf-Neto J, SanJose SD, Ascunce N, Gili M, Izarzugaza I, Viladiu P, Tormo M-J, et al. Risk factors for HPV DNA detection in middle-aged women. Sex Transm Dis 1996:504-10.
Myers ER, Thompson JW, Simpson K. Cost-effectiveness of mandatory compared with voluntary screening for human immunodeficiency virus in pregnancy. Obstet Gynecol. 1998; 91: 17481. [PubMed]
Naslund I, Auer G, Pettersson F, Sjovall K. Evaluation of the pulse wash sampling technique for screening of uterine cervical carcinoma. Acta Radiol Oncol. 1986; 25: 1316. [PubMed]
National Cancer Institute Workshop. The Bethesda System for reporting cervical/vaginal cytologic diagnoses: Revised after the second National Cancer Institute Workshop, April 29-30, 1991. Acta Cytol. 1993; 37: 11524. [PubMed]
National Center for Health Statistics. http://www.cdc.gov/nchswww/daa/gm291_1.pdf.
Noller KL. Incident and demographic trends in cervical neoplasia. Am J Obstet Gynecol. 1996; 175(4 Pt 2): 108890. [PubMed]
Nyirjesy I. Atypical or suspicious cervical smears. An aggressive diagnostic approach. JAMA. 1972; 222: 6913. [PubMed]
O'Brien BJ, Heyland D, Richardson WS, Levine M; Drummond MF. Users' guides to the medical literature. XIII. How to use an article on economic analysis of clinical practice. B. What are the results and will they help me in caring for my patients? For the Evidence-Based Medicine Working Group. JAMA. 1997; 277: 18026. [PubMed]
Okagaki T, Zelterman D. Information, discrimination and divergence in cytology. II. Total discrimination as a measure of performance. Acta Cytol. 1991; 35: 259. [PubMed]
O'Leary TJ, Tellado M, Buckner SB, Ali IS, Stevens A, Ollayos CW. PAPNET-assisted rescreening of cervical smears. Cost and accuracy compared with a 100% manual rescreening strategy. JAMA. 1998; 279: 2357. [PubMed]
Oyer R, Hanjani P. Endocervical curettage: Does it contribute to the management of patients with abnormal cervical cytology? Gynecol Oncol. 1986; 25: 20411. [PubMed]
Pang JC, Hsu C, Kwan-Chau KM. Cytology and colposcopy in the diagnosis of cervical neoplasia. Gynecol Oncol. 1977; 5: 13441. [PubMed]
Parham DM, Wiredu EK, Hussein KA. The cytological prediction of cervical intraepithelial neoplasia in colposcopically directed biopsies. Cytopathology. 1991; 2: 28590. [PubMed]
Parker A, Ueki M. A comparison of preoperative exfoliative cervical cytology with subsequent histology. N Z Med J. 1986; 99: 4146. [PubMed]
Parkin DM. The epidemiological basis for evaluating screening policies. New developments in cervical cancer screening and prevention. Franco E and Monsonego J, editors. Oxford, England: Blackwell Science; 1997.
Paskett ED, Rimer BK. Psychosocial effects of abnormal Pap tests and mammograms: A review. J Women's Health. 1995; 4: 7381.
Paskett ED, Carter WB, Chu J, White E. Compliance behavior in women with abnormal Pap smears. Developing and testing a decision model. Med Care. 1990; 28: 64356. [PubMed]
Patten SF Jr, Lee JSJ, Wilbur DC, Bonfiglio TA, Colgan TJ, Richart RM, Cramer H, Moinuddin S. The AutoPap 300 QC System multicenter clinical trials for use in quality control rescreening of cervical smears. I. A prospetive intended use study. Cancer. 1997a; 81: 33742. [PubMed]
Patten SF Jr, Lee JSJ, Wilbur DC, Bonfiglio TA, Colgan TJ, Richart RM, Cramer H, Moinuddin S. The AutoPap 300 QC System multicenter clinical trials for use in quality control rescreening of cervical smears. II. Prospective and archival sensitivity studies. Cancer. 1997b; 81: 3437. [PubMed]
Prabhakar AK. Cervical cancer in India -- strategy for control. Indian J Cancer. 1992; 29: 10413. [PubMed]
Pretorius R, Semrad N, Watring W, Fotheringham N. Presentation of cervical cancer. Gynecol Oncol. 1991; 2: 4853.
Prorok PC. Mathematical models and natural history in cervical cancer screening. Screening for cancer of the uterine cervix. Hakama M, Miller AB, Day NE, editors. Lyon (France): International Agency for Research on Cancer 1986;76:185-98.
Raab SS. The cost-effectiveness of cervical-vaginal rescreening. Am J Clin Pathol. 1997; 108: 52536. [PubMed]
Radensky PW, Mango LJ. Interactive neural network-assisted screening. An economic assessment. Acta Cytol. 1998; 42: 24652. [PubMed]
Ramirez EJ, Hernandez E, Miyazawa K. Cervical conization findings in women with dysplastic cervical cytology and normal colposcopy. J Reprod Med. 1990; 35: 35961. [PubMed]
Ransohoff DF, Feinstein AR. Problems of spectrum and bias in evaluating the efficacy of diagnostic tests. N Engl J Med. 1978; 299: 92630. [PubMed]
Rasbridge SA, Nayagam M. Discordance between cytologic and histologic reports in cervical intraepithelial neoplasia. Results of a one-year audit. Acta Cytol. 1995; 39: 64853. [PubMed]
Reagan JW, Fu YS. Histologic types and prognosis of cancers of the uterine cervix. Int J Radiat Oncol Biol Phys. 1979; 5: 101520. [PubMed]
Richart RM. Cervical intraepithelial neoplasia. In: Pathology Annual. East Norwalk, CT: Appleton-Century-Crofts; 1973. p. 301-23.
Richart RM, Barron BA. Screening strategies for cervical cancer and cervical intraepithelial neoplasia. Cancer. 1981; 47(5 Suppl): 117681. [PubMed]
Ries LAG, Kosary CL, Hankey BF, Miller BA, Harras A, Edwards BK. SEER cancer statistics review, 1973-1994. Bethesda, MD: National Cancer Institute; 1997. NIH Pub No. 97-2789.
Ries LAG, Kosary CL, Hankey BF, Miller BA, Edwards BK, editors. SEER cancer statistics review, 1973-1995. Bethesda, MD: National Cancer Institute; 1998.
Ritter DB, Kadish AS, Vermund SH, Romney SL, Villari D, Burk RD. Detection of human papillomavirus deoxyribonucleic acid in exfoliated cervicovaginal cells as a predictor of cervical neoplasia in a high-risk population. Am J Obstet Gynecol. 1988; 159: 151725. [PubMed]
Roberts JM, Gurley AM, Thurloe JK, Bowditch R, Laverty CRA. Evaluation of the ThinPrep Pap test as an adjunct to the conventional Pap smear. Med J Aust. 1997; 167: 4669. [PubMed]
Ronco G, Montanari G, Aimone V, Parisio F, Segnan N, Valle A, Volante R. Estimating the sensitivity of cervical cytology: Errors of interpretation and test limitations. Cytopathology. 1996; 7: 1518. [PubMed]
Russell AH, Shingleton HM, Jones WB, Fremgen A, Winchester DP, Clive R, Chmiel JS. Diagnostic assessements in patients with invasive cancer of the cervix: A national patterns of care study of the American College of Surgeons. Gynecol Oncol. 1996; 63: 15965. [PubMed]
Schechter CB. Cost-effectiveness of rescreening conventionally prepared cervical smears by PAPNET testing. Acta Cytol. 1996; 40: 127282. [PubMed]
Schiffman MH, Bauer HM, Hoover RN, Glass AG, Cadell DM, Rush BB, Scott DR, Sherman ME, Kurman RJ, Wacholder S, et al. Epidemiologic evidence showing that human papillomavirus infection causes most cervical intraepithelial neoplasia. J Natl Cancer Inst. 1993; 85: 95864. [PubMed]
Schwartz PE, Hadjimichael O, Lowell DM, Merino MJ, Janerich D. Rapidly progressive cervical cancer: The Connecticut experience. Am J Obstet Gynecol. 1996; 175: 11059. [PubMed]
Schweitzer SO. Cost effectiveness of early detection of disease. Health Serv Res. 1974; 9: 2232. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
Schweitzer SO, Luce BR. A cost-effective approach to cervical cancer detection: Final report and executive summary. Rockville, MD: Agency for Health Care Policy and Research (formerly the National Center for Health Services Research); 1978 Apr. National Center for Health Services Research Publication 79-82. 86 p. Available from: NTIS, Springfield, VA; PB292685.
Sherlaw-Johnson C, Gallivan S, Jenkins D, Jones MH. Cytological screening and management of abnormalities in prevention of cervical cancer: An overview with stochastic modelling. J Clin Pathol. 1994; 47: 4305. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
Sherman ME, Kelly D. High-grade squamous intraepithelial lesions and invasive carcinoma following the report of three negative Papanicolaou smears: Screening failures or rapid progression? Mod Pathol. 1992; 5: 33742. [PubMed]
Sherman ME, Mendoza M, Lee KR, Ashfaq R, Birdsong, Corkill ME, McIntosh KM, Inhorn SL, Baber G, Barber C, Stoler M. Peformance of liquid-based, thin-layer cervical cytology: Correlation with reference diagnosis and HPC testing. Mod Pathol, in press.
Shingleton HM, Patrick RL, Johnston WW, Smith RA. The current status of the Papanicolaou smear. CA Cancer J Clin. 1995; 45: 30520. [PubMed]
Siegel JE, Weinstein MC, Russell LB, Gold MR. For the Panel on Cost-Effectiveness in Health and Medicine. Recommendations for reporting cost-effectiveness analyses. JAMA. 1996; 276: 133941. [PubMed]
Singh P, Pang M, Ratnam SS. Selective cervical conization following colposcopy: A critical evaluation of 107 cases. Singapore Med J. 1985; 26: 28994. [PubMed]
Skehan M, Soutter WP, Lim K, Krausz T, Pryse-Davies J. Reliability of colposcopy and directed punch biopsy. Br J Obstet Gynaecol. 1990; 97: 8116. [PubMed]
Slawson DC, Bennett JH, Herman JM. Are Papanicolaou smears enough? Acetic acid washes of the cervix as adjunctive therapy: A HARNET study. Harrisburg Area Research Network. J Fam Pract. 1992; 35: 2717. [PubMed]
Slawson DC, Bennett JH, Simon LJ, Herman JM. Should all women with cervical atypia be referred for colposcopy: A HARNET study. J Fam Pract. 1994; 38: 38792. [PubMed]
Smith EM, Johnson SR, Figuerres EJ, Mendoza M, Fedderson D, Haugen TH, Tureck LP. The frequency of human papillomavirus detection in postmenopausal women on hormone replacement therapy. Gynecol Oncol. 1997; 65: 4416. [PubMed]
Smith JJ, Cowan L, Hemstreet G 3d, Smith ML, Pippitt CH Jr, Doggett R, McGill L. Automated quantitative fluorescent image analysis of cervical cytology. Gynecol Oncol. 1987; 28: 24154. [PubMed]
Sonnenberg FA, Beck JR. Markov models in medical decision making: A practical guide. Med Decis Making. 1993; 13: 32238. [PubMed]
Soost HJ, Lange HJ, Lehmacher W, Ruffing-Kullmann B. The validation of cervical cytology. Sensitivity, specificity and predictive values. Acta Cytol. 1991; 35: 814. [PubMed]
Soutter WP, Wisdom S, Brough AK, Monaghan JM. Should patients with mild atypia in a cervical smear be referred for colposcopy? Br J Obstet Gynaecol. 1986; 93: 704. [PubMed]
Spinillo A, Capuzzo E, Tenti P, De Santolo A, Piazzi G, Iasci A. Adequacy of screening cervical cytology among human immunodeficiency virus-seropositive women. Gynecol Oncol. 1998; 69: 10913. [PubMed]
Spitzer M, Krumholz BA, Chernys AE, Seltzer V, Lightman AR. Comparative utility of repeat Papanicolaou smears, cervicography, and colposcopy in the evaluation of atypical Papanicolaou smears. Obstet Gynecol. 1987; 69: 7315. [PubMed]
Stafl A. Cervicography: A new method for cervical cancer detection. Am J Obstet Gynecol. 1981; 139(7): 81525. [PubMed]
Stevens MW, Milne AJ, James KA, Brancheau D, Ellison D, Kuan L. Effectiveness of automated cervical cytology rescreening using the AutoPap 300 QC System. Diagn Cytopathol. 1997; 16: 50512. [PubMed]
Swets, JA. Measuring the accuracy of diagnostic systems. Science. 1988; 240: 128593. [PubMed]
Syrjanen KJ, Mantyjarvi R, Vayrynen M, Yliskoski M, Syrjanen SM, Saarikoski S, Nurmi T, Parkkinen S, Castren O. Cervical smears in assessment of the natural history of human papillomavirus infections in prospectively followed women. Acta Cytol. 1987; 31: 85565. [PubMed]
Syrjanen K, Kataja V, Ykliskoski M, Chang F, Syrjanen S, Saarikoski S. Natural history of cervical human papillomavirus lesions does not substantiate the biologic relevance of the Bethesda System. Obstet Gynecol. 1992; 79: 67582. [PubMed]
Tait IA, Alawattegama AB, Rees E. Screening for cervical dysplasia in department of genitourinary medicine. Genitourin Med. 1988; 64: 2558. [PubMed]
Tawa K, Forsythe A, Cove JK, Saltz A, Peters HW, Watring WG. A comparison of the Papanicolaou smear and the cervigram: Sensitivity, specificity, and cost analysis. Obstet Gynecol. 1988; 71: 22935. [PubMed]
Tay SK, Jenkins D, Singer A. Management of squamous atypia (borderline nuclear abnormalities): Repeat cytology or colposcopy? Aust N Z J Obstet Gynaecol. 1987; 27: 1401. [PubMed]
U.S. Congress Office of Technology Assessment. Allocating costs and benefits in disease prevention programs: An application to cervical cancer screening. Washington, DC: U.S. Government Printing Office; 1981.
U.S. Congress Office of Technology Assessment. The costs and effectiveness of cervical cancer screening in elderly women. OTA-BP-H-65. Washington, DC: U.S. Government Printing Office; 1990 Feb.
U.S. Preventive Services Task Force. Guide to clinical preventive services. Baltimore: Williams & Wilkins; 1996.
Upadhyay SN, Jha RS, Sinha TK, Mishra NK. Accuracy of cytology in screening for cervical cancer. Indian J Med Res. 1984; 80: 45762. [PubMed]
van Ballegooijen M, Habbema JD, van Oortmarssen GJ, Koopmanschap MA, Lubbe JT, van Agt HM. Preventive Pap-smears: Balancing costs, risks and benefits. Br J Cancer. 1992; 65: 9303. [PubMed]
van Ballegooijen M, van den Akker-van Marle ME, Warmerdam PG, Meijer CJ, Walboomers JM, Habbema JD. Present evidence on the value of HPV testing for cervical cancer screening: A model-based exploration of the (cost-)effectiveness. Br J Cancer. 1997; 76: 6517. [PubMed]
van Nagell JR Jr, Dudik LM, Frank AL, Allen DT, Leigh JP, Engelberg J. Cervical cancer. South Med J. 1987; 80: 7581. [PubMed]
van Oortmarssen G, Habbema J. Epidemiological evidence for age-dependent regression of pre-invasive cervical cancer. Br J Cancer. 1991; 64: 55965. [PubMed]
van Oortmarssen GJ, Habbema JDF. Duration of preclinical cervical cancer and reduction in incidence of invasive cancer following negative pap smears. Int J Epidemiol. 1995; 24: 3007. [PubMed]
van Oortmarssen GJ, Habbema JD, van Ballegooijen M. Predicting mortality from cervical cancer after negative smear test results. BMJ. 1992; 305: 44951. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
Veljovich DS, Stoler MH, Anderson WA, Covell JL, Rice LW. Atypical glandular cells of undetermined significance: A five-year retrospective histopathological study. Am J Obstet Gynecol. 1998; 179(2): 38290. [PubMed]
Walker EM, Dodgson J, Duncan ID. Does mild atypia on a cervical smear warrant further investigation? Lancet. 1986; 2: 6723. [PubMed]
Waugh N, Robertson A. Costs and benefits of cervical screening. II. Is it worthwhile reducing the screening interval from 5 to 3 years? Cytopathology. 1996; 7: 2418. [PubMed]
Waugh N, Smith I, Robertson A, Reid GS, Halkerston R, Grant A. Costs and benefits of cervical screening. I. The costs of the cervical screening programme. Cytopathology. 1996a; 7: 23140. [PubMed]
Waugh N, Smith I, Robertson A, Reid GS, Halkerston R, Grant A. Costs and benefits of cervical screening. III. Cost/benefit analysis of a call of previously unscreened women. Cytopathology. 1996b; 7: 24955. [PubMed]
Weintraub J, Morabia A. Efficacy of the ThinPrep Pap test for the cytological detection of cervical cancer precursors in a low risk outpatient population: A large-scale investigator sponsored prospective study. Unpublished manuscript.
Wetrich DW. An analysis of the factors involved in the colposcopic evaluation of 2194 patients with abnormal Papanicolaou smears. Am J Obstet Gynecol. 1986; 154: 133949. [PubMed]
Wheeler CM, Parmenter CA, Hunt WC, Becker TM, Greer CE, Hildesheim A, Manos M. Determinants of genital human papillomavirus infection among cytologically normal women attending the University of New Mexico Student Health Center. Sex Transm Dis. 1993; 20: 2869. [PubMed]
Wheelock JB, Kaminski PF. Value of repeat cytology at the time of colposcopy for the evaluation of cervical intraepithelial neoplasia on Papanicolaou smears. J Reprod Med. 1989; 34: 8157. [PubMed]
Wilbur DC, Bonfiglio TA, Rutkowski MA, Atkison KM, Richart RM, Lee JS, Patten SF Jr. Sensitivity of the AutoPap 300 QC System for cervical cytologic abnormalities. Biopsy data confirmation. Acta Cytol. 1996; 40: 12732. [PubMed]
Wilbur DC, Prey MU, Miller WM, Pawlick GF, Colgan TJ. The AutoPap System for primary screening in cervical cytology. Comparing the results of a prospective, intended-use study with routine manual practice. Acta Cytol. 1998; 42: 21420. [PubMed]
Wilson S, Woodman C. Assessing the effectiveness of cervical screening. Clin Obset Gynecol. 1995; 38: 57784.
Woolf SH. An organized analytic framework for practice guideline development: Using the analytic logic as a guide for reviewing evidence, developing recommendations, and explaining the rationale. In: McCormick KA, Moore SR, Siegel RA, editors. Clinical practice guideline development: methodology perspectives. Rockville, MD: Agency for Health Care Policy and Research; 1994 Nov. AHCPR Pub. No. 95-0009.
Womeodu RJ, Bailey JE. Barriers to cancer screening. Med Clin North Am. 1996; 80: 11533. [PubMed]
Wright TC Jr, Ellerbrock TV, Chiasson MA, Van Devanter N, Sun XW. Cervical intraepithelial neoplasia in women infected with human immunodeficiency virus: Prevalence, risk factors, and validity of Papanicolaou smears. New York Cervical Disease Study. Obstet Gynecol. 1994b; 84: 5917. [PubMed]
Wright TC, Richart RM. Pathogenesis and diagnosis of preinvasive lesions of the lower genital tract. Hoskins WJ, Perez CA, Young RC, editors. In: Principles and practice of gynecologic oncology. Philadelphia: Lippincott; 1992.
Wright TC, Kurman RJ, Ferenczy A. Precancerous lesions of the cervix.Kurman RJ, editor. In: Blaustein's pathology of the female genital tract, 4 th edition. New York: Springer-Verlag; 1994a. p. 229-78.
Wright TC, Sun XW, Koulos J. Comparison of management algorithms for the evaluation of women with low-grade cytologic abnormalities. Obstet Gynecol. 1995; 85: 20210. [PubMed]
Young NA, Naryshkin S, Bowman RL. Value of repeat cervical smears at the time of colposcopic biopsy. Diagn Cytopathol. 1993; 9(6): 6469. [PubMed]
Bibliography
Adachi A, Fleming I, Burk RD, Ho GY, Klein RS. Women with human immunodeficiency virus infection and abnormal Papanicolaou smears: A prospective study of colposcopy and clinical outcome. Obstet Gynecol. 1993; 81(3): 3727. [PubMed]
Alanen KW, Elit LM, Molinaro PA, McLachlin CM. Assessment of cytologic follow-up as the recommended management for patients with atypical squamous cells of undetermined significance or low grade squamous intraepithelial lesions. Cancer. 1998; 84: 510. [PubMed]
Allahverdian V, Valaitis J, Kalis O, Pearlman S. Cytology and colposcopy in the diagnosis and management of outpatients with cervical intraepithelial neoplasia. J Reprod Med. 1980; 24: 14. [PubMed]
Allen KA, Zaleski S, Cohen MB. Review of negative Papanicolaou tests: Is the retrospective 5-year review necessary? Am J Clin Pathol. 1994; 101: 1921. [PubMed]
Alloub MI, Barr BB, McLaren KM, Smith IW, Bunney MH, Smart GE. Human papillomavirus infection and cervical intraepithelial neoplasia in women with renal allografts. BMJ. 1989; 298: 1536. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
Alons-van Kordelaar JJ, Boon ME. Diagnostic accuracy of squamous cervical lesions studied in spatula-cytobrush smears. Acta Cytol. 1988; 32: 8014. [PubMed]
American Academy of Pediatrics. Guidelines for health supervision II. Elk Grove Village, IL: The Academy; 1988.
American Cancer Society. Guidelines for the cancer-related checkup: An update. New York: The Society; 1993.
American College of Obstetricians and Gynecologists. Cervical cytology: Evaluation and management of abnormalities. Washington, DC: American College of Obstetricians and Gynecologists; 1993 Aug. ACOG Technical Bulletin No. 183.
American College of Obstetricians and Gynecologists. ACOG Committee Opinion: Colposcopy training and practice. Washington, DC: American College of Obstetricians and Gynecologists; 1994. Publication No. 133.
American College of Obstetricians and Gynecologists. ACOG Committee Opinion: Recommendations on frequency of Pap test screening. Washington, DC: American College of Obstetricians and Gynecologists; 1995. Publication No. 152.
American College of Obstetricians and Gynecologists. Ambulatory care criteria set. Washington, DC: American College of Obstetricians and Gynecologists; 1995. Publication No. 6.
American College of Obstetricians and Gynecologists CoGP. ACOG Committee Opinion: Routine cancer screening. Number 185, September 1997 (replaces no. 128, October 1993). Int J Gynecol Obstet. 1997; 59: 15761.
American College of Physicians. Screening for cervical cancer. In: Eddy DM, editor. Common screening tests. Philadelphia: American College of Physicians; 1991. p. 413-4.
American Joint Committee on Cancer. Cervix uteri.In: AJCC cancer staging manual. Philadelphia: Lippincott-Raven Publishers, 5th edition, 1997. p. 189-94.
American Medical Association. AMA guidelines for adolescent preventive services (GAPS): Recommendations and rationale. Chicago: American Medical Association; 1994.
Amschler DH. Pap tests every three years: Cost-effective in the long run? Health Educ. 1983; 14: 425. [PubMed]
Andersen W, Frierson H, Barber S, Tabbarah S, Taylor P, Underwood P. Sensitivity and specificity of endocervical curettage and the endocervical brush for the evaluation of the endocervical canal. Am J Obstet Gynecol. 1988; 159: 7027. [PubMed]
Anderson DJ, Flannelly GM, Kitchener HC, Fisher PM, Mann EM, Campbell MK, Templeton A. Mild and moderate dyskaryosis: Can women be selected for colposcopy on the basis of social criteria? BMJ. 1992; 305: 847. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
Anderson DJ, Strachan F, Parkin DE. Cone biopsy: Has endocervical sampling a role? Br J Obstet Gynaecol. 1992; 99: 66870. [PubMed]
Anderson G, Macaulay C, Matisic J, Garner D, Palcic B. The use of an automated image cytometer for screening and quantitative assessment of cervical lesions in the British Columbia Cervical Smear Screening Programme. Cytopathology. 1997; 8: 298312. [PubMed]
Anderson M, Jones BA. False positive cervicovaginal cytology: A follow-up study. Acta Cytol. 1997; 41: 1697700. [PubMed]
Andrews S, Miyazawa K. The significance of a negative Papanicolaou smear with hyperkeratosis or parakeratosis. Obstet Gynecol. 1989; 73(5 Pt 1): 7513. [PubMed]
Andrews S, Hernandez E, Miyazawa K. Paired Papanicolaou smears in the evaluation of atypical squamous cells. Obstet Gynecol. 1989; 73(5 Pt 1): 74750. [PubMed]
Anonymous. Population screening for cervical cancer in The Netherlands. A report by the Evaluation Committee. Int J Epidemiol. 1989; 18: 77581. [PubMed]
Anonymous. Pap smears cost-effective screening for high-risk elderly. Geriatrics. 1991; 45: 1823.
Anonymous. PAPNET cytological screening system. Lab Med. 1991; 22: 27680.
Aponte-Cipriani SL, Teplitz C, Rorat E, Savino A, Jacobs AJ. Cervical smears prepared by an automated device versus the conventional method. A comparative analysis. Acta Cytol. 1995; 39: 62330. [PubMed]
Ashfaq R, Liang Y, Saboorian MH. Evaluation of PAPNET system for rescreening of negative cervical smears. Diagn Cytopathol. 1995; 13: 316. [PubMed]
Ashfaq R, Solares B, Saboorian MH. Detection of endocervical component by PAPNET system on negative cervical smears. Diagn Cytopathol. 1996; 15: 1213. [PubMed]
Ashfaq R, Thomas S, Saboorian MH. Efficiency of PAPNET in detecting infectious organisms in cervicovaginal smears. Acta Cytol. 1996; 40: 8858. [PubMed]
Ashfaq R, Saliger F, Solares B, Thomas S, Liu G, Liang Y, Saboorian MH. Evaluation of the PAPNET system for prescreening triage of cervicovaginal smears. Acta Cytol. 1997; 41: 105864. [PubMed]
Audet-Lapointe P. Detection of cervical cancer in a women's prison. Can Med Assoc J. 1971; 104: 50911. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
August N. Cervicography for evaluating the atypical Papanicolaou smear. J Reprod Med. 1991; 36: 8994. [PubMed]
Austin RM. Results of blinded rescreening of Papanicolaou smears versus biased retrospective review. Arch Pathol Lab Med. 1997; 121: 3114. [PubMed]
Austin RM, Ramzy I. Increased detection of epithelial cell abnormalities by liquid-based gynecologic cytology preparations. A review of accumulated data. Acta Cytol. 1998; 42: 17884. [PubMed]
Autier P, Coibion M, Huet F, Grivegnee AR. Transformation zone location and intraepithelial neoplasia of the cervix uteri. Br J Cancer. 1996; 74: 48890. [PubMed]
Awen C, Hathway S, Eddy W, Voskuil R, Janes C. Efficacy of ThinPrep preparation of cervical smears: A 1,000-case, investigator-sponsored study. Diagn Cytopathol. 1994; 11: 336; discussion 36-7. [PubMed]
Aziz DC, Ferre F, Robitaille J, Ferenczy A. Human papillomavirus testing in the clinical laboratory. Part I: Squamous lesions of the cervix. J Gynecol Surg. 1993 Spring; 9: 17. [PubMed]
Bacus JW. Cervical cell recognition and morphometric grading by image analysis. J Cell Biochem Suppl. 1995; 23: 3342. [PubMed]
Baker A, Melcher D, Smith R. Role of re-screening of cervical smears in internal quality control. J Clin Pathol. 1995; 48: 10024. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
Baker RW, Wadsworth J, Brugal G, Coleman DV. An evaluation of "rapid review" as a method of quality control of cervical smears using the AxioHOME microscope. Cytopathology. 1997; 8: 8595. [PubMed]
Baldauf JJ, Dreyfus M, Ritter J, Philippe E. Colposcopy and directed biopsy reliability during pregnancy: A cohort study. Eur J Obstet Gynecol Reprod Biol. 1995 Sep; 62(1): 316.
Baldauf JJ, Dreyfus M, Lehmann M, Ritter J, Philippe E. Cervical cancer screening with cervicography and cytology. Eur J Obstet Gynecol Reprod Biol. 1995; 58: 339. [PubMed]
Baldauf JJ, Dreyfus M, Ritter J, Meyer P, Philippe E. Screening histories of incidence cases of cervical cancer and high grade SIL. A comparison. Acta Cytol. 1997; 41: 14318. [PubMed]
Bamber, D. The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. J Math Psychol. 1975; 12: 387415.
Barnes BA. Papanicolaou cervical smears for screening in asymptomatic women. Prim Care. 1981; 8: 13140. [PubMed]
Barnes BA, Barnes AB. Evaluation of surgical therapy by cost-benefit analysis. Surgery. 1977; 82: 2133. [PubMed]
Barton SE, Jenkins D, Hollingworth A, Cuzick J, Singer A. An explanation for the problem of false-negative cervical smears. Br J Obstet Gynaecol. 1989; 96: 4825. [PubMed]
Bartoo GT, Lee JS, Bartels PH, Kiviat NB, Nelson AC. Automated prescreening of conventionally prepared cervical smears: A feasibility study. Lab Invest. 1992; 66: 11622. [PubMed]
Bauer HM, Hildesheim A, Schiffman MH, Glass AG, Rush BB, Scott DR, Cadell DM, Kurman RJ, Manos MM. Determinants of genital human papillomavirus infection in low-risk women in Portland, Oregon. Sex Transm Dis. 1993; 20: 2748. [PubMed]
Bearman DM, MacMillan JP, Creasman WT. Papanicolaou smear history of patients developing cervical cancer: An assessment of screening protocols. Obstet Gynecol. 1987; 69: 1515. [PubMed]
Beeby AR, Keating PJ, Wagstaff TI, Wilson EMJ, Sims PF, Manning PJ, Wadehra V. The quality and accuracy of cervical cytology: A study of five sampling devices in a colposcopy clinic. J Obstet Gynaecol. 1993; 13: 27681.
Beeby AR, Wadehra V, Keating PJ, Wagstaff TI. A retrospective analysis of 94 patients with CIN and false negative cervical smears taken at colposcopy. Cytopathology. 1993; 4: 3317. [PubMed]
Begg, CB. Advances in statistical methodology for diagnostic medicine in the 1980's. Stat Med. 1991; 10: 188795. [PubMed]
Begg, CB, Greenes RA. Assessment of diagnostic tests when disease verification is subject to selection bias. Biometrics. 1983; 39: 20715. [PubMed]
Beilby JO, Bourne R, Guillebaud J, Steele ST. Paired cervical smears: A method of reducing the false-negative rate in population screening. Obstet Gynecol. 1982; 60: 468. [PubMed]
Berger BM. Using the Pathfinder system to reduce missed abnormal cervical cytologic smear cases in a rescreening program. Acta Cytol. 1997; 41: 17381. [PubMed]
Bergeron C, Debaque H, Ayivi J, Amaizo S, Fagnani F. Cervical smear histories of 585 women with biopsy-proven carcinoma in situ. Acta Cytol. 1997; 41: 167680. [PubMed]
Berget A, Olsen J, Poll P. Sensitivity and specificity of screening by cervico-vaginal cytology. Dan Med Bull. 1977; 24: 269. [PubMed]
Bergstrom R, Adami HO, Gustafsson L, Ponten J, Sparen P. Detection of preinvasive cancer of the cervix and the subsequent reduction in invasive cancer. J Natl Cancer Inst. 1993; 85: 10507. [PubMed]
Berkeley AS, LiVolsi VA, Schwartz PE. Advanced squamous cell carcinoma of the cervix with recent normal Papanicolaou tests. Lancet. 1980; 2: 3756. [PubMed]
Berkowitz RS, Ehrmann RL, Lavizzo-Mourey R, Knapp RC. Invasive cervical carcinoma in young women. Gynecol Oncol. 1979; 8: 3116. [PubMed]
Bernstein A, Vitner S, Webber JM. Evaluation of a new tampon device for cytologic autocollection and mass screening of cervical cancer and its precursors. Am J Obstet Gynecol. 1985; 151: 3515. [PubMed]
Bethwaite J, Rayner T, Bethwaite P. Economic aspects of screening for cervical cancer in New Zealand. N Z Med J. 1986; 99: 74751. [PubMed]
Bigelow JH, Ellis DB. The false-negative rate in screening for cervical cancer. In: Proceedings of the Joint IIASA/WHO Workshop on Screening for Cervical Cancer. Laxenburg (Austria): International Institute for Applied Systems Analysis; 1975. p. 34-46.
Bigelow JH. The natural history of cervical cancer. In: Proceedings of the Joint IIASA/WHO Workshop on Screening for Cervical Cancer. Laxenburg, Austria: International Institute for Applied Systems Analysis; 1975.
Bigrigg MA, Codling BW, Pearson P, Read MD, Swingler GR. Colposcopic diagnosis and treatment of cervical dysplasia at a single clinic visit. Experience of low-voltage diathermy loop in 1000 patients. Lancet. 1990; 336: 22931. [PubMed]
Bishop JW, Bigner SH, Colgan TJ, Husain M, Howell HP, McIntosh KM, Taylor DA, Sadeghi MH. Multicenter masked evaluation of AutoCyte PREP thin layers with matched conventional smears. Including initial biopsy results. Acta Cytol. 1998; 42: 18997. [PubMed]
Bishop JW, Hartinger JS, Pawlick GF. Time interval effect on repeat cervical smear results. Acta Cytol. 1997; 41: 26976. [PubMed]
Bishop JW. Comparison of the CytoRich system with conventional cervical cytology. Preliminary data on 2,032 cases from a clinical trial site. Acta Cytol. 1997; 41: 1523. [PubMed]
Blakely DD, Oddone EZ, Hasselblad V, Simel DL, Matchar DB. Noninvasive carotid artery testing. Ann Intern Med. 1995; 122: 3607. [PubMed]
Block B, Branham RA. Efforts to improve the follow-up of patients with abnormal Papanicolaou test results. J Am Board Fam Pract. 1998; 11: 111. [PubMed]
Blue Cross Blue Shield Association. Monolayer slide preparation and automated slide reading systems for cervical cancer screening -- clinical effectiveness analysis. Chicago, IL: Blue Cross Blue Shield Association Technical Evaluation Center Assessment Program; Vol. 15, No. 1; 1998 April.
Bocking A, Hilgarth M, Auffermann W, Hack-Werdier C, Fischer-Becker D, von Kalkreuth G. DNA-cytometric diagnosis of prospective malignancy in borderline lesions of the uterine cervix. Acta Cytol. 1986; 30: 60815. [PubMed]
Boden E, Evander M, Wadell G, Bjersing L, von Schoultz B, Rylander E. Detection of human papilloma virus in women referred for colposcopy. A comparison between different diagnostic methods. Acta Obstet Gynecol Scand. 1990; 69: 1539. [PubMed]
Bolger BS, Lewis BV. A prospective study of colposcopy in women with mild dyskaryosis or koilocytosis. Br J Obstet Gynaecol. 1988; 95: 11179. [PubMed]
Bolick DR, Hellman DJ. Laboratory implementation and efficacy assessment of the ThinPrep cervical cancer screening system. Acta Cytol. 1998; 42: 20913. [PubMed]
Bonfiglio TA. Cervical cytology: Perspectives from both sides of the Atlantic. Human Pathol. 1997; 28: 1179. [PubMed]
Boon ME, de Graaff Guilloud JC. Cost effectiveness of population screening and rescreening for cervical cancer in the Netherlands. Acta Cytol. 1981; 25: 53942. [PubMed]
Boon ME, de Graaff Guilloud JC, Rietveld WJ, Wijsman-Grootendorst A. Effect of regular 3-yearly screening on the incidence of cervical smears: The Leiden experience. Cytopathology. 1990; 1: 20110. [PubMed]
Boon ME, Kok LP. Neural network processing can provide means to catch errors that slip through human screening of pap smears. Diagn Cytopathol. 1993; 9: 41116. [PubMed]
Boon ME, Kleinschmidt-Guy ED, Ouwerkerk-Noordam E. PAPNET for analysis of proliferating (MIB-1 positive) cell populations in cervical smears. Eur J Morphol. 1994; 32: 7885. [PubMed]
Boon ME, Kok LP, Nygaard-Nielsen M, Holm K, Holund B. Neural network processing of cervical smears can lead to a decrease in diagnostic variability and an increase in screening efficacy: A study of 63 false-negative smears. Mod Pathol. 1994; 7: 95761. [PubMed]
Boon ME, Kok LP, Beck S. Histologic validation of neural network-assisted cervical screening: comparison with the conventional procedure. Cell Vision. 1995; 2: 237.
Boon ME. Assessing costs and benefits of PAPNET. Acta Cytol. 1996; 40: 110911. [PubMed]
Boyko EJ. Meta-analysis of Pap test accuracy. Am J Epidemiol. 1996; 143: 4067. [PubMed]
Brown AD, Garber AM. The cost-effectiveness of three new technologies to enhance Pap testing. Chicago, IL: Blue Cross Blue Shield Association Technical Evaluation Center Assessment Program Special Assessment; April 1998.
Bryant J, Stevens A. Cervical screening interval. Bristol, England: U.K. National Health Service, South and West Regional Health Authority; 1995. 10 p.
Buchanan M, Husain OA, Butler EB. Costs and benefits of cervical screening. Cytopathology. 1997; 8: 45.
Buntinx F, Brouwers M. Relation between sampling device and detection of abnormality in cervical smears: A meta-analysis of randomised and quasi-randomised studies. BMJ. 1996; 313: 128590. [PubMed]
Buntinx F, Knottnerus JA, Crebolder HF, Essed GG, Schouten H. Relation between quality of cervical smears and probability of abnormal results. BMJ. 1992; 304: .
Bur M, Knowles K, Pekow P, Corral O, Donovan J. Comparison of ThinPrep preparations with conventional cervicovaginal smears. Practical considerations. Acta Cytol. 1995; 39: 63142. [PubMed]
Bureau of Labor Statistics. CPI-U. Medicare Services Average Annual Index, Base Period 1982-84.
Byrne P, Jordan J, Williams D, Woodman C. Importance of negative result of cervical biopsy directed by colposcopy. BMJ. 1988; 296: .
Canadian Coordinating Office for Health Technology Assessment. Assessment of techniques for cervical cancer screening. Ottawa: CCOHTA; 1997 May. CCOHTA Report 1997:2E.
Canadian Task Force on the Periodic Health Examination. Canadian guide to clinical preventive health care. Ottawa: Health Canada; 1994.
Cannistra SA, Niloff JM. Cancer of the uterine cervix. N Engl J Med. 1996; 334: 10308. [PubMed]
Cantor SB, Mitchell MF, Tortolero-Luna G, Bratka CS, Bodurka DC, Richards-Kotum R. Cost-effectiveness analysis of diagnosis and management of cervical squamos intraepithelial lesions. Obstet Gynecol. 1998; 91: 2707. [PubMed]
Carson HJ, DeMay RM. The mode ages of women with cervical dysplasia. Obstet Gynecol. 1993; 82: 4304. [PubMed]
Carter PG, Harris VG, Wilson POG. The management of the abnormal cervical smear: A comparative study between loop diathermy alone and laser ablation preceded by colposcopically directed punch biopsy. J Obstet Gynaecol. 1993; 13: 5963.
Carter PM, Coburn TC, Luszczak M. Cost-effectiveness of cervical cytologic examination during pregnancy. J Am Board Fam Pract. 1993; 6: 53745. [PubMed]
Cecchini S, Iossa A, Grazzini G. Estimate of clinical human papilloma virus infection prevalence in a population-based screening. Cervix Lower Female Genital Tract. 1998; 6: 2137.
Cecchini S, Bonardi R, Iossa A, Zappa M, Ciatto S. Colposcopy as a primary screening test for cervical cancer. Tumori. 1997; 83: 8103. [PubMed]
Cecchini S, Bonardi R, Mazzotta A, Grazzini G, Iossa A, Ciatto S. Testing cervicography and cervicoscopy as screening tests for cervical cancer. Tumori. 1993; 79: 225. [PubMed]
Celentano DD, deLissovoy G. Assessment of cervical cancer screening and follow-up programs. Public Health Rev. 1989; 17: 173240. [PubMed]
Centers for Disease Control and Prevention. Current trends: Premarital sexual experience among adolescent women -- United States, 1970-1988. MMWR. 1991; 39: 92932. [PubMed]
Chakrabarti S, Guijon FB, Paraskevas M. Brush vs.spatula for cervical smears: Histologic correlation with concurrent biopsies. Acta Cytol. 1994; 38: 3158. [PubMed]
Chesebro MJ, Everett WD. A cost-benefit analysis of colposcopy for cervical squamous intraepithelial lesions found on Papanicolaou smear. Arch Fam Med. 1996; 5: 57681. [PubMed]
Chia KV, Fayle RJ, Sobowale O. Efficacy of large loop excision of the transformation zone for cervical intraepithelial neoplasia. Aust N Z J Obstet Gynaecol. 1993; 33: 2879. [PubMed]
Chock C, Irwig L, Berry G, Glasziou P. Comparing dichotomous screening tests when individuals negative on both tests are not verified. J Clin Epidemiol. 1997; 50: 12117. [PubMed]
Chomet J. Screening for cervical cancer: A new scope for general practitioners?Results of the first year of colposcopy in general practice. BMJ. 1987; 294: 13268. [PubMed] [Free Full Text in PMC icon.Free Full text in PMC]
Chou P. Review on cervical cancer screening. Chung Hua I Hsueh Tsa Chih (Taipei). 1991; 48: 112. [PubMed]
Chou P, Lai MY, Chang HJ. Accuracy of screening for cervical cancer in Taiwan, 1974-1984. Chung Hua I Hsueh Tsa Chih (Taipei). 1990; 45: 14756. [PubMed]
Clement KD, Christenson PD. Papanicolaou smear cell recovery techniques used by primary care physicians. J Am Board Fam Pract. 1990; 3: 2538. [