Statistical methods to correct for verification bias in diagnostic studies are inadequate when there are few false negatives: a simulation study

Angel M Cronin; Andrew J Vickers

doi:10.1186/1471-2288-8-75

Statistical methods to correct for verification bias in diagnostic studies are inadequate when there are few false negatives: a simulation study

BMC Med Res Methodol. 2008 Nov 11:8:75. doi: 10.1186/1471-2288-8-75.

Authors

Angel M Cronin¹, Andrew J Vickers

Affiliation

¹ Department of Epidemiology and Biostatistics, Memorial Sloan-Kettering Cancer Center, NY, NY 10021, USA. serioa@mskcc.org

Abstract

Background: A common feature of diagnostic research is that results for a diagnostic gold standard are available primarily for patients who are positive for the test under investigation. Data from such studies are subject to what has been termed "verification bias". We evaluated statistical methods for verification bias correction when there are few false negatives.

Methods: A simulation study was conducted of a screening study subject to verification bias. We compared estimates of the area-under-the-curve (AUC) corrected for verification bias varying both the rate and mechanism of verification.

Results: In a single simulated data set, varying false negatives from 0 to 4 led to verification bias corrected AUCs ranging from 0.550 to 0.852. Excess variation associated with low numbers of false negatives was confirmed in simulation studies and by analyses of published studies that incorporated verification bias correction. The 2.5th - 97.5th centile range constituted as much as 60% of the possible range of AUCs for some simulations.

Conclusion: Screening programs are designed such that there are few false negatives. Standard statistical methods for verification bias correction are inadequate in this circumstance.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Area Under Curve*
Bias
Computer Simulation
Coronary Artery Disease / diagnostic imaging
Diagnostic Tests, Routine / standards*
False Negative Reactions
Female
Humans
Male
Predictive Value of Tests
Prostatic Neoplasms / diagnosis
Radionuclide Imaging
Uterine Cervical Neoplasms / diagnosis

Abstract

Publication types

MeSH terms

Grants and funding