• We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
Logo of bmjLink to Publisher's site
BMJ. May 15, 1999; 318(7194): 1322–1323.

Reporting of precision of estimates for diagnostic accuracy: a review

Robert Harper, principal optometrista and Barnaby Reeves, senior lecturerb

Diagnostic accuracy is usually characterised by the sensitivity and specificity of a test, and these indices are most commonly presented when evaluations of diagnostic tests are reported. It is important to emphasise that, as in other empirical studies, specific values of diagnostic accuracy are merely estimates. Therefore, when evaluations of diagnostic accuracy are reported the precision of the sensitivity and specificity estimates or likelihood ratios should be stated.13 If sensitivity and specificity estimates are reported without a measure of precision, clinicians cannot know the range within which the true values of the indices are likely to lie.

Confidence intervals are widely used in medical literature, and journals usually require confidence intervals to be specified for other descriptive estimates and for epidemiological or experimental analytical comparisons. Journals seem less vigilant, however, for evaluations of diagnostic accuracy. For example, a recent review of compliance with methodological standards in diagnostic test research found that for the period 1978-93 only 12 of 112 studies published in the New England Journal of Medicine, JAMA, the BMJ, and the Lancet reported the precision of the estimates of diagnostic accuracy.3 We have found that the reporting of 95% confidence intervals for estimates is somewhat better in a more recent 2 year interval for studies published in the BMJ but still far from ideal.

Methods and results

We searched the Medline database (for 1996 and 1997) for reports of diagnostic evaluations in the BMJ. After we excluded letters, case reports, and review or education articles we identified 16 studies (references supplied on request). Only eight (95% confidence interval 25% to 75%) papers reported precision for the estimates of diagnostic accuracy, with two of these studies providing confidence intervals only for either predictive power values or likelihood ratios but not for the sensitivity or specificity estimates also reported.


Evaluations of diagnostic accuracy should be prescribed with confidence intervals. We have also recently reviewed the extent of compliance with the reporting of confidence intervals in the ophthalmic literature and concluded that evaluations of diagnostic tests in this specialty are similarly flawed.4 The omission of the precision of estimates for diagnostic accuracy can make a considerable difference to a clinician's interpretation of the findings of a study. For example, an evaluation of the sensitivity and specificity of an imaging system for the optic nerve head for the detection of glaucoma reported estimates of 89% and 78%, respectively5; the 95% confidence intervals of these estimates (not reported in the paper) ranged from 80% to 98% for sensitivity and from 66% to 90% for specificity. For a test with poorer diagnostic accuracy, these 95% confidence intervals would have been even larger for an equivalent sample size because of the dependence of the standard error of a proportion on the proportion itself (figure). The figure shows how the precision of the sensitivity or specificity estimate varies as a function of both the point estimate itself and the sample size.

Most statistical packages will generate exact binomial confidence intervals. Approximate confidence intervals can easily be calculated by using the formula for the SE of a proportion (√pq/n), which is based on a binomial approximation to the normal distribution and can be used to calculate 95% confidence intervals for sensitivity and specificity (for instance, p±1.96√pq/n, where p represents either sensitivity or specificity, q=1−p, n is the sample size, and where n×p is >10).

To enhance the quality of information on diagnostic tests made available to clinicians we recommend that 95% confidence intervals are supplied with estimates of diagnostic accuracy. Referees and journal editors should enforce this requirement in the same way as they routinely do for other descriptive or comparative estimates.

Breadth of exact binomial 95% confidence intervals as function of sample estimate of proportion of interest and sample size; from outside to centre, pairs of lines represent sample sizes of 20, 40, 60, 100, 200, and 500. Note 95% confidence ...


Funding: No external funding.

Competing interests: None declared.


1. Jaeschke A, Guyatt GH, Sackett DL.for the Evidence-based Medicine Working Group. Users’ guides to the medical literature. III. How to use an article about a diagnostic test. A. Are the results of the study valid? JAMA 1994. 271389–391.391 [PubMed]
2. Jaeschke A, Guyatt GH, Sackett DL.for the Evidence-based Medicine Working Group. Users' guides to the medical literature. III. How to use an article about a diagnostic test B. What are the results and will they help me in caring for patients? JAMA 1994. 271703–707.707 [PubMed]
3. Reid MC, Lachs MS, Feinstein AR. Use of methodological standards in diagnostic test research: getting better but still not good. JAMA. 1995;274:645–651. [PubMed]
4. Harper R, Reeves B. Compliance with methodological standards when evaluating ophthalmic diagnostic tests. Optom Vis Sci. 1998;75:78. [PubMed]
5. Mikelberg FS, Parfitt CM, Swindale SL, Graham SL, Drance SM, Gosine R. Ability of the Heidelberg retina tomograph to detect early glaucomatous visual field loss. J Glaucoma. 1995;4:242–247. [PubMed]

Articles from BMJ : British Medical Journal are provided here courtesy of BMJ Group
PubReader format: click here to try


Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


  • PubMed
    PubMed citations for these articles

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...