The kappa statistic in rehabilitation research: an examination

Arch Phys Med Rehabil. 2004 Aug;85(8):1371-6. doi: 10.1016/j.apmr.2003.12.002.

Abstract

The number and sophistication of statistical procedures reported in medical rehabilitation research is increasing. Application of the principles and methods associated with evidence-based practice has contributed to the need for rehabilitation practitioners to understand quantitative methods in published articles. Outcomes measurement and determination of reliability are areas that have experienced rapid change during the past decade. In this study, distinctions between reliability and agreement are examined. Information is presented on analytical approaches for addressing reliability and agreement with the focus on the application of the kappa statistic. The following assumptions are discussed: (1) kappa should be used with data measured on a categorical scale, (2) the patients or objects categorized should be independent, and (3) the observers or raters must make their measurement decisions and judgments independently. Several issues related to using kappa in measurement studies are described, including use of weighted kappa, methods of reporting kappa, the effect of bias and prevalence on kappa, and sample size and power requirements for kappa. The kappa statistic is useful for assessing agreement among raters, and it is being used more frequently in rehabilitation research. Correct interpretation of the kappa statistic depends on meeting the required assumptions and accurate reporting.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.
  • Review

MeSH terms

  • Activities of Daily Living
  • Analysis of Variance
  • Data Collection
  • Data Interpretation, Statistical*
  • Evidence-Based Medicine / methods*
  • Evidence-Based Medicine / standards
  • Humans
  • Observer Variation
  • Outcome Assessment, Health Care
  • Prevalence
  • Psychometrics
  • Rehabilitation* / standards
  • Reproducibility of Results
  • Research / standards
  • Research Design* / standards
  • Sample Size
  • Statistics, Nonparametric