Reliability of the European Society of Human Reproduction and Embryology/European Society for Gynaecological Endoscopy and American Society for Reproductive Medicine classification systems for congenital uterine anomalies detected using three-dimensional ultrasonography

Fertil Steril. 2015 Sep;104(3):688-97.e8. doi: 10.1016/j.fertnstert.2015.06.019. Epub 2015 Jul 7.

Abstract

Objective: To estimate the inter-rater/intrarater reliability of the European Society of Human Reproduction and Embryology/European Society for Gynaecological Endoscopy (ESHRE-ESGE) classification of congenital uterine malformations and to compare the results obtained with the reliability of the American Society for Reproductive Medicine (ASRM) classification supplemented with additional morphometric criteria.

Design: Reliability/agreement study.

Setting: Private clinic.

Patient(s): Uterine malformations (n = 50 patients, consecutively included) and normal uterus (n = 62 women, randomly selected) constituted the study. These were classified based on real-time three-dimensional ultrasound single volume transvaginal (or transrectal in the case of virgins, 4 cases) ultrasonography findings, which were assessed by an expert rater based on the ESHRE-ESGE criteria. The samples were obtained from women of reproductive age.

Intervention(s): Unprocessed three-dimensional datasets were independently evaluated offline by two experienced, blinded raters using both classification systems.

Main outcome measure(s): The κ-values and proportions of agreement.

Result(s): Standardized interpretation indicated that the ESHRE-ESGE system has substantial/good or almost perfect/very good reliability (κ >0.60 and >0.80), but the interpretation of the clinically relevant cutoffs of κ-values showed insufficient reliability for clinical use (κ < 0.90), especially in the diagnosis of septate uterus. The ASRM system had sufficient reliability (κ > 0.95).

Conclusion(s): The low reliability of the ESHRE-ESGE system may lead to a lack of consensus about the management of common uterine malformations and biased research interpretations. The use of the ASRM classification, supplemented with simple morphometric criteria, may be preferred if their sufficient reliability can be confirmed real-time in a large sample size.

Keywords: Müllerian ducts; agreement; inter-rater; intrarater; septate uterus.

Publication types

  • Comparative Study
  • Evaluation Study

MeSH terms

  • Case-Control Studies
  • Consensus
  • Female
  • Humans
  • Image Interpretation, Computer-Assisted
  • Imaging, Three-Dimensional*
  • Observer Variation
  • Predictive Value of Tests
  • Prospective Studies
  • Reproducibility of Results
  • Terminology as Topic*
  • Ultrasonography
  • Urogenital Abnormalities / classification*
  • Urogenital Abnormalities / diagnostic imaging*
  • Uterus / abnormalities*
  • Uterus / diagnostic imaging*