Demonstration of the potential of white-box machine learning approaches to gain insights from cardiovascular disease electrocardiograms

PLoS One. 2020 Dec 17;15(12):e0243615. doi: 10.1371/journal.pone.0243615. eCollection 2020.

Abstract

We present the results from a white-box machine learning approach to detect cardiac arrhythmias using electrocardiographic data. A C5.0 is trained to recognize four classes using common features. The four classes are (i) atrial fibrillation and atrial flutter, (ii) tachycardias (iii), sinus bradycardia and (iv) sinus rhythm. Data from 10,646 subjects, 83% of whom have at least one arrhythmia and 17% of whom exhibit a normal sinus rhythm, are used. The C5.0 is trained using 10-fold cross-validation and is able to achieve a balanced accuracy of 95.35%. By using the white-box machine learning approach, a clear and comprehensible tree structure can be revealed, which has selected the 5 most important features from a total of 24 features. These 5 features are ventricular rate, RR-Interval variation, atrial rate, age and difference between longest and shortest RR-Interval. The combination of ventricular rate, RR-Interval variation and atrial rate is especially relevant to achieve classification accuracy, which can be disclosed through the tree. The tree assigns unique values to distinguish the classes. These findings could be applied in medicine in the future. It can be shown that a white-box machine learning approach can reveal granular structures, thus confirming known linear relationships and also revealing nonlinear relationships. To highlight the strength of the C5.0 with respect to this structural revelation, the results of further white-box machine learning and black-box machine learning algorithms are presented.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Aged
  • Aged, 80 and over
  • Algorithms
  • Arrhythmias, Cardiac / diagnosis*
  • Arrhythmias, Cardiac / physiopathology
  • Electrocardiography / methods
  • Female
  • Humans
  • Machine Learning*
  • Male
  • Middle Aged

Associated data

  • figshare/10.6084/m9.figshare.c.4560497.v2

Grants and funding

This research is funded by the Carl Zeiss Foundation and the German Federal Ministry of Education and Research (13FH4E05IA). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.