Machine Learning Based on Multi-Parametric MRI to Predict Risk of Breast Cancer

Front Oncol. 2021 Feb 26:11:570747. doi: 10.3389/fonc.2021.570747. eCollection 2021.

Abstract

Purpose: Machine learning (ML) can extract high-throughput features of images to predict disease. This study aimed to develop nomogram of multi-parametric MRI (mpMRI) ML model to predict the risk of breast cancer.

Methods: The mpMRI included non-enhanced and enhanced T1-weighted imaging (T1WI), T2-weighted imaging (T2WI), apparent diffusion coefficient (ADC), K trans, K ep, V e, and V p. Regions of interest were annotated in an enhanced T1WI map and mapped to other maps in every slice. 1,132 features and top-10 principal components were extracted from every parameter map. Single-parametric and multi-parametric ML models were constructed via 10 rounds of five-fold cross-validation. The model with the highest area under the curve (AUC) was considered as the optimal model and validated by calibration curve and decision curve. Nomogram was built with the optimal ML model and patients' characteristics.

Results: This study involved 144 malignant lesions and 66 benign lesions. The average age of patients with benign and malignant lesions was 42.5 years old and 50.8 years old, respectively, which were statistically different. The sixth and fourth principal components of K trans had more importance than others. The AUCs of K trans, K ep, V e and V p, non-enhanced T1WI, enhanced T1WI, T2WI, and ADC models were 0.86, 0.81, 0.81, 0.83, 0.79, 0.81, 0.84, and 0.83 respectively. The model with an AUC of 0.90 was considered as the optimal model which was validated by calibration curve and decision curve. Nomogram for the prediction of breast cancer was built with the optimal ML models and patient age.

Conclusion: Nomogram could improve the ability of breast cancer prediction preoperatively.

Keywords: breast cancer; machine learning; multi-parametric MRI; nomogram; risk prediction.