Integrating image and gene-data with a semi-supervised attention model for prediction of KRAS gene mutation status in non-small cell lung cancer

PLoS One. 2024 Mar 11;19(3):e0297331. doi: 10.1371/journal.pone.0297331. eCollection 2024.

Abstract

KRAS is a pathogenic gene frequently implicated in non-small cell lung cancer (NSCLC). However, biopsy as a diagnostic method has practical limitations. Therefore, it is important to accurately determine the mutation status of the KRAS gene non-invasively by combining NSCLC CT images and genetic data for early diagnosis and subsequent targeted therapy of patients. This paper proposes a Semi-supervised Multimodal Multiscale Attention Model (S2MMAM). S2MMAM comprises a Supervised Multilevel Fusion Segmentation Network (SMF-SN) and a Semi-supervised Multimodal Fusion Classification Network (S2MF-CN). S2MMAM facilitates the execution of the classification task by transferring the useful information captured in SMF-SN to the S2MF-CN to improve the model prediction accuracy. In SMF-SN, we propose a Triple Attention-guided Feature Aggregation module for obtaining segmentation features that incorporate high-level semantic abstract features and low-level semantic detail features. Segmentation features provide pre-guidance and key information expansion for S2MF-CN. S2MF-CN shares the encoder and decoder parameters of SMF-SN, which enables S2MF-CN to obtain rich classification features. S2MF-CN uses the proposed Intra and Inter Mutual Guidance Attention Fusion (I2MGAF) module to first guide segmentation and classification feature fusion to extract hidden multi-scale contextual information. I2MGAF then guides the multidimensional fusion of genetic data and CT image data to compensate for the lack of information in single modality data. S2MMAM achieved 83.27% AUC and 81.67% accuracy in predicting KRAS gene mutation status in NSCLC. This method uses medical image CT and genetic data to effectively improve the accuracy of predicting KRAS gene mutation status in NSCLC.

MeSH terms

  • Biopsy
  • Carcinoma, Non-Small-Cell Lung* / diagnostic imaging
  • Carcinoma, Non-Small-Cell Lung* / genetics
  • Humans
  • Image Processing, Computer-Assisted
  • Lung Neoplasms* / diagnostic imaging
  • Lung Neoplasms* / genetics
  • Mutation
  • Proto-Oncogene Proteins p21(ras) / genetics

Substances

  • Proto-Oncogene Proteins p21(ras)
  • KRAS protein, human

Grants and funding

This work was supported by the National Natural Science Foundation of China (Grant No. U21A20469); the National Natural Science Foundation of China (Grant No. 61972274); the Central Government Guides Local Science and Technology Development Fund Project (Grant No. YDZJSX2022C004); the Natural Science Foundation of Shanxi Province (Grant No. 202103021224066); and NHC Key Laboratory of Pneumoconiosis Shanxi China Project, (Grant No.2020-PT320-005), the Non-profit Central Research Institute Fund of Chinese Academy of Medical Science. The funders had a role in decision to publish and preparation of the manuscript.