LDAH2V: Exploring Meta-Paths Across Multiple Networks for lncRNA-Disease Association Prediction

IEEE/ACM Trans Comput Biol Bioinform. 2021 Jul-Aug;18(4):1572-1581. doi: 10.1109/TCBB.2019.2946257. Epub 2021 Aug 6.

Abstract

Accumulating evidence has demonstrated dysfunctions of long non-coding RNAs (lncRNAs) are involved in various complex human diseases. However, even today, the relationships between lncRNAs and diseases remain unknown in most cases. Developing effective computational approaches to identify potential lncRNA-disease associations has become a hot topic. Existing network-based approaches are usually focused on the intrinsic features of lncRNAs and diseases but ignore the heterogeneous information of biological networks. Considering the limitations in previous methods, we propose LDAH2V, an efficient computational framework for predicting potential lncRNA-disease associations. LDAH2V uses the HIN2Vec to calculate the meta-path and feature vector for each lncRNA-disease pair in the heterogeneous information network (HIN), which consists of lncRNA similarity network, disease similarity network, miRNA similarity network, and the associations between them. Then, a Gradient Boosting Tree (GBT) classifier to predict lncRNA-disease associations is built with the feature vectors. The results show that LDAH2V performs significantly better than the four existing state-of-the-art methods and gains an AUC of 0.97 in the 10-fold cross-validation test. Furthermore, case studies of colon cancer and ovarian cancer-related lncRNAs have been confirmed in related databases and medical literature.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Colonic Neoplasms / genetics
  • Colonic Neoplasms / metabolism
  • Computational Biology / methods*
  • Female
  • Genetic Predisposition to Disease / genetics*
  • Humans
  • MicroRNAs / genetics
  • Ovarian Neoplasms / genetics
  • Ovarian Neoplasms / metabolism
  • RNA, Long Noncoding* / genetics
  • RNA, Long Noncoding* / metabolism

Substances

  • MicroRNAs
  • RNA, Long Noncoding