LUNCRW: Prediction of potential lncRNA-disease associations based on unbalanced neighborhood constraint random walk

Anal Biochem. 2023 Oct 15:679:115297. doi: 10.1016/j.ab.2023.115297. Epub 2023 Aug 22.

Abstract

Accumulating evidence suggests that long non-coding RNAs (lncRNAs) are associated with various complex human diseases. They can serve as disease biomarkers and hold considerable promise for the prevention and treatment of various diseases. The traditional random walk algorithms generally exclude the effect of non-neighboring nodes on random walking. In order to overcome the issue, the neighborhood constraint (NC) approach is proposed in this study for regulating the direction of the random walk by computing the effects of both neighboring nodes and non-neighboring nodes. Then the association matrix is updated by matrix multiplication for minimizing the effect of the false negative data. The heterogeneous lncRNA-disease network is finally analyzed using an unbalanced random walk method for predicting the potential lncRNA-disease associations. The LUNCRW model is therefore developed for predicting potential lncRNA-disease associations. The area under the curve (AUC) values of the LUNCRW model in leave-one-out cross-validation and five-fold cross-validation were 0.951 and 0.9486 ± 0.0011, respectively. Data from published case studies on three diseases, including squamous cell carcinoma, hepatocellular carcinoma, and renal cell carcinoma, confirmed the predictive potential of the LUNCRW model. Altogether, the findings indicated that the performance of the LUNCRW method is superior to that of existing methods in predicting potential lncRNA-disease associations.

Keywords: AUC; Association prediction; LNS; LncRNA-disease; Neighborhood constraint; Unbalanced random walk.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Area Under Curve
  • Humans
  • Kidney Neoplasms*
  • RNA, Long Noncoding* / genetics
  • Walking

Substances

  • RNA, Long Noncoding