Feedback

Drug–disease association prediction with literature based multi-feature fusion

Affiliation
Department of Biomedical Engineering ,School of Life Science ,Beijing Institute of Technology ,Beijing ,China
Kang, Hongyu;
Affiliation
Institute of Medical Information ,Chinese Academy of Medical Sciences and Peking Union Medical College ,Beijing ,China
Hou, Li;
Affiliation
Institute of Medical Information ,Chinese Academy of Medical Sciences and Peking Union Medical College ,Beijing ,China
Gu, Yaowen;
Affiliation
Department of Biomedical Engineering ,School of Life Science ,Beijing Institute of Technology ,Beijing ,China
Lu, Xiao;
Affiliation
Institute of Medical Information ,Chinese Academy of Medical Sciences and Peking Union Medical College ,Beijing ,China
Li, Jiao;
Affiliation
Department of Biomedical Engineering ,School of Life Science ,Beijing Institute of Technology ,Beijing ,China
Li, Qin

Introduction: Exploring the potential efficacy of a drug is a valid approach for drug development with shorter development times and lower costs. Recently, several computational drug repositioning methods have been introduced to learn multi-features for potential association prediction. However, fully leveraging the vast amount of information in the scientific literature to enhance drug-disease association prediction is a great challenge. Methods: We constructed a drug-disease association prediction method called Literature Based Multi-Feature Fusion (LBMFF), which effectively integrated known drugs, diseases, side effects and target associations from public databases as well as literature semantic features. Specifically, a pre-training and fine-tuning BERT model was introduced to extract literature semantic information for similarity assessment. Then, we revealed drug and disease embeddings from the constructed fusion similarity matrix by a graph convolutional network with an attention mechanism. Results: LBMFF achieved superior performance in drug-disease association prediction with an AUC value of 0.8818 and an AUPR value of 0.5916. Discussion: LBMFF achieved relative improvements of 31.67% and 16.09%, respectively, over the second-best results, compared to single feature methods and seven existing state-of-the-art prediction methods on the same test datasets. Meanwhile, case studies have verified that LBMFF can discover new associations to accelerate drug development. The proposed benchmark dataset and source code are available at: https://github.com/kang-hongyu/LBMFF .

Cite

Citation style:
Could not load citation form.

Access Statistic

Total:
Downloads:
Abtractviews:
Last 12 Month:
Downloads:
Abtractviews:

Rights

License Holder: Copyright © 2023 Kang, Hou, Gu, Lu, Li and Li.

Use and reproduction: