Literature DB >> 29228285

Matrix factorization-based data fusion for the prediction of lncRNA-disease associations.

Guangyuan Fu1, Jun Wang1, Carlotta Domeniconi2, Guoxian Yu1.   

Abstract

Motivation: Long non-coding RNAs (lncRNAs) play crucial roles in complex disease diagnosis, prognosis, prevention and treatment, but only a small portion of lncRNA-disease associations have been experimentally verified. Various computational models have been proposed to identify lncRNA-disease associations by integrating heterogeneous data sources. However, existing models generally ignore the intrinsic structure of data sources or treat them as equally relevant, while they may not be.
Results: To accurately identify lncRNA-disease associations, we propose a Matrix Factorization based LncRNA-Disease Association prediction model (MFLDA in short). MFLDA decomposes data matrices of heterogeneous data sources into low-rank matrices via matrix tri-factorization to explore and exploit their intrinsic and shared structure. MFLDA can select and integrate the data sources by assigning different weights to them. An iterative solution is further introduced to simultaneously optimize the weights and low-rank matrices. Next, MFLDA uses the optimized low-rank matrices to reconstruct the lncRNA-disease association matrix and thus to identify potential associations. In 5-fold cross validation experiments to identify verified lncRNA-disease associations, MFLDA achieves an area under the receiver operating characteristic curve (AUC) of 0.7408, at least 3% higher than those given by state-of-the-art data fusion based computational models. An empirical study on identifying masked lncRNA-disease associations again shows that MFLDA can identify potential associations more accurately than competing models. A case study on identifying lncRNAs associated with breast, lung and stomach cancers show that 38 out of 45 (84%) associations predicted by MFLDA are supported by recent biomedical literature and further proves the capability of MFLDA in identifying novel lncRNA-disease associations. MFLDA is a general data fusion framework, and as such it can be adopted to predict associations between other biological entities. Availability and implementation: The source code for MFLDA is available at: http://mlda.swu.edu.cn/codes.php? name = MFLDA. Contact: gxyu@swu.edu.cn. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29228285     DOI: 10.1093/bioinformatics/btx794

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  34 in total

Review 1.  RWSF-BLP: a novel lncRNA-disease association prediction model using random walk-based multi-similarity fusion and bidirectional label propagation.

Authors:  Guobo Xie; Bin Huang; Yuping Sun; Changhai Wu; Yuqiong Han
Journal:  Mol Genet Genomics       Date:  2021-02-15       Impact factor: 3.291

2.  ncRPheno: a comprehensive database platform for identification and validation of disease related noncoding RNAs.

Authors:  Wenliang Zhang; Guocai Yao; Jianbo Wang; Minglei Yang; Jing Wang; Haiyue Zhang; Weizhong Li
Journal:  RNA Biol       Date:  2020-03-26       Impact factor: 4.652

3.  M2PP: a novel computational model for predicting drug-targeted pathogenic proteins.

Authors:  Shiming Wang; Jie Li; Yadong Wang
Journal:  BMC Bioinformatics       Date:  2022-01-04       Impact factor: 3.169

4.  HBRWRLDA: predicting potential lncRNA-disease associations based on hypergraph bi-random walk with restart.

Authors:  Guobo Xie; Yinting Zhu; Zhiyi Lin; Yuping Sun; Guosheng Gu; Jianming Li; Weiming Wang
Journal:  Mol Genet Genomics       Date:  2022-06-25       Impact factor: 2.980

5.  Heterogeneous graph neural network for lncRNA-disease association prediction.

Authors:  Hong Shi; Xiaomeng Zhang; Lin Tang; Lin Liu
Journal:  Sci Rep       Date:  2022-10-20       Impact factor: 4.996

6.  DSCMF: prediction of LncRNA-disease associations based on dual sparse collaborative matrix factorization.

Authors:  Jin-Xing Liu; Ming-Ming Gao; Zhen Cui; Ying-Lian Gao; Feng Li
Journal:  BMC Bioinformatics       Date:  2021-05-12       Impact factor: 3.169

7.  Dual Attention Mechanisms and Feature Fusion Networks Based Method for Predicting LncRNA-Disease Associations.

Authors:  Yu Liu; Yingying Yu; Shimin Zhao
Journal:  Interdiscip Sci       Date:  2022-01-24       Impact factor: 2.233

8.  A Novel Probability Model for LncRNA⁻Disease Association Prediction Based on the Naïve Bayesian Classifier.

Authors:  Jingwen Yu; Pengyao Ping; Lei Wang; Linai Kuang; Xueyong Li; Zhelun Wu
Journal:  Genes (Basel)       Date:  2018-07-08       Impact factor: 4.096

9.  Computational Methods and Applications for Identifying Disease-Associated lncRNAs as Potential Biomarkers and Therapeutic Targets.

Authors:  Congcong Yan; Zicheng Zhang; Siqi Bao; Ping Hou; Meng Zhou; Chongyong Xu; Jie Sun
Journal:  Mol Ther Nucleic Acids       Date:  2020-05-21       Impact factor: 8.886

10.  PWCDA: Path Weighted Method for Predicting circRNA-Disease Associations.

Authors:  Xiujuan Lei; Zengqiang Fang; Luonan Chen; Fang-Xiang Wu
Journal:  Int J Mol Sci       Date:  2018-10-31       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.