| Literature DB >> 32585624 |
Congcong Yan1, Zicheng Zhang1, Siqi Bao1, Ping Hou1, Meng Zhou1, Chongyong Xu2, Jie Sun3.
Abstract
Long non-coding RNAs (lncRNAs) have been recognized as critical components of a broad genomic regulatory network and play pivotal roles in physiological and pathological processes. Identification of disease-associated lncRNAs is becoming increasingly crucial for fundamentally improving our understanding of molecular mechanisms of disease and developing novel biomarkers and therapeutic targets. Considering lower efficiency and higher time and labor cost of biological experiments, computer-aided inference of disease-associated RNAs has become a promising avenue for facilitating the study of lncRNA functions and provides complementary value for experimental studies. In this study, we first summarize data and knowledge resources publicly available for the study of lncRNA-disease associations. Then, we present an updated systematic overview of dozens of computational methods and models for inferring lncRNA-disease associations proposed in recent years. Finally, we explore the perspectives and challenges for further studies. Our study provides a guide for biologists and medical scientists to look for dedicated resources and more competent tools for accelerating the unraveling of disease-associated lncRNAs.Entities:
Keywords: bioinformatics; computational method; disease; lncRNA-disease association; long non-coding RNAs
Year: 2020 PMID: 32585624 PMCID: PMC7321789 DOI: 10.1016/j.omtn.2020.05.018
Source DB: PubMed Journal: Mol Ther Nucleic Acids ISSN: 2162-2531 Impact factor: 8.886
Figure 1Schematic Workflow of Matrix Completion-Based Methods
Three matrices (including the lncRNA-disease association matrix, lncRNA-lncRNA matrix, and disease-disease matrix) were first obtained as the input data. Then, feature extraction was accomplished based on the above three matrices to obtain lncRNA feature vectors and disease feature vectors. Finally, matrix completion methods were performed on the lncRNA-disease association matrix to acquire the lncRNA-disease association.
Overview of Categories and Corresponding Method/Tool for Acquiring lncRNA-lncRNA Association
| Categories | Method/Tool | Data Types | Data Resources | References |
|---|---|---|---|---|
| Sequence similarity | EMBOSS Needle tool | lncRNA sequence | LncRNADisease, UCSC, LNCipedia | Needleman and Wunsch |
| Functional similarity | LNCSIM | lncRNA-disease association, MeSH descriptors | LncRNADisease, Lnc2Cancer, MNDR, MeSH | Chen et al. |
| Functional similarity | ILNCSIM | lncRNA-disease association, MeSH descriptors | MNDR, Lnc2Cancer, MeSH | Huang et al. |
| Functional similarity | NA | lncRNA-gene association, protein-protein interaction | LncRNA2Target, StarBase, HPRD | Paik et al. |
| Functional similarity | NA | lncRNA-miRNA association | StarBase | Zhao et al. |
| Expression similarity | Spearman/Pearson correlation | lncRNA expression profiles | Array Express, UCSC Genome Bioinformatics | Chen and Yan |
| Cosine similarity | cosine similarity | lncRNA-disease association | MNDR, Lnc2Cancer, LncRNADisease | Cheng et al. |
NA, not applicable.
Figure 2Schematic Workflow of Resource Allocation-Based Methods
Multi-type data source matrices were first obtained as the input data. Then, a heterogeneous multilayer network is constructed, and the edges are weighted by the corresponding values of the matrix. Finally, the lncRNA-disease scoring matrix was produced by post-processing resource allocation on the heterogeneous network.
Overview of Categories and Corresponding Method/Tool for Acquiring Disease-Disease Association
| Categories | Method/Tool | Data Types | Data Resources | References |
|---|---|---|---|---|
| Semantic similarity | R package DOSE | MeSH descriptor | Disease Ontology, MeSH | Yu and Wang |
| Semantic similarity | NA | MeSH descriptor, Disease Ontology terms | MeSH, DincRNA | Chen et al. |
| Functional similarity | Jaccard coefficient | disease-gene association, gene-Gene Ontology terms association | Ensembl, DisGeNET | Mathur and Dinakarpandian |
| Functional similarity | NA | disease-miRNA association | HMDD | Zhao et al. |
| Gaussian interaction profile kernel similarity | Gaussian interaction profile kernel similarity/radial basis function (RBF) kernel similarity | disease-miRNA association, disease-gene association, lncRNA-disease association, sequence, expression | DisGeNet, HMDD, MNDR, Lnc2Cancer, LncRNADisease | Chen and Yan |
| Cosine similarity | Cosine similarity | lncRNA-disease association | MNDR, Lnc2Cancer, LncRNADisease | Hamaneh and Yu |
NA, not applicable.
Figure 3Schematic Workflow of Recommendation Algorithm-Based Methods
Multi-type data source matrices were first obtained as the input data. Then, recommendation matrices at multiple levels (e.g., lncRNAs, miRNAs) are obtained by applying a recommendation system algorithm. Finally, the possibility of the potential relationship between lncRNA and disease is measured through the combination of the recommendation matrices.
Overview of Matrix Completion-Based Computational Methods for Inferring lncRNA-Disease Association
| Method Name | Computational Principle | Data Types | Available Tool (Package or Code) | References |
|---|---|---|---|---|
| SIMCLDA | inductive matrix completion, singular value decomposition | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | code ( | Lu et al. |
| LDAPM | inductive matrix completion, singular value decomposition | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | NA | Fraidouni and Zaruba |
| FRMCLDA | faster randomized matrix completion, faster singular value threshold | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | code ( | Li et al. |
| TSSR | sparse self-representation | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | code ( | Ou-Yang et al. |
NA, not applicable.
Overview of Resource Allocation-Based Computational Methods for Inferring lncRNA-Disease Association
| Method Name | Computational Principle | Data Types | Available Tool (Package or Code) | References |
|---|---|---|---|---|
| BPLLDA | paths together with a decay function | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | NA | Xiao et al. |
| TPGLDA | resource allocation | disease-gene association, lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | code ( | Ding et al. |
| IDHI-MIRW | positive pointwise mutual information, random walk with restart algorithm | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | IDHI-MIRW ( | Fan et al. |
| Lap-BiRWRHLDA | Laplacian normalization, random walks | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | NA | Wen et al. |
| IIRWR | random walk with restart algorithm | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | code ( | Wang et al. |
| LLCLPLDA | label propagation algorithm, locality-constrained linear coding | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | NA | Xie et al. |
| LION | network diffusion approach | lncRNA-protein interaction, protein-protein interaction, protein-disease interaction | NA | Sumathipala et al. |
| NBLDA | label propagation algorithm | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | NA | Liu et al. |
| DislncRF | random forest | RNA sequencing data, disease-protein coding gene association, lncRNA-disease association | code ( | Pan et al. |
| NA | DeepWalk and a rule-based inference method | lncRNA-disease association, lncRNA-miRNA association, miRNA-disease association | code ( | Zhang et al. |
| NA | ncPred | disease-target association, target-ncRNA association, ncRNA-ncRNA association, target-target association | NA | Mori et al. |
| BiWalkLDA | Laplacian normalization, random walks | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | code ( | Gao et al. |
NA, not applicable.
Overview of Recommendation Algorithm-Based Computational Methods for Inferring lncRNA-Disease Association
| Method Name | Computational Principle | Data Types | Available Tool (Package or Code) | References |
|---|---|---|---|---|
| ILDMSF | similarity network fusion | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | NA | Chen et al. |
| NBCLDA | naive Bayesian, collaborative filtering | miRNA-disease association, miRNA-lncRNA association, lncRNA-disease association, disease-disease association | NA | Yu et al. |
| NCPLDA | network consistency projection | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | code ( | Li et al. |
| MFLDA | matrix factorization | lncRNA-miRNA association, lncRNA-gene association, lncRNA-Gene Ontology (GO) association, lncRNA-disease association, miRNA-gene association, miRNA-disease association, gene-disease association, gene-gene association, gene-drug association, drug-drug association, gene-GO association | code ( | Fu et al. |
| WMFLDA | matrix factorization | lncRNA-miRNA association, lncRNA-gene association, lncRNA-GO association, lncRNA-disease association, miRNA-gene association, miRNA-disease association, gene-disease association, gene-gene association, gene-drug association, drug-drug association, gene-GO association | code ( | Wang et al. |
| PMFILDA | probabilities matrix factorization | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association, miRNA-disease association, miRNA-lncRNA association | NA | Xuan et al. |
| DNILMF-LDA | logistic matrix factorization, Bayesian optimization | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | NA | Li et al. |
| DSCMF | collaborative matrix factorization | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | NA | Gao et al. |
| NNLDA | matrix factorization | lncRNA-disease association | code ( | Hu et al. |
| NA | SimRank measure, common neighbor-based | lncRNA-disease association | NA | Ping et al. |
| CFNBC | naive Bayesian, collaborative filtering | miRNA-disease association, miRNA-lncRNA association, lncRNA-disease association, disease-disease association | code ( | Yu et al. |
| DCSLDA | distance correlation set | disease-disease association, lncRNA-disease association, miRNA-disease association, miRNA-LncRNA association, lncRNA-lncRNA association | NA | Zhao et al. |
| SKF-LDA | similarity kernel fusion | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | NA | Xie et al. |
| BLM-NPAI | bipartite local model | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | NA | Cui et al. |
NA, not applicable.
Overview of Multi-Model Integration-Based Computational Methods for Inferring lncRNA-Disease Association
| Method Name | Computational Principle | Data Types | Available Tool (Package or Code) | References |
|---|---|---|---|---|
| NA | weighted bagging lightGBM model | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | NA | Chen and Liu |
| LDASR | rotation forest | lncRNA-disease association | NA | Guo et al. |
| ECLDA | extreme learning machine, convolutional neural networks | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association | NA | Guo et al. |
| CNNLDA | convolutional neural networks, attention mechanisms | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association, miRNA-disease association, miRNA-lncRNA association, miRNA-miRNA association | NA | Xuan et al. |
| CNNDLP | convolutional neural networks, attention mechanisms | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association, miRNA-disease association, miRNA-lncRNA association, miRNA-miRNA association | NA | Xuan et al. |
| GCNLDA | convolutional neural networks, graph convolutional network | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association, miRNA-disease association, miRNA-lncRNA association, miRNA-miRNA association | NA | Xuan et al. |
| LDAPred | convolutional neural networks, information flow propagation | lncRNA-disease association, lncRNA-lncRNA association, disease-disease association, miRNA-disease association, miRNA-lncRNA association, miRNA-miRNA association | NA | Xuan et al. |
NA, not applicable.