| Literature DB >> 31510011 |
Ping Xuan1,2, Lan Jia3, Tiangang Zhang4, Nan Sheng5, Xiaokun Li6, Jinbao Li7.
Abstract
Long non-coding RNAs (lncRNAs) play a crucial role in the pathogenesis and development of complex diseases. Predicting potential lncRNA-disease associations can improve our understanding of the molecular mechanisms of human diseases and help identify biomarkers for disease diagnosis, treatment, and prevention. Previous research methods have mostly integrated the similarity and association information of lncRNAs and diseases, without considering the topological structure information among these nodes, which is important for predicting lncRNA-disease associations. We propose a method based on information flow propagation and convolutional neural networks, called LDAPred, to predict disease-related lncRNAs. LDAPred not only integrates the similarities, associations, and interactions among lncRNAs, diseases, and miRNAs, but also exploits the topological structures formed by them. In this study, we construct a dual convolutional neural network-based framework that comprises the left and right sides. The embedding layer on the left side is established by utilizing lncRNA, miRNA, and disease-related biological premises. On the right side of the frame, multiple types of similarity, association, and interaction relationships among lncRNAs, diseases, and miRNAs are calculated based on information flow propagation on the bi-layer networks, such as the lncRNA-disease network. They contain the network topological structure and they are learned by the right side of the framework. The experimental results based on five-fold cross-validation indicate that LDAPred performs better than several state-of-the-art methods. Case studies on breast cancer, colon cancer, and osteosarcoma further demonstrate LDAPred's ability to discover potential lncRNA-disease associations.Entities:
Keywords: convolutional neural network; deep learning; information flow propagation; lncRNA–disease association; network topological structure
Mesh:
Substances:
Year: 2019 PMID: 31510011 PMCID: PMC6771133 DOI: 10.3390/ijms20184458
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1(a) Receiver operating characteristic (ROC) curves of LDAPred and the other four methods. (b) Precision–recall (PR) curves of LDAPred and the other four methods.
Area under ROC curves (AUC) of LDAPred and other methods for all diseases and 10 well-characterized diseases.
| Disease Name | Percentage of Disease-Related lncRNAs | AUC | ||||
|---|---|---|---|---|---|---|
| LDAPred | SIMCLDA | Ping’s Method | MFLDA | LDAP | ||
| Respiratory system cancer | 1.1% |
| 0.789 | 0.911 | 0.719 | 0.891 |
| Organ system cancer | 1.6% |
| 0.820 | 0.950 | 0.729 | 0.884 |
| Intestinal cancer | 2.3% |
| 0.811 | 0.909 | 0.559 | 0.905 |
| Prostate cancer | 1.0% |
| 0.873 | 0.826 | 0.553 | 0.711 |
| Lung cancer | 1.1% | 0.833 | 0.790 |
| 0.676 | 0.883 |
| Breast cancer | 0.1% |
| 0.742 | 0.871 | 0.517 | 0.830 |
| Reproductive organ cancer | 1.1% |
| 0.707 | 0.818 | 0.741 | 0.742 |
| Gastrointestinal system cancer | 0.1% |
| 0.784 | 0.896 | 0.582 | 0.867 |
| Liver cancer | 1.5% |
| 0.799 | 0.910 | 0.634 | 0.898 |
| Hepatocellular carcinoma | 1.5% | 0.867 | 0.765 |
| 0.688 | 0.902 |
The bold values indicate the higher AUCs.
Area under PR curves (AUPR) of LDAPred and other methods for all diseases and 10 well-characterized diseases.
| Disease Name | AUPR | ||||
|---|---|---|---|---|---|
| LDAPred | SIMCLDA | Ping’s Method | MFLDA | LDAP | |
| Respiratory system cancer | 0.178 | 0.149 |
| 0.072 | 0.303 |
| Organ system cancer | 0.029 | 0.411 |
| 0.338 | 0.628 |
| Intestinal cancer |
| 0.141 | 0.252 | 0.042 | 0.246 |
| Prostate cancer |
| 0.176 | 0.333 | 0.095 | 0.297 |
| Lung cancer |
| 0.138 | 0.334 | 0.008 | 0.094 |
| Breast cancer | 0.125 | 0.445 |
| 0.476 | 0.629 |
| Reproductive organ cancer |
| 0.047 | 0.403 | 0.031 | 0.396 |
| Gastrointestinal system cancer |
| 0.130 | 0.271 | 0.104 | 0.238 |
| Liver cancer |
| 0.201 | 0.526 | 0.086 | 0.498 |
| Hepatocellular carcinoma | 0.198 | 0.096 |
| 0.082 | 0.303 |
The bold values indicate the higher AUPRs.
Results of a paired Wilcoxon-test for LDAPred and four other contrast methods in terms of AUCs and AUPRs.
| SIMCLDA | Ping’s Method | MFLDA | LDAP | |
|---|---|---|---|---|
| 2.4816 × 10−17 | 0.0079 × 10−15 | 1.2144 × 10−15 | 0.0033 × 10−14 | |
| 0.0118 × 10−14 | 0.3000 × 10−13 | 0.0030 × 10−14 | 0.9211 × 10−11 |
Figure 2Recall values of the top k candidates of LDAPred and the other four methods.
Candidate long non-coding RNAs (lncRNAs) associated with breast cancer, colon cancer, and osteosarcoma.
| Disease Name | Rank | LncRNA Name | Description | Rank | LncRNA Name | Description |
|---|---|---|---|---|---|---|
| Breast cancer | 1 | AFAP1-AS1 | Lnc2Cancer, lncRNADisease | 9 | CECR7 | Unconfirmed |
| 2 | LINC00675 | Literature | 10 | DBET | lncRNADisease_P | |
| 3 | H19 | Lnc2Cancer, lncRNADisease_P | 11 | CARMN | lncRNADisease_P | |
| 4 | HOTTIP | Lnc2Cancer, lncRNADisease_P | 12 | DISC1FP1 | lncRNADisease_P | |
| 5 | HCG9 | lncRNADisease_P | 13 | VLDLR-AS1 | lncRNADisease_P | |
| 6 | MEG8 | Literature | 14 | PWAR5 | Literature | |
| 7 | LINC00315 | lncRNADisease_P | 15 | LINC00479 | lncRNADisease_P | |
| 8 | GABPB1-AS1 | Unconfirmed | ||||
| Colon cancer | 1 | NPSR1-AS1 | GEO | 9 | LINC00477 | lncRNADisease_P |
| 2 | MEG3 | Lnc2Cancer, lncRNADisease | 10 | PARD6G-AS1 | lncRNADisease_P | |
| 3 | H19 | Lnc2Cancer, lncRNADisease | 11 | OIP5-AS1 | lncRNADisease_P | |
| 4 | CCAT2 | Lnc2Cancer, lncRNADisease | 12 | LINC01184 | lncRNADisease_P | |
| 5 | HOTAIR | Lnc2Cancer, lncRNADisease | 13 | CARMN | lncRNADisease_P | |
| 6 | CCAT1 | Lnc2Cancer, lncRNADisease | 14 | MEG8 | lncRNADisease_P | |
| 7 | MALAT1 | Lnc2Cancer, lncRNADisease | 15 | GABPB1-AS | lncRNADisease_P | |
| 8 | GATA3-AS1 | lncRNADisease_P | ||||
| Osteosarcoma | 1 | HOTAIR | Lnc2Cancer, lncRNADisease | 9 | MEG8 | lncRNADisease_P |
| 2 | LINC00673 | Lnc2Cancer, lncRNADisease | 10 | GNAS-AS1 | lncRNADisease_P | |
| 3 | MIR17HG | lncRNADisease_P | 11 | PTCSC2 | lncRNADisease_P | |
| 4 | HULC | Lnc2Cancer, lncRNADisease_P | 12 | LINC00319 | Unconfirmed | |
| 5 | TUSC7 | Lnc2Cancer, lncRNADisease | 13 | GABPB1-AS1 | Unconfirmed | |
| 6 | HOTTIP | Lnc2Cancer, lncRNADisease | 14 | LINC00473 | Lnc2Cancer, lncRNADisease_P | |
| 7 | MEG3 | Lnc2Cancer, lncRNADisease | 15 | VLDLR-AS1 | lncRNADisease | |
| 8 | BANCR | Lnc2Cancer, lncRNADisease |
Figure 3Construction of the original eigenmatrix of lncRNA l2 and disease d3.
Figure 4Construction of the topological feature matrix of lncRNA l2 and disease d3.
Figure 5lncRNA–disease association prediction framework based on a dual CNN.