| Literature DB >> 31001321 |
Ping Luo1, Qianghua Xiao2, Pi-Jing Wei1,3, Bo Liao4, Fang-Xiang Wu1,4,5,6.
Abstract
Complex diseases are known to be associated with disease genes. Uncovering disease-gene associations is critical for diagnosis, treatment, and prevention of diseases. Computational algorithms which effectively predict candidate disease-gene associations prior to experimental proof can greatly reduce the associated cost and time. Most existing methods are disease-specific which can only predict genes associated with a specific disease at a time. Similarities among diseases are not used during the prediction. Meanwhile, most methods predict new disease genes based on known associations, making them unable to predict disease genes for diseases without known associated genes.In this study, a manifold learning-based method is proposed for predicting disease-gene associations by assuming that the geodesic distance between any disease and its associated genes should be shorter than that of other non-associated disease-gene pairs. The model maps the diseases and genes into a lower dimensional manifold based on the known disease-gene associations, disease similarity and gene similarity to predict new associations in terms of the geodesic distance between disease-gene pairs. In the 3-fold cross-validation experiments, our method achieves scores of 0.882 and 0.854 in terms of the area under of the receiver operating characteristic (ROC) curve (AUC) for diseases with more than one known associated genes and diseases with only one known associated gene, respectively. Further de novo studies on Lung Cancer and Bladder Cancer also show that our model is capable of identifying new disease-gene associations.Entities:
Keywords: disease gene identification; disease module theory; gene ontology; manifold learning; multi-task learning
Year: 2019 PMID: 31001321 PMCID: PMC6454152 DOI: 10.3389/fgene.2019.00270
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1ROC curves of the three competing algorithms on multiple-gene diseases.
Figure 2ROC curves of the three competing algorithms on single-gene diseases.
Top 10 predictions for lung cancer and bladder cancer.
| SEMA4A | |
| KCNK9 | Sun et al., |
| MYL2 | Che et al., |
| DENND5A | |
| HTRA1 | Esposito et al., |
| GABRA1 | |
| ATP6AP1 | Sabrkhany et al., |
| KCTD17 | |
| HFE | McLarty et al., |
| BCS1L | |
| PDYN | |
| DKC1 | |
| SMAD3 | Tong et al., |
| MCC | |
| DMP1 | Peng et al., |
| MGP | |
| CALR | Kageyama et al., |
| CASQ2 | |
| SOX18 | |
| GATM | |