| Literature DB >> 32164526 |
Ping Luo1, Li-Ping Tian2, Bolin Chen3, Qianghua Xiao4, Fang-Xiang Wu5,6,7,8.
Abstract
BACKGROUND: Disease gene prediction is a critical and challenging task. Many computational methods have been developed to predict disease genes, which can reduce the money and time used in the experimental validation. Since proteins (products of genes) usually work together to achieve a specific function, biomolecular networks, such as the protein-protein interaction (PPI) network and gene co-expression networks, are widely used to predict disease genes by analyzing the relationships between known disease genes and other genes in the networks. However, existing methods commonly use a universal static PPI network, which ignore the fact that PPIs are dynamic, and PPIs in various patients should also be different.Entities:
Keywords: Disease gene prediction; Ensemble learning; Network centrality; Protein-protein interaction network; Sample-based networks
Mesh:
Substances:
Year: 2020 PMID: 32164526 PMCID: PMC7068856 DOI: 10.1186/s12859-020-3346-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Work flow of the algorithm. (a) Obtain gene expression data of case samples; (b) Construct single sample-based networks; (c) Fuse sample-based networks based on the clustering results; (d) Perform prediction on each fused network; (e) Combine the prediction results in (d) to generate the final prediction
Fig. 2Hierarchical clustering dendrogram for BC
Fig. 3Hierarchical clustering dendrogram for TC
Fig. 4Hierarchical clustering dendrogram for AD
Sensitivity analysis
| k | |||||
|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | ||
| 1.0 | 0.1 | 0.867 | 0.961 | 0.873 | 0.878 |
| 1.0 | 0.2 | 0.869 | 0.966 | 0.889 | 0.870 |
| 1.1 | 0.1 | 0.883 | 0.967 | 0.890 | 0.903 |
| 1.1 | 0.2 | 0.881 | 0.909 | 0.896 | |
| 1.2 | 0.1 | 0.845 | 0.957 | 0.877 | 0.898 |
| 1.2 | 0.2 | 0.846 | 0.958 | 0.892 | 0.894 |
| 1.3 | 0.1 | 0.787 | 0.938 | 0.819 | 0.842 |
| 1.3 | 0.2 | 0.787 | 0.940 | 0.841 | 0.842 |
| 1.5 | 0.1 | 0.777 | 0.938 | 0.813 | 0.775 |
| 1.5 | 0.2 | 0.777 | 0.938 | 0.786 | 0.816 |
The resulted AUC values obtained with different combinations of hyperparameters for BC
The highest AUC value is marked in boldface
Sensitivity analysis
| k | |||||
|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | ||
| 1.0 | 0.1 | 0.716 | 0.966 | 0.839 | 0.790 |
| 1.0 | 0.2 | 0.713 | 0.967 | 0.795 | 0.802 |
| 1.1 | 0.1 | 0.729 | 0.800 | 0.746 | |
| 1.1 | 0.2 | 0.728 | 0.969 | 0.744 | 0.779 |
| 1.2 | 0.1 | 0.809 | 0.954 | 0.748 | 0.776 |
| 1.2 | 0.2 | 0.808 | 0.953 | 0.652 | 0.792 |
| 1.3 | 0.1 | 0.621 | 0.962 | 0.779 | 0.786 |
| 1.3 | 0.2 | 0.620 | 0.960 | 0.662 | 0.794 |
| 1.5 | 0.1 | 0.412 | 0.965 | 0.809 | 0.720 |
| 1.5 | 0.2 | 0.411 | 0.963 | 0.645 | 0.679 |
The resulted AUC values obtained with different combinations of hyperparameters for TC
The highest AUC value is marked in boldface
Sensitivity analysis
| k | |||||
|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | ||
| 1.0 | 0.1 | 0.808 | 0.964 | 0.809 | 0.763 |
| 1.0 | 0.2 | 0.809 | 0.764 | 0.705 | |
| 1.1 | 0.1 | 0.665 | 0.956 | 0.757 | 0.685 |
| 1.1 | 0.2 | 0.665 | 0.957 | 0.596 | 0.636 |
| 1.2 | 0.1 | 0.564 | 0.938 | 0.809 | 0.605 |
| 1.2 | 0.2 | 0.563 | 0.939 | 0.608 | 0.596 |
| 1.3 | 0.1 | 0.508 | 0.914 | 0.810 | 0.674 |
| 1.3 | 0.2 | 0.508 | 0.914 | 0.608 | 0.614 |
The resulted AUC values obtained with different combinations of hyperparameters for AD
The highest AUC value is marked in boldface
Fig. 5ROC curves for BC
Fig. 6ROC curves for TC
Fig. 7ROC curves for AD
Top 10 unknown genes
| Gene Name | Function | Reference |
|---|---|---|
| CREBBP | Potential disease gene | [ |
| NBN | Potential disease gene | [ |
| PARP1 | Potential biomarker | [ |
| NCOR2 | Potential biomarker | [ |
| RXRA | Potential therapeutic target | [ |
| WRN | Potential disease gene | [ |
| EXO1 | Potential disease gene | [ |
| NCOA3 | Potential disease gene | [ |
| RMI2 | Potential disease gene | [ |
| TOPBP1 | Potential therapeutic target | [ |
| HRAS | Potential disease gene | [ |
| HAUS7 | ||
| CEP72 | ||
| GTF2I | Potential disease gene | [ |
| BCLAF1 | Potential disease gene | [ |
| HAUS3 | ||
| FGFR1OP | Potential disease gene | [ |
| CEP131 | ||
| GPR83 | ||
| ALMS1 | Potential disease gene | [ |
| MAP2 | Potential disease gene | [ |
| DPYSL3 | ||
| ERRFI1 | Potential disease gene | [ |
| DAB2 | Potential disease gene | [ |
| AMPH | Potential disease gene | [ |
| SYN1 | Potential disease gene | [ |
| SYT9 | Potential disease gene | [ |
| AXIN1 | ||
| PRNP | Potential disease gene | [ |
| AAK1 | Potential disease gene | [ |