| Literature DB >> 30598085 |
Ke Zhang1, Wei Geng1, Shuqin Zhang2.
Abstract
BACKGROUND: Many mathematical and statistical models and algorithms have been proposed to do biomarker identification in recent years. However, the biomarkers inferred from different datasets suffer a lack of reproducibilities due to the heterogeneity of the data generated from different platforms or laboratories. This motivates us to develop robust biomarker identification methods by integrating multiple datasets.Entities:
Keywords: Data integration; Logistic regression; Meta-analysis; Network penalty
Mesh:
Substances:
Year: 2018 PMID: 30598085 PMCID: PMC6311907 DOI: 10.1186/s12918-018-0657-8
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Prediction results for simulation setting 1
| Prediction | ||||
|---|---|---|---|---|
| Method | Sensitivity | Specificity | Accuracy | AUC |
| LASSO | 0.63(0.05) | 0.62(0.04) | 0.62(0.02) | 0.66(0.02) |
| Enet | 0.65(0.05) | 0.64(0.05) | 0.63(0.02) | 0.68(0.02) |
| Network | 0.82(0.06) | 0.82(0.06) | 0.81(0.05) | 0.89(0.05) |
| Abs-Network | 0.82(0.05) | 0.82(0.06) | 0.81(0.04) | 0.89(0.04) |
| Merge-LASSO | 0.65(0.04) | 0.65(0.06) | 0.63(0.02) | 0.68(0.02) |
| Merge-Enet | 0.65(0.05) | 0.64(0.05) | 0.63(0.02) | 0.68(0.02) |
| Merge-Network | 0.87(0.04) | 0.88(0.03) | 0.88(0.03) | 0.95(0.02) |
| Merge-Abs-Network | 0.88(0.04) | 0.88(0.03) | 0.88(0.02) | 0.95(0.02) |
| Int-LASSO | 0.88(0.02) | 0.88(0.02) | 0.88(0.02) | 0.96(0.01) |
| Int-Enet | 0.88(0.02) | 0.88(0.02) | 0.88(0.02) | 0.96(0.01) |
| Int-Network | 0.89(0.02) |
| 0.89(0.01) | 0.96(0.01) |
| Int-Abs-Network |
|
|
|
|
| MetaLasso | 0.75(0.05) | 0.76(0.04) | 0.76(0.04) | 0.84(0.04) |
β is shown in (9),
The maximum value for each measure is highlighted using boldface font
Variable selection results for simulation setting 1
| Variable selection | |||
|---|---|---|---|
| Method | Precision | Recall | |
| LASSO | 0.93(0.02) | 0.26(0.06) | 0.60(0.06) |
| Enet | 0.90(0.04) | 0.41(0.06) | 0.61(0.06) |
| Network | 0.85(0.02) | 0.91(0.05) | 0.80(0.06) |
| Abs-Network | 0.82(0.02) | 0.95(0.05) | 0.81(0.06) |
| Merge-LASSO | 0.94(0.02) | 0.49(0.05) | 0.62(0.05) |
| Merge-Enet | 0.94(0.02) | 0.56(0.04) | 0.61(0.07) |
| Merge-Network |
| 0.94(0.03) | 0.87(0.03) |
| Merge-Abs-Network |
|
| 0.88(0.03) |
| Int-LASSO | 0.95(0.01) | 0.49(0.05) | 0.88(0.02) |
| Int-Enet | 0.96(0.01) | 0.65(0.04) | 0.88(0.02) |
| Int-Network | 0.94(0.04) | 0.96(0.03) | 0.89(0.01) |
| Int-Abs-Network | 0.91(0.05) |
|
|
| MetaLasso | 0.94(0.01) | 0.05(0.02) | 0.75(0.04) |
β is shown in (9),
The maximum value for each measure is highlighted using boldface font
Prediction results for simulation setting 2
| Prediction | ||||
|---|---|---|---|---|
| Method | Sensitivity | Specificity | Accuracy | AUC |
| LASSO | 0.63(0.05) | 0.64(0.08) | 0.62(0.02) | 0.66(0.03) |
| Enet | 0.61(0.04) | 0.63(0.06) | 0.61(0.03) | 0.65(0.03) |
| Network | 0.83(0.04) | 0.85(0.06) | 0.84(0.04) | 0.92(0.04) |
| Abs-Network | 0.85(0.05) | 0.84(0.05) | 0.84(0.03) | 0.92(0.03) |
| Merge-LASSO | 0.63(0.05) | 0.63(0.06) | 0.61(0.02) | 0.66(0.02) |
| Merge-Enet | 0.62(0.04) | 0.63(0.07) | 0.61(0.02) | 0.66(0.02) |
| Merge-Network | 0.82(0.04) |
| 0.84(0.03) | 0.93(0.02) |
| Merge-Abs-Network | 0.81(0.04) | 0.86(0.04) | 0.83(0.03) | 0.92(0.03) |
| Int-LASSO | 0.82(0.03) | 0.89(0.03) | 0.85(0.03) | 0.93(0.02) |
| Int-Enet | 0.82(0.04) | 0.89(0.03) | 0.85(0.03) | 0.94(0.02) |
| Int-Network | 0.88(0.04) |
| 0.87(0.02) |
|
| Int-Abs-Network |
| 0.87(0.04) |
|
|
| MetaLasso | 0.81(0.03) | 0.82(0.04) | 0.81(0.04) | 0.90(0.03) |
The sign of β is shown in (10),
The maximum value for each measure is highlighted using boldface font
Variable selection results for simulation setting 2
| Variable selection | |||
|---|---|---|---|
| Method | Precision | Recall | |
| LASSO | 0.91(0.04) | 0.28(0.06) | 0.60(0.07) |
| Enet | 0.91(0.04) | 0.35(0.07) | 0.62(0.06) |
| Network | 0.85(0.03) | 0.74(0.12) | 0.84(0.05) |
| Abs-Network | 0.83(0.03) | 0.77(0.13) | 0.84(0.03) |
| Merge-LASSO | 0.95(0.01) | 0.42(0.08) | 0.60(0.06) |
| Merge-Enet | 0.95(0.01) | 0.50(0.07) | 0.61(0.05) |
| Merge-Network |
| 0.74(0.08) | 0.84(0.03) |
| Merge-Abs-Network |
| 0.77(0.08) | 0.84(0.03) |
| Int-LASSO | 0.95(0.01) | 0.43(0.09) | 0.85(0.02) |
| Int-Enet | 0.96(0.01) | 0.64(0.07) | 0.86(0.03) |
| Int-Network | 0.93(0.03) | 0.83(0.07) | 0.87(0.03) |
| Int-Abs-Network | 0.92(0.04) |
|
|
| MetaLasso | 0.94(0.01) | 0.04(0.02) | 0.81(0.04) |
The sign of β is shown in (10),
The maximum value for each measure is highlighted using boldface font
Datasets summary [14]
| Dataset | Publication | # Patients | Classification | # patients |
|---|---|---|---|---|
| GSE2034 | [ | 242 | time to relapse ≤ 5y & relapse=True | 95 |
| time to relapse > 7y & relapse=False | 147 | |||
| GSE1456 | [ | 111 | time to relapse ≤ 5y & relapse=True | 35 |
| time to relapse > 7y & relapse=False | 76 |
Real data results. MetaLasso achieved the AUC 0.62(0.02), and selected 3 genes as biomarkers
| Data Merging | Our model | |||
|---|---|---|---|---|
| Penalty | AUC | # Genes | AUC | # Genes |
| LASSO | 0.67(0.01) | 59 | 0.70(0.03) | 122 |
| Enet | 0.67(0.01) | 306 | 0.69(0.02) | 104 |
| Network | 0.58(0.01) | 255 | 0.70(0.04) | 214 |
| Abs-Network | 0.59(0.03) | 285 | 0.67(0.01) | 270 |
Fig. 1Identified subnetwork biomarkers using network-based integrative logistic regression with Abs-Network penalty
Fig. 2One identified subnetwork biomarker using network-based integrative logistic regression with Abs-Network penalty