| Literature DB >> 32615919 |
Shaima Belhechmi1,2, Riccardo De Bin3, Federico Rotolo4, Stefan Michiels5,6.
Abstract
Entities:
Keywords: Biomarker selection; Cox model; High-dimensional; Lasso penalty; Pathways; Precision medicine; Stratified medicine
Mesh:
Substances:
Year: 2020 PMID: 32615919 PMCID: PMC7331150 DOI: 10.1186/s12859-020-03618-y
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Simulation results at the biomarker level. False Discovery Rate (FDR) against False Negative Rate (FNR) of biomarkers in alternative scenarios for the simulations (a) with n=500 patients, p=1000 biomarkers and R=20 biomarker groups of equal size. Average quantities across 500 replications. Abbreviations: l, number of active groups; q, number of active biomarkers; AC: Average Coefficients; SW: Single Wald; ASW: Average Single Wald; ASW*SW: Average Single Wald product Single Wald; MSW: Max Single Wald; MSW*SW: Max Single Wald product Single Wald
Fig. 2Simulation results at the biomarker level. Boxplots of the number of selected biomarkers according to the different scenarios, the box bounds the interquartile range (IQR) and contains a horizontal line corresponding to the median; outside of the box, the Tukey-style whiskers extend to a maximum of 1.5*IQR beyond the box. The black diamonds are the average number of selected biomarkers. The black dashed lines indicate the true number of active biomarkers in each scenario for the simulations (a) with n=500 patients, p=1000 biomarkers and R=20 biomarker groups of equal size. Abbreviations: l, number of active groups; q, number of active biomarkers; AC: Average Coefficients; SW: Single Wald; ASW: Average Single Wald; ASW*SW: Average Single Wald product Single Wald; MSW: Max Single Wald; MSW*SW: Max Single Wald product Single Wald
Fig. 3Simulation results at the group level. False Discovery Rate (FDR) against False Negative Rate (FNR) of biomarker groups in alternative scenarios for the simulations (a) with n=500 patients, p=1000 biomarkers and R=20 biomarker groups of equal size. Average quantities across 500 replications. Abbreviations: l, number of active groups; q, number of active biomarkers; AC: Average Coefficients; SW: Single Wald; ASW: Average Single Wald; ASW*SW: Average Single Wald product Single Wald; MSW: Max Single Wald; MSW*SW: Max Single Wald product Single Wald
The average and the empirical standard error (ESE) of the AUC at 5 years in alternative scenarios for the simulations (a) with n=500 patients, p=1000 biomarkers and R=20 biomarker groups of equal size. Average quantities across 500 replications
| ( | ||||||
|---|---|---|---|---|---|---|
| HR ∼ | HR ∼ | |||||
| Standard Lasso | 0.88 (0.02) | 0.87 (0.03) | 0.75 (0.04) | 0.75 (0.04) | 0.67 (0.05) | 0.61 (0.05) |
| Adaptive lasso | ||||||
| AC | 0.88 (0.02) | 0.88 (0.03) | 0.78 (0.04) | 0.77 (0.04) | 0.68 (0.05) | 0.61 (0.05) |
| PCA | 0.87 (0.06) | 0.85 (0.05) | 0.74 (0.09) | 0.73 (0.07) | 0.64 (0.07) | 0.58 (0.06) |
| Lasso+PCA | 0.88 (0.04) | 0.86 (0.05) | 0.74 (0.08) | 0.73 (0.06) | 0.66 (0.06) | 0.61 (0.06) |
| SW | 0.88 (0.02) | 0.88 (0.03) | 0.75 (0.04) | 0.75 (0.04) | 0.65 (0.05) | 0.59 (0.05) |
| ASW | 0.88 (0.02) | 0.88 (0.03) | 0.78 (0.03) | 0.77 (0.04) | 0.68 (0.05) | 0.61 (0.05) |
| ASW*SW | 0.89 (0.02) | 0.88 (0.02) | 0.78 (0.04) | 0.77 (0.04) | 0.66 (0.05) | 0.59 (0.05) |
| MSW | 0.88 (0.02) | 0.88 (0.02) | 0.78 (0.04) | 0.77 (0.04) | 0.68 (0.05) | 0.62 (0.05) |
| MSW*SW | 0.89 (0.02) | 0.88 (0.02) | 0.77 (0.04) | 0.76 (0.04) | 0.67 (0.05) | 0.60 (0.05) |
| cMCP | 0.87 (0.03) | 0.86 (0.03) | 0.75 (0.04) | 0.74 (0.04) | 0.67 (0.05) | 0.61 (0.06) |
| gel | 0.88 (0.02) | 0.88 (0.02) | 0.74 (0.08) | 0.60 (0.08) | 0.58 (0.07) | 0.57 (0.06) |
| SGL | 0.88 (0.02) | 0.88 (0.02) | 0.77 (0.04) | 0.76 (0.04) | 0.69 (0.05) | 0.62 (0.06) |
| IPF-Lasso | ||||||
| IPF-Lasso1 | 0.88 (0.02) | 0.88 (0.03) | 0.78 (0.04) | 0.77 (0.04) | 0.69 (0.05) | 0.63 (0.05) |
| IPF-Lasso2 | 0.88 (0.02) | 0.88 (0.03) | 0.76 (0.04) | 0.75 (0.04) | 0.68 (0.05) | 0.62 (0.06) |
Abbreviations: l, number of active groups; q, number of active biomarkers; AC: Average Coefficients; SW: Single Wald; ASW: Average Single Wald; ASW*SW: Average Single Wald product Single Wald; MSW: Max Single Wald; MSW*SW: Max Single Wald product Single Wald.
IPF-Lasso1: pflist1 = list(c(1,rep(2,19)), c(2,1,rep(2,18)), c(rep(2,10),1,rep(2,9)), c(1,1,rep(2,18)), c(1,rep(2,9),1,rep(2,9)), c(2,1,rep(2,8),1,rep(2,9)), c(1,1,rep(2,8),1,rep(2,9))).
IPF-Lasso2: pflist2 = list(c(2,rep(1,19)), c(1,2,rep(1,18)), c(rep(1,10),2,rep(1,9)), c(2,2,rep(1,18)), c(2,rep(1,9),2,rep(1,9)), c(1,2,rep(1,8),2,rep(1,9)), c(2,2,rep(1,8),2,rep(1,9)))
Fig. 4The identified biomarkers by the different lasso-based variable selection procedures on the training set of the breast cancer gene expression study for relapse-free survival. Abbreviations: AC, Average Coefficients; SW, Single Wald; ASW, Average Single Wald; ASW*SW, Average Single Wald product Single Wald; MSW, Max Single Wald; MSW*SW, Max Single Wald product Single Wald
The average AUC and the empirical standard error (ESE) for 5-year relapse-free survival prediction obtained from 500 random training-validation sets using the different variables selection procedures
| AUC | 0.62 | 0.61 | 0.63 | 0.61 | 0.62 | 0.62 | 0.62 | 0.63 | 0.61 | 0.57 | 0.62 | 0.62 | 0.60 |
| ESE | 0.04 | 0.04 | 0.04 | 0.05 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.05 | 0.04 | 0.04 | 0.05 |