| Literature DB >> 29491388 |
Jeremy Wang1, Surbhi Jain1, Dion Chen2, Wei Song1, Chi-Tan Hu3, Ying-Hsiu Su4,5.
Abstract
Hepatocellular carcinoma is one of the fastest growing cancers in the US and has a low survival rate, partly due to difficulties in early detection. Because of HCC's high heterogeneity, it has been suggested that multiple biomarkers would be needed to develop a sensitive HCC screening test. This study applied random forest (RF), a machine learning technique, and proposed two novel models, fixed sequential (FS) and two-step (TS), for comparison with two commonly used statistical techniques, logistic regression (LR) and classification and regression trees (CART), in combining multiple urine DNA biomarkers for HCC screening using biomarker values obtained from 137 HCC and 431 non-HCC (224 hepatitis and 207 cirrhosis) subjects. The sensitivity, specificity, area under the receiver operating curve, and variability were estimated through repeated 10-fold cross-validation to compare the models' performances in accuracy and robustness. We show that RF and TS have higher accuracy and stability; specifically, they reach 90% specificity and 86%/87% sensitivity respectively along with 15% higher sensitivity and 10% higher specificity than LR in cross-validation. The potential of RF and TS to develop a panel of multiple biomarkers and the possibility for self-training, cloud-based models for HCC screening are discussed.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29491388 PMCID: PMC5830457 DOI: 10.1038/s41598-018-21922-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Study population.
| Variables | HCC (n = 137) | Non-HCC (n = 431) | p-value | |
|---|---|---|---|---|
| Age (years) | n (missing 55) | 137 | 376 | <0.0001 |
| Mean (SD) | 61.3 (11.4) | 55.3 (10.7) | ||
| Median (Range) | 61.0 (26.0–88.0) | 55.5 (24.0–81.0) | ||
| Gender | n (missing 3) | 137 | 428 | 0.1315 |
| Male: n (%) | 107 (78%) | 304 (71%) | ||
| HBV | n (missing 46) | 129 | 393 | <0.0001 |
| Positive: n (%) | 75 (58%) | 308 (78%) | ||
| HCV | n (missing 51) | 112 | 405 | 0.0604 |
| Positive: n (%) | 37 (33%) | 96 (31%) | ||
Figure 1Box plot of each biomarker in HCC and Non-HCC. Each biomarker value was plotted by disease group. “0” indicates non-HCC (n = 431) and “1” indicates HCC (n = 137). P-values were generated via Wilcoxon rank sum test.
Summary of relationships between demographics and biomarkers.
| Correlation Coefficients | ||
|---|---|---|
| Age (Spearson’s ρ, n = 513) | Gender (Point Biserial w/ Log Transform, n = 565) | |
| TP53.249 T | 0.125 | −0.144 |
| mRASSF1A | 0.143 | −0.058 |
| mGSTP1 | 0.107 | 0.007 |
| Serum.AFP | 0.137 | −0.055 |
Figure 2Performance of each biomarker in HCC classification as evaluated by Univariate ROC curves and AUC (n = 568; HCC 137, Non-HCC 431).
Figure 3ROC curves and AUCs generated using multivariate models LR, CART, FS, RF, and TS for distinguishing HCC (n = 137) from non-HCC (n = 431).
Summary of multivariate models’ AUCs from cross-validation.
| AUC | Model Building | Cross Validation | ||
|---|---|---|---|---|
| Mean (95% CI) | Median (Range) | Mean (95% CI) | Median (Range) | |
| LR | 0.928 (0.927–0.928) | 0.928 (0.927–0.929) | 0.926 (0.921–0.931) | 0.926 (0.915–0.934) |
| CART | 0.910 (0.909–0.912) | 0.910 (0.905–0.914) | 0.897 (0.886–0.908) | 0.897 (0.856–0.910) |
| FS | 0.934 (0.934–0.934) | 0.934 (0.933–0.934) | 0.933 (0.928–0.937) | 0.932 (0.925–0.940) |
| RF | 0.950 (0.949–0.951) | 0.950 (0.948–0.952) | 0.938 (0.932–0.945) | 0.938 (0.927–0.949) |
| TS | 0.945 (0.944–0.946) | 0.945 (0.943–0.947) | 0.935 (0.930–0.940) | 0.935 (0.923–0.946) |
Summary of multivariate models’ sensitivities at different cutoffs of specificity from the 1,000 iterations of 10-fold CV for both model building and validation data.
| Specificity Cutoff | Model | Model Building | Cross Validation | ||
|---|---|---|---|---|---|
| Mean (95% CI) | Median (Range) | Mean (95% CI) | Median (Range) | ||
| 85% | LR | 0.873 (0.869–0.878) | 0.873 (0.865–0.882) | 0.870 (0.853–0.886) | 0.869 (0.842–0.898) |
| CART | 0.861 (0.857–0.864) | 0.861 (0.849–0.868) | 0.766 (0.738–0.795) | 0.766 (0.667–0.798) | |
| FS | 0.873 (0.870–0.875) | 0.873 (0.870–0.877) | 0.871 (0.863–0.879) | 0.869 (0.860–0.885) | |
| RF | 0.910 (0.907–0.913) | 0.910 (0.904–0.915) | 0.900 (0.889–0.910) | 0.898 (0.882–0.914) | |
| TS | 0.915 (0.913–0.917) | 0.915 (0.912–0.918) | 0.906 (0.894–0.919) | 0.906 (0.884–0.920) | |
| 90% | LR | 0.789 (0.781–0.797) | 0.789 (0.777–0.840) | 0.782 (0.757–0.807) | 0.781 (0.743–0.820) |
| CART | 0.861 (0.856–0.865) | 0.861 (0.843–0.865) | 0.766 (0.738–0.795) | 0.766 (0.667–0.798) | |
| FS | 0.846 (0.843–0.849) | 0.846 (0.840–0.851) | 0.838 (0.826–0.851) | 0.839 (0.823–0.863) | |
| RF | 0.870 (0.866–0.874) | 0.870 (0.862–0.877) | 0.862 (0.848–0.876) | 0.862 (0.831–0.877) | |
| TS | 0.890 (0.883–0.891) | 0.887 (0.880–0.893) | 0.871 (0.857–0.885) | 0.870 (0.837–0.899) | |
| 95% | LR | 0.655 (0.647–0.662) | 0.655 (0.643–0.668) | 0.645 (0.622–0.669) | 0.644 (0.610–0.686) |
| CART | 0.809 (0.793–0.825) | 0.809 (0.780–0.829) | 0.664 (0.571–0.757) | 0.664 (0.481–0.734) | |
| FS | 0.750 (0.745–0.753) | 0.749 (0.745–0.758) | 0.738 (0.723–0.752) | 0.737 (0.715–0.760) | |
| RF | 0.790 (0.781–0.799) | 0.790 (0.776–0.809) | 0.766 (0.746–0.786) | 0.766 (0.727–0.797) | |
| TS | 0.827 (0.818–0.835) | 0.827 (0.811–0.840) | 0.798 (0.775–0.822) | 0.800 (0.750–0.831) | |
Summary of multivariate models’ specificities at different cutoffs of sensitivity from the 1,000 iterations of 10-fold CV for both model building and validation data.
| Sensitivity Cutoff | Model | Model Building | Cross Validation | ||
|---|---|---|---|---|---|
| Mean (95% CI) | Median (Range) | Mean (95% CI) | Median (Range) | ||
| 90% | LR | 0.823 (0.816–0.830) | 0.823 (0.812–0.832) | 0.823 (0.811–0.836) | 0.824 (0.800–0.842) |
| CART | 0.01 (0.000–0.049) | 0.01 (0.000–0.102) | 0.005 (0.903–0.920) | 0.005 (0.893–0.928) | |
| FS | 0.812 (0.809–0.816) | 0.812 (0.807–0.820) | 0.812 (0.803–0.821) | 0.812 (0.798–0.828) | |
| RF | 0.867 (0.861–0.874) | 0.867 (0.857–0.876) | 0.856 (0.844–0.869) | 0.856 (0.833–0.873) | |
| TS | 0.888 (0.884–0.893) | 0.888 (0.881–0.896) | 0.879 (0.869–0.889) | 0.879 (0.856–0.900) | |
| 95% | LR | 0.640 (0.631–0.648) | 0.639 (0.630–0.654) | 0640 (0.627–0.652) | 0.638 (0.622–0.673) |
| CART | 0.01 (0.000–0.011) | 0.01 (0.000–0.014) | 0.005 (0.901–0.923) | 0.005 (0.884–0.928) | |
| FS | 0.660 (0.651–0.670) | 0.660 (0.644–0.680) | 0.659 (0.644–0.674) | 0.659 (0.633–0.687) | |
| RF | 0.738 (0.727–0.749) | 0.738 (0.716–0.752) | 0.724 (0.705–0.742) | 0.724 (0.692–0.752) | |
| TS | 0.739 (0.730–0.749) | 0.739 (0.725–0.758) | 0.730 (0.713–0.746) | 0.729 (0.698–0.759) | |
| 99% | LR | 0.233 (0.217–0.250) | 0.234 (0.212–0.253) | 0.233 (0.210–0.256) | 0.232 (0.204–0.281) |
| CART | 0.01 (0.000–0.011) | 0.01 (0.000–0.014) | 0.005 (0.901–0.923) | 0.005 (0.884–0.928) | |
| FS | 0.223 (0.206–0.239) | 0.218 (0.216–0.259) | 0.223 (0.204–0.241) | 0.220 (0.211–0.269) | |
| RF | 0.365 (0.293–0.438) | 0.367 (0.218–0.455) | 0.356 (0.279–0.433) | 0.357 (0.197–0.459) | |
| TS | 0.271 (0.186–0.356) | 0.269 (0.132–0.451) | 0.263 (0.174–0.351) | 0.263 (0.125–0.450) | |
Figure 4Fixed Sequential Flowchart Data are first separated based on the AASLD standard cutoff of 20 ng/mL. The AFP-positive subjects are predicted as HCC-positive (p = 1), while the AFP-negative subset is run through a logistic regression algorithm that provides a final classification.