| Literature DB >> 29884785 |
Lujun Shen1,2, Qi Zeng3, Pi Guo4, Jingjun Huang5, Chaofeng Li2,6, Tao Pan7, Boyang Chang1,2, Nan Wu8, Lewei Yang3, Qifeng Chen1,2, Tao Huang1,2, Wang Li9,10, Peihong Wu11,12.
Abstract
Patients with hepatocellular carcinoma (HCC) always require routine surveillance and repeated treatment, which leads to accumulation of huge amount of clinical data. A predictive model utilizes the time-series data to facilitate dynamic prognosis prediction and treatment planning is warranted. Here we introduced an analytical approach, which converts the time-series data into a cascading survival map, in which each survival path bifurcates at fixed time interval depending on selected prognostic features by the Cox-based feature selection. We apply this approach in an intermediate-scale database of patients with BCLC stage B HCC and get a survival map consisting of 13 different survival paths, which is demonstrated to have superior or equal value than conventional staging systems in dynamic prognosis prediction from 3 to 12 months after initial diagnosis in derivation, internal testing, and multicentric testing cohorts. This methodology/model could facilitate dynamic prognosis prediction and treatment planning for patients with HCC in the future.Entities:
Mesh:
Year: 2018 PMID: 29884785 PMCID: PMC5993743 DOI: 10.1038/s41467-018-04633-7
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Baseline characteristics of derivation, validation, and testing cohort at initial diagnosis
| Variable | Derivation cohort no. (%) | Internal testing cohort no. (%) | Multicenter testing cohort no. (%) | ||
|---|---|---|---|---|---|
| Age (years) | 0.441 | <0.001 | |||
| ≤50 | 370 (37.8) | 249 (39.7) | 199 (48.1) | ||
| >50 | 609 (62.2) | 378 (60.3) | 215 (51.9) | ||
| Gender | 0.113 | 0.043 | |||
| Male | 889 (90.8) | 554 (88.4) | 361 (87.2) | ||
| Female | 90 (9.2) | 73 (11.6) | 53 (12.8) | ||
| HBV infection | 0.086 | 0.813 | |||
| No | 33 (3.4) | 32 (5.1) | 15 (3.6) | ||
| Yes | 946 (96.6) | 595 (94.9) | 399 (96.4) | ||
| AFP (IU/ml) | 0.493 | 0.314 | |||
| <25 | 318 (32.5) | 214 (34.1) | 146 (35.3) | ||
| ≥25 | 661 (67.5) | 413 (65.9) | 268 (64.7) | ||
| Child-Pugh class | 0.815 | 0.602 | |||
| A | 841 (85.9) | 536 (85.5) | 360 (87.0) | ||
| B | 138 (14.1) | 91 (14.5) | 54 (13.0) | ||
| Tumor size (cm) | 0.727 | 0.939 | |||
| Mean ± SD | 7.20 ± 3.57 | 7.07 ± 3.48 | 7.12 ± 3.51 | ||
| ≤5 | 329 (33.6) | 216 (34.4) | 140 (33.8) | ||
| >5 | 650 (66.4) | 411 (65.6) | 274 (66.2) | ||
| Number of lesions | 0.428 | 0.956 | |||
| ≤3 | 391 (39.9) | 238 (38.0) | 166 (40.1) | ||
| >3 | 588 (60.1) | 389 (62.0) | 248 (59.9) |
All values are presented as numbers of patients followed by percentages in the parentheses. P values were calculated by comparing categorical variables between testing cohorts and derivation cohort with chi-square test
Fig. 1The survival path system constructed for BCLC stage B HCC patients. Using the selected features identified at each time slice, the population was divided into cascades of subgroups, which was further visualized by two-dimensional graph, with the time slices on x-axis and median OS time on y-axis. A total of 13 different paths were constructed
Comparison of c-index between the survival path system, BCLC staging system, AJCC staging system, and ART score at each time slice in the derivation cohort
| Time slice | Number of cases (modeling/all)a | Survival path system | BCLC staging system | AJCC staging system | ART class | ||||
|---|---|---|---|---|---|---|---|---|---|
| Number of nodes | Number of classes | Number of classes | Number of classes | ||||||
| No.1 | 979/979 | 2 | 0.624 (0.623–0.625) | 1 | — | 2 | 0.602 (0.601–0.603) | 1 | — |
| No.2 | 822/822 | 4 | 0.695 (0.693–0.697) | 5 | 0.696 (0.694–0.698) | 6 | 0.702c (0.702–0.704) | 2 | 0.528b (0.526–0.530) |
| No.3 | 506/513 | 7 | 0.733 (0.730–0.736) | 4 | 0.725b (0.722–0.728) | 6 | 0.733 (0.730–0.736) | 2 | 0.536b (0.533–0.539) |
| No.4 | 374/390 | 10 | 0.760 (0.756–0.764) | 4 | 0.724b (0.720–0.728) | 6 | 0.727b (0.723–0.731) | 2 | 0.572b (0.568–0.576) |
| No.5 | 307/336 | 12 | 0.768 (0.763–0.773) | 4 | 0.731b (0.726–0.736) | 6 | 0.737b (0.731–0.743) | 2 | 0.589b (0.584–0.594) |
| No.6 | 245/294 | 12 | 0.771 (0.764–0.778) | 5 | 0.749b (0.742–0.756) | 6 | 0.757b (0.750–0.764) | 2 | 0.535b (0.528–0.542) |
| No.7 | 199/246 | 10 | 0.792 (0.783–0.801) | 4 | 0.764b (0.755–0.773) | 6 | 0.766b (0.756–0.776) | 2 | 0.563b (0.554–0.572) |
| No.8 | 167/221 | 10 | 0.811 (0.799–0.823) | 4 | 0.773b (0.762–0.784) | 6 | 0.769b (0.757–781) | 2 | 0.541b (0.530–0.550) |
| No.9 | 128/202 | 7 | 0.830 (0.816–0.844) | 4 | 0.769b (0.752–0.786) | 6 | 0.802 (0.785–0.819) | 2 | 0.549b (0.531–0.567) |
aNodes of survival path system with less than six cases were excluded from the computing of c-index. Therefore, the number of cases in modeling is less than the number of all cases with effective data
bThe c-index of the interested system was lower than survival path system, with P < 0.006
cThe c-index of the interested system was higher than the survival path system, with P < 0.006
Fig. 2Kaplan–Meier plots in the derivation cohort. Kaplan–Meier plots showed OS divided by the survival path system, BCLC staging system, AJCC staging system, and ART score, respectively, at time slice No.1 (a), time slice No.3 (b), time slice No.5 (c), and time slice No.7 (d) in the derivation cohort
Fig. 3Kaplan–Meier plots in the validation cohorts. Kaplan–Meier plots showed OS divided by the survival path system in the internal testing cohort and multicenter testing cohort, respectively, at time slice No.1 (a, b), time slice No.3 (c, d), and time slice No.5 (e, f). Note: nodes of paths with <6 cases in the testing cohorts were regarded unstable and not included in the analysis
Comparison of c-index between the survival path system, BCLC staging system, and AJCC staging system in the internal testing cohort and multicenter testing cohort
| Time slice | Number (modeling/all) | Survival path system | BCLC staging system | AJCC staging system | |||
|---|---|---|---|---|---|---|---|
| Number of nodes | Number of classes | Number of classes | |||||
| Internal testing cohort | |||||||
| No.1 | 627/627 | 2 | 0.634 (0.632–0.636) | 1 | — | 6 | 0.634 (0.632–0.636) |
| No.2 | 562/562 | 4 | 0.695 (0.692–0.698) | 5 | 0.724a (0.721–0.727) | 6 | 0.733a (0.730–0.736) |
| No.3 | 367/367 | 8 | 0.747 (0.722–0.752) | 4 | 0.729b (0.725–0.733) | 6 | 0.737b (0.732–0.742) |
| No.4 | 271/277 | 10 | 0.774 (0.766–0.782) | 4 | 0.751b (0.743–0.759) | 6 | 0.737b (0.729–0.745) |
| No.5 | 210/222 | 11 | 0.764 (0.755–0.773) | 4 | 0.760 (0.749–0.771) | 6 | 0.728b (0.712–0.739) |
| No.6 | 171/181 | 11 | 0.756 (0.743–0.769) | 5 | 0.785a (0.755–0.775) | 6 | 0.746 (0.732–0.760) |
| No.7 | 125/148 | 8 | 0.820 (0.803–0.837) | 4 | 0.817 (0.801–0.833) | 6 | 0.824 (0.808–0.840) |
| Multicenter testing cohort | |||||||
| No.1 | 414/414 | 2 | 0.631 (0.628–0.634) | 1 | — | 3 | 0.602b (0.599–0.605) |
| No.2 | 359/359 | 4 | 0.689 (0.685–0.693) | 4 | 0.698a (0.694–0.702) | 6 | 0.715a (0.711–0.719) |
| No.3 | 233/234 | 7 | 0.725 (0.718–0.732) | 4 | 0.715 (0.709–0.721) | 6 | 0.720 (0.714–0.726) |
| No.4 | 181/189 | 8 | 0.790 (0.781–0.799) | 4 | 0.752b (0.742–0.762) | 6 | 0.759b (0.749–0.769) |
| No.5 | 131/149 | 7 | 0.778 (0.765–0.791) | 4 | 0.757b (0.745–0.769) | 6 | 0.754b (0.743–0.765) |
| No.6 | 113/128 | 7 | 0.769 (0.750–0.788) | 4 | 0.714b (0.695–0.733) | 6 | 0.734 (0.716–0.752) |
aThe c-index of the interested system was higher than the survival path system, with P < 0.006
bThe c-index of the interested system was lower than the survival path system, with P < 0.006
Hazard ratio and significance of upper node versus lower node at each path bifurcation in the derivation, internal testing, and multicenter testing cohorts
| Bifurcation node | Derivation cohort | Internal testing cohort | Multicenter testing cohort | |||
|---|---|---|---|---|---|---|
| HR (95% CI) | HR (95% CI) | HR (95% CI) | ||||
|
| 2.33 (1.97–2.75) | <0.001 | 2.58 (2.00–3.32) | <0.001 | 2.34 (1.80–3.05) | <0.001 |
|
| 3.70 (2.78–4.93) | <0.001 | 3.69 (2.40–5.69) | <0.001 | 2.79 (0.171–4.53) | <0.001 |
|
| 3.50 (2.26–5.43) | <0.001 | 2.52 (1.29–4.91) | 0.007 | 5.95 (2.98–11.91) | <0.001 |
|
| 5.25 (2.97–9.31) | <0.001 | 4.02 (1.57–10.31) | 0.004 | 12.27 (4.53–33.26) | <0.001 |
|
| 5.08 (2.24–11.53) | <0.001 | 6.35 (1.51–26.75) | 0.012 | — | —b |
|
| 2.73 (2.04–3.66) | <0.001 | 3.31 (2.06–5.32) | <0.001 | 2.47 (1.64–3.73) | <0.001 |
|
| 6.45 (3.29–12.62) | <0.001 | 7.16 (2.00–25.60) | 0.002 | 6.12 (2.69–13.94) | <0.001 |
|
| 5.61 (2.05–15.34) | <0.001 | — | 0.052a | — | —b |
|
| 4.26 (2.35–7.72) | <0.001 | 3.83 (1.84–7.98) | <0.001 | 3.48 (1.33–9.11) | 0.011 |
|
| 10.35 (3.17–33.82) | <0.001 | — | —b | — | —b |
|
| 6.89 (1.47–32.14) | 0.005 | — | —b | — | —b |
|
| 4.45 (1.67–11.85) | 0.003 | 3.67 (1.03–8.71) | 0.048 | — | —b |
p: path, ts: time slice
aNo deaths were recorded in one node and Kaplan–Meier Method with log rank test was utilized
bSample size in one node of the two comparators was <6
The correlation between surgery/ablation and path transfer in KEY nodes
| Nodes, | With surgery/ablation | Without surgery/ablation | |||||
|---|---|---|---|---|---|---|---|
| Go up ( | Go down ( | Died/NS ( | Go up ( | Go down ( | Died/NS ( | ||
|
| 20 (83.3) | 2 (8.3) | 2 (8.3) | 37 (46.8) | 2 (2.5) | 40 (50.6) | <0.001 |
|
| 11 (84.6) | 1 (7.7) | 1 (7.7) | 20 (50.0) | 5 (12.5) | 15 (37.5) | 0.072a |
|
| 13 (81.3) | 2 (12.5) | 1 (6.3) | 16 (45.7) | 12 (34.3) | 7 (20.0) | 0.070a |
|
| 79 (71.2) | 24 (21.6) | 8 (7.2) | 50 (15.2) | 203 (61.5) | 77 (23.3) | <0.001 |
|
| 17 (56.7) | 7 (23.3) | 6 (20.0) | 28 (14.2) | 38 (19.3) | 131 (66.5) | <0.001 |
p: path, ts: time slice, NS: no surveillance
aFisher’s exact test
Variables and methods of dichotomization for construction of the survival paths
| Categories and variables | Methods of dichotomization |
|---|---|
| Laboratory tests | |
| Serum AFP level (IU/ml) | <200 vs. ≥200; <400 vs. ≥400 |
| Child-Pugh class | Class B/C vs. class A; class C vs. class A/B |
| Imaging examination | |
| Diameter of main lesion (mm) | ≤50 vs. >50; ≤70 vs. >70; ≤100 vs. >100 |
| Number and size of hepatic lesions | ≤1 lesion/2–3 lesions, |
| Vascular invasion | With vs. without |
| Distant metastasis | With vs. without |
| Vascular invasion/N1/M1 | With vs. without |
| Change of lesions | With viable lesion vs. without viable lesion |
| Performance status | 0–2 vs. >2 |
Fig. 4Flowchart of study design. The time-series data of HCC patients were converted into data of time slices with constant time interval. Time slices in each case with complete data were enrolled for further analysis (a). Data in the first time slice of the whole population would initially undergo the processing cycle (PC) for feature selection and subgroup subdivision. Then data of the next time slice in each subdivided subgroup sequentially undergo PC; the analytical cycles continue until the completion of the last time slice (b)