| Literature DB >> 35207736 |
Yuze Li1, Ziming Xu1, Chao An2, Huijun Chen1, Xiao Li3.
Abstract
This study aimed to develop a deep learning-based model to simultaneously perform the objective response (OR) and tumor segmentation for hepatocellular carcinoma (HCC) patients who underwent transarterial chemoembolization (TACE) treatment. A total of 248 patients from two hospitals were retrospectively included and divided into the training, internal validation, and external testing cohort. A network consisting of an encoder pathway, a prediction pathway, and a segmentation pathway was developed, and named multi-DL (multi-task deep learning), using contrast-enhanced CT images as input. We compared multi-DL with other deep learning-based OR prediction and tumor segmentation methods to explore the incremental value of introducing the interconnected task into a unified network. Additionally, the clinical model was developed using multivariate logistic regression to predict OR. Results showed that multi-DL could achieve the highest AUC of 0.871 in OR prediction and the highest dice coefficient of 73.6% in tumor segmentation. Furthermore, multi-DL can successfully perform the risk stratification that the low-risk and high-risk patients showed a significant difference in survival (p = 0.006). In conclusion, the proposed method may provide a useful tool for therapeutic regime selection in clinical practice.Entities:
Keywords: deep learning; liver neoplasms; treatment outcome
Year: 2022 PMID: 35207736 PMCID: PMC8875107 DOI: 10.3390/jpm12020248
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Figure 1The diagram of the patient inclusion, model construction, and performance evaluation.
Figure 2The structure of the proposed multi-DL (multi-task deep learning) model.
The detailed network structure of multi-DL (multi-task deep learning).
| Block Name | Layer | Parameter |
|---|---|---|
| Encoder-1 | 2 × (Conv2D + BN + LReLU) + Max-pooling | Conv: 3 × 3 × 32 filter, stride 1, same padding; |
| Encoder-2 | 2 × (Conv2D + BN + LReLU) + Max-pooling | Conv: 3 × 3 × 64 filter, stride 1, same padding; |
| Encoder-3 | 2 × (Conv2D + BN + LReLU) + Max-pooling | Conv: 3 × 3 × 128 filter, stride 1, same padding; |
| Encoder-4 | 2 × (Conv2D + BN + LReLU) + Max-pooling | Conv: 3 × 3 × 256 filter, stride 1, same padding; |
| Encoder-5 | Conv2D + BN + LReLU | Conv: 3 × 3 × 128 filter, stride 1, same padding; |
| Encoder-6 | Conv2D + BN + LReLU | Conv: 3 × 3 × 64 filter, stride 1, same padding; |
| Encoder-7 | Conv2D + BN + LReLU | Conv: 3 × 3 × 32 filter, stride 1, same padding; |
| Decoder-1 | (Conv2D + BN + LReLU) + (DeConv2D + BN + LReLU) | Conv: 3 × 3 × 32 filter, stride 1, same padding; |
| Decoder-2 | (Conv2D + BN + LReLU) + (DeConv2D + BN + LReLU) | Conv: 3 × 3 × 64 filter, stride 1, same padding; |
| Decoder-3 | (Conv2D + BN + LReLU) + (DeConv2D + BN + LReLU) | Conv: 3 × 3 × 128 filter, stride 1, same padding; |
| Decoder-4 | (Conv2D + BN + LReLU) + (DeConv2D + BN + LReLU) | Conv: 3 × 3 × 256 filter, stride 1, same padding; |
Patient characteristics in the training, internal validation, and external testing cohorts.
| Training | Internal Validation | External Testing |
| |
|---|---|---|---|---|
| Mean age (years) | 56.9 ± 11.9 | 57.9 ± 10.9 | 57.7 ± 13.3 | 0.831 |
| F/M ratio | 14:122 | 1:49 | 4:58 | 0.221 |
| Child–Pugh class | 0.397 | |||
| A | 116 (85.3) | 41 (82.0) | 48 (77.4) | |
| B | 22 (14.7) | 9 (18.0) | 14 (22.6) | |
| BCLC stage | 0.129 | |||
| A | 50 (36.8) | 18 (36.0) | 14 (22.6) | |
| B | 86 (63.2) | 32 (64.0) | 48 (77.4) | |
| HBV | 0.730 | |||
| Presence | 117 (86.0) | 45 (90.0) | 55 (88.7) | |
| Absence | 19 (14.0) | 5 (10.0) | 7 (11.3) | |
| ALB (g/L) | 38.8 ± 4.1 | 38.2 ± 5.1 | 38.1 ± 5.8 | 0.589 |
| ALT (U/mL) | 65.7 ± 58.5 | 77.1 ± 93.8 | 68.8 ± 76.6 | 0.630 |
| AST (U/mL) | 111.4 ± 235.4 | 110.8 ± 89.2 | 125.0 ± 112.7 | 0.882 |
| PT, seconds | 12.2 ± 1.7 | 12.0 ± 1.0 | 12.5 ± 1.5 | 0.188 |
| PLT × 109/L | 195.1 ± 78.1 | 203.8 ± 90.3 | 205.5 ± 120.0 | 0.715 |
| TBil (umol/L) | 18.6 ± 24.3 | 16.9 ± 8.1 | 17.9 ± 13.5 | 0.869 |
| AFP (ng/mL) | 0.202 | |||
| ≤400 | 71 (52.2) | 31 (62.0) | 40 (64.5) | |
| >400 | 65 (47.8) | 19 (38.0) | 22 (35.5) | |
| Tumor maximum diameter (cm) | 9.0 ± 3.6 | 9.3 ± 3.9 | 9.5 ± 4.4 | 0.652 |
| Multiple tumors | 0.098 | |||
| Single | 54 (39.7) | 19 (40.0) | 15 (24.2) | |
| Multiple | 82 (60.3) | 31 (60.0) | 47 (75.8) | |
| Tumor response | 0.773 | |||
| OR | 44 (32.4) | 17 (34.0) | 21 (33.9) | |
| Non-OR | 92 (67.6) | 33 (66.0) | 41 (66.1) |
Abbreviations: HBV, hepatitis B virus; AFP, alpha fetoprotein; ALT, alanine aminotransferase; AST, aspartate aminotransferase; ALB, albumin; PT, pro-thrombin time; TBil, total bilirubin; PLT, platelet; OR, objection response.
Uni- and multivariable regression analysis of predictors of OR in the training cohort.
| Clinical Variables | β | Odds Ratio (95% CI) | β | Odds Ratio (95% CI) | ||
|---|---|---|---|---|---|---|
| Mean age (years) | −0.014 | 0.986 (0.950–1.024) | 0.470 | |||
| Sex (Female/Male) | 0.142 | 1.152 (0.306–4.342) | 0.834 | |||
| Child–Pugh class | 0.139 | 1.150 (0.424–3.120) | 0.784 | |||
| BCLC Stage (B/A) | −1.697 | 0.183 (0.062–0.542) | 0.002 * | −1.556 | 0.211 (0.079–0.562) | 0.002 * |
| HBV (Presence/Absence) | −0.587 | 0.556 (0.172–1.792) | 0.325 | |||
| ALB (g/L) | −0.033 | 0.967 (0.864–1.084) | 0.567 | |||
| ALT (U/mL) | −0.004 | 0.996 (0.987–1.004) | 0.303 | |||
| AST (U/mL) | 0.000 | 1.000 (0.996–1.003) | 0.759 | |||
| PT, seconds | 0.188 | 1.206 (0.835–1.743) | 0.318 | |||
| PLT × 109/L | 0.004 | 1.003 (0.997–1.008) | 0.371 | |||
| TBil (umol/L) | 0.011 | 1.011 (0.981–1.041) | 0.470 | |||
| AFP (>400 ng/mL/≤400 ng/mL) | −0.361 | 0.697 (0.297–1.634) | 0.406 | |||
| Tumor maximum diameter (>5 cm/≤5 cm) | −1.399 | 0.247 (0.073–0.835) | 0.024 * | −1.654 | 0.191 (0.065–0.562) | 0.003 * |
| Multiple tumor (Single/Multiple) | 1.298 | 3.664 (1.222–10.983) | 0.020 * | 1.059 | 2.884 (1.071–7.764) | 0.036 * |
* indicated p < 0.05.
Figure 3Training curve of multi-DL. (A) Cross-entropy vs. training epochs. (B) Accuracy vs. training epochs.
Figure 4ROCs of OR prediction and risk stratification. (A) ROCs and AUCs of multi-DL, single-DL-Pre, ResNet50 and clinical model. (B) Survival curve of high- and low-risk patients stratified by the multi-DL model.
Performance for multi-DL and compared methods.
| Method | AUC | ACC (%) | Dice (%) |
|---|---|---|---|
| OR Prediction | |||
| Clinical model | 0.739 | 71.0 | N/A |
| ResNet50 [ | 0.859 | 80.6 | N/A |
| Single-DL-Pre | 0.858 | 70.9 | N/A |
| Tumor Segmentation | |||
| CNN [ | N/A | N/A | 63.2 |
| Encoder–decoder [ | N/A | N/A | 66.7 |
| Ours | |||
| Multi-DL | 0.871 | 83.9 | 73.6 |
N/A: Not applicable.
Figure 5The confusion matrix for the clinical model, ResNet50, single-DL and multi-DL models.
Figure 6Tumor segmentation for three cases with reference manual segmentation, results of multi-DL, CNN, and encoder–decoder methods.