| Literature DB >> 32594660 |
Fang Zhang1,2, Su Yao3, Zhi Li3, Changhong Liang2, Ke Zhao2,4, Yanqi Huang2,5, Ying Gao1, Jinrong Qu6, Zhenhui Li7, Zaiyi Liu2.
Abstract
Quantitative features extracted from biopsy digital pathology images can provide predictive information for neoadjuvant chemoradiotherapy (nCRT) in local advanced rectal cancer (LARC) Machine learning technologies are applied to build the digital-pathology-based pathology signature The pathology signature is an independent predictor of treatment response to nCRT in LARC.Entities:
Year: 2020 PMID: 32594660 PMCID: PMC7403709 DOI: 10.1002/ctm2.110
Source DB: PubMed Journal: Clin Transl Med ISSN: 2001-1326
Clinical characteristic in the primary and validation datasets
| Primary dataset | Validation dataset | |||||
|---|---|---|---|---|---|---|
| Characteristic | Non‐PR | PR |
| Non‐PR | PR |
|
| Age, mean ± SD | 56.0 ± 11.4 | 55.4 ± 10.9 | .465 | 51.7 ± 11.8 | 60.4 ±9.18 | .012 |
| Gender, No. (%) | .401 | .800 | ||||
| Male | 38 (62.3%) | 42 (71.2%) | 8 (72.7%) | 14 (70.0%) | ||
| Female | 23 (37.7%) | 17 (28.8%) | 3 (27.3%) | 6 (30.0%) | ||
| T staging, No. (%) | .698 | .378 | ||||
| T0 | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) | ||
| T1 | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) | ||
| T2 | 2 (3.3%) | 1 (1.7%) | 1 (9.1%) | 0 (0%) | ||
| T3 | 23 (37.7%) | 26 (44.1%) | 4 (36.4%) | 7 (35.0%) | ||
| T4 | 36 (59.0%) | 32 (54.2%) | 6 (54.5%) | 13 (65.0%) | ||
| N staging, No. (%) | .015 | .521 | ||||
| N0 | 6 (9.8%) | 17 (28.8%) | 2 (18.2%) | 4 (20.0%) | ||
| N1 | 46 (75.4%) | 31 (52.5%) | 8 (72.7%) | 11 (55.0%) | ||
| N2 | 9 (14.8%) | 11 (18.6%) | 1 (9.1%) | 5 (25.0%) | ||
| CEA level, No. (%) | .134 | .724 | ||||
| Normal | 30 (49.2%) | 38(64.4%) | 5 (45.5%) | 9 (45.0%) | ||
| Abnormal | 31 (50.8%) | 21 (35.6%) | 6 (54.5%) | 11 (55.0%) | ||
| Tumor location (cm) | .313 | .298 | ||||
| < 5 | 36(59.1%) | 40 (67.8%) | 8 (72.7%) | 9 (45.0%) | ||
| 5‐10 | 25 (40.9%) | 18 (30.5%) | 3 (27.3%) | 10 (50.0%) | ||
| ≥10 | 0 (0%) | 1 (1.7%) | 0 (0.0%) | 1 (5.0%) | ||
| Length of tumor (cm), mean ± SD | 4.87 ± 1.88 | 4.92 ± 1.52 | .483 | 4.56 ± 1.19 | 4.89 ± 1.70 | .917 |
| Thickness of tumor (cm), mean ± SD | 1.77 ± 0.69 | 1.76 ± 0.65 | .945 | 1.27 ± 0.38 | 1.76 ± 0.53 | .03 |
| Pathology score, median (interquartile range) | 0.128 (0.089 to 0.321) | 0.891 (0.626 to 0.929) | <.001 | 0. 207 (0.130 to 0.483) | 0.814(0.481 to 0.874) | <.001 |
Note. P‐value is derived from the univariable association analyses between each of the clinicopathological variables and treatment response. The clinical characters were the data from the initial diagnosis. The threshold value for CEA level was ≤5 ng/mL and >5 ng/mL according to the universally normal range used.
Abbreviations: CEA, pre‐treatment carcinoembryonic antigen; SD, standard deviation.
P < .05.
FIGURE 1Pathology signature construction in hematoxylin and eosin stained whole slide images (WSIs). With manually annotated tumor areas, we cropped WSIs into tile images; quantitative features were extracted and reduced from the selected patches of tumor cell dense area. Next, we built a tile‐level classifier via a support vector machine (SVM) model, and then the pathology signature was constructed with a logistic regression model. Finally, the predictive power of the signature was evaluated
FIGURE 2Texture feature selection and pathology signature's performance. A, The parameter alpha selection in the least absolute shrinkage and selection operator (LASSO) model used the 10‐fold cross‐validation. B, LASSO coefficient profiles of the 104 texture features using 10‐fold cross‐validation. C, the receiver operating characteristics (ROC) curve of the tile‐level classifier in the primary and validation dataset. D, ROC curve of the pathology signature in the primary and validation datasets. E, calibration curves of the pathology signature in the primary and validation datasets. F, Decision curve analysis for the pathology signature in the primary and validation datasets
Performance of pathology signature
| Accuracy(95%CI) | Sensitivity(95% CI) | Specificity(95% CI) | F1‐score(95% CI) | AUC(95%CI) | |
|---|---|---|---|---|---|
| TL‐p |
0.790 (0.756‐0.82) |
0.760 (0.714‐0.811) |
0.826 (0.778‐0.872) |
0.796 (0.760‐0.833) |
0.887 (0.858‐0.916) |
| TL‐v |
0.732 (0.655‐0.803) |
0.753 (0.663‐0.833) |
0.688 (0.553‐0.821) |
0.793 (0.720‐0.853) |
0.797 (0.718‐0.866) |
| P‐p |
0.792 (0.708‐0.858) |
0.780 (0.661‐0.873) |
0.803 (0.692‐0.897) |
0.786 (0.685‐0.855) |
0.930 (0.883‐0.966) |
| P‐v |
0.710 (0.548‐0.871) |
0.700 (0.476‐0.889) |
0.727 (0.444‐1.0) |
0.757 (0.564‐0.878) |
0.877 (0.719‐0.97) |
Abbreviations: CI, confidence interval; P‐p, the performance of primary dataset in pathology signature; P‐v, the performance of validation dataset in pathology signature; TL‐p, the performance of primary dataset in tile‐level classifier; TL‐v, the performance of validation dataset in tile‐level classifier