| Literature DB >> 35007352 |
Neeraj Kumar1, Ruchika Verma2, Chuheng Chen2, Cheng Lu2, Pingfu Fu3, Joseph Willis4,5, Anant Madabhushi2,6.
Abstract
We assessed the utility of quantitative features of colon cancer nuclei, extracted from digitized hematoxylin and eosin-stained whole slide images (WSIs), to distinguish between stage II and stage IV colon cancers. Our discovery cohort comprised 100 stage II and stage IV colon cancer cases sourced from the University Hospitals Cleveland Medical Center (UHCMC). We performed initial (independent) model validation on 51 (143) stage II and 79 (54) stage IV colon cancer cases from UHCMC (The Cancer Genome Atlas's Colon Adenocarcinoma, TCGA-COAD, cohort). Our approach comprised the following steps: (1) a fully convolutional deep neural network with VGG-18 architecture was trained to locate cancer on WSIs; (2) another deep-learning model based on Mask-RCNN with Resnet-50 architecture was used to segment all nuclei from within the identified cancer region; (3) a total of 26 641 quantitative morphometric features pertaining to nuclear shape, size, and texture were extracted from within and outside tumor nuclei; (4) a random forest classifier was trained to distinguish between stage II and stage IV colon cancers using the five most discriminatory features selected by the Wilcoxon rank-sum test. Our trained classifier using these top five features yielded an AUC of 0.81 and 0.78, respectively, on the held-out cases in the UHCMC and TCGA validation sets. For 197 TCGA-COAD cases, the Cox proportional hazards model yielded a hazard ratio of 2.20 (95% CI 1.24-3.88) with a concordance index of 0.71, using only the top five features for risk stratification of overall survival. The Kaplan-Meier estimate also showed statistically significant separation between the low-risk and high-risk patients, with a log-rank P value of 0.0097. Finally, unsupervised clustering of the top five features revealed that stage IV colon cancers with peritoneal spread were morphologically more similar to stage II colon cancers with no long-term metastases than to stage IV colon cancers with hematogenous spread.Entities:
Keywords: colon cancer; computational pathology; hematogenous spread; peritoneal spread; quantitative histomorphometric image analysis
Mesh:
Substances:
Year: 2022 PMID: 35007352 PMCID: PMC9007877 DOI: 10.1002/path.5864
Source DB: PubMed Journal: J Pathol ISSN: 0022-3417 Impact factor: 9.883
Figure 1Patient inclusion criteria and distribution of cases in the training and validation cohorts.
Figure 2Illustration of the tumor and nucleus segmentation modules.
Figure 3Illustration of the features extracted from the segmented tumor nuclei for stage II CC and stage IV CC with both peritoneal and hematogenous metastases. (A) Patches of size 1000 × 1000 pixels were extracted from the tumor region of the input WSI. (B) An input patch to the nuclei segmentation module. (C) Output of the nuclei segmentation module, where each nucleus is shown using different colors to show the separation of touching and overlapping nuclei. (D) Nuclear shape features quantifying attributes such as circumference, area, length of major axis, etc. and (E) nuclear orientation features quantifying the direction (in red arrows) of the major axis of each of the segmented nuclei are shown as an illustration.
Figure 4K–M curves for overall survival on the independent validation set of the TCGA‐COAD cohort. (A) The Cox proportional hazards model generated low‐risk and high‐risk categories using the top five quantitative features. (B) Classes generated using unsupervised hierarchical clustering based on the top five quantitative nuclear features. (C) Original stage labels available in the TCGA‐COAD cohort's clinical data.
Figure 5Comparison of hematogenous and peritoneal metastases of stage IV CC with stage II CC. (A) UMAP illustration of hematogenous versus peritoneal metastases of stage IV CC. (B) Unsupervised clustering‐based heatmap of the top five features that generated maximum cluster separation. True class labels are shown on the left vertical bar beside the heatmap – stage II CCs are shown in green, while blue and red represent stage IV CC with peritoneal and hematogenous metastases, respectively.
Figure 6Violin plots of the top discriminatory features between stage II and stage IV CC with both peritoneal and hematogenous metastases. The best‐performing feature from each of the top feature families is shown in this illustration – average nuclear area from nuclear shape features, variance of nuclear orientation from cell‐graph tensor (CGT) features, and variance in local nuclear contrast from local cell‐cluster co‐occurrence nuclear morphology matrix (cCCM) based features.