| Literature DB >> 35739342 |
Yanan Wang1, Yu Guang Wang2,3,4, Changyuan Hu1, Ming Li5, Yanan Fan4, Nina Otter6, Ikuan Sam7, Hongquan Gou7, Yiqun Hu7, Terry Kwok1,8, John Zalcberg9,10, Alex Boussioutas11, Roger J Daly1, Guido Montúfar3,6, Pietro Liò12, Dakang Xu13, Geoffrey I Webb14,15, Jiangning Song16,17.
Abstract
Gastric cancer is one of the deadliest cancers worldwide. An accurate prognosis is essential for effective clinical assessment and treatment. Spatial patterns in the tumor microenvironment (TME) are conceptually indicative of the staging and progression of gastric cancer patients. Using spatial patterns of the TME by integrating and transforming the multiplexed immunohistochemistry (mIHC) images as Cell-Graphs, we propose a graph neural network-based approach, termed Cell-Graph Signature or CGSignature, powered by artificial intelligence, for the digital staging of TME and precise prediction of patient survival in gastric cancer. In this study, patient survival prediction is formulated as either a binary (short-term and long-term) or ternary (short-term, medium-term, and long-term) classification task. Extensive benchmarking experiments demonstrate that the CGSignature achieves outstanding model performance, with Area Under the Receiver Operating Characteristic curve of 0.960 ± 0.01, and 0.771 ± 0.024 to 0.904 ± 0.012 for the binary- and ternary-classification, respectively. Moreover, Kaplan-Meier survival analysis indicates that the "digital grade" cancer staging produced by CGSignature provides a remarkable capability in discriminating both binary and ternary classes with statistical significance (P value < 0.0001), significantly outperforming the AJCC 8th edition Tumor Node Metastasis staging system. Using Cell-Graphs extracted from mIHC images, CGSignature improves the assessment of the link between the TME spatial patterns and patient prognosis. Our study suggests the feasibility and benefits of such an artificial intelligence-powered digital staging system in diagnostic pathology and precision oncology.Entities:
Year: 2022 PMID: 35739342 PMCID: PMC9226174 DOI: 10.1038/s41698-022-00285-5
Source DB: PubMed Journal: NPJ Precis Oncol ISSN: 2397-768X
Fig. 1An overall workflow of graph neural network-based prognosis prediction using Cell-Graphs.
a Specimen processing: the tumor tissues were extracted from gastric cancer, and stained with seven different biomarkers including DAPI, Pan-CK, CD8, CD68, CD163, Foxp3, and PD-L1. b Image pre-processing: sub-sampling and cell-graph construction were conducted for image pre-processing. c An illustration for the cohort, 172 gastric cancer patients were collected. d Data split. The training, validation, and testing datasets were split with the percentages of 64%, 16%, and 20%, respectively. e Model construction: four different GNN model architectures, including GCNSag, GCNTopK, GINSag, and GINTopK, were constructed and compared. Multi-run model training, fivefold cross-validation, and independent tests were conducted to evaluate the performance of the constructed GNN models. f Data binning: overall survival time ranged from 0 to 88 months, and two data-binning strategies were applied to generate binary- and ternary-class datasets. g Model architecture: The four models shared the same architecture but employed different types of convolutional unit and pooling layer, which consists of four consecutive convolutional layer and pooling layer blocks, followed by a summary layer and three fully connected layers, prior to the generation of the final classification outcome. The architecture of the best-performing GINTopK model is illustrated herein, which outperformed the other three model architectures and also achieved the best performance on the test dataset. The corresponding number of hidden layers or feature dimensions is indicated at the bottom of each box. Here, FC stands for “fully connected layer”.
The list of node attributes and their variable types.
| Feature name | Feature type |
|---|---|
| DAPI positive | Boolean |
| DAPI positive nucleus | Boolean |
| DAPI positive cytoplasm | Boolean |
| DAPI nucleus Intensity | Float |
| DAPI cytoplasm intensity | Float |
| PD-L1 (Opal 520) positive | Boolean |
| PD-L1 (Opal 520) positive nucleus | Boolean |
| PD-L1 (Opal 520) positive cytoplasm | Boolean |
| PD-L1 (Opal 520) nucleus intensity | Float |
| PD-L1 (Opal 520) cytoplasm intensity | Float |
| CD68 (Opal 540) positive | Boolean |
| CD68 (Opal 540) positive nucleus | Boolean |
| CD68 (Opal 540) positive cytoplasm | Boolean |
| CD68 (Opal 540) nucleus intensity | Float |
| CD68 (Opal 540) cytoplasm intensity | Float |
| Foxp3 (Opal 570) positive | Boolean |
| Foxp3 (Opal 570) positive nucleus | Boolean |
| Foxp3 (Opal 570) positive cytoplasm | Boolean |
| Foxp3 (Opal 570) nucleus intensity | Float |
| Foxp3 (Opal 570) cytoplasm intensity | Float |
| CD8 (Opal 620) positive | Boolean |
| CD8 (Opal 620) positive nucleus | Boolean |
| CD8 (Opal 620) positive cytoplasm | Boolean |
| CD8 (Opal 620) nucleus intensity | Float |
| CD8 (Opal 620) cytoplasm intensity | Float |
| Pan-CK (Opal 690) positive | Boolean |
| Pan-CK (Opal 690) positive nucleus | Boolean |
| Pan-CK (Opal 690) positive cytoplasm | Boolean |
| Pan-CK (Opal 690) nucleus intensity | Float |
| Pan-CK (Opal 690) cytoplasm intensity | Float |
| Cell area (μm2) | Float |
| Cytoplasm area (μm2) | Float |
| Nucleus area (μm2) | Float |
| Nucleus perimeter (μm) | Float |
| Nucleus roundness | Float |
Each type of feature is comprised of three Boolean variables and two float variables. These Boolean variables were identified by the pathology software based on the float values of Nucleus Intensity and Cytoplasm Intensity of each biomarker. Moreover, five different morphology features were extracted as the node attributes.
Fig. 2Model performance of four GNNs on fivefold cross-validation.
a, b show the Boxplots of performance metrics of Accuracy, F1-score, and MCC on fivefold cross-validation. c, d illustrate the ROCs of GINTopK binary- and ternary models on fivefold cross-validation. In the boxplot, the center line marks the mid-point of the data; the top and bottom lines show the maximum and minimum non-outlier data; the upper and lower bounds of the box indicate the third quartile and first quartile of the data; the height of the notch indicates the 95% confidence interval of the median point; small circles represent outliers.
Fig. 3Performance assessment of the GINTopK model in terms of ROC curves and confusion matrix on the independent test.
The left column shows the ROC curves of a binary- and c ternary classification, while the right column displays the confusion matrix of the model predictions on the b binary- and d ternary classification tasks.
Ablation studies of the major types of features used by the GNN models in both binary and ternary classification.
| Feature sets | ACC of binary | ACC of ternary |
|---|---|---|
| All-features | 0.917 ± 0.012 | 0.719 ± 0.020 |
| No-DAPI | 0.882 ± 0.022 | 0.710 ± 0.018 |
| No-PD-L1 | 0.921 ± 0.014 | 0.718 ± 0.006 |
| No-CD68 | 0.911 ± 0.011 | 0.717 ± 0.010 |
| No-FOXP3 | 0.918 ± 0.017 | 0.719 ± 0.016 |
| No-CD8 | 0.910 ± 0.006 | 0.732 ± 0.023 |
| No-Pan-CK | 0.927 ± 0.023 | 0.714 ± 0.016 |
| No-morphology | 0.892 ± 0.009 | 0.706 ± 0.021 |
The relative importance and contribution of the features were measured by the accuracy change compared with that of the all-feature model.
Fig. 4Kaplan–Meier survival analysis of patient overall survival based on the “digital grade” (patient-level predictions) produced by CG.
a Kaplan–Meier survival analysis results based on the binary-classification. b Kaplan–Meier survival analysis results based on the ternary classification.
Univariate and multivariable Cox regression analysis of overall survival (Cox proportional hazards regression model) based on the predictions of binary- and ternary classification by CG.
| Univariate analysis | Multivariate analysis | ||||||
|---|---|---|---|---|---|---|---|
| Variable | C-Index1 (95% CI) | HR2 (95% CI) | C-Index1 (95% CI) | HR2 (95% CI) | |||
| Binary-class test cohort | pT | - | - | - | 0.363 | ||
| pN | - | - | - | 0.378 | |||
| TNM-23 | 0.659 (0.577–0.740) | 5.276 (2.147–12.966) | <1e-4*** | 0.998 | |||
| TNM-34 | - | - | - | 0.998 | |||
| TNM-65 | 0.714 (0.623–0.805) | 1.873 (1.388–2.529) | 8.1e-4*** | 0.135 | |||
| 0.699 (0.637–0.762) | 0.217 (0.108–0.438) | <1e-4*** | 0.804 (0.728–0.881) | 0.037 (0.003–0.422) | 0.007** | ||
| 0.740 (0.661–0.819) | 2.412 (1.650–3.525) | <1e-4*** | 0.146 | ||||
| Ternary-class test cohort | pT | - | - | - | 0.097 | ||
| pN | - | - | - | 0.053 | |||
| TNM-23 | - | - | - | 0.998 | |||
| TNM-34 | 0.632 (0.510–0.753) | 3.169 (1.335–7.522) | 0.019* | 0.998 | |||
| TNM-66 | 0.681 (0.535–0.827) | 1.708 (1.212–2.407) | 0.028* | 0.066 | |||
| 0.823 (0.748–0.899) | 0.204 (0.107–0.389) | <1e-4*** | 0.883 (0.820–0.947) | 0.190 (0.087–0.414) | 2.94e-05*** | ||
1C-Index: concordance index; 2HR: Hazard ratio; 3I+II vs. III; 4I vs. II vs. III; 5I, IIA, IIB, IIIA, IIIB, IIIC; 6Low vs. high; 7Low vs. medium vs. High.
The classification results were compared with Harrell’s Concordance Index (C-Index), Hazard Ratio (HR), and p value. For the convenience of survival analysis comparison, the variables of TNM stages were regrouped into TNM-2 (I+II vs. III), TNM-3 (I vs. II vs. III), and TNM-6 (I, IIA, IIB, IIIA, IIIB, IIIC), while “CG+TNM-2" denotes a four-class variable by combining the classes of TNM-2 and binary-class CG.
Search space for hyperparameters of GNN models.
| Hyperparameter | Searching space |
|---|---|
| Learning rate | 10−4, 5 × 10−4, 10−3 |
| Weight decay ( | 10−4, 5 × 10−4, 10−3 |
| Hidden units | 256, 512 |
| Pooling ratio | 0.5, 0.65, 0.75 |