| Literature DB >> 34903781 |
Anne Laure Le Page1, Elise Ballot2, Caroline Truntzer2,3, Valentin Derangère2,3,4,5, Alis Ilie2, David Rageot2,4, Frederic Bibeau1, Francois Ghiringhelli6,7,8,9,10.
Abstract
Histological stratification in metastatic non-small cell lung cancer (NSCLC) is essential to properly guide therapy. Morphological evaluation remains the basis for subtyping and is completed by additional immunohistochemistry labelling to confirm the diagnosis, which delays molecular analysis and utilises precious sample. Therefore, we tested the capacity of convolutional neural networks (CNNs) to classify NSCLC based on pathologic HES diagnostic biopsies. The model was estimated with a learning cohort of 132 NSCLC patients and validated on an external validation cohort of 65 NSCLC patients. Based on image patches, a CNN using InceptionV3 architecture was trained and optimized to classify NSCLC between squamous and non-squamous subtypes. Accuracies of 0.99, 0.87, 0.85, 0.85 was reached in the training, validation and test sets and in the external validation cohort. At the patient level, the CNN model showed a capacity to predict the tumour histology with accuracy of 0.73 and 0.78 in the learning and external validation cohorts respectively. Selecting tumour area using virtual tissue micro-array improved prediction, with accuracy of 0.82 in the external validation cohort. This study underlines the capacity of CNN to predict NSCLC subtype with good accuracy and to be applied to small pathologic samples without annotation.Entities:
Mesh:
Year: 2021 PMID: 34903781 PMCID: PMC8669012 DOI: 10.1038/s41598-021-03206-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Description of learning and external validation cohorts.
| Non squamous cell carcinoma | Squamous cell carcinoma | |
|---|---|---|
| Training sample (n = 78) | 39 patients (77 640 tiles) | 39 patients (62 496 tiles) |
| Validation sample (n = 26) | 13 patients (23 750 tiles) | 13 patients (21 801 tiles) |
| Test sample (n = 28) | 14 patients (22 911 tiles) | 14 patients (20 322 tiles) |
| External sample (n = 65) | 45 patients (464 022 tiles) | 20 patients (106 727 tiles) |
Accuracy achieved by the different strategies at tile level.
| Training | Validation | Test | External validation | TCGA | |||||
|---|---|---|---|---|---|---|---|---|---|
| Overall | Overall | By class | Overall | By class | Overall | By class | Overall | By class | |
| 0.75 | 0.64 | NS: 0.70 S: 0.57 | 0.58 | NS: 0.66 S: 0.49 | 0.68 | NS: 0.73 S: 0.44 | 0.53 | NS: 0.62 S: 0.44 | |
| NS = 0.9, S = 0.9 | 0.99 | 0.87 | NS: 0.88 S: 0.87 | 0.85 | NS: 0.84 S: 0.86 | 0.85 | NS: 0.89 S: 0.72 | 0.75 | NS: 0.77 S: 0.73 |
| 0.84 | 0.71 | NS: 0.79 S: 0.62 | 0.63 | NS: 0.71 S: 0.53 | 0.71 | NS: 0.77 S: 0.48 | 0.53 | NS: 0.69 S: 0.38 | |
| 0.78 | 0.69 | NS: 0.74 S: 0.63 | 0.65 | NS: 0.68 S: 0.62 | 0.66 | NS: 0.77 S: 0.49 | 0.57 | NS: 0.59 S: 0.55 | |
| NS = 0.9, S = 0.9 | 0.99 | 0.83 | NS: 0.84 S: 0.82 | 0.88 | NS: 0.77 S: 0.92 | 0.92 | NS: 0.92 S: 0.94 | 0.83 | NS: 0.55 S: 0.94 |
| 0.88 | 0.79 | NS: 0.83 S: 0.76 | 0.73 | NS: 0.75 S: 0.71 | 0.71 | NS: 0.81 S: 0.56 | 0.57 | NS: 0.66 S: 0.50 | |
Figure 1Evaluation of abilities of the different strategies to predict the class of tiles in the external validation cohort. ROC curves at tile level (a), patient level according to majority voting (b) and max pooling (c). Green, blue and red lines correspond respectively to threshold equal to 0.5, threshold equal to 0.9 and re-estimation using filter kernel. Solid and dashed lines respectively correspond to the prediction of tiles from whole slides and TMA.
Accuracy achieved by the different strategies at patient level.
| Max pooling | Majority voting | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Test | External validation | TCGA | Test | External validation | TCGA | |||||||
| Overall | By class | Overall | By class | Overall | By class | Overall | By class | Overall | By class | Overall | By class | |
| 0.71 | NS: 0.71 S: 0.71 | 0.74 | NS: 0.69 S: 0.85 | 0.64 | NS: 0.47 S: 0.81 | 0.68 | NS: 0.79 S: 0.57 | 0.71 | NS: 0.80 S: 0.50 | 0.54 | NS: 0.73 S: 0.35 | |
| NS = 0.9, S = 0.9 | 0.69 | NS: 0.69 S: 0.69 | 0.74 | NS: 0.69 S: 0.85 | 0.64 | NS: 0.47 S: 0.81 | 0.73 | NS: 0.77 S: 0.69 | 0.78 | NS: 0.76 S: 0.85 | 0.64 | NS: 0.57 S: 0.71 |
| 0.61 | NS: 0.57 S: 0.64 | 0.81 | NS: 0.77 S: 0.90 | 0.54 | NS: 0.73 S: 0.32 | 0.71 | NS: 0.79 S: 0.64 | 0.73 | NS: 0.80 S: 0.60 | 0.52 | NS: 0.77 S: 0.29 | |
| 0.79 | NS: 0.79 S: 0.79 | 0.79 | NS: 0.79 S: 0.80 | 0.72 | NS: 0.57 S: 0.87 | 0.82 | NS: 0.93 S: 0.71 | 0.73 | NS: 0.83 S: 0.50 | 0.58 | NS: 0.7 S: 0.47 | |
| NS = 0.9, S = 0.9 | 0.68 | NS: 0.67 S: 0.70 | 0.80 | NS: 0.79 S: 0.83 | 0.73 | NS: 0.52 S: 0.92 | 0.68 | NS: 0.67 S: 0.70 | 0.82 | NS: 0.81 S: 0.83 | 0.73 | NS: 0.52 S: 0.92 |
| 0.79 | NS: 0.79 S: 0.79 | 0.77 | NS: 0.79 S: 0.75 | 0.67 | NS: 0.83 S: 0.50 | 0.82 | NS: 0.86 S: 0.79 | 0.73 | NS: 0.79 S: 0.60 | 0.57 | NS: 0.77 S: 0.37 | |
Figure 2Prediction procedure for a squamous tumour biopsy section. (a) Representation of original slide. (b,c) Steps for prediction analysis based on (b) Whole Slide Image (WSI) or (c) virtual TMA strategy. Heatmaps show for each tile probability of being predicted as squamous tumour (red) or non-squamous tumour (green); the highest probability was kept for coloration; grey was used for tiles non selected by the pathologist or removed by denoising step (tile containing more than 2/3 of white background).
Figure 3Predictions of tumour biopsy section containing non-squamous tumour. (a) Representation of original slide. (b,c) Steps for prediction analysis based on (b) whole Slide Image (WSI) or (c) virtual TMA strategy. Heatmaps show for each tile probability of being predicted as squamous tumour (red) or non-squamous tumour (green); the highest probability was kept for coloration; grey was used for tiles non selected by the pathologist or removed by denoising step (tile containing more than 2/3 of white background).