| Literature DB >> 31057753 |
Vincent Gaydou1, Myriam Polette2,3, Cyril Gobinet1, Claire Kileztky2, Jean-François Angiboust1, Philippe Birembaut2,3, Vincent Vuiblet1,3, Olivier Piot1,4.
Abstract
Spectral histopathology, based on infrared interrogation of tissue sections, proved a promising tool for helping pathologists in characterizing histological structures in a quantitative and automatic manner. In cancer diagnosis, the use of chemometric methods permits establishing numerical models able to detect cancer cells and to characterize their tissular environment. In this study, we focused on exploiting multivariate infrared data to score the tumour aggressiveness in preneoplastic lesions and squamous cell lung carcinomas. These lesions present a wide range of aggressive phenotypes; it is also possible to encounter cases with various degrees of aggressiveness within the same lesion. Implementing an infrared-based approach for a more precise histological determination of the tumour aggressiveness should arouse interest among pathologists with direct benefits for patient care. In this study, the methodology was developed from a set of samples including all degrees of tumour aggressiveness and by constructing a chain of data processing steps for an automated analysis of tissues currently manipulated in routine histopathology.Entities:
Year: 2019 PMID: 31057753 PMCID: PMC6471539 DOI: 10.1039/c8sc04320e
Source DB: PubMed Journal: Chem Sci ISSN: 2041-6520 Impact factor: 9.825
Elaboration of the aggressiveness scale on the basis of the histopathological characterisation of the lesions
| Lesion type | Phenotypic characteristics | Aggressiveness score based on histopathology criteria |
| Hypertrophy | An increase in cell size and/or functional activity in response to a stimulus: score of 1 to 3 | 1 to 3 |
| Hyperplasia | An increase of cell numbers, | 2 to 4 |
| Proliferative lesion | An increase in cell growth and lication that is not dependent on an external stimulus | 3 to 5 |
| Metaplasia | A reversible process in which one mature cell type is replaced by another mature cell type (adaptive response to a stimulus) | 4 to 6 |
| Dysplasia | Reversible, irregular, atypical, proliferative cellular changes in response to irritation or inflammation | 5 to 7 |
| Severe dysplastic lesion | A lack of differentiation of tissue cells; a tumor with fewer differentiated cells is more malignant | 6 to 8 |
|
| A local SCC tumor | 7 to 8 |
| Invasive and micro-invasive carcinoma | An irregular, atypical, proliferative SCC tumor | 8 to 9 |
Fig. 2Results of unsupervised and supervised models constructed from the calibration. Representative calibration samples of NSCLC corresponding to the 6 phenotypes included in the analysis are presented. For each sample, three images are depicted: HE staining image revealing the tissue morphology, ASK image recovering the histological structures on the basis of their infrared signature and colour-coded image indicating the pixels of interest (associated with epithelial cells) scored according to the aggressiveness scale. From this last image, a graph is extracted for quantifying the numbers of pixels as a function of the aggressiveness scale. The last column of this figure shows the colour scale adopted to highlight the PLS pixel scoring.
Description of the calibration sample set and results of the internal leave-one-image-out-cross validation
| Experimental references | References used for model calibration (pathologist empirical attribution) | Result of internal validation (leave one image out cross validation) | |||||
| Sample image number | Lame & spot numbers | Pathologist expertise attribution | Reference aggressiveness | Predicted aggressiveness (mean of pixel-scores/images) | Relative RMSE | Standard deviation (at the pixel level) | Bias (at the image level) |
| 1 | L1S07 |
| 7.0 | 6.1 | 10.1 | 0.45 | –0.93 |
| 2 | L1S08 | Dysplasia (medium/serious) | 6.5 | 5.6 | 10.6 | 0.72 | –0.92 |
| 3 | L1S10 | Infiltrating SCC | 9.0 | 7.0 | 20.6 | 0.37 | –2.04 |
| 4 | L1S14 | Micro-invasive SCC | 8.0 | 8.11 | 4.8 | 0.85 | 0.13 |
| 5 | L1S17 | Malpighian metaplasia + slight dysplasia | 6.0 | 6.5 | 8.1 | 0.51 | 0.45 |
| 6 | L2S01 | Malpighian metaplasia + slight dysplasia | 6.0 | 7.0 | 11.1 | 0.64 | 1.01 |
| 7 | L2S03 | Hyperplasia of basal cell + muco-secretion | 5.0 | 6.0 | 11.8 | 0.95 | 0.98 |
| 8 | L2S05 | Malpighian metaplasia | 5.5 | 7.0 | 16.7 | 0.87 | 1.52 |
| 9 | L2S08 | Infiltrating SCC | 9.0 | 7.6 | 14.7 | 0.52 | –1.37 |
| 10 | L2S11 | Malpighian metaplasia | 5.5 | 7.6 | 21.6 | 0.63 | 2.09 |
| 11 | L2S15 | Infiltrating SCC | 9.0 | 8.3 | 8.3 | 0.67 | –0.69 |
| 12 | L2S16 |
| 7.0 | 8.6 | 16.6 | 0.55 | 1.58 |
| 13 | L3S08 | Malpighian metaplasia + slight dysplasia | 6.0 | 6.3 | 6.3 | 0.67 | 0.34 |
| 14 | L3S11 | Hyperplasia of basal cell | 4.5 | 5.4 | 10.1 | 0.48 | 0.91 |
| 15 | L3S12 | Hyperplasia of basal cell + mucosecretion | 5.0 | 5.6 | 7.1 | 0.58 | 0.64 |
| 16 | L3S13 | Hyperplasia of basal cell + mucosecretion | 5.0 | 6.0 | 10.3 | 0.51 | 0.96 |
| 17 | L3S14 | Normal epithelium + malpighian metaplasia | 3.0 | 4.0 | 12.6 | 0.85 | 1.03 |
| 18 | L3S17 | Normal epithelium + malpighian metaplasia | 3.0 | 3.2 | 7.3 | 0.91 | 0.17 |
| 19 | L3S18 | Malpighian metaplasia + slight dysplasia | 6.0 | 4.8 | 13.4 | 0.95 | –1.20 |
| 20 | L3S19 | Malpighian metaplasia + slight dysplasia | 6.0 | 4.7 | 14.5 | 0.69 | –1.31 |
| 21 | L4S02 | Normal epithelium + malpighian metaplasia | 3.0 | 3.2 | 6.2 | 0.59 | 0.17 |
| 22 | L4S07 | Normal epithelium | 1.0 | 1.8 | 12.8 | 1.10 | 0.80 |
| 23 | L4S08 | Normal epithelium + mucosecretion | 2.0 | 2.2 | 14.8 | 1.37 | 0.16 |
| 24 | L4S09 | Normal epithelium | 1.0 | 0.4 | 19.0 | 1.70 | –0.59 |
| 25 | L4S13 | Normal epithelium + mucosecretion | 2.0 | 3.4 | 17.0 | 1.05 | 1.42 |
| 26 | L4S15 | Maplighian metaplasia | 5.5 | 4.5 | 13.0 | 0.90 | –1.03 |
| Mean of RMSE | Mean of StDev | Mean of bias | |||||
| 12.3 | 0.78 | 0.16 | |||||
RMSE: Root Mean Square Error.
List of all used calibration samples with the lame number, spot position (allowing us to present sample images in Fig. 3e) and pathologist expertise. Reference aggressiveness is presented next to the predicted values of PLS internal cross validation. The 3 last columns show the prediction error, standard deviation and bias for each sample and the last row shows the mean of these 3 statistical values reached with internal cross validation.
Fig. 3Results of the various chemometric steps required in calibration. (a) Representation of the mean (dotted line) and of min/max variability (grey zone) of the data matrix of calibration obtained after EMSC pre-processing. (b) Principal Component Analysis (PCA) projection of the calibration data matrix on the first 3 components. The percentage of variance for each PC is also indicated. Spectra are coloured according to the reference aggressiveness scale, based on histology expertise. The colours follow the rainbow order with blue associated with normal tissues (score of 1) and red-brown for NSCLC invasive tissues (score of 9). (c) Cross-validation results of the Partial Least Squares Discriminant Analysis (PLS-DA) cascade model. Inputs and outputs correspond to K-means clusters and sub-clusters highlighted by ASK. The histograms refer to confusion matrixes obtained at each of the two levels of the model cascade. Clusters and sub-clusters of interest correspond to clusters containing epithelial cells, whatever the aggressiveness phenotype appraised. Only clusters identified as of interest by the level 1 PLS-DA model were used to develop the level 2 model. (d) Root Mean Square Error (RMSE) of the aggressiveness scoring model (based on the PLS algorithm). The blue and red curves correspond to the RMSE of calibration and internal validation (image by image cross validation) as a function of H respectively, with H the iteration number (or also the number of computed dimensions of the PLS vectorial space). (e) Predicted aggressiveness score based on the IR model as a function of the reference aggressiveness score based on histology for each image of the calibration set. The prediction was realized at the pixel level. For each image, the prediction was visualised by an ellipse, with the ellipse centre corresponding to the mean predicted score and the ellipse vertical axis corresponding to pixel score standard deviation.
Fig. 4Prediction of the aggressiveness for independent samples of the test set. Analysis of 4 representative samples containing cells of different aggressiveness phenotypes. For each sample, the HE-stained image, PLS color-coded image and the associated histogram are obtained.
Results of the external validation on independent samples
| Sample image number | Pathologist expertise attribution (external validation images were prior chosen for their tissue heterogeneity) | Predicted aggressiveness (mean of pixel scores for one image) | Standard deviation (at the pixel level) |
| 27 | Normal epithelium + malpighian metaplasia | 2.5 | 2.40 |
| 28 | Normal epithelium + invasive SCC | 3.8 | 0.81 |
| 29 |
| 6.6 | 0.76 |
| 30 | Invasive SCC | 7.2 | 1.20 |
| 31 | Normal epithelium | 2.7 | 0.74 |
| 32 | Normal epithelium + malpighian metaplasia | 3.5 | 0.86 |
| 33 |
| 6.4 | 1.30 |
| 34 | Invasive SCC | 7.5 | 0.51 |
For the first four samples (#27 to #30), the results of the IR scoring correspond to the PLS color-coded images shown in Fig. 4.
Fig. 1Schematic processing organigram used to process infrared multivariate data. The chemometric chain follows 8 steps applied to different sets of images. The 3 black lines concern the considered sample set (all, calibration or external validation samples). EMSC (Extended Multiplicative Signal Correction), ASK (Automatic Serial K-means), PLS-DA (Partial Least Squares Discriminant Analysis) and PLS (Partial Least Squares) algorithms were employed sequentially as described.