| Literature DB >> 29500362 |
Daniel Xia1,2,3,4, Ruben Casanova5, Devayani Machiraju5, Trevor D McKee6, Walter Weder7, Andrew H Beck8, Alex Soltermann9.
Abstract
The goal of this study is to use computational pathology to help guide the development of human-based prognostic H&E biomarker(s) suitable for research and potential clinical use in lung squamous cell carcinoma (SCC). We started with high-throughput computational image analysis with tissue microarrays (TMAs) to screen for histologic features associated with patient overall survival, and found that features related to stromal inflammation were the most strongly prognostic. Based on this, we developed an H&E stromal inflammation (SI) score. The prognostic value of the SI score was validated by two blinded human observers on two large cohorts from a single institution. The SI score was found to be reproducible on TMAs (Spearman rho = 0.88 between the two observers), and highly prognostic (e.g. hazard ratio = 0.32; 95% confidence interval: 0.19-0.54; p-value = 2.5 × 10-5 in multivariate analyses), particularly in comparison to established histologic biomarkers. Guided by downstream molecular/biomarker correlation studies starting with TCGA cases, we investigated the hypothesis that epithelial PD-L1 expression modified the prognostic value of SI. Our research demonstrates that computational pathology can be an efficient hypothesis generator for human pathology research, and support the histologic evaluation of SI as a prognostic biomarker in lung SCCs.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29500362 PMCID: PMC5834457 DOI: 10.1038/s41598-018-22254-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
A comparison of the relative strengths and weaknesses of human pathology and computational histologic image analysis. The following is intended to illustrate the rational basis for the current project (summarized in Fig. 1). Considerations 1–2 favor the application of computational pathology as a screening tool for hypothesis generation. Considerations 4–6 outline some of the reasons for why human pathologists are arguably more suited for current clinical practice. Consideration 3 is a relative advantage of computer-based image analysis, but the concordance between and within observers for human-based histologic biomarkers vary widely. In the case of the SI score, the intra- and inter-observer concordance was found to be high, at least on tissue microarrays (see Results and Supplementary Figure 1).
| Computer-based histologic image analysis | Human-based histology | |
|---|---|---|
| 1. Number of features assessed at one time | Strength: Can be a relatively large number (e.g. > 1000) | Weakness: Usually a relatively small number (e.g. < 10) |
| 2. Bias in selecting features | Strength: No inherent bias in favor of either neoplastic or non-neoplastic tissue | Weakness: Potential bias towards epithelial biomarkers, since the classification of neoplastic tissue is a traditional focus of oncological pathology |
| 3. Intra- and inter-observer variability | Strength: The same algorithm analyzing the same digital image should give the same result every time | Weakness: Reproducibility can be an issue for human pathologists, but this depends greatly on the feature assessed |
| 4. Use in routine clinical practice | Weakness: Not currently | Strength: Human-based histopathology is the current gold-standard in clinical practice |
| 5. Versatility across different clinical and pathologic settings | Weakness: Algorithms developed for a specific application may not work well in other settings, e.g. an algorithm trained only on breast cancer examples may not interpret foci of adjacent benign breast lobules appropriately, if the system has not been trained to identify normal breast tissue. | Strength: Human pathologists are currently more versatile than computer algorithms trained for specific applications. This represents a distinct advantage in the practice of general surgical pathology, which depends heavily on a very wide breadth of knowledge of different tissue types and pathological processes. |
| 6. Availability of method | Weakness: Currently not widely available; requires specialized software and hardware | Strength: Currently widely available; can be used by people in any research or clinical laboratory |
Figure 1Study Overview. Note: the image of the human brain is in the Public Domain and was obtained from the Wikimedia Commons (https://upload.wikimedia.org/wikipedia/commons/8/88/PSM_V46_D167_Outer_surface_of_the_human_brain.jpg). The image of the human eye was modified from the original image (“A blue iris. A human eye.” created by user 8thstar at the English language Wikipedia) obtained from the Wikimedia Commons (https://upload.wikimedia.org/ wikipedia/commons/8/84/A_blue_eye.jpg) under the Creative Commons (CC) BY-SA 3.0 license (https://creativecommons.org/licenses/by-sa/3.0/). According to the terms of this license, this particular Fig. 1 is also distributed under CC BY-SA 3.0 terms with no additional restrictions. Abbreviations: tissue microarray (TMA), The Cancer Genome Atlas (TCGA).
Figure 2Computationally-guided histologic hypothesis generation. (A) Left: unprocessed H&E tissue microarray (TMA) image of lung squamous cell carcinoma. Middle-left: using labeled examples of tumor stroma and epithelia provided by one author, the software learned to divide regions of all TMA images into epithelia (orange) or stroma (blue). The computational classification was correct in many instances, but did have trouble distinguishing inflamed epithelium from inflamed stroma (typically calling all such areas stroma). Middle-right: epithelial objects (yellow = small; orange = medium; brown = large objects); while nearly every tumor nuclei is correctly accounted for, the cytoplasmic borders of some tumors were not appropriately captured, thereby underestimating the extent of at least some tumor cells (i.e. in solid tumor nests [orange areas], there should few to no gaps between epithelial objects). Right: stromal objects. 768 epithelial and 768 stromal features were quantified by the software for each image. After combining with clinical data, four computationally-measured stromal (i.e. from the blue regions) features were found to be significantly associated with overall survival at a cut-off false discovery rate of <0.05. No features from the epithelia (orange region) was significant at this cutoff. (B) The four significant features and representative images from the highest and lowest ranked cases (for illustration, only one of the two images for each case) is shown. Three of the four significant features were associated with the amount of stromal lymphoplasmacytic inflammation by visual review, where more inflammation was associated with better prognosis. A more complete manual review of the four features is available in the Supplementary Data (for Features 1–4).
The stromal inflammation (SI) score is significantly associated with overall survival in the univariate Cox proportional hazards model for the full TMA dataset (n = 437). This list of variables is ordered by p-values (most significant listed first). The SI score is highlighted. The values for observers 1 and 2 are separated by “/.”
| Variable | Type of variable | Hazard ratio (95% CI) | P-value |
|---|---|---|---|
| Overall stage | Clinical | 1.64 (1.42–1.90) | 1.5 × 10−11 |
| Vascular invasion | Histologic | 2.27 (1.79–2.89) | 1.7 × 10−11 |
| T | Clinical | 1.53 (1.33–1.77) | 3.2 × 10−9 |
| Age (years) | Clinical | 1.03 (1.02–1.05) | 5.5 × 10−7 |
| M | Clinical | 3.46 (2.10–5.71) | 1.1 × 10−6 |
| N | Clinical | 1.40 (1.21–1.63) | 7.5 × 10−6 |
|
| |||
| Tumor size (cm) | Clinical | 1.10 (1.05–1.16) | 1.1 × 10−4 |
| Pleural invasion | Histologic | 1.48 (1.17–1.90) | 0.0013 |
| Tumor grade | Histologic | 1.14 (0.91–1.42) | 0.23 |
| Gender (M/F) | Clinical | 0.84 (0.64–1.12) | 0.25 |
| Pack years | Clinical | 1.00 (0.99–1.00) | 0.71 |
The stromal inflammation (SI) score is significantly associated with overall survival in multivariate Cox proportional hazard (CPH) analyses for the full TMA dataset (n = 423 complete cases). The variables in the table are ordered by p-values (most significant listed first). Only the variables listed are part of the CPH analyses. The SI score is highlighted. The results for observers 1 (this table) and 2 (see Supplementary Table 3) were similar.
| Variable | Hazard ratio (95% CI) | P-value |
|---|---|---|
| Overall stage | 1.59 (1.36–1.85) | 2.7 × 10−9 |
| Age (years) | 1.04 (1.02–1.05) | 6.3 × 10−8 |
| Vascular invasion | 1.84 (1.44–2.37) | 1.3 × 10−6 |
|
|
|
|
| Pleural invasion | 1.16 (0.90–1.50) | 0.26 |
| Tumor grade | 1.11 (0.88–1.39) | 0.39 |
Figure 3Human validation of the histologic hypothesis as a prognostic biomarker. (A) Example of manual scoring of SI from observers (Obs) 1 and 2. Patient 1 was deceased and had a low SI score; patient 2 was living and had a high SI score. (B) Kaplan Meier survival analysis for the full TMA dataset. Cases were divided into high SI (red; >median SI score; n = 195) and low SI (black; ≤median SI score; n = 229) groups. Survival was significantly better for the high SI group in comparison to the low group (median survival of 65.0 vs 33.3 months, respectively; log rank p-value = 4.6 × 10−5). The results for observers 1 (this Figure) and 2 (see Supplementary Figure 2) were similar.
Figure 4The relationship between SI, PD-L1 expression, and overall survival. (A and B) PD-L1 expression was not strongly associated with inflammation. (A) By gene expression profiling, CD274 (PD-L1) RNA levels did not correlate strongly with RNA levels of genes expressed by immune cells in lung SCC TCGA cases. The numbers in each box are the Spearman rho values for the expression levels of the gene-pair combinations (red = high correlation; white = low correlation). (B) PD-L1 protein expression by immunohistochemistry did not correlate strongly with histologic SI scores in lung SCC TMA cases (Spearman rho = 0.20). (C and D) The prognostic value of the SI score is modified by epithelial PD-L1 expression. Cases from the TMA cohorts were separated by PD-L1 expression (high versus low). (C) SI and survival when PD-L1 expression was low. (D) SI and survival when PD-L1 expression was high. The interaction term for this Cox proportional hazards model was trending towards significance (interaction p-values = 0.056). The results from observer 1 (this Figure) and observer 2 (Supplementary Figure 4) were similar.