| Literature DB >> 35626070 |
Victor Garcia1, Katherine Elfer1,2, Dieter J E Peeters3,4,5, Anna Ehinger6, Bruce Werness7,8, Amy Ly9, Xiaoxian Li10, Matthew G Hanna11, Kim R M Blenman12,13, Roberto Salgado14,15, Brandon D Gallas1.
Abstract
The High Throughput Truthing project aims to develop a dataset for validating artificial intelligence and machine learning models (AI/ML) fit for regulatory purposes. The context of this AI/ML validation dataset is the reporting of stromal tumor-infiltrating lymphocytes (sTILs) density evaluations in hematoxylin and eosin-stained invasive breast cancer biopsy specimens. After completing the pilot study, we found notable variability in the sTILs estimates as well as inconsistencies and gaps in the provided training to pathologists. Using the pilot study data and an expert panel, we created custom training materials to improve pathologist annotation quality for the pivotal study. We categorized regions of interest (ROIs) based on their mean sTILs density and selected ROIs with the highest and lowest sTILs variability. In a series of eight one-hour sessions, the expert panel reviewed each ROI and provided verbal density estimates and comments on features that confounded the sTILs evaluation. We aggregated and shaped the comments to identify pitfalls and instructions to improve our training materials. From these selected ROIs, we created a training set and proficiency test set to improve pathologist training with the goal to improve data collection for the pivotal study. We are not exploring AI/ML performance in this paper. Instead, we are creating materials that will train crowd-sourced pathologists to be the reference standard in a pivotal study to create an AI/ML model validation dataset. The issues discussed here are also important for clinicians to understand about the evaluation of sTILs in clinical practice and can provide insight to developers of AI/ML models.Entities:
Keywords: biomarker; expert panel; pathologist training/education; tumor-infiltrating lymphocytes; validation dataset
Year: 2022 PMID: 35626070 PMCID: PMC9139395 DOI: 10.3390/cancers14102467
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.575
Figure 1Plot of the stromal tumor-infiltrating lymphocytes (sTILs) variance vs. mean density for the pilot study. Plotted data include all data collected on the digital platform caMicroscope (All ROIs, circles) and the same data restricted to regions of interest (ROIs) selected for further evaluation by the expert panel (Select ROIs, solid triangles). ROIs were selected to be distributed equally across low, moderate, and high infiltration levels based on the clinically recommended thresholds of 10% and 40%, represented by the vertical dashed lines. Within each density bin, ROIs with the highest and lowest variance and entropy were selected for expert panel review. nAll and nSelect are the respective counts of All ROIs and Select ROIs within each bin: “≤ 10%” nAll = 385, nSelect = 27; “10% < % ≤ 40%” nAll = 152, nSelect = 25; “> 40%” nAll = 33, nSelect = 20.
Figure 2Example regions of interest (ROIs) selected by pathologist variance of stromal tumor-infiltrating lymphocytes (sTILs). Each ROI is 500 µm × 500 µm and has a 100 µm bar for scale. The collected annotations for these ROIs are listed in Table 1 and Table 2. (A) High sTILs variance, mean sTILs density ≤ 10% (LE10). (B) Low sTILs variance, mean sTILs density ≤ 10% (LE10). (C) High sTILs variance, mean sTILs density > 40% (GT40). (D) Low sTILs variance, mean sTILs density > 40% (GT40).
Figure 3Example regions of interest (ROIs) selected by pathologist entropy of the ROI label. Each ROI is 500 µm × 500 µm and has a 100 µm bar for scale. The collected annotations for these ROIs are listed in Table 1 and Table 2. (A) High ROI label entropy, mean sTILs density ≤ 10% (LE10). (B) Low ROI label entropy, mean sTILs density ≤ 10% (LE10). (C) High ROI label entropy, mean sTILs density > 40% (GT40). (D) Low ROI label entropy, mean sTILs density > 40% (GT40).
Summary statistics of collected annotations from crowd pathologists for the example regions of interest (ROIs) in Figure 2 and Figure 3 within the stromal tumor-infiltrating lymphocytes (sTILs) density bins of “less than or equal to 10%” (LE10) and “greater than 40%” (GT40). For the High Entropy LE10 (Figure 3A) case, there is a tie for the Majority Label; the multiple labels are separated by “*AND*”.
| Figure | Figure Description | Mean sTILs | Variance | Majority Label | Entropy |
|---|---|---|---|---|---|
| 2A | High Variance LE10 | 10 | 400 | Intra-Tumoral Stroma | 1.01 |
| 2B | Low Variance LE10 | 0 | 0 | Other Regions | 0.64 |
| 2C | High Variance GT40 | 64.2 | 1008.2 | Intra-Tumoral Stroma | 0.45 |
| 2D | Low Variance GT40 | 79.83 | 58.97 | Intra-Tumoral Stroma | 0.64 |
| 3A | High Entropy LE10 | 3.5 | 9.67 | Intra-Tumoral Stroma *AND* | 1.1 |
| 3B | Low Entropy LE10 | 9.75 | 70.79 | Intra-Tumoral Stroma | 0 |
| 3C | High Entropy GT40 | 69.08 | 775.9 | Intra-Tumoral Stroma | 0.86 |
| 3D | Low Entropy GT40 | 66.83 | 212.17 | Intra-Tumoral Stroma | 0 |
Frequency of region of interest (ROI) labels of collected annotations from crowd pathologists for the example ROIs in Figure 2 and Figure 3 within the stromal tumor-infiltrating lymphocytes (sTILs) density bins of “less than or equal to 10%” (LE10) and “greater than 40%” (GT40).
| Figure | Figure | Invasive Margin | Intra-Tumoral Stroma | Tumor with No Intervening Stroma | Other Regions |
|---|---|---|---|---|---|
| 2A | High Variance LE10 | 1 | 3 | 2 | 0 |
| 2B | Low Variance LE10 | 0 | 2 | 0 | 4 |
| 2C | High Variance GT40 | 0 | 5 | 1 | 0 |
| 2D | Low Variance GT40 | 2 | 4 | 0 | 0 |
| 3A | High Entropy LE10 | 2 | 2 | 2 | 0 |
| 3B | Low Entropy LE10 | 0 | 8 | 0 | 0 |
| 3C | High Entropy GT40 | 2 | 10 | 3 | 0 |
| 3D | Low Entropy GT40 | 0 | 6 | 0 | 0 |
Figure 4Plot of the variance vs. mean stromal tumor-infiltrating lymphocytes (sTILs) density for the regions of interest (ROIs) selected for the expert panel. ROIs are matched on their unique case identifiers and plotted with the mean sTILs density as determined by the crowd pathologists. Variances belonging to the same ROI are connected by straight lines. Cases in which there was no calculable variance from the experts’ assessment are represented by an open circle (Crowd) without a connected blue triangle (Experts).
Variance summary statistics of the crowd annotations from all regions of interest (ROIs) scored using the caMicroscope modality (Crowd-All), the crowd annotations of the selected ROIs (Crowd-Select), and the expert panel’s annotations of the selected ROIs. Data are grouped by the mean stromal tumor-infiltrating lymphocytes (sTILs) density bin and reported as the median variance and (interquartile range).
| All Densities | ≤10% | 10% < % ≤ 40% | >40% | |
|---|---|---|---|---|
| Crowd-All | 48.10 | 30.70 | 111.50 | 324.55 |
| Crowd-Select | 212.24 | 44.67 | 246.80 | 358.75 |
| Experts | 14.17 | 3.07 | 70.00 | 96.67 |
Figure 5Plot of the region of interest (ROI) label entropy vs. mean stromal tumor-infiltrating lymphocytes (sTILs) density for the ROIs selected for the expert panel. ROIs are matched on their case identifiers and plotted with their mean sTILs score density as determined by the crowd pathologists. ROI label entropies belonging to the same ROI are connected by straight lines.
Entropy summary statistics of the labels from all regions of interest (ROIs) scored using the caMicroscope modality (Crowd-All), the crowd labels of the selected ROIs (Crowd–Select), and the expert panel’s labels of the selected ROIs. Data are grouped by the mean stromal tumor-infiltrating lymphocytes (sTILs) density bin and reported as the median entropy and (interquartile range).
| All Densities | ≤10% | 10% < % ≤ 40% | >40% | |
|---|---|---|---|---|
| Crowd-All | 0.23 (0.00–0.45) | 0.23 (0.00–0.41) | 0.24 (0.00- 0.50) | 0.00 (0.00–0.45) |
| Crowd-Select | 0.56 (0.00–0.86) | 0.64 (0.45–0.99) | 0.64 (0.24–0.92) | 0.45 (0.00–0.52) |
| Experts | 0.00 (0.00–0.45) | 0.00 (0.00–0.45) | 0.00 (0.00–0.45) | 0.00 (0.00–0.50) |
Frequency of the calculated majority region of interest (ROI) labels. Counts are grouped as all ROIs scored using caMicroscope (Crowd–All), selected ROIs annotated by the crowd pathologists (Crowd–Select), and the selected ROIs annotated by the experts (Experts).
| Majority Label | Crowd-All | Crowd-Select | Experts |
|---|---|---|---|
| Intra-Tumoral Stroma | 525 (82.03%) | 54 (75%) | 56 (77.78%) |
| Intra-Tumoral Stroma *AND* Invasive Margin | 10 (1.56%) | 1 (1.39%) | 1 (1.39%) |
| Intra-Tumoral Stroma *AND* Invasive Margin | 1 (0.16%) | 1 (1.39%) | 0 (0%) |
| Intra-Tumoral Stroma *AND* Other Regions | 2 (0.31%) | 0 (0%) | 2 (2.78%) |
| Intra-Tumoral Stroma *AND* Tumor with No Intervening Stroma | 4 (0.62%) | 1 (1.39%) | 0 (0%) |
| Invasive Margin | 8 (1.25%) | 2 (2.78%) | 1 (1.39%) |
| Invasive Margin *AND* Other Regions | 1 (0.16%) | 1 (1.39%) | 0 (0%) |
| Other Regions | 80 (12.5%) | 7 (9.72%) | 12 (16.67%) |
| Tumor with No Intervening Stroma | 9 (1.41%) | 5 (6.94%) | 0 (0%) |
Summary of pitfalls encountered during the stromal tumor-infiltrating lymphocytes (sTILs) assessment grouped by pitfall type. Region of interest is abbreviated as “ROI”.
| Pitfall Type | Pitfall Summary |
|---|---|
| Percent of Tumor-Associated Stroma | Exclude thick-walled vessels, benign glandular elements, adipocytes, carcinoma in situ, and necrosis from the area of tumor-associated stroma |
| Calculate with respect to the entire ROI area | |
| Variations in tumor cell morphology can make it difficult to distinguish stroma from tumor | |
| sTILs Density Score | Cells with small/pyknotic nuclei and/or perinuclear clearing can be difficult to categorize |
| Non-lymphoid cells may be confused for lymphocytes | |
| Error in the percent tumor-associated stroma can affect the sTILs density | |
| Sparsely distributed tumor cells may be more challenging to quantitate |