| Literature DB >> 31581201 |
Katherine Tian1,2, Christopher A Rubadue1, Douglas I Lin1, Mitko Veta3, Michael E Pyle1, Humayun Irshad1, Yujing J Heng1,4.
Abstract
We developed an automated 2-tiered Fuhrman's grading system for clear cell renal cell carcinoma (ccRCC). Whole slide images (WSI) and clinical data were retrieved for 395 The Cancer Genome Atlas (TCGA) ccRCC cases. Pathologist 1 reviewed and selected regions of interests (ROIs). Nuclear segmentation was performed. Quantitative morphological, intensity, and texture features (n = 72) were extracted. Features associated with grade were identified by constructing a Lasso model using data from cases with concordant 2-tiered Fuhrman's grades between TCGA and Pathologist 1 (training set n = 235; held-out test set n = 42). Discordant cases (n = 118) were additionally reviewed by Pathologist 2. Cox proportional hazard model evaluated the prognostic efficacy of the predicted grades in an extended test set which was created by combining the test set and discordant cases (n = 160). The Lasso model consisted of 26 features and predicted grade with 84.6% sensitivity and 81.3% specificity in the test set. In the extended test set, predicted grade was significantly associated with overall survival after adjusting for age and gender (Hazard Ratio 2.05; 95% CI 1.21-3.47); manual grades were not prognostic. Future work can adapt our computational system to predict WHO/ISUP grades, and validating this system on other ccRCC cohorts.Entities:
Year: 2019 PMID: 31581201 PMCID: PMC6776313 DOI: 10.1371/journal.pone.0222641
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Schematic diagram showing how regions of interest (ROIs) were identified by Pathologist 1.
Pathologist 1 identified ROIs and assigned a Fuhrman grade for each ROI. The highest grade among all ROIs was the “Grade by Pathologist 1”. Each case also had a “TCGA grade” retrieved from the TCGA database.
Fig 2From whole slide image to patches for image processing and nuclei segmentation.
Fig 3Examples of nuclei detection and segmentation in low and high grade clear cell renal cell carcinoma.
The rightmost column shows computer-generated segmentation mask where cell nuclei are labelled white against a black background. The middle column shows the overlay of segmented nuclei (green spots) over each hematoxylin and eosin (H&E) patch.
Fig 4Data summarization and the selection of the representative region of interest (ROI).
Fig 5A summary of the workflow used to develop the 2-tiered clear cell renal cell carcinoma (ccRCC) grade classification.
Seven machine learning classification methods were evaluated to determine the optimal method to develop a robust classification model for ccRCC using cases from the Training Set (A). Lasso regression produced an average area under the receiver operator characteristic curve of 0.84 and identified nuclei histomics features associated with ccRCC grade. The Test Set was used to evaluate the performance of the final model; and grades were predicted in the Extended Test Set (B).
Demographic table of the 395 The Cancer Genome Atlas (TCGA) clear cell renal cell carcinoma cases with 2-tiered histological grade (low and high).
Note that the TCGA grade for each patient in the discordant set is the opposite grade assigned by Pathologist 1.
| Concordant | Discordant cases | Discordant cases | |||||
|---|---|---|---|---|---|---|---|
| Total | Low | High | Low | High | Low | High | |
| Cases, | 395 (100) | 162 (58.5) | 115 (41.5) | 28 (23.7) | 90 (76.3) | 90 (76.3) | 28 (23.7) |
| Age group, | |||||||
| <50 | 80 (20.3) | 36 (22.2) | 22 (19.1) | 4 (14.3) | 18 (20.0) | 18 (20.0) | 4 (14.3) |
| 50–59 | 106 (26.8) | 50 (30.9) | 29 (25.2) | 6 (21.4) | 21 (23.3) | 21 (23.3) | 6 (21.4) |
| 60–69 | 109 (27.6) | 37 (22.8) | 33 (28.7) | 10 (35.7) | 29 (32.2) | 29 (32.2) | 10 (35.7) |
| 70–79 | 82 (20.8) | 31 (19.1) | 26 (22.6) | 8 (28.6) | 17 (18.9) | 17 (18.9) | 8 (28.6) |
| >80 | 18 (4.6) | 8 (4.9) | 5 (4.3) | 0 (0.0) | 5 (5.6) | 5 (5.6) | 0 (0.0) |
| Gender, | |||||||
| Female | 130 (32.9) | 67 (41.4) | 28 (24.3) | 10 (35.7) | 25 (27.8) | 25 (27.8) | 10 (35.7) |
| Male | 265 (67.1) | 95 (58.6) | 87 (75.7) | 18 (64.3) | 65 (72.2) | 65 (72.2) | 18 (64.3) |
| Race, | |||||||
| Asian | 7 (1.8) | 3 (1.9) | 2 (1.7) | 0 (0.0) | 2 (2.2) | 2 (2.2) | 0 (0.0) |
| Black | 33 (8.4) | 13 (8.0) | 10 (8.7) | 2 (7.1) | 8 (8.9) | 8 (8.9) | 2 (7.1) |
| White | 349 (88.4) | 142 (87.7) | 102 (88.7) | 25 (89.3) | 80 (88.9) | 80 (88.9) | 25 (89.3) |
| Not reported | 6 (1.5) | 4 (2.5) | 1 (0.9) | 1 (3.6) | 0 (0.0) | 0 (0.0) | 1 (3.6) |
| Stage, | |||||||
| Stage I | 207 (52.4) | 121 (74.7) | 35 (30.4) | 13 (46.4) | 38 (42.2) | 38 (42.2) | 13 (46.4) |
| Stage II | 44 (11.1) | 17 (10.5) | 15 (13.0) | 5 (17.9) | 7 (7.8) | 7 (7.8) | 5 (17.9) |
| Stage III | 92 (23.3) | 18 (11.1) | 36 (31.3) | 8 (28.6) | 30 (33.3) | 30 (33.3) | 8 (28.6) |
| Stage IV | 52 (13.2) | 6 (3.7) | 29 (25.2) | 2 (7.1) | 15 (16.7) | 15 (16.7) | 2 (7.1) |
| Type of Treatment, | |||||||
| Chemotherapy | 7 (1.8) | 3 (1.9) | 4 (3.5) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| Immunotherapy | 6 (1.5) | 2 (1.2) | 2 (1.7) | 0 (0.0) | 2 (2.2) | 2 (2.2) | 0 (0.0) |
| Molecular therapy | 79 (20.0) | 31 (19.1) | 24 (20.9) | 5 (17.9) | 19 (21.1) | 19 (21.1) | 5 (17.9) |
| Radiation | 10 (2.5) | 2 (1.2) | 4 (3.5) | 2 (7.1) | 2 (2.2) | 2 (2.2) | 2 (7.1) |
| Mixed therapy | 21 (5.3) | 4 (2.5) | 10 (8.7) | 2 (7.1) | 5 (5.6) | 5 (5.6) | 2 (7.1) |
| Unknown | 272 (68.9) | 120 (74.1) | 71 (61.7) | 19 (67.9) | 62 (68.9) | 62 (68.9) | 19 (67.9) |
Nuclear histomics features associated with 2-tiered ccRCC grade selected in the final Lasso classification model (18 unique features; 26 total features).
| Feature | Type | Biological | Color | Summary | Coefficient | |
|---|---|---|---|---|---|---|
| Elongation | Morphology | Nuclear pleomorphism, | - | MAD | -1.51E-01 | |
| Minor axis of the Ellipse Fit | Morphology | Nuclear pleomorphism, | - | Median | -1.25E+00 | |
| Flatness | Morphology | Nuclear shape (irregular) | - | MAD | -4.20E-16 | |
| Kurtosis | Intensity | Uneven distribution of | HSV | Median | 5.38E-03 | |
| Skewness | Intensity | Uneven distribution of | H&E | MAD | -2.22E-01 | -2.22E-01 |
| HSV | Median | -2.87E-01 | ||||
| Lab | MAD | -1.10E-01 | ||||
| Correlation | Texture | Granularity of chromatin ( | HSV | MAD | -2.48E-02 | |
| Haralick Correlation | Texture | Granularity of chromatin ( | H&E | Median | -2.24E-01 | |
| Energy | Texture | Granularity of chromatin ( | Lab | Median | -9.36E-01 | |
| MAD | -3.74E-01 | |||||
| Inverse difference moment | Texture | Granularity of chromatin ( | H&E | Median | -4.81E-01 | |
| MAD | -4.19E-02 | |||||
| HSV | Median | 1.67E-01 | ||||
| MAD | -1.19E-01 | |||||
| Inertia | Texture | Granularity of chromatin ( | H&E | Median | -1.77E-01 | |
| Lab | Median | -9.72E-02 | ||||
| Entropy | Texture | Granularity of chromatin ( | HSV | Median | 8.97E-03 | |
| Low gray-level run emphasis | Texture | Granularity of chromatin ( | H&E | MAD | -7.52E-02 | |
| Long run high gray-level emphasis | Texture | Granularity of chromatin ( | HSV | MAD | 1.50E-01 | |
| Long run low gray-level emphasis | Texture | Granularity of chromatin ( | H&E | MAD | -7.71E-16 | |
| Short run high gray-level emphasis | Texture | Granularity of chromatin ( | HSV | MAD | 4.02E-02 | |
| Short run low gray-level emphasis | Texture | Granularity of chromatin ( | H&E | MAD | -1.28E-15 | |
| Gray level non-uniformity | Texture | Granularity of chromatin ( | HSV | Median | -4.26E-01 | |
| Lab | MAD | 2.45E+00 | ||||
| High gray-level run emphasis | Texture | Granularity of chromatin ( | HSV | MAD | 6.21E-03 |
a) Correlation is a co-occurrence based texture feature, describing roughness and repeated direction inside the nuclei.
b) Co-occurrence based texture feature, describing roughness inside the nuclei.
c) Run-length matrix based texture feature, describing randomness of gray-level distribution.
d) Run-length matrix based texture feature, describing coarseness inside nuclei.
MAD: median absolute deviation; Lab: Lab color space; HSV: hue-saturation-value color space; H&E: Hematoxylin and Eosin color space. Median and MAD were used to summarize the data extracted at the patch level to the region of interest (ROI) level.
Fig 6Prognostic efficacy of predicted grades.
Cases predicted as high grade have significantly poorer overall survival rates compared to cases predicted as low grade in the extended test set (hazard ratio 2.07, 95% confidence interval of 1.25–3.43, p<0.01; 65 death events among 160 cases). The shaded areas reflect the 95% confidence interval for high or low grade.
Fig 7Kaplan-Meier curves comparing manual and predicted grades with overall survival in concordant and discordant cases.
Grades assigned by TCGA/Pathologist 1 were significantly associated with overall survival within the concordant cases (A). In the discordant set, neither grades assigned by TCGA (B) nor Pathologist 1 (C) were associated with overall survival while predicted grade remained significantly prognostic (D). Please refer to Table 3 for hazard ratios and 95% confidence intervals for each analysis. The shaded areas reflect the 95% confidence interval for high or low grade.
The association of manual or computer predicted 2-tiered grade with overall survival in the concordant and discordant cases.
| Manual Grade | Computer Predicted Grade | |||||
|---|---|---|---|---|---|---|
| Hazard | (95% CI) | Hazard | (95% CI) | |||
| Model A: Crude | 3.12 | (2.00, 4.86) | NA | NA | NA | |
| Model B: Adjusted for Age and Gender | 3.00 | (1.91, 4.71) | NA | NA | NA | |
| Model C: Adjusted for Age, Gender, and Stage | 1.59 | (0.99, 2.57) | 0.06 | NA | NA | NA |
| Model A: Crude | 1.16 | (0.59, 2.25) | 0.67 | 2.01 | (1.14, 3.54) | |
| Model B: Adjusted for Age and Gender | 1.09 | (0.56, 2.13) | 0.80 | 2.31 | (1.26, 4.24) | |
| Model C: Adjusted for Age, Gender, and Stage | 1.08 | (0.55, 2.11) | 0.83 | 1.83 | (0.98, 3.41) | 0.06 |
| Model A: Crude | 0.86 | (0.44, 1.68) | 0.67 | NA | NA | NA |
| Model B: Adjusted for Age and Gender | 0.92 | (0.47, 1.79) | 0.80 | NA | NA | NA |
| Model C: Adjusted for Age, Gender, and Stage | 0.93 | (0.47, 1.82) | 0.83 | NA | NA | NA |
| Model A: Crude | 1.09 | (0.63, 1.89) | 0.75 | NA | NA | NA |
| Model B: Adjusted for Age and Gender | 1.21 | (0.70, 2.10) | 0.50 | NA | NA | NA |
| Model C: Adjusted for Age, Gender, and Stage | 1.15 | (0.66, 2.00) | 0.62 | NA | NA | NA |
Confidence Interval, CI
The association of manual or computer predicted 2-tiered grade with overall survival in 118 discordant cases.
| Manual Grade | Computer Predicted Grade | |||||
|---|---|---|---|---|---|---|
| Hazard | (95% CI) | Hazard | (95% CI) | |||
| Model A: Crude | 0.99 | (0.44, 2.23) | 0.99 | 2.05 | (1.00, 4.21) | |
| Model B: Adjusted for Age and Gender | 1.04 | (0.46, 2.34) | 0.93 | 2.42 | (1.13, 5.20) | |
| Model C: Adjusted for Age, Gender, and Stage | 1.18 | (0.52, 2.68) | 0.69 | 1.89 | (0.87, 4.12) | 0.11 |
| Model A: Crude | 0.64 | (0.19, 2.20) | 0.48 | 2.49 | (0.83, 7.45) | 0.10 |
| Model B: Adjusted for Age and Gender | 0.63 | (0.18, 2.17) | 0.46 | 2.49 | (0.72, 7.28) | 0.16 |
| Model C: Adjusted for Age, Gender, and Stage | 0.59 | (0.17, 2.03) | 0.40 | 2.03 | (0.62, 6.66) | 0.24 |
| Model A: Crude | 1.56 | (0.46, 5.31) | 0.48 | NA | NA | NA |
| Model B: Adjusted for Age and Gender | 1.58 | (0.46, 5.44) | 0.46 | NA | NA | NA |
| Model C: Adjusted for Age, Gender, and Stage | 1.70 | (0.49, 5.89) | 0.40 | NA | NA | NA |
Confidence Interval, CI