| Literature DB >> 34885094 |
Michelle Hershman1, Bardia Yousefi1,2, Lacey Serletti3, Maya Galperin-Aizenberg1, Leonid Roshkovan1, José Marcio Luna1,2, Jeffrey C Thompson4, Charu Aggarwal5, Erica L Carpenter5, Despina Kontos1,2, Sharyn I Katz1.
Abstract
This study tackles interobserver variability with respect to specialty training in manual segmentation of non-small cell lung cancer (NSCLC). Four readers included for segmentation are: a data scientist (BY), a medical student (LS), a radiology trainee (MH), and a specialty-trained radiologist (SK) for a total of 293 patients from two publicly available databases. Sørensen-Dice (SD) coefficients and low rank Pearson correlation coefficients (CC) of 429 radiomics were calculated to assess interobserver variability. Cox proportional hazard (CPH) models and Kaplan-Meier (KM) curves of overall survival (OS) prediction for each dataset were also generated. SD and CC for segmentations demonstrated high similarities, yielding, SD: 0.79 and CC: 0.92 (BY-SK), SD: 0.81 and CC: 0.83 (LS-SK), and SD: 0.84 and CC: 0.91 (MH-SK) in average for both databases, respectively. OS through the maximal CPH model for the two datasets yielded c-statistics of 0.7 (95% CI) and 0.69 (95% CI), while adding radiomic and clinical variables (sex, stage/morphological status, and histology) together. KM curves also showed significant discrimination between high- and low-risk patients (p-value < 0.005). This supports that readers' level of training and clinical experience may not significantly influence the ability to extract accurate radiomic features for NSCLC on CT. This potentially allows flexibility in the training required to produce robust prognostic imaging biomarkers for potential clinical translation.Entities:
Keywords: computed tomography (CT); interobserver variability; non-small cell lung cancer; radiomics
Year: 2021 PMID: 34885094 PMCID: PMC8657389 DOI: 10.3390/cancers13235985
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.575
Figure 1Workflow of the approach. The NSCLC tumor is segmented from the original CT images by four segmenters (n = 4) with different backgrounds, yielding radiomics features and tumor masks as inputs. Next, PCA categorizes features based on their maximum variance in radiomics. For every group, three principal components of feature sets are selected and used for correlative analysis and prediction of survival.
Clinical and demographic data including gender, type of NSCLC, and stage of cancer for collected patients in NSCLC-Radiomics-Genomics (Harvard) lung dataset is presented.
| NSCLC-Radiomics-Genomics | ||
|---|---|---|
| Gender | Male | 61 (68.5%) |
| Clinical combined stage curated | Stage I | 39 (43.8%) |
| Non-small cell lung cancer (NSCLC) | Adenocarcinoma, | 42 (47.2%) |
| Event | Recurrence or death | 46 (51.7%) |
Figure 2Number of patients included in study. Two publicly available datasets were analyzed in the study, the NSCLC-Radiomics-Genomics-Lung3 (Harvard) dataset and the NSCLC-Radiogenomics (Stanford dataset). Eighty-nine patients and 211 patients are part of the Harvard and Stanford datasets, respectively. A total of 3 patients were excluded from the Harvard dataset and 4 patients were excluded from the Stanford dataset due to lack of available data. Tumor types consisted of adenocarcinoma (Adeno), squamous cell carcinoma (SCC), and other types of NSCLC. A total of 293 patients were segmented as part of the study.
Clinical and demographic data including age, race, type of NSCLC, EGFR, and KRAS receptor status, and smoking status for collected patients NSCLC-Radiogenomics (Stanford) is presented.
| NSCLC-Radiogenomics | ||
|---|---|---|
| Age | Median (±IQR) | 69 (43,87) |
| Gender | Male | 133 (64.2%) |
| Race | Caucasian | 120 (57.4%) |
| Smoking Status | Non-smoking | 47 (22.7%) |
| EGFR-Mutation Status | Wildtype | 128 (61.8%) |
| KRAS Mutation Status | Wildtype | 130 (62.8%) |
| Histology | Adenocarcinoma | 170 (82.1%) |
| Solid-Subsolid | Solid | 134 (64.7%) |
| Event | Recurrence or death | 41(21.1%) |
Similarity of the radiomic signatures using multiple scoring methods among different segmenters are presented.
| NSCLC Dataset | Similarity among Segmenters | ||||||
|---|---|---|---|---|---|---|---|
| Segmenters ID | Correlation Score | Dice Score | Precision(%) | Recall (%) | Boundary Distance | Volume Difference | |
|
| BY | 0.92 | 0.89 (±0.25) | 81.8 (±21.8) | 86.1 (±24.5) | 1.2 (±2.7) | 1.1 (±0.5) |
| LS | 0.94 | 0.82 (±0.14) | 81.2 (±2.7) | 69.6 (±24.5) | 6.5 (±26.4) | 2.3 (±21.1) | |
| MH | 0.95 | 0.84 (±0.20) | 72.3 (±22.4) | 88.7 (±18.9) | 4.2 (±15.1) | 0.6 (±1.9) | |
|
| BY | 0.93 | 0.69 (±0.28) | 77.8 (±25.1) | 87.3 (±25.2) | 2.92 (±10.7) | 0.3 (±0.8) |
| LS | 0.72 | 0.80 (±0.27) | 84.2 (±31.5) | 47.8 (±29.9) | 16.6 (±52.6) | 0.3 (±1.2) | |
| MH | 0.87 | 0.83 (±0.23) | 80 (±24.3) | 77.1 (±24.7) | 6.2 (±26.1) | 1.4 (±16.9) | |
Figure 3Two visual comparisons of low-rank radiomics representation with their boxplots relation for labels provided by BY, LS, MH, and SK for two different NSCLC Radiogenomics datasets.
Figure 43D tumor volume. 3D tumor volumes for four segmentation cases and two different NSCLC Radiogenomics datasets.
Overall survival, Cox regression. Using the low-rank representation of the radiomic signatures survival prediction is measured for each segmenter.
| Prediction Survival | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| NSCLC Datasets | Modeling Covariates | BY | LS | MH | SK-RS | ||||
| c-Statistic (95% CI) | c-Statistic (95% CI) | c-Statistic (95% CI) | c-Statistic(95% CI) | ||||||
|
| clinical and demographic 2 | 0.64 | 0.2 | ||||||
| Three PC radiomic signatures | 0.6 | 0.5 | 0.62 | 0.08 | 0.59 | 0.2 | 0.65 | 0.03 | |
| Radiomic signatures, clinical and demographic | 0.65 | 0.3 | 0.68 | 0.04 | 0.66 | 0.2 | 0.7 | 0.03 | |
|
| clinical and demographic 3 | 0.6 | 0.007 | ||||||
| Three PC radiomic signatures | 0.65 | 0.001 | 0.64 | 0.04 | 0.67 | 0.003 | 0.65 | 0.003 | |
| Radiomic signatures, clinical and demographic | 0.71 | <0.005 | 0.68 | 0.003 | 0.71 | <0.005 | 0.69 | <0.005 | |
CI: confidence interval. 1 p-value by likelihood ratio test versus the hypothesis that the model is no better than the null model. 2 Clinical and demographic covariates for LUNG3-NSCLC-Radiomics-Genomics Harvard Dataset: sex, stage status, and histology. 3 Clinical and demographic covariates for NSCLC-Radiogenomics Stanford Dataset: sex, morphological status, and histology.
Figure 5Kaplan-Meier curves for multivariate models of overall survival using low-rank radiomics show significant differences between high- and low-risk patients for each segmenter and NSCLC dataset using median risk score in the model.