| Literature DB >> 28139704 |
Olivier Gevaert1, Sebastian Echegaray2, Amanda Khuong3, Chuong D Hoang3, Joseph B Shrager3, Kirstin C Jensen4,5, Gerald J Berry4, H Henry Guo2, Charles Lau6, Sylvia K Plevritis2, Daniel L Rubin2, Sandy Napel2, Ann N Leung2.
Abstract
Molecular analysis of the mutation status for EGFR and KRAS are now routine in the management of non-small cell lung cancer. Radiogenomics, the linking of medical images with the genomic properties of human tumors, provides exciting opportunities for non-invasive diagnostics and prognostics. We investigated whether EGFR and KRAS mutation status can be predicted using imaging data. To accomplish this, we studied 186 cases of NSCLC with preoperative thin-slice CT scans. A thoracic radiologist annotated 89 semantic image features of each patient's tumor. Next, we built a decision tree to predict the presence of EGFR and KRAS mutations. We found a statistically significant model for predicting EGFR but not for KRAS mutations. The test set area under the ROC curve for predicting EGFR mutation status was 0.89. The final decision tree used four variables: emphysema, airway abnormality, the percentage of ground glass component and the type of tumor margin. The presence of either of the first two features predicts a wild type status for EGFR while the presence of any ground glass component indicates EGFR mutations. These results show the potential of quantitative imaging to predict molecular properties in a non-invasive manner, as CT imaging is more readily available than biopsies.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28139704 PMCID: PMC5282551 DOI: 10.1038/srep41674
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Clinical data for the NSCLC cohort (N = 186).
| Number | Percentage | |
|---|---|---|
| Male | 120 | 65% |
| Female | 66 | 35% |
| AdenoCarcinoma | 153 | 82% |
| AdenoCarcinoma (BAC) | 1 | 1% |
| Squamous cell carcinoma | 29 | 16% |
| NSCLC | 3 | 2% |
| non smoker | 42 | 23% |
| former smoker | 113 | 61% |
| current smoker | 31 | 17% |
| Academic center | 113 | 61% |
| VA | 73 | 39% |
| positive | 40 | 22% |
| negative | 110 | 59% |
| missing | 36 | 19% |
| positive | 32 | 17% |
| negative | 118 | 63% |
| missing | 36 | 19% |
Univariate correlation of EGFR mutation status with semantic image features.
| Semantic feature | Test | P-value | Q-value |
|---|---|---|---|
| Emphysema: Presence | Fisher exact test | 6.26E-09 | 4.02E-07 |
| Primary Emphysema Laterality: Both | Fisher exact test | 1.09E-08 | 3.50E-07 |
| Overall Emphysema Severity: Multi-class with increasing % of emphysema | Spearman rho | 1.98E-08 | 4.23E-07 |
| Ground glass category: Multi-class with increasing % of GGO | Spearman rho | 2.20E-08 | 3.53E-07 |
| Primary Distribution: Upper predominant | Fisher exact test | 8.84E-08 | 1.14E-06 |
| Lung Parenchyma Features: Presence of airway abnormality | Fisher exact test | 3.76E-07 | 4.02E-06 |
| Nodule Internal Features: Presence of reticulation | Fisher exact test | 1.96E-05 | 0.00017956 |
| Overall Emphysema Severity: Low severity (1–25%) vs. rest | Fisher exact test | 2.75E-05 | 0.00022074 |
| Nodule Attenuation: Solid | Fisher exact test | 4.99E-05 | 0.00035601 |
| Nodule Periphery: Normal | Fisher exact test | 0.00010845 | 0.00069696 |
| Primary Emphysema Pattern: Centrilobular | Fisher exact test | 0.00011886 | 0.00069446 |
| Nodule Attenuation: Solidness More Than 5 mm | Fisher exact test | 0.00069605 | 0.0037278 |
| Nodule Periphery: Presence of emphysema | Fisher exact test | 0.00082816 | 0.0040941 |
| Nodule Associated Findings: Presence of entering airway | Fisher exact test | 0.0011145 | 0.0051163 |
| Nodule Margins: Primary Pattern poorly defined | Fisher exact test | 0.0018075 | 0.0077444 |
| Nodule Margins: Multi-categorical Primary Pattern | Spearman rho | 0.0018343 | 0.0073679 |
Figure 1Demonstration of some of the semantic features applied to tumors in our cohort.
Note some features (e.g. airway abnormalities, emphysema) are not always depicted on the cross-sections showing the tumor. (A) Solid, lobulated squamous cell carcinoma with emphysema, (B) part solid, smooth adenocarcinoma, (C) ground glass poorly defined adenocarcinoma with airway abnormality, (D) part solid, lobulated adenocarcinoma, (E) part solid, poorly defined adenocarcinoma, (F) part solid, poorly defined adenocarcinoma.
Figure 2ROC curve showing sensitivity/specificity tradeoff for predicting EGFR mutation status using 5 semantic features.
Top five features for the two analyses; using all non-small cell lung cancers (NSCLC), and focusing only on adenocarcinoma.
| Image feature | Percentage selected in N = 100 iterations |
|---|---|
| Emphysema: presence | 98% |
| Lung Parenchyma Features: presence of airway abnormality | 96% |
| Nodule Margins: Multi-categorical Primary Pattern | 94% |
| Nodule Attenuation: Multi-class with increasing size of solid component | 58% |
| Nodule Attenuation: Solid | 37% |
| Emphysema: presence | 93% |
| Lung Parenchyma Features: presence of airway abnormality | 92% |
| Nodule Margins: Multi-categorical Primary Pattern | 92% |
| Nodule Attenuation: Multi-class with increasing size of solid component | 47% |
| Nodule Attenuation: Solid | 44% |
Figure 3Decision tree for predicting EGFR mutation status using a combination of five semantic image features.
Inter-reader variability of the features in the final model for predicting EGFR mutation status.
| Image feature | Cohen’s kappa |
|---|---|
| Emphysema | 0.85 |
| Airway abnormality | 0.30 |
| Nodule attenuation | 0.16 |
| Nodule margin | 0.46 |