Guixue Liu1, Zhihan Xu2, Yingqian Ge2, Beibei Jiang1, Harry Groen3, Rozemarijn Vliegenthart4, Xueqian Xie1. 1. Department of Radiology, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China. 2. Siemens Healthineers Ltd, Shanghai, China. 3. Department of Lung Diseases, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9700RB Groningen, The Netherlands. 4. Department of Radiology, University Medical Center Groningen, University of Groningen, Hanzeplein 1, 9700RB Groningen, The Netherlands.
Abstract
BACKGROUND: To establish a radiomic approach to identify epidermal growth factor receptor (EGFR) mutation status in lung adenocarcinoma patients based on CT images, and to distinguish exon-19 deletion and exon-21 L858R mutation. METHODS: Two hundred sixty-three patients who underwent pre-surgical contrast-enhanced CT and molecular testing were included, and randomly divided into the training (80%) and test (20%) cohort. Tumor images were three-dimensionally segmented to extract 1,672 radiomic features. Clinical features (age, gender, and smoking history) were added to build classification models together with radiomic features. Subsequently, the top-10 most relevant features were used to establish classifiers. For the classifying tasks including EGFR mutation, exon-19 deletion, and exon-21 L858R mutation, four logistic regression models were established for each task. RESULTS: The training and test cohort consisted of 210 and 53 patients, respectively. Among the established models, the highest accuracy and sensitivity among the four models were 75.5% (61.7-86.2%) and 92.9% (76.5-99.1%) to classify EGFR mutation, respectively. The highest specificity values were 86.7% (69.3-96.2%) and 70.4% (49.8-86.3%) to classify exon-19 deletion and exon-21 L858R mutation, respectively. CONCLUSIONS: CT radiomics can sensitively identify the presence of EGFR mutation, and increase the certainty of distinguishing exon-19 deletion and exon-21 L858R mutation in lung adenocarcinoma patients. CT radiomics may become a helpful non-invasive biomarker to select EGFR mutation patients for invasive sampling. 2020 Translational Lung Cancer Research. All rights reserved.
BACKGROUND: To establish a radiomic approach to identify epidermal growth factor receptor (EGFR) mutation status in lung adenocarcinoma patients based on CT images, and to distinguish exon-19 deletion and exon-21 L858R mutation. METHODS: Two hundred sixty-three patients who underwent pre-surgical contrast-enhanced CT and molecular testing were included, and randomly divided into the training (80%) and test (20%) cohort. Tumor images were three-dimensionally segmented to extract 1,672 radiomic features. Clinical features (age, gender, and smoking history) were added to build classification models together with radiomic features. Subsequently, the top-10 most relevant features were used to establish classifiers. For the classifying tasks including EGFR mutation, exon-19 deletion, and exon-21 L858R mutation, four logistic regression models were established for each task. RESULTS: The training and test cohort consisted of 210 and 53 patients, respectively. Among the established models, the highest accuracy and sensitivity among the four models were 75.5% (61.7-86.2%) and 92.9% (76.5-99.1%) to classify EGFR mutation, respectively. The highest specificity values were 86.7% (69.3-96.2%) and 70.4% (49.8-86.3%) to classify exon-19 deletion and exon-21 L858R mutation, respectively. CONCLUSIONS: CT radiomics can sensitively identify the presence of EGFR mutation, and increase the certainty of distinguishing exon-19 deletion and exon-21 L858R mutation in lung adenocarcinoma patients. CT radiomics may become a helpful non-invasive biomarker to select EGFR mutation patients for invasive sampling. 2020 Translational Lung Cancer Research. All rights reserved.
Lung cancer is the most common cause of mortality among malignant tumors. With the application of lung cancer CT screening and advancement of treatment methods, the 5-year survival rate has increased from 10–20% to 15–30% (1). Lung adenocarcinoma is the most common subtype of lung cancer, accounting for 40% (2). The core mechanism of carcinogenesis of lung adenocarcinoma is the somatic mutation of epidermal growth factor receptor (EGFR) that leads to persistent activation of EGFR tyrosine kinase domain in tumor tissue. Structural changes of the EGFR gene due to activating mutations are associated with the proliferation of adenocarcinoma cells. The therapeutic effect of tyrosine kinase inhibitors (TKIs) depends on the mutation type of the EGFR gene (3,4). Tumor response percentage of patients with activating mutant EGFR is 68% and the progression-free survival time in these patients is 12.1 months compared with only 11% and 3.4 months in patients with wild type EGFR (5,6). The guideline from the College of American Pathologists, the International Association for the Study of Lung Cancer, and the Association for Molecular Pathology recommends routine testing for EGFR mutations to guide molecular targeted therapy in adenocarcinoma patients (7).EGFR mutation sites mostly locate in exons 18, 19, 20, and/or 21. The two most common activating mutations are exon-19 deletion (Del19) and exon-21 mutation (L858R) (8,9). Patients who have tumors harboring one of these two mutations exhibit a greater objective response rate to afatinib, erlotinib, and gefitinib (10). There is differential prognosis based on EGFR mutation: the overall survival time for patients with exon-19 and -21 mutations with targeted therapy was found to be 41 to 44 months, longer than those with exon-18 mutation (19 months) (11). Therefore, detection of specific EGFR mutations may be of interest in guiding the application of targeted therapy in lung adenocarcinoma patients.The detection of EGFR mutation status relies on gene sequencing or amplification refractory mutation systems. These detection techniques require invasive sampling methods such as surgical excision or tissue biopsy (12), associated with costs and patient discomfort. More importantly, the intra-tumor heterogeneity leads to heterogenous biopsy sampling results. The implementation of these molecular tests is still moderate to poor. A non-invasive approach would be helpful to screen for EGFR mutation to guide further treatment before the clinical intervention. Because imaging phenotype seems to reflect the genotype of lung cancer (13,14), imaging morphology, features, and patterns, such as texture and intensity distribution are often used to derive information regarding diagnosis, tumor heterogeneity, and prognosis. In comparison with a traditional visual inspection, radiomics transforms medical images into minable data to develop models to guide clinical decision (15,16). Zhou et al. found no difference in CT morphology in 346 patients with and without EGFR mutation (17). Rios Velazquez et al. analyzed 763 adenocarcinoma cases and observed an AUC of 0.69 to distinguish mutant EGFR based on radiological features (18). However, there have been improved results on the association between CT radiomic features and EGFR mutation. Yip et al. revealed that radiomic features could classify the mutation status of EGFR (19). To our knowledge, no study has comprehensively investigated whether the different activating EGFR mutations can be estimated from imaging features. We performed a comprehensive CT-derived radiomics analysis on lung adenocarcinoma patients to classify the presence of the EGFR mutation status, and whether we are able to classify exon-19 deletion and exon-21 L858R mutation.
Methods
Study population
From July 2017 to December 2018, an exhaustive search in the electronic health records was performed for lung cancer patients who had molecular testing for EGFR mutation in our hospital. The inclusion criteria were: (I) tumor sample tissue obtained by surgery; (II) pathologically diagnosed adenocarcinoma based on hematoxylin-eosin and immunohistochemistry staining; (III) pre-surgery thin-slice (<1 mm) contrast-enhanced CT scan; (IV) time interval less than 2 months between CT scan and surgery. Patients were excluded if the tumor edge was indistinguishable with the naked eyes, and thus impossible to accurately segment on the CT images, caused by factors such as unclear demarcation between the tumor and surrounding consolidation or pleural effusion, etc. This retrospective study was approved by the Institutional Review Committee and waived the requirement for informed consent.The collected data were patient characteristics, smoking status including non-smoker or smoker (former or current), histological subtype of adenocarcinoma and molecular outcome. The study flow diagram is shown in .
Figure 1
The study workflow diagram. (I) Tumor segmentation: the tumor was three-dimensionally semi-automatically segmented in chest CT images. (II) Feature extraction: 1,672 radiomic features were automatically extracted from the segmented tumor volume. (III) Feature selection: radiomic features were selected using the minimum redundancy maximum relevance algorithm. (IV) Logistic regression: models were trained and tested to determine their classification performance
The study workflow diagram. (I) Tumor segmentation: the tumor was three-dimensionally semi-automatically segmented in chest CT images. (II) Feature extraction: 1,672 radiomic features were automatically extracted from the segmented tumor volume. (III) Feature selection: radiomic features were selected using the minimum redundancy maximum relevance algorithm. (IV) Logistic regression: models were trained and tested to determine their classification performance
CT acquisition
CT examinations were performed on three modern CT systems (Somatom Force, Siemens Healthineers; Revolution and HD750, GE Healthcare). All included patients underwent contrast-enhanced chest CT scanning after injection of 60–80 mL contrast-media (Iopamiro 300, Bracco) into the antecubital vein at 3 mL/sec. The reconstructed slice thickness was 0.6 or 0.625 mm. The detailed acquisition protocol and reconstruction parameters are listed in .
Table S1
CT acquisition protocols and image reconstruction parameters
Parameter
Siemens Somatom Force
GE Revolution
GE HD750
Acquisition mode
Helical
Helical
Helical
Tube voltage, kV
100
120
120
Tube current, mA
130 (quality reference)
100–400 (smart mode)
100–400 (smart mode)
Collimation, mm
192×0.6
256×0.625
64×0.625
Pitch
1.2
0.984
1.39
Rotation time, ms
500
500
500
Reconstruction kernel
Br44
Standard
Standard
Field of view, mm
325
325
325
Slice thickness, mm
0.6
0.625
0.625
Slice increment, mm
0.6
0.625
0.625
Radiation dose, mGy
≈4.52
≈5.20
≈10.78
Histologic evaluation and molecular testing
The criteria for the histological definition of adenocarcinoma were according to the 2015 World Health Organization classification of lung cancer (20). Testing for EGFR mutations including exon expression of 18, 19, 20, and 21 was performed by using a human EGFR gene mutation detection kit (Aide Biomedical Technology).
Tumor segmentation and radiomic feature extraction
We used a radiomics analysis package (Radiomics 1.0.9a, Siemens Healthineers) on a research platform (SyngoVia VB10b, Research Frontier, Siemens Healthineers) to three-dimensionally segment the tumor on CT images as a volume of interest and extracted radiomic features (21). Two radiologists with experiences of at least five years in diagnostic chest CT semi-automatically segmented these tumors, by finding the lesion and click on it. Then the software automatically segmented the lesion.A total of 1,672 features were extracted from each tumor (https://cdn.amegroups.cn/static/application/8373764accc56018fb4582edf66e4fe4/tlcr-20-122-1.pdf). The extracted computational features were classified into original features, filtered (9 types) and wavelet transformed (8 types). The composition and classification of features were listed in . The algorithmic principles are explained in the Supplementary File.
Figure S1
Schematic diagram of the composition and classification of 1,672 radiomic features.
Feature selection
Features with a consistent correlation coefficient (CCC) greater than 0.70 (22) were selected for further analysis. Subsequently, these candidate features were characterized by the minimum redundancy maximum relevance (mRMR) algorithm, which allows for further feature selection by maximizing the correlation between features and label while minimizing the interaction information among features (23).All candidate radiomic features were ranked according to their explanatory power with regards to the presence of EGFR mutation. The most relevant features were additionally selected and clustered. The thresholds of significance and effect size were set to 0.05 and 0.1 for feature relevance, respectively. All features satisfying these relevance criteria were considered for subsequent decorrelation mRMR procedure. In this second part of the mRMR algorithm process, redundant features are removed. For decorrelation, classic mRMR was chosen in this study. The maximum number of features was defined as 10 and a fast mRMR algorithm was applied to select the proper number of features.
Model establishment and validation
The enrolled patients were randomly divided into training (80%) and test (20%) cohort for model establishment and validation, respectively. Five-fold cross-validation was performed for the data in the training cohort, to make sure that the stability of the established models was sufficient. A logistic regression algorithm was used as a machine learning-based classification model built with the features selected from the univariate statistics on the training cohort. Logistic regression, a traditional method in radiomic research, has the advantages of understandability and interpretability, and can incorporate both discrete and continuous variables (24,25).Six logistic regression models were established by Radiomics software (Radiomics 1.0.9a, Siemens Healthineer). Three logistic regression models based on radiomic features were built to detect EGFR mutation, and to identify exon-19 deletion and exon-21 mutations, respectively (Model EGFR_A, Exon19A, and Exon21A). In addition, three multivariate logistic regression models were generated based on radiomic features combined with clinical factors (age, gender, and smoking history) (Model EGFR_B, Exon19B, and Exon21B). The whole training cohort was used to establish each model.The most relevant model was determined by the best subset forward selection method. This method starts with an empty model and adds in first 10 features one by one. In each forward step, the one feature that gives the best improvement to the model is added. The most relevant model was finally selected by the model selection criterion. For each model, the most relevant feature subset was chosen by Akaike information criterion (AIC) and Bayesian information criterion (BIC). When fitting model, it is possible to lead to overfitting if increasing the likelihood by adding parameters, AIC and BIC are designed to handle this problem by introducing a penalty term for the number of parameters in the model. This penalty term is smaller in AIC than in BIC. 95% confidence level was reported when assessing the best subset’s model.
Statistics
The normality of data was assessed by the Kolmogorov-Smirnov test. An independent-sample t-test or Mann-Whitney U test was used for continuous variables and chi-square or Fisher’s exact test was applied for comparing the categorical variables. Two observers independently evaluated 50 randomly selected patients to access inter-observer agreement of feature extraction expressed as an intraclass correlation coefficient (ICC). High ICC indicates high inter-observer agreement. The created logistic regression models were used to evaluate the classification of EGFR mutation in the independent test cohort. The model performance was assessed by using receiver operating characteristic (ROC) curve and area under the ROC curve (AUC), sensitivity, specificity, and confusion matrix. Pairwise comparisons were performed among AUCs by DeLong’s test (26). The model validation and statistical analysis part were performed by two software packages (Matlab R2019a, Mathworks; MedCalc 19.0.4, MedCalc Software). A P value <0.05 was deemed to indicate statistical significance.
Results
Demographics
Among 472 Chinese candidate patients, 263 (mean age 62.5±9.4 years) were finally included (). The training and test cohort consisted of 210 and 53 patients, respectively. The inclusion flowchart is shown in . The tumors of all patients were histologically confirmed lung adenocarcinoma, including 215 (81.7%) invasive adenocarcinoma, 30 (11.4%) microinvasive adenocarcinoma, and 18 (6.8%) other types. In terms of CT lesion density, 153 (58.2%) were solid, and 110 (41.8%) were sub-solid. With regards to molecular testing, 84 (31.9%) of cancers were wild type, and 179 (68.1%) showed EGFR mutation, including 73 (27.8%) exon-19 deletion, and 99 (37.6%) exon-21 L858R mutation. Seven (2.7%) had a mutation of either exon-18 G719X, exon-20 20-INS, exon-20 S768I, exon-20 T790M, exon-21 L859R, exon-21 L860R, or exon-21 L861R. shows the relationship between tumor location, age, and maximum tumor diameter. shows several representative cases.
Table 1
Patient characteristics
Characteristics
Training cohort (n=210)
Test cohort (n=53)
EGFR mutant (n=141)
EGFR wild type (n=69)
P value
EGFR mutant (n=38)
EGFR wild type (n=15)
P value
Age (mean ± SD)
61.4±9.7
63.0±8.8
0.365
63.4±7.4
68.9±10.7
0.066
Gender
0.002*
0.005*
Male
65 (45.8%)
48 (33.8%)
16 (11.3%)
13 (9.2%)
Female
76 (62.8%)
21 (17.4%)
22 (18.2%)
2 (1.7%)
Smoking status
0.067
0.090
Smoker
12 (38.7%)
12 (38.7%)
3 (9.7%)
4 (12.%9)
Non-smoker
129 (55.6%)
57 (24.6%)
35 (15.1%)
11 (4.7%)
Lesion density
0.001*
0.127
Subsolid
67 (60.9%)
19 (17.3%)
20 (18.2%)
4 (3.6%)
Solid
74 (48.4%)
50 (32.7%)
18 (11.8%)
11 (7.2%)
Maximum 3D diameter
0.304
0.556
≥3 cm
66 (51.2%)
38 (29.5%)
19 (14.7%)
6 (4.7%)
<3 cm
75 (56.0%)
31 (23.1%)
19 (14.2%)
9 (6.7%)
Lesion location
0.447
0.115
Right upper lobe
45 (51.7%)
23 (26.4%)
11 (12.6%)
8 (9.2%)
Right middle lobe
13 (65.0%)
2 (10.0%)
4 (20.0%)
1 (5.0%)
Right lower lobe
25 (55.6%)
10 (22.2%)
10 (22.2%)
0 (0%)
Left upper lobe
38 (53.5%)
21 (29.6%)
9 (12.7%)
3 (4.2%)
Left lower lobe
20 (50%)
13 (32.5%)
4 (10.5%)
3 (7.5%)
Histologic subtype
0.669
0.524
Invasive adenocarcinoma
113 (52.6%)
57 (26.5%)
33 (15.3%)
12 (5.6%)
Microinvasive adenocarcinoma
18 (60.0%)
6 (20.0%)
4 (13.3%)
2 (6.7%)
Other subtypes
10 (55.6%)
6 (33.3%)
1 (5.6%)
1 (5.6%)
Data are represented as mean ± standard deviation or number (percentage). P value stands for the comparison between EGFR mutant and wildtype. * indicates significantly correlated. Other subtypes include 6 adenocarcinomas in situ, 4 low differentiated adenocarcinomas, 1 poor differentiated adenocarcinoma, 3 unclassified adenocarcinomas, 1 mixed invasive adenocarcinoma and sarcomatoid carcinoma, 1 micropapillary adenocarcinoma, 1 solid adenocarcinoma with the mucus secretion, 1 mixed infiltrating mucinous and non-mucinous adenocarcinoma. EGFR, epidermal growth factor receptor; SD, standard deviation.
Figure 2
The patient selection workflow of the study cohort. EGFR, epidermal growth factor receptor.
Figure 3
The bubble chart of lung cancers about the relationship between location, age, and the maximum diameter of tumors. Each bubble represents the maximum three-dimensional diameter of the tumor. This figure shows the distribution of the variable size of lung adenocarcinomas among five pulmonary lobes.
Figure 4
Representative CT images with tumor segmentation by the radiomics analysis platform. (I) Axial views show a lobulated lung tumor for image segmentation. (II) Axial view, (III) coronal view, and (IV) sagittal view showed segmented lung tumors indicated by a yellow overlay. (V) Three-dimensional views of the segmented tumors. (A) (I-V) A 61-year-old male non-smoker, with EGFR wild-type lung adenocarcinoma. CT mediastinal window showed a solid mass in the upper lobe of the left lung with lobulation sign and a maximum diameter of about 20 mm. (B) (I-V) A 58-year-old male smoker, with lung adenocarcinoma of EGFR exon-19 deletion. CT mediastinal window showed a solid mass in the upper lobe of the right lung with a maximum diameter of about 35 mm, rough margin, and lobulated sign. (C) (I-V) A 48-year-old female non-smoker, with lung adenocarcinoma of EGFR exon-21 L858R mutation. CT mediastinal window showed a solid mass in the lower lobe of the left lung with a maximum diameter of about 36 mm, burrs, and lobulation. EGFR, epidermal growth factor receptor.
Data are represented as mean ± standard deviation or number (percentage). P value stands for the comparison between EGFR mutant and wildtype. * indicates significantly correlated. Other subtypes include 6 adenocarcinomas in situ, 4 low differentiated adenocarcinomas, 1 poor differentiated adenocarcinoma, 3 unclassified adenocarcinomas, 1 mixed invasive adenocarcinoma and sarcomatoid carcinoma, 1 micropapillary adenocarcinoma, 1 solid adenocarcinoma with the mucus secretion, 1 mixed infiltrating mucinous and non-mucinous adenocarcinoma. EGFR, epidermal growth factor receptor; SD, standard deviation.The patient selection workflow of the study cohort. EGFR, epidermal growth factor receptor.The bubble chart of lung cancers about the relationship between location, age, and the maximum diameter of tumors. Each bubble represents the maximum three-dimensional diameter of the tumor. This figure shows the distribution of the variable size of lung adenocarcinomas among five pulmonary lobes.Representative CT images with tumor segmentation by the radiomics analysis platform. (I) Axial views show a lobulated lung tumor for image segmentation. (II) Axial view, (III) coronal view, and (IV) sagittal view showed segmented lung tumors indicated by a yellow overlay. (V) Three-dimensional views of the segmented tumors. (A) (I-V) A 61-year-old male non-smoker, with EGFR wild-type lung adenocarcinoma. CT mediastinal window showed a solid mass in the upper lobe of the left lung with lobulation sign and a maximum diameter of about 20 mm. (B) (I-V) A 58-year-old male smoker, with lung adenocarcinoma of EGFR exon-19 deletion. CT mediastinal window showed a solid mass in the upper lobe of the right lung with a maximum diameter of about 35 mm, rough margin, and lobulated sign. (C) (I-V) A 48-year-old female non-smoker, with lung adenocarcinoma of EGFR exon-21 L858R mutation. CT mediastinal window showed a solid mass in the lower lobe of the left lung with a maximum diameter of about 36 mm, burrs, and lobulation. EGFR, epidermal growth factor receptor.In univariate analysis, female gender and subsolid density showed a significant positive correlation with EGFR mutation (P<0.01). EGFR exon-19 deletion associated with higher age (P=0.014), but was not associated with other clinical factors (P>0.05). Exon-21 mutation was more common in men (P=0.016) and in solid lesions (P=0.005).
Classification of EGFR mutation status
A total of 1,672 radiomic features were extracted from each tumor. The ICC between two assessments for all extracted features was 0.937±0.063, indicating an optimal interobserver agreement. Summary of diagnostic metrics to detect EGFR mutation, exon-19 deletion, and exon-21 L858R mutation is shown in .
Table 2
Summary of diagnostic metrics to detect EGFR mutation, exon-19 deletion, and exon-21 L858R mutation in the training and test cohort
Mutation
Clinical information
Information criterion
Training (n=210)
Testing (n=53)
AUC
Cut-off value
AUC
Sensitivity
Specificity
Accuracy
EGFR
No
Akaike
0.73 (0.66–0.79)
0.71
0.70 (0.56–0.82)
82.8% (64.2–94.2%)
41.7% (22.1–63.4%)
64.2% (49.8–76.9%)
Bayesian
0.71 (0.65–0.77)
0.67
0.65 (0.51–0.78)
84.0% (63.9–95.5%)
39.3% (21.5–59.4%)
60.4% (46.0–73.6%)
Yes
Akaike
0.78 (0.72–0.83)
0.68
0.76 (0.63–0.87)
90.3% (74.3–98.0%)
54.6% (32.2–75.6%)
75.5% (61.7–86.2%)
Bayesian
0.77 (0.70–0.82)
0.63
0.76 (0.62–0.84)
92.9% (76.5–99.1%)
52.0% (31.3–72.2%)
73.6% (59.7–84.7%)
Exon-19 del
No
Akaike
0.75 (0.69–0.81)
0.40
0.70 (0.56–0.82)
50.0% (24.7–75.4%)
83.8% (68.0–93.8%)
73.6% (59.7–84.7%)
Bayesian
0.66 (0.59–0.73)
0.24
0.65 (0.50–0.77)
38.1% (18.1–61.6%)
81.3% (63.6–92.8%)
64.2% (49.8–76.9%)
Yes
Akaike
0.74 (0.68–0.80)
0.30
0.69 (0.55–0.81)
43.5% (23.2–65.5%)
86.7% (69.3–96.2%)
67.9% (53.7–80.1%)
Bayesian
0.65 (0.58–0.71)
0.27
0.64 (0.49–0.77)
36.0% (18.0– 57.5%)
82.1% (63.1–93.9%)
60.4% (46.0–73.6%)
Exon-21 L858R
No
Akaike
0.78 (0.71–0.83)
0.26
0.65 (0.50–0.77)
48.2% (28.7–68.1%)
65.4% (44.3–82.8%)
56.6% (42.3–70.2%)
Bayesian
0.75 (0.69–0.81)
0.41
0.65 (0.50–0.77)
52.0% (31.3–72.2%)
67.9% (47.7–84.1%)
60.4% (46.0–73.6%)
Yes
Akaike
0.77 (0.70–0.82)
0.29
0.66 (0.52–0.79)
53.9% (33.4–73.4%)
70.4% (49.8–86.3%)
62.3% (47.9–75.2%)
Bayesian
0.75 (0.69–0.81)
0.30
0.65 (0.50–0.77)
52.0% (31.3–72.2%)
67.9% (47.7–84.1%)
60.4% (46.0–73.6%)
To establish a multivariate logistic regression model for only radiomic features (Model EGFR_A), all features with CCC >0.7 were first selected before proceeding feature processing. Then 172 features that were both statistically significantly correlated with EGFR mutation and non-redundant were finally selected by the mRMR algorithm. Subsequently, Model EGFR_A was established by selecting the top 10 most relevant features, which were all image texture features (). When clinical factors were integrated (Model EGFR_B), the resulting top 10 features changed to one clinical character (gender), three first-order, and six texture features ().
Table S2
The top 10 most relevant features (not including clinical factors) for the detection of EGFR mutation (Model EGFR_A)
AUC, area under the receiver operating characteristic curve; MI, mutual information, represents redundancy; P value is the significance of the minimum redundancy maximum relevance (mRMR).
Table S3
The top 10 most relevant features (including clinical factors) for the detection of EGFR mutation (Model EGFR_B)
Feature
AUC
MI
R2
P
wavelet-HLH_gldm_DependenceVariance
0.665
0.041
0.077
4.16E-05
custom_PatientSex
0.619
0.039
0.050
1.06E-03
logarithm_glcm_InverseVariance
0.627
0.035
0.047
1.52E-03
square_glcm_Correlation
0.633
0.032
0.040
3.50E-03
wavelet-HLL_firstorder_Kurtosis
0.599
0.037
0.031
1.07E-02
wavelet-LHL_glrlm_LongRunLowGrayLevelEmphasis
0.601
0.054
0.030
1.21E-02
wavelet-HLL_firstorder_Median
0.630
0.039
0.030
1.22E-02
original_glszm_SizeZoneNonUniformityNormalized
0.606
0.023
0.026
1.91E-02
exponential_firstorder_Skewness
0.676
0.020
0.018
4.88E-02
wavelet-LLH_glcm_ClusterShade
0.510
0.031
0.018
4.99E-02
AUC, area under the receiver operating characteristic curve; MI, mutual information, represents redundancy; P value is the significance of the minimum redundancy maximum relevance (mRMR).
Adding clinical factors to the regression models improved the performance of the model. In the training cohort, the AUC increased from 0.73 (95% CI: 0.66–0.79) to 0.78 (0.72–0.83) (DeLong’s P=0.136) with AIC selection criteria, and from 0.71 (0.65–0.77) to 0.77 (0.70–0.82) (P=0.110) with BIC (), respectively. In the test cohort, the AUC increased from 0.70 (0.56–0.82) to 0.76 (0.63–0.87) (P=0.114) and significantly increased from 0.65 (0.51–0.78) to 0.76 (0.62–0.84) (P=0.011), respectively. Subsequently, the accuracy in the test cohort increased to 75.5% (61.7–86.2%) and 73.6% (59.7–84.7%) with AIC and BIC selection criteria, respectively (). Among these methods, Model EGFR_B using BIC selection criteria exhibited a sensitivity of 92.9% (76.5–99.1%).
Figure 5
Diagnostic performance of the radiomic features in training and test dataset for EGFR mutation status (Model EGFR). (A) (I) ROC curve in the training dataset by AIC selection criteria; (II) ROC curve by BIC selection criteria; (III) Heatmap of the top 10 most relevant features. The training was performed using image features without clinical factors. (B) (I) ROC curve in the training dataset by AIC selection criteria; (II) ROC curve by BIC selection criteria; (III) Heatmap of the top 10 most relevant features. The training was performed using image features and clinical factors. (C) (I) ROC curve in the test dataset using radiomic features and clinical factors by AIC selection criteria; (II) ROC curve using radiomic features and clinical factors by BIC selection criteria. ROC, receiver operating characteristic; AIC, Akaike information criterion; BIC, Bayesian information criterion.
Table S4
Results of the logistic regression models to detect EGFR mutation in the test cohort
Label
Predicted label
Akaike information criterion
Bayesian information criterion
Radiomic model (Model EGFR_A)
Clinical information integrated mode (Model EGFR_B)
Radiomic model (Model EGFR_A)
Clinical information integrated mode (Model EGFR_B)
Mutant
Wildtype
Mutant
Wildtype
Mutant
Wildtype
Mutant
Wildtype
Actual label
Mutant
24
5
28
3
21
4
26
2
Wildtype
14
10
10
12
17
11
12
13
Sensitivity
82.8% (64.2–94.2%)
90.3% (74.3–98.0%)
84.0% (63.9–95.5%)
92.9% (76.5–99.1%)
Specificity
41.7% (22.1–63.4%)
54.6% (32.2–75.6%)
39.3% (21.5–59.4%)
52.0% (31.3–72.2%)
Accuracy
64.2% (49.8–76.9%)
75.5% (61.7–86.2%)
60.4% (46.0–73.6%)
73.6% (59.7–84.7%)
Diagnostic performance of the radiomic features in training and test dataset for EGFR mutation status (Model EGFR). (A) (I) ROC curve in the training dataset by AIC selection criteria; (II) ROC curve by BIC selection criteria; (III) Heatmap of the top 10 most relevant features. The training was performed using image features without clinical factors. (B) (I) ROC curve in the training dataset by AIC selection criteria; (II) ROC curve by BIC selection criteria; (III) Heatmap of the top 10 most relevant features. The training was performed using image features and clinical factors. (C) (I) ROC curve in the test dataset using radiomic features and clinical factors by AIC selection criteria; (II) ROC curve using radiomic features and clinical factors by BIC selection criteria. ROC, receiver operating characteristic; AIC, Akaike information criterion; BIC, Bayesian information criterion.
Classification of EGFR exon-19 deletion and exon-21 L858R mutation
For exon-19 deletion, 71 features were finally selected by mRMR and the top 10 most relevant features were all texture types (). Adding clinical factors changed the top 10 features to one clinical feature (age), three first-order, and six texture features (). When training Model Exon19, integration of clinical factors significantly improved AUC with AIC selection criteria (DeLong’s P=0.019) but not improved with BIC (P=0.835) (). Additionally, in both training and test cohort of Model Exon19, the AUC of AIC selected models was larger than that of BIC selected models. The highest accuracy among the two models in the test cohort was 73.6% (59.7–84.7%) with AIC selection criteria without the integration of clinical factors (). Model Exon19B using AIC selection criteria exhibited a specificity of 86.7% (69.3–96.2%).
Table S5
The top 10 most relevant features (not including clinical factors) for the detection of exon-19 deletion (Model Exon19_A)
AUC, area under the receiver operating characteristic curve; MI, mutual information, represents redundancy; P value is the significance of the minimum redundancy maximum relevance (mRMR).
Table S6
The top 10 most relevant features (including clinical factors) for the detection of exon-19 deletion (Model Exon19_B)
AUC, area under the receiver operating characteristic curve; MI, mutual information, represents redundancy; P value is the significance of the minimum redundancy maximum relevance (mRMR).
Figure S2
Diagnostic performance of the radiomic features in the training and test dataset for exon-19 deletion (Model Exon19). (A) (I) ROC curve in the training dataset by AIC selection criteria; (II) ROC curve by BIC selection criteria; (III) Heatmap of top 10 most relevant features. The training was performed using image features without clinical factors. (B) (I) ROC curve in the training dataset by AIC selection criteria; (II) ROC curve by BIC selection criteria; (III) Heatmap of top 10 most relevant features. The training was performed using image features and clinical factors. (C) (I) ROC curve in the test dataset using radiomic features and clinical factors by AIC selection criteria; (II) ROC curve using radiomic features and clinical factors by BIC selection criteria. ROC, receiver operating characteristic; AIC, Akaike information criterion; BIC, Bayesian information criterion.
Table S7
Results of the logistic regression models to detect exon-19 deletion in the test cohort
Label
Predicted label
Akaike information criterion
Bayesian information criterion
Radiomic model (Model Exon19_A)
Clinical information integrated mode (Model Exon19_B)
Radiomic model (Model Exon19_A)
Clinical information integrated mode (Model Exon19_B)
Mutant
Wildtype
Mutant
Wildtype
Mutant
Wildtype
Mutant
Wildtype
Actual label
Mutant
8
8
10
13
8
13
9
16
Wildtype
6
31
4
26
6
26
5
23
Sensitivity
50.0% (24.7–75.4%)
43.5% (23.2–65.5%)
38.1% (18.1–61.6%)
36.0% (18.0–57.5%)
Specificity
83.8% (68.0–93.8%)
86.7% (69.3–96.2%)
81.3% (63.6–92.8%)
82.1% (63.1–93.9%)
Accuracy
73.6% (59.7–84.7%)
67.9% (53.7–80.1%)
64.2% (49.8–76.9%)
60.4% (46.0–73.6%)
Regarding exon-21 L858R mutation, more features were selected. The top 10 most relevant features are shown in . When training Model Exon21, integration of clinical factors did not improve AUC with AIC selection criteria (P=0.413), but significantly improved with BIC (P=0.043) (). Using Model Exon21, the highest accuracy among the two models in the test cohort was 62.3% (47.9–75.2%) with AIC selection criteria with the integration of clinical factors (). Model Exon21B using AIC selection criteria exhibited a specificity of 70.4% (49.8–86.3%).
Table S8
The top 10 most relevant features (not including clinical factors) for the detection of exon-21 L858R mutation (Model Exon21_A)
Feature
AUC
MI
R2
P
logarithm_glcm_InverseVariance
0.675
0.061
0.097
4.09E-06
original_firstorder_Skewness
0.654
0.06
0.068
1.33E-04
wavelet-LHL_firstorder_Mean
0.643
0.064
0.052
8.75E-04
wavelet-HLL_firstorder_Median
0.666
0.033
0.045
2.02E-03
original_glszm_SizeZoneNonUniformityNormalized
0.614
0.032
0.040
3.34E-03
logarithm_ngtdm_Contrast
0.616
0.086
0.040
3.74E-03
wavelet-HHH_firstorder_Mean
0.601
0.063
0.036
5.71E-03
wavelet-LHL_glcm_ClusterShade
0.597
0.041
0.029
1.38E-02
wavelet-HHH_glszm_GrayLevelVariance
0.591
0.052
0.025
2.10E-02
wavelet-LLL_glszm_ZoneVariance
0.570
0.058
0.020
3.84E-02
AUC, area under the receiver operating characteristic curve; MI, mutual information, represents redundancy; P value is the significance of the minimum redundancy maximum relevance (mRMR).
Table S9
The top 10 most relevant features (including clinical factors) for the detection of exon-21 L858R mutation (Model Exon21_B)
Feature
AUC
MI
R2
P
logarithm_glcm_InverseVariance
0.675
0.061
0.097
4.09E-06
original_firstorder_Skewness
0.654
0.06
0.068
1.33E-04
wavelet-HLL_firstorder_Median
0.666
0.033
0.045
2.02E-03
original_glszm_SizeZoneNonUniformityNormalized
0.614
0.032
0.040
3.34E-03
logarithm_ngtdm_Contrast
0.616
0.086
0.040
3.74E-03
wavelet-HHH_firstorder_Mean
0.601
0.063
0.036
5.71E-03
wavelet-LHL_glcm_ClusterShade
0.597
0.041
0.029
1.38E-02
wavelet-HHH_glszm_GrayLevelVariance
0.591
0.052
0.025
2.10E-02
custom_PatientSex
0.579
0.017
0.023
2.63E-02
wavelet-LLL_glszm_ZoneVariance
0.570
0.058
0.020
3.84E-02
AUC, area under the receiver operating characteristic curve; MI, mutual information, represents redundancy; P value is the significance of the minimum redundancy maximum relevance (mRMR).
Figure S3
Diagnostic performance of the radiomic features in training and test dataset for exon-21 L858R mutation (Model Exon21). (A) (I) ROC curve in the training dataset by AIC selection criteria; (II) ROC curve by BIC selection criteria; (III) Heatmap of top 10 most relevant features. The training was performed using image features without clinical factors. (B) (I) ROC curve in the training dataset by AIC selection criteria; (II) ROC curve by BIC selection criteria; (III) Heatmap of top 10 most relevant features. The training was performed using image features and clinical factors. (C) (I) ROC curve in the test dataset using radiomic features and clinical factors by AIC selection criteria; (II) ROC curve using radiomic features and clinical factors by BIC selection criteria. ROC, receiver operating characteristic; AIC, Akaike information criterion; BIC, Bayesian information criterion.
Table S10
Results of the logistic regression models to detect exon-21 L858R mutation in the test cohort
Label
Predicted label
Akaike information criterion
Bayesian information criterion
Radiomics model (Model Exon21_A)
Clinical information integrated mode (Model Exon21_B)
Radiomics model (Model Exon21_A)
Clinical information integrated mode (Model Exon21_B)
Mutant
Wildtype
Mutant
Wildtype
Mutant
Wildtype
Mutant
Wildtype
Actual label
Mutant
13
14
14
12
13
12
13
12
Wildtype
9
17
8
19
9
19
9
19
Sensitivity
48.2% (28.7–68.1%)
53.9% (33.4–73.4%)
52.0% (31.3–72.2%)
52.0% (31.3–72.2%)
Specificity
65.4% (44.3–82.8%)
70.4% (49.8–86.3%)
67.9% (47.7–84.1%)
67.9% (47.7–84.1%)
Accuracy
56.6% (42.3–70.2%)
62.3% (47.9–75.2%)
60.4% (46.0–73.6%)
60.4% (46.0–73.6%)
Discussion
In this study, we established six logistic regression models based on radiomic features and clinical factors to identify and differentiate EGFR mutations. The highest sensitivity among the two models to identify EGFR mutation was 92.9%, indicating a very low false-negative rate. This suggests that a radiomics based model can be helpful to select EGFR mutation patients for further invasive procedures. The highest specificities among two exon-19 deletion classifying models and among two exon-21 L858R mutation models were 86.7% and 70.4%, respectively, indicating a low false-positive rate. Our study reveals the possibility to screen for EGFR mutation by radiomic features on CT images.In this study, a multivariate model combining radiomic features as well as patient characteristics improved the diagnostic performance to detect EGFR mutation in lung adenocarcinoma patients and reached an AUC of 0.78 and 0.76 in the training and test cohort, respectively. Liu et al. also showed that CT imaging features of adenocarcinoma combined with clinical variables could better classify EGFR mutation status than only using clinical variables (27). Yip et al. reported an AUC of 0.67 in predicting EGFR mutation status (19). Zhang et al. reported an AUC of 0.86 and 0.87 for the training and validation cohort, respectively, based on 140 patients, including 68 adenocarcinomas, 54 squamous cell carcinoma, and 18 others (28). Our results were based on 263 patients with histologically confirmed lung adenocarcinoma, which might be closer to a clinical setting, in view of the experience that the therapeutic effect of tyrosine kinase inhibitors is more significant in lung adenocarcinoma. Importantly, we found a sensitivity to detect EGFR mutation of 92.9%, which indicates a very low false-negative rate.In addition, we found that lung cancer was located predominantly in the upper lobes, especially on the right side (), that is consistent with the prior study (29). One well-accepted explanation is that hazardous particles deposit more readily in the upper lobes because of the gravity and ventilation, and they may persist longer there due to less relative ventilation or less efficient lymphatic clearance (30). Another reason might be the higher tissue PO2 levels in the upper lobes that help with tumor initiation, and neovascularization.The major strength of this study is a comprehensive analysis of the two most common activating EGFR mutations, exon-19 deletion, and exon-21 L858R mutation. Rosell et al. found that the probability of mutation sites of EGFR gene expression was higher in exon-19 than in exon-21 (31). Therapeutic effects of TKIs in EGFR mutation-positive lung cancer patients differ due to specific activating EGFR mutation (32-34). To the best of our knowledge, only a few studies investigated imaging factors associated with exon expression. Li et al. showed an AUC of approximately 0.79 for the detection of exon-19 deletion and exon-21 mutations using radiomics, but did not apply the model to an independent test cohort (35). Our results reveal that radiomic features have a high specificity to identify exon-19 deletion and exon-21 L858R mutation.3D segmentation algorithm was applied to extract a large number of candidate radiomic features for establishing the predicting model. Most radiomics-related studies only extracted relatively few features for analysis (35-38). In our study, 1,672 features covering three different categories and various filters and translations were extracted in the feature extraction stage, which provides a broad base for features involving in the predicting models. These features can maximize the potential information hidden behind the images, thus improving the capacity of established models.Radiomics has received extensive attention in recent years (21,22,24). The intrinsic association between image features and EGFR gene expression in lung adenocarcinoma could be explored further by data mining for diagnosis, prognosis, and clinical decision-making (36). Zhang et al. combined clinical data and 485 radiation features extracted from CT images to classify EGFR mutation status, finally reached an accuracy of 75.6% (28). There are some radiomics studies on lung adenocarcinoma that reported similar contributing features as our study. For the EGFR mutation detection models, the radiomic features entered the model all reflect texture-related information. Park et al. (39) and Jiang et al. (40) included glszm_ZoneEntropy feature, representing the tumor heterogeneity in the texture pattern, in the radiomic models to predict the subtype and spread through air space in lung adenocarcinoma patients. Hong et al. (41) and Sun et al. (42) showed that InverseVariance was associated with EGFR mutation and invasion in patients with advanced lung adenocarcinoma. In addition, skewness, mean and median that belong to the first-order feature type contributed to the radiomic models in predicting lesion invasion (42), ALK mutation (43), spread through air space (40) and EGFR mutation (41).There are limitations to this study. First, this evaluation was a retrospective study performed in a single center. Ideally, a multicenter prospective study would enhance the conclusion of this study, as well as test the effect of the implementation of such a radiomics-based model in clinical practice. Second, although we included 263 patients, increasing the sample size would further strengthen the accuracy of the algorithm model in this study. Third, one single algorithm was used for feature selection and building regression model, more algorithms could increase the robustness by horizontal comparison.
Conclusions
CT radiomics can sensitively identify the presence of EGFR mutation with a low false-negative rate, and increase the certainty of identification of EGFR exon-19 deletion and exon-21 L858R mutation in lung adenocarcinoma patients but the discrimination between both DNA aberrations show less specificity. CT radiomics may become a non-invasive alternative biomarker to select EGFR mutation patients for invasive sampling.Schematic diagram of the composition and classification of 1,672 radiomic features.Diagnostic performance of the radiomic features in the training and test dataset for exon-19 deletion (Model Exon19). (A) (I) ROC curve in the training dataset by AIC selection criteria; (II) ROC curve by BIC selection criteria; (III) Heatmap of top 10 most relevant features. The training was performed using image features without clinical factors. (B) (I) ROC curve in the training dataset by AIC selection criteria; (II) ROC curve by BIC selection criteria; (III) Heatmap of top 10 most relevant features. The training was performed using image features and clinical factors. (C) (I) ROC curve in the test dataset using radiomic features and clinical factors by AIC selection criteria; (II) ROC curve using radiomic features and clinical factors by BIC selection criteria. ROC, receiver operating characteristic; AIC, Akaike information criterion; BIC, Bayesian information criterion.Diagnostic performance of the radiomic features in training and test dataset for exon-21 L858R mutation (Model Exon21). (A) (I) ROC curve in the training dataset by AIC selection criteria; (II) ROC curve by BIC selection criteria; (III) Heatmap of top 10 most relevant features. The training was performed using image features without clinical factors. (B) (I) ROC curve in the training dataset by AIC selection criteria; (II) ROC curve by BIC selection criteria; (III) Heatmap of top 10 most relevant features. The training was performed using image features and clinical factors. (C) (I) ROC curve in the test dataset using radiomic features and clinical factors by AIC selection criteria; (II) ROC curve using radiomic features and clinical factors by BIC selection criteria. ROC, receiver operating characteristic; AIC, Akaike information criterion; BIC, Bayesian information criterion.AUC, area under the receiver operating characteristic curve; MI, mutual information, represents redundancy; P value is the significance of the minimum redundancy maximum relevance (mRMR).AUC, area under the receiver operating characteristic curve; MI, mutual information, represents redundancy; P value is the significance of the minimum redundancy maximum relevance (mRMR).AUC, area under the receiver operating characteristic curve; MI, mutual information, represents redundancy; P value is the significance of the minimum redundancy maximum relevance (mRMR).AUC, area under the receiver operating characteristic curve; MI, mutual information, represents redundancy; P value is the significance of the minimum redundancy maximum relevance (mRMR).AUC, area under the receiver operating characteristic curve; MI, mutual information, represents redundancy; P value is the significance of the minimum redundancy maximum relevance (mRMR).AUC, area under the receiver operating characteristic curve; MI, mutual information, represents redundancy; P value is the significance of the minimum redundancy maximum relevance (mRMR).The article’s supplementary files as
Authors: David M Jackman; Beow Y Yeap; Lecia V Sequist; Neal Lindeman; Alison J Holmes; Victoria A Joshi; Daphne W Bell; Mark S Huberman; Balazs Halmos; Michael S Rabin; Daniel A Haber; Thomas J Lynch; Matthew Meyerson; Bruce E Johnson; Pasi A Jänne Journal: Clin Cancer Res Date: 2006-07-01 Impact factor: 12.531
Authors: Nick Thatcher; Alex Chang; Purvish Parikh; José Rodrigues Pereira; Tudor Ciuleanu; Joachim von Pawel; Sumitra Thongprasert; Eng Huat Tan; Kristine Pemberton; Venice Archer; Kevin Carroll Journal: Lancet Date: 2005 Oct 29-Nov 4 Impact factor: 79.321
Authors: J Y Zhou; J Zheng; Z F Yu; W B Xiao; J Zhao; K Sun; B Wang; X Chen; L N Jiang; W Ding; J Y Zhou Journal: Eur Radiol Date: 2015-01-11 Impact factor: 5.315
Authors: Claudia Allemani; Tomohiro Matsuda; Veronica Di Carlo; Rhea Harewood; Melissa Matz; Maja Nikšić; Audrey Bonaventure; Mikhail Valkov; Christopher J Johnson; Jacques Estève; Olufemi J Ogunbiyi; Gulnar Azevedo E Silva; Wan-Qing Chen; Sultan Eser; Gerda Engholm; Charles A Stiller; Alain Monnereau; Ryan R Woods; Otto Visser; Gek Hsiang Lim; Joanne Aitken; Hannah K Weir; Michel P Coleman Journal: Lancet Date: 2018-01-31 Impact factor: 79.321
Authors: Rafael Rosell; Teresa Moran; Cristina Queralt; Rut Porta; Felipe Cardenal; Carlos Camps; Margarita Majem; Guillermo Lopez-Vivanco; Dolores Isla; Mariano Provencio; Amelia Insa; Bartomeu Massuti; Jose Luis Gonzalez-Larriba; Luis Paz-Ares; Isabel Bover; Rosario Garcia-Campelo; Miguel Angel Moreno; Silvia Catot; Christian Rolfo; Noemi Reguart; Ramon Palmero; José Miguel Sánchez; Roman Bastus; Clara Mayo; Jordi Bertran-Alamillo; Miguel Angel Molina; Jose Javier Sanchez; Miquel Taron Journal: N Engl J Med Date: 2009-08-19 Impact factor: 91.245
Authors: Erica L Carpenter; Despina Kontos; Bardia Yousefi; Michael J LaRiviere; Eric A Cohen; Thomas H Buckingham; Stephanie S Yee; Taylor A Black; Austin L Chien; Peter Noël; Wei-Ting Hwang; Sharyn I Katz; Charu Aggarwal; Jeffrey C Thompson Journal: Sci Rep Date: 2021-05-11 Impact factor: 4.379