| Literature DB >> 34359623 |
Xiaoyang Liu1,2,3, Farhad Maleki4, Nikesh Muthukrishnan4, Katie Ovens4, Shao Hui Huang1,5, Almudena Pérez-Lara6, Griselda Romero-Sanchez6, Sahir Rai Bhatnagar4,7, Avishek Chatterjee8, Marc Philippe Pusztaszeri9, Alan Spatz9, Gerald Batist6, Seyedmehdi Payabvash10, Stefan P Haider10, Amit Mahajan10, Caroline Reinhold4, Behzad Forghani4,6, Brian O'Sullivan1,5, Eugene Yu1,3, Reza Forghani4,6.
Abstract
Current radiomic studies of head and neck squamous cell carcinomas (HNSCC) are typically based on datasets combining tumors from different locations, assuming that the radiomic features are similar based on histopathologic characteristics. However, molecular pathogenesis and treatment in HNSCC substantially vary across different tumor sites. It is not known if a statistical difference exists between radiomic features from different tumor sites and how they affect machine learning model performance in endpoint prediction. To answer these questions, we extracted radiomic features from contrast-enhanced neck computed tomography scans (CTs) of 605 patients with HNSCC originating from the oral cavity, oropharynx, and hypopharynx/larynx. The difference in radiomic features of tumors from these sites was assessed using statistical analyses and Random Forest classifiers on the radiomic features with 10-fold cross-validation to predict tumor sites, nodal metastasis, and HPV status. We found statistically significant differences (p-value ≤ 0.05) between the radiomic features of HNSCC depending on tumor location. We also observed that differences in quantitative features among HNSCC from different locations impact the performance of machine learning models. This suggests that radiomic features may reveal biologic heterogeneity complementary to current gold standard histopathologic evaluation. We recommend considering tumor site in radiomic studies of HNSCC.Entities:
Keywords: classification; head and neck squamous cell carcinomas; human papilloma virus; machine learning; metastasis; radiomics
Year: 2021 PMID: 34359623 PMCID: PMC8345201 DOI: 10.3390/cancers13153723
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Patient demographics and class distribution of lymph node metastasis (LN) and human papilloma virus (HPV) status across tumor sites—oral cavity (OC), oropharynx (OP), and larynx or hypopharynx (LHP). Note that for LN and HPV status, there are missing values for some patients.
| OC | OP | LHP | Total | |
|---|---|---|---|---|
| Number of cases | 164 | 200 | 241 | 605 |
| Age | 64 (24–90) | 61 (33–87) | 65 (27–88) | 64 (24–90) |
| Sex (Male:Female) | 111:53 | 161:39 | 201:40 | 473:132 |
| LN (+/−) | 93:70 | 175:25 | 78:163 | 346:258 |
| HPV (+/−) | 0:9 | 134:63 | 11:55 | 145:127 |
Figure 1Examples of tumor segmentation using manually placed ROI contours in three different primary tumor locations: larynx/hypopharynx (left), oropharynx (middle), and oral cavity (right).
Figure 2Panel (A) illustrates the model building process using samples from a site. Although panel A shows this for the LHP (larynx or hypopharynx) site, this process is also applicable to OC (oral cavity) or OP (oropharynx). Panel (B) illustrates the model building process using samples combined from all sites.
Conditional probabilities of lymph node metastasis given the tumor site (LHP: larynx or hypopharynx; OC: oral cavity); OP: oropharynx).
| Sites | Nonmetastatic | Metastatic |
|---|---|---|
| LHP | 0.676 | 0.324 |
| OC | 0.429 | 0.571 |
| OP | 0.125 | 0.875 |
Conditional probability of HPV (human papilloma virus) status given the tumor site (LHP: larynx or hypopharynx; OC: oral cavity); OP: oropharynx).
| Sites | Negative | Positive |
|---|---|---|
| LHP | 0.833 | 0.167 |
| OC | 1.00 | 0.00 |
| OP | 0.320 | 0.680 |
Figure 3Box plots illustrating the distributions of quantitative features based on a gray-level intensity histogram including the mean, standard deviation, mean of positive pixels, entropy, skewness, and kurtosis. Each box plot represents the distributions of one feature for a different spatial scale filter (SSF 0, 2, 3, 4, 5, 6). LHP: larynx or hypopharynx; OC: oral cavity); OP: oropharynx.
Figure 4Clustering of texture features from three different anatomical locations (LHP: larynx or hypopharynx; OC: oral cavity); OP: oropharynx) using t-SNE with a perplexity value of 30.
Adjusted p-values for post hoc analysis of variance (ANOVA) among HNSCCs from three different sites.
| Feature | Adjusted |
|---|---|
| SSF0_entropy | 1.36 × 10−6 |
| SSF0_kurtosis | 4.02 × 10−2 |
| SSF0_mean | 7.99 × 10−11 |
| SSF0_mpp | 2.92 × 10−11 |
| SSF0_sd | 1.06 × 10−3 |
| SSF0_skewness | 1.97 × 10−4 |
| SSF2_entropy | 2.52 × 10−1 |
| SSF2_kurtosis | 3.01 × 10−14 |
| SSF2_mean | 2.64 × 10−14 |
| SSF2_mpp | 2.55 × 10−21 |
| SSF2_sd | 2.04 × 10−23 |
| SSF2_skewness | 9.41 × 10−23 |
| SSF3_entropy | 5.65 × 10−2 |
| SSF3_kurtosis | 2.57 × 10−23 |
| SSF3_mean | 8.38 × 10−12 |
| SSF3_mpp | 5.76 × 10−18 |
| SSF3_sd | 3.65 × 10−16 |
| SSF3_skewness | 6.04 × 10−33 |
| SSF4_entropy | 1.97 × 10−3 |
| SSF4_kurtosis | 6.90 × 10−19 |
| SSF4_mean | 4.38 × 10−9 |
| SSF4_mpp | 5.65 × 10−9 |
| SSF4_sd | 3.10 × 10−13 |
| SSF4_skewness | 1.30 × 10−29 |
| SSF5_entropy | 3.28 × 10−6 |
| SSF5_kurtosis | 9.52 × 10−13 |
| SSF5_mean | 1.08 × 10−08 |
| SSF5_mpp | 9.34 × 10−02 |
| SSF5_sd | 1.07 × 10−13 |
| SSF5_skewness | 8.98 × 10−23 |
| SSF6_entropy | 1.01 × 10−9 |
| SSF6_kurtosis | 1.17 × 10−6 |
| SSF6_mean | 9.33 × 10−10 |
| SSF6_mpp | 1.00 |
| SSF6_sd | 4.04 × 10−14 |
| SSF6_skewness | 1.15 × 10−12 |
The results of Wilcoxon rank-sum tests for comparing the performance of RF (Random Forest) models when all tumor sites were combined and used for model development (scenario 2) versus when only samples from one tumor site were used for model development (scenario 1). LN-LHP: lymph node metastasis prediction using features from larynx or hypopharynx sites; LN-OC: lymph node metastasis prediction using features from oropharynx site; HPV-OP: human papilloma virus status prediction using features from oropharynx site).
| Endpoint-Site | Statistics | Accuracy | Precision | Recall | F1 | AUC |
|---|---|---|---|---|---|---|
| statistic | 8.34 | 12.22 | 10.17 | 12.09 | 9.71 | |
| <0.001 | <0.001 | <0.001 | <0.001 | <0.001 | ||
| statistic | 11.54 | 9.96 | 7.13 | 8.40 | 11.97 | |
| <0.001 | <0.001 | <0.001 | <0.001 | <0.001 | ||
| statistic | 11.12 | 0.32 | 7.73 | 4.06 | 11.67 | |
| <0.001 | <0.001 | <0.001 | <0.001 | <0.001 |