| Literature DB >> 31263186 |
Hina Shakir1,2, Yiming Deng3, Haroon Rasheed2, Tariq Mairaj Rasool Khan4.
Abstract
Radiomic features based classifiers and neural networks have shown promising results in tumor classification. The classification performance can be further improved greatly by exploring and incorporating the discriminative features towards cancer into mathematical models. In this research work, we have developed two radiomics driven likelihood models in Computed Tomography(CT) images to classify lung, colon, head and neck cancer. Initially, two diagnostic radiomic signatures were derived by extracting 105 3-D features from 200 lung nodules and by selecting the features with higher average scores from several supervised as well as unsupervised feature ranking algorithms. The signatures obtained from both the ranking approaches were integrated into two mathematical likelihood functions for tumor classification. Validation of the likelihood functions was performed on 265 public data sets of lung, colon, head and neck cancer with high classification rate. The achieved results show robustness of the models and suggest that diagnostic mathematical functions using general tumor phenotype can be successfully developed for cancer diagnosis.Entities:
Mesh:
Year: 2019 PMID: 31263186 PMCID: PMC6603029 DOI: 10.1038/s41598-019-45053-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Work flow of the proposed method for nodule classification.
Distribution summary of employed databases, nodules sizes and their classes.
| Training database (no. of sets) | Validation Database (no. of sets) | Min. Diameter (mm) | Median Diameter (mm) | Max. Diameter (mm) | |
|---|---|---|---|---|---|
| Malignant Nodules | Lung1(165), RIDER(10) | Lung1(155), CTC(30), HNSCC(35), LIDC(10) | 7.015665 | 66.06505 | 215.9035 |
| Benign Nodules | LUNGx(35) | LIDC(35) | 5.738865 | 41.58367 | 221.052 |
Description of computed radiomic features.
| Features Class | No. of computed features (n = 105) |
|---|---|
| Shape | 13 |
| Gray level Difference Method (GLDM) | 14 |
| Gray-Level Co-Occurrence Matrix (GLCM) | 23 |
| Neighborhood Gray-Tone Difference Matrix (NGTDM) | 5 |
| First order statistics | 18 |
| Gray Level Size Zone Matrix (GLSZM) | 16 |
| Gray Level Run Length Matrix (GLRLM) | 16 |
Figure 2Distribution of top 25 ranked radiomic features with respect to the feature classes.
Nodule classification results.
| Patient | Test = Positive | Test = Negative |
|---|---|---|
|
| ||
| Cancer | 155(TP) | 10(FN) |
| No Cancer | 7(FP) | 28(TN) |
|
| ||
| Cancer | 161(TP) | 4(FN) |
| No Cancer | 2(FP) | 33(TN) |
Figure 3ROC Curves for proposed likelihood functions MLF I and MLF II.
Figure 4Visualization of (a) stable and reliable radiomics features using PCA transformation in image space (b) radiomic signature (surface volume ratio, sum entropy) from MLF II in image space.
Figure 5Quantitative classification results of MLF I and II for malignant and benign tumors from LIDC, CTC and HNSCC databases.
Comparison of performance metrics of MLF I and MLF II with other state of the art classification models.
| Prediction Model | # of Data sets (Tumor site, Database) | Accuracy | Sensitivity | Specificity | AUC | |
|---|---|---|---|---|---|---|
| Wu | Random forest classifier | 152(Lung, LIDC) | 55.0% | 80.0% | 72.0% | — |
| Chen | SFS, SVM | 75(Lung, LIDC) | 84.0% | 92.85% | 72.73% | — |
| Choi | SVM-LASSO | 72(Lung, LIDC) | 84.6% | 87.2% | 81.2% | 89% |
| Liu | Multi-view convolutional neural networks | 172(Lung, LIDC) | 94.59% | — | — | 98.1% |
| Kumar | Deep convolutional neural network | 97(Lung, LIDC) | 75.1% | 83.35% | 61.0% | — |
| Pallamar | Linear Discriminant analysis, k nearest neighbor | 27(Head & Neck, Private) | 81.48% 1.5T 92.59% 3T | — | — | — |
| Huang | Gene expression | 462(Colon, Private) | — | — | — | — |
| Proposed MLF I | Curve fitting using non-linear regression | 200(Lung, LIDC & Lung1) 35(Colon, CTC) 30(Head & Neck, HNSCC) | 91.5% 74.28% 83.33% | 95.68% | 73.68% | 92.68% |
| Proposed MLF II | Curve fitting using non-linear regression | 200(Lung, LIDC & Lung1) 35(Colon, CTC) 30(Head & Neck, HNSCC) | 97% 85.71% 90% | 98.77% | 89.19% | 98.81% |