| Literature DB >> 32454885 |
Jin Deng1, Weiming Zeng1, Yuhu Shi1, Wei Kong1, Shunjie Guo1.
Abstract
Extracting massive features from images to quantify tumors provides a new insight to solve the problem that tumor heterogeneity is difficult to assess quantitatively. However, quantification of tumors by single-mode methods often has defects such as difficulty in features extraction and high computational complexity. The multimodal approach has shown effective application prospects in solving these problems. In this paper, we propose a feature fusion method based on positron emission tomography (PET) images and clinical information, which is used to obtain features for lung metastasis prediction of soft tissue sarcomas (STSs). Random forest method was adopted to select effective features by eliminating irrelevant or redundant features, and then they were used for the prediction of the lung metastasis combined with back propagation (BP) neural network. The results show that the prediction ability of the proposed model using fusion features is better than that of the model using an image or clinical feature alone. Furthermore, a good performance can be obtained using 3 standard uptake value (SUV) features of PET image and 7 clinical features, and its average accuracy, sensitivity, and specificity on all the sets can reach 92%, 91%, and 92%, respectively. Therefore, the fusing features have the potential to predict lung metastasis for STSs.Entities:
Year: 2020 PMID: 32454885 PMCID: PMC7222598 DOI: 10.1155/2020/8153295
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Patient and tumor characteristics.
| Clinical parameters | |
|---|---|
| Age, years (mean ± SD) | 54.8 ± 16.0 |
| Gender, | |
| Male | 24 (47.1) |
| Female | 27 (52.9) |
| Histology, | |
| Malignant fibrous histiocytomas | 17 (33.3) |
| Liposarcoma | 11 (21.6) |
| Leiomyosarcoma | 10 (19.6) |
| Synovial sarcoma | 5 (9.8) |
| Extraskeletal bone sarcoma | 4 (7.8) |
| Fibrosarcoma | 1 (2.0) |
| Other | 3 (5.9) |
| Grade, | |
| High | 28 (54.9) |
| Intermediate | 15 (29.4) |
| Low | 5 (9.8) |
| Unknown | 3 (5.9) |
| Metastases, | |
| Lung | 19 (37.3) |
| Other | 5 (9.8) |
| None | 27 (52.9) |
| Time, days (mean ± SD) | |
| Diagnosis to outcome | 285.7 ± 252.3 |
| Diagnosis to last follow-up | 849 ± 447.4 |
| Status, | |
| No evidence of disease | 26 (51.0) |
| Alive with disease | 9 (17.6) |
| Dead | 15 (29.4) |
Note: SD: standard deviation; n: number; diagnosis to outcome: days elapsed between the date of diagnosis of primary STS (biopsy) and the date of diagnosis of recurrence or metastases; diagnosis to last follow-up: days elapsed between the date of diagnosis of primary STS (biopsy) and the date of last-follow-up (or death, if applicable).
SUV metrics features and Clinical features used in this study.
| Type | Name | Description |
|---|---|---|
| SUV metrics | SUVmax | Maximum SUV of the tumour region |
| SUVpeak | Average of the voxel with maximum SUV within the tumour region and its 26 connected neighbors | |
| SUVmean | Average SUV value of the tumour region | |
| aucCSH | Area under the curve of the cumulative SUV volume histogram describing the percentage of total tumour volume above a percentage threshold of maximum SUV | |
| PercentInactive | Percentage of the tumour region that is inactive. A threshold of 0.005 × (SUVmax)2 followed by closing and opening morphological operations were used to differentiate active and inactive regions on FDG-PET scans | |
|
| ||
| Textures | Global | Variance |
| Skewness | ||
| Kurtosis | ||
| GLCM | Energy | |
| Contrast | ||
| Entropy | ||
| Homogeneity | ||
| Correlation | ||
| SumAverage | ||
| Variance | ||
| Dissimilarity | ||
| AutoCorrelation | ||
|
| ||
| Textures | GLRLM | SRE (short run emphasis) |
| LRE (long run emphasis) | ||
| GLN (gray-level nonuniformity) | ||
| RLN (run-length nonuniformity) | ||
| RP (run percentage) | ||
| LGRE (low gray-level run emphasis) | ||
| HGRE (high gray-level run emphasis) | ||
| SRLGE (short run low gray-level emphasis) | ||
| SRHGE (short run high gray-level emphasis) | ||
| LRLGE (long run low gray-level emphasis) | ||
| LRHGE (long run high gray-level emphasis) | ||
| GLV (gray-level variance) | ||
| RLV (run-length variance) | ||
| GLSZM | SZE (small-zone emphasis) | |
| LZE (large-zone emphasis) | ||
| GLN (gray-level nonuniformity) | ||
| ZSN (zone-size nonuniformity) | ||
| ZP (zone percentage) | ||
| LGZE (low gray-level zone emphasis) | ||
| HGZE (high gray-level zone emphasis) | ||
| SZLGE (small-zone low gray-level emphasis) | ||
| SZHGE (small-zone high gray-level emphasis) | ||
| LZLGE (large-zone low gray-level emphasis) | ||
| LZHGE (large-zone high gray-level emphasis) | ||
| GLV (gray-level variance) | ||
| ZSV (zone-size variance) | ||
| NGTDM | Coarseness | |
| Contrast | ||
| Busyness | ||
| Complexity | ||
| Strength | ||
|
| ||
| Clinical | Age | Age |
| Sex | Male | |
| Female | ||
| Treatment | Radiotherapy + surgery + chemotherapy | |
| Radiotherapy + surgery | ||
| Surgery + chemotherapy | ||
| Status | Alive | |
| Alive with disease | ||
| Dead | ||
| Grade | High | |
| Intermediate | ||
| Low | ||
| MSKCC type | Liposarcoma | |
| Leiomyosarcoma | ||
| Synovial sarcoma | ||
| Malignant fibrous histiocytomas | ||
| Extraskeletal bone sarcoma | ||
| Fibrosarcoma | ||
| Other | ||
Figure 1The 25 significant features based on random forest and T test. The transverse axis represents different characteristics, the main longitudinal axis represents the ranking of the feature, and the secondary longitudinal axis represents the significance level of the feature.
Figure 2Neural network model. The numbers in the figure represent the number of neurons in each layer.
Figure 3(a) The performance of model including accuracy, sensitivity, and specificity from different data sets. (b) Best validation performance of the neural network.
Figure 4Performance comparison of feature selection. (a) represents the overall performance from the perspective of all data sets. (b–d) denote the performance from the three types of data sets including training, validation, and test set, respectively. Feature_selection represents 24 features selected by random forest and T test methods. Original denotes 67 features including 48 image features and 16 clinical features. Only Radiomics represents 48 image features, and Only Clinical denotes 16 clinical features.
Figure 5(a) The 10 top features of the contribution degree to prediction model. (b) Comparison of model performance based on different features.
Figure 6(a) Correlation between features and label. (b) Effect of lung metastasis on patient survival. The blue and red lines represent the survival probability of patients with LungMets and NoLungMets, respectively. The x-axis shows survival time of the patients, and the y-axis shows the survival probability of patients.