| Literature DB >> 34300667 |
Mohamed Shehata1, Ahmed Alksas1, Rasha T Abouelkheir2, Ahmed Elmahdy2, Ahmed Shaffie1, Ahmed Soliman1, Mohammed Ghazal3, Hadil Abu Khalifeh3, Reem Salim3, Ahmed Abdel Khalek Abdel Razek4, Norah Saleh Alghamdi5, Ayman El-Baz1.
Abstract
Renal cell carcinoma (RCC) is the most common and a highly aggressive type of malignant renal tumor. In this manuscript, we aim to identify and integrate the optimal discriminating morphological, textural, and functional features that best describe the malignancy status of a given renal tumor. The integrated discriminating features may lead to the development of a novel comprehensive renal cancer computer-assisted diagnosis (RC-CAD) system with the ability to discriminate between benign and malignant renal tumors and specify the malignancy subtypes for optimal medical management. Informed consent was obtained from a total of 140 biopsy-proven patients to participate in the study (male = 72 and female = 68, age range = 15 to 87 years). There were 70 patients who had RCC (40 clear cell RCC (ccRCC), 30 nonclear cell RCC (nccRCC)), while the other 70 had benign angiomyolipoma tumors. Contrast-enhanced computed tomography (CE-CT) images were acquired, and renal tumors were segmented for all patients to allow the extraction of discriminating imaging features. The RC-CAD system incorporates the following major steps: (i) applying a new parametric spherical harmonic technique to estimate the morphological features, (ii) modeling a novel angular invariant gray-level co-occurrence matrix to estimate the textural features, and (iii) constructing wash-in/wash-out slopes to estimate the functional features by quantifying enhancement variations across different CE-CT phases. These features were subsequently combined and processed using a two-stage multilayer perceptron artificial neural network (MLP-ANN) classifier to classify the renal tumor as benign or malignant and identify the malignancy subtype as well. Using the combined features and a leave-one-subject-out cross-validation approach, the developed RC-CAD system achieved a sensitivity of 95.3%±2.0%, a specificity of 99.9%±0.4%, and Dice similarity coefficient of 0.98±0.01 in differentiating malignant from benign tumors, as well as an overall accuracy of 89.6%±5.0% in discriminating ccRCC from nccRCC. The diagnostic abilities of the developed RC-CAD system were further validated using a randomly stratified 10-fold cross-validation approach. The obtained results using the proposed MLP-ANN classification model outperformed other machine learning classifiers (e.g., support vector machine, random forests, relational functional gradient boosting, etc.). Hence, integrating morphological, textural, and functional features enhances the diagnostic performance, making the proposal a reliable noninvasive diagnostic tool for renal tumors.Entities:
Keywords: CE-CT; RC-CAD; functionality; morphology; renal cell carcinoma; texture
Mesh:
Year: 2021 PMID: 34300667 PMCID: PMC8309718 DOI: 10.3390/s21144928
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1The proposed renal cancer computer-assisted diagnosis (RC-CAD) system.
Figure 2Visualization of the segmentation process to obtain 3D renal tumors.
Figure 3Visualizing 3D surface complexity differences between different renal tumors (benign are shown in blue, while malignant are shown in red).
Figure 4Renal tumors’ reconstruction meshes showing the morphological differences among malignant ccRCC, malignant nccRCC, and benign AML tumors.
Figure 5An illustrative example showing differences in texture between various renal tumor types.
Figure 6A visualization of the average normalized histogram curves for all benign subjects (blue) vs. malignant (red).
Figure 7Visualization of the rotation-invariant neighborhood calculation system used to construct the grey-level co-occurrence matrix (GLCM). The GLCM can be constructed by counting the occurrence frequency of different grey-level pairs in-plane and in adjacent planes accounting for the 26-neighbor voxels (blue) of the central voxel (red).
Definition of first- and second-order textural features.
| Textural Feature | Definition |
|---|---|
|
| |
| Mean | The average grey value of voxels within the tumor. |
| Variance | Second central moment of gray values. |
| Standard deviation | Square root of variance. |
| Skewness (Skew) | Asymmetry of the distribution of gray values about the mean. If Skew < 0, that means the grey level spreads out more to the left of the mean than to the right, and if Skew > 0, that means the grey level spreads out more to the right of the mean than to the left. Skew will equal zero in the case of normal distributions. |
| Kurtosis (Kurt) | Measures the tail weight, or tendency to extreme values, of the object grey-level distribution. The normal distribution has Kurt = 3; distributions with heavier tails have Kurt > 3; distributions with less weight in the tails have Kurt < 3. |
| Entropy | A measure of randomness of grey values within an input image. |
| CDFs | A distribution function that accumulates voxel-wise grey values from the whole tumor object with minimum value = 0 and maximum value = 1. |
| Percentiles | Grey values percentiles corresponding to the CDFs (from 10% to 100%) |
|
| |
| Contrast | Measures the disparity in grey-level values between neighbors. |
| Dissimilarity | Finds to what extent voxels are different from their neighbors. |
| Homogeneity | Expresses the inverse difference moment among neighbors. |
| Angular second moment (ASM) | Determines the gray levels’ local uniformity (orderliness). |
| Energy | The square root of the ASM. |
| Correlation | Determines the grey-level linear dependency in neighborhood blocks. |
Texture features formulas.
| Feature | Formula |
|---|---|
|
| |
| Mean ( |
|
| Variance ( |
|
| Entropy (Ent) |
|
| Skewness (Skew) |
|
| Kurtosis (Kurt) |
|
|
| |
| Contrast |
|
| Dissimilarity |
|
| Homogeneity |
|
| ASM |
|
| Energy |
|
| Correlation |
|
Figure 8Example of the wash-in and wash-out slopes construction process for various types of renal tumors. When compared to nccRCC (green) and AML (blue), ccRCC tumors exhibit higher and faster wash-in/-out slopes (red).
Details of the extracted feature sets used in the two-stage renal tumor classification.
|
| |
| Feature Set 1: First-order (histogram features) | 6 features |
| Feature Set 2: First-order (percentiles) | 10 features |
| Feature Set 3: Second-order (GLCM) | 6 features |
|
| |
| Feature Set 4: Spherical harmonic reconstruction errors | 70 features |
|
| |
| Feature Set 5: Wash-in/out slopes | 2 features |
|
| |
| Feature Sets 1, 2, 3, 4, and 5 | 94 features |
Diagnostic performance results of the first stage classification (RCC vs. AML) using different individual feature sets along with multilayer perceptron artificial neural network (MLP-ANN) classification models. The RC-CAD system diagnostic performance using the combined features outperformed the diagnostic abilities using individual feature sets. Sens: sensitivity, Spec: specificity, DSC: Dice coefficient of similarity, : size of hidden layer n.
| RCC vs. AML Classification Performance (Mean ± SD ≈) | ||||
|---|---|---|---|---|
|
|
|
|
|
|
| Set 1 | 94.1 ± 1.5 | 97.9 ± 1.5 | 0.96 ± 0.01 | hl |
| Set 2 | 92.4 ± 2.9 | 95.1 ± 3.5 | 0.94 ± 0.02 | hl |
| Set 3 | 94.9 ± 2.2 | 95.3 ± 2.5 | 0.95 ± 0.02 | hl |
| Set 4 | 92.0 ± 2.4 | 96.6 ± 2.0 | 0.94 ± 0.02 | hl |
| Set 5 | 82.7 ± 4.1 | 91.7 ± 2.0 | 0.87 ± 0.02 | hl |
|
|
|
|
|
|
Hyperparameters: MLP-ANN (optimization function: trainlm, max epochs = 500, goal = 0, max validation failure = 6, min gradient = , training gain (): initial = 0.001, decrease factor = 0.1, increase factor = 10, max = 1e).
Results from the second stage classification (ccRCC vs. nccRCC) using individual feature sets (1, 2, 3, 4, and 5) along with the multilayer perceptron artificial neural network (MLP-ANN) classification models. The RC-CAD system diagnostic performance using the combined features outperformed the diagnostic abilities using individual feature sets. Acc: accuracy, hl: size of hidden layer n.
| ccRCC vs. nccRCC Classification Performance (Mean ± SD ≈) | ||
|---|---|---|
|
|
|
|
| Set 1 | 76.8 ± 2.6 | hl |
| Set 2 | 75.7 ± 3.8 | hl |
| Set 3 | 83.3 ± 5.6 | hl |
| Set 4 | 81.4 ± 5.1 | hl |
| Set 5 | 76.2 ± 2.33 | hl |
|
|
|
|
Hyperparameters: MLP-ANN (optimization function: trainlm, max epochs = 500, goal = 0, max validation failure = 6, min gradient = , training gain (): initial = 0.001, decrease factor = 0.1, increase factor = 10, max = 1e).
Figure 9A difficult case presentation showing the textural differences, wash-in and wash-out slope differences, and shape differences between two ccRCC, two nccRCC, and two AML renal tumors.
Diagnostic performance comparison for both classification stages between the developed RC-CAD system and other classification approaches (e.g., random forest (RF) and support vector machine (SVM)). Using leave-one-subject-out (LOSO) and a randomly stratified 10-fold cross-validation approach, the diagnostic abilities of the RC-CAD outperformed the others. Let Sens: sensitivity, Spec: Specificity, DSC: Dice similarity coefficient, and Acc: Accuracy.
|
| ||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| RFs | LOSO | 89.0 ± 1.7 | 92.7 ± 2.7 | 0.91 ± 0.02 |
| 10-fold | 88.4 ± 1.0 | 90.7 ± 3.0 | 0.89 ± 0.01 | |
| SVM | LOSO | 82.9 ± 0.0 | 88.6 ± 0.0 | 0.85 ± 0.00 |
| 10-fold | 81.9 ± 2.2 | 87.7 ± 2.5 | 0.84 ± 0.02 | |
|
| ||||
|
|
|
| ||
|
|
|
| ||
|
|
| |||
| RFs | LOSO | 53.7 ± 3.7 | ||
| 10-fold | 51.9 ± 2.6 | |||
| SVM | LOSO | 52.9 ± 0.0 | ||
| 10-fold | 54.3 ± 3.0 | |||
Hyperparameters: MLP-ANN (optimization function: trainlm, max epochs = 500, hidden layers: hl = 50 nodes, hl = 25 nodes, goal = 0, max validation failure = 6, min gradient = , training gain (): initial = 0.001, decrease factor = 0.1, increase factor = 10, max = 1e); RF (method: Bag, number of learning cycles = 30); SVM (kernel function: quadratic, box constraint = 1).
Diagnostic performance comparison for both classification stages between the developed RC-CAD system and the state-of-the-art approaches by [27,32,37]. The diagnostic abilities of the RC-CAD outperformed all other methods in both classification stages. Let Sens: sensitivity, Spec: Specificity, DSC: Dice similarity coefficient, and Acc: Accuracy.
|
| ||||
|
|
|
|
| |
|
|
|
|
| |
| Kunapuli [ | 81.4 ± 0.0 | 95.7 ± 0.0 | 0.88 ± 0.00 | |
| Oberai [ | 88.9 ± 1.7 | 87.4 ± 1.4 | 0.91 ± 0.01 | |
| Lee [ | AlexNet | 84.0 ± 1.7 | 93.4 ± 1.9 | 0.88 ± 0.02 |
| GoogleNet | 88.3 ± 1.7 | 95.1 ± 1.9 | 0.91 ± 0.01 | |
| ResNet | 88.0 ± 3.5 | 95.7 ± 0.9 | 0.91 ± 0.02 | |
| VGGNet | 86.9 ± 0.6 | 91.4 ± 2.4 | 0.89 ± 0.01 | |
|
| ||||
|
|
|
|
| |
|
|
|
|
| |
| Kunapuli [ | 60.6 ± 2.7 | 28 ± 1 | 15 ± 1 | |
| Oberai [ | 84.3 ± 3.1 | 34 ± 1 | 25 ± 2 | |
| Lee [ | AlexNet | 71.7 ± 1.9 | 31 ± 2 | 19 ± 2 |
| GoogleNet | 68.0 ± 1.5 | 32 ± 1 | 15 ± 1 | |
| ResNet | 70.3 ± 2.5 | 32 ± 0 | 17 ± 2 | |
| VGGNet | 72.6 ± 2.3 | 33 ± 1 | 18 ± 1 | |
Hyperparameters: MLP-ANN (optimization function: trainlm, max epochs = 500, hidden layers: hl = 50 nodes, hl = 25 nodes, goal = 0, max validation failure = 6, min gradient = , training gain (): initial = 0.001, decrease factor = 0.1, increase factor = 10, max = 1e).