| Literature DB >> 34162893 |
Ahmed Alksas1, Mohamed Shehata1, Gehad A Saleh2, Ahmed Shaffie1, Ahmed Soliman1, Mohammed Ghazal3, Adel Khelifi4, Hadil Abu Khalifeh3, Ahmed Abdel Razek2, Guruprasad A Giridharan1, Ayman El-Baz5.
Abstract
Liver cancer is a major cause of morbidity and mortality in the world. The primary goals of this manuscript are the identification of novel imaging markers (morphological, functional, and anatomical/textural), and development of a computer-aided diagnostic (CAD) system to accurately detect and grade liver tumors non-invasively. A total of 95 patients with liver tumors (M = 65, F = 30, age range = 34-82 years) were enrolled in the study after consents were obtained. 38 patients had benign tumors (LR1 = 19 and LR2 = 19), 19 patients had intermediate tumors (LR3), and 38 patients had hepatocellular carcinoma (HCC) malignant tumors (LR4 = 19 and LR5 = 19). A multi-phase contrast-enhanced magnetic resonance imaging (CE-MRI) was collected to extract the imaging markers. A comprehensive CAD system was developed, which includes the following main steps: i) estimation of morphological markers using a new parametric spherical harmonic model, ii) estimation of textural markers using a novel rotation invariant gray-level co-occurrence matrix (GLCM) and gray-level run-length matrix (GLRLM) models, and iii) calculation of the functional markers by estimating the wash-in/wash-out slopes, which enable quantification of the enhancement characteristics across different CE-MR phases. These markers were subsequently processed using a two-stages random forest-based classifier to classify the liver tumor as benign, intermediate, or malignant and determine the corresponding grade (LR1, LR2, LR3, LR4, or LR5). The overall CAD system using all the identified imaging markers achieved a sensitivity of 91.8%±0.9%, specificity of 91.2%±1.9%, and F[Formula: see text] score of 0.91±0.01, using the leave-one-subject-out (LOSO) cross-validation approach. Importantly, the CAD system achieved overall accuracies of [Formula: see text], 85%±2%, 78%±3%, 83%±4%, and 79%±3% in grading liver tumors into LR1, LR2, LR3, LR4, and LR5, respectively. In addition to LOSO, the developed CAD system was tested using randomly stratified 10-fold and 5-fold cross-validation approaches. Alternative classification algorithms, including support vector machine, naive Bayes classifier, k-nearest neighbors, and linear discriminant analysis all produced inferior results compared to the proposed two stage random forest classification model. These experiments demonstrate the feasibility of the proposed CAD system as a novel tool to objectively assess liver tumors based on the new comprehensive imaging markers. The identified imaging markers and CAD system can be used as a non-invasive diagnostic tool for early and accurate detection and grading of liver cancer.Entities:
Mesh:
Year: 2021 PMID: 34162893 PMCID: PMC8222341 DOI: 10.1038/s41598-021-91634-0
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The proposed computer-aided diagnosis (CAD) system for detecting and grading liver cancer tumors.
Acquisition parameters of CE-MRI sequences.
| CE-MRI Acquisition Parameters | ||||||
|---|---|---|---|---|---|---|
| TR (ms) | TE (ms) | FOV (mm) | Slice size (pixels) | Slice thickness (mm) | Slice gap (mm) | Flip angle |
| 7.3 | 3.1 | 500 | 256 | 3 | 1 | |
TR: repetition time; TE: echo time; FOV: field of view.
Figure 2Liver tumors segmentation and 3D objects construction.
Figure 33D visualization of the surface complexity differences between benign tumors (blue) and malignant ones (red).
Figure 4Morphology approximation of two benign (LR1-2), intermediate (LR3), and two malignant (LR4-5) tumors.
Figure 5Texture differences visualization between three different tumors, (benign, intermediate, and malignant) at the four different phases.
Figure 6Probability density function visualization of the normalized gray-level intensity histogram for three different liver tumors: benign (blue), intermediate (orange), and malignant (green).
First and second order textural markers.
| Textural marker | Definition |
|---|---|
| First order | |
| Mean ( | Represents the gray-level values balance point of each object. It is calculated simply by getting the average gray-level value for each object. |
| Variance | Describes the gray-level distribution around our computed Mean. |
| Skewness | Expresses how the gray-level values are asymmetrically distributed around the Mean of the object. |
| Kurtosis | Measures to what extent the gray-level values are concentrated towards the tails of the distribution. |
| Entropy | Expresses the amount of randomness within each structure gray-level values. |
| CDFs | Return the cumulative distribution function of the histogram density values. This is calculated along the whole object and getting the cumulative sum of the gray-level values (Normalized to [0 to 1] at multiple positions (from 0 to 100% of the object with a 10% step). |
| Percentiles | Calculate the percentiles of gray-level values for the corresponding CDFs. |
| Second order | |
| Contrast | Measures the disparity in gray-level values between neighbors. |
| Dissimilarity | Finds to what extent voxels are different from their neighbors. |
| Homogeneity | Expresses the inverse difference moment among neighbors. |
| Angular second moment (ASM) | Determines the gray-levels local uniformity (orderliness). |
| Energy | The square root of the ASM. |
| Correlation | Determines the gray-level linear dependency between center voxel and its neighbors. |
| Gray-level non-uniformity (GLN) | Describes the dissimilarity of gray-level values within the object. |
| High gray-level run emphasis (HGLRE) | Measures the concentration of high gray-level values in the structure. |
| Long run emphasis (LRE) | Determines how long run lengths are distributed in the object indicating the coarseness of the texture. |
| Long run high gray-level emphasis (LRHGLE) | Measures how long runs of high gray-level values are distributed in the object. |
| Long run low gray-level emphasis (LRLGLE) | Measures how long runs of low gray-level values are distributed in the object. |
| Low gray-level run emphasis (LGLRE) | Measures the concentration of low gray-level values in the structure. |
| Run entropy (RE) | Indicates the amount of randomness in gray-level runs in the structure. |
| Run length non-uniformity (RLN) | Expresses the inhomogeneity among run lengths in the object. |
| Run percentage (RP) | Is calculated by the division of the overall count of runs by the total number of pixels. |
| Short run emphasis (SRE) | Measures the concentration of short run lengths in the object indicating how fine the texture is. |
| Short run high gray-level emphasis (SRHGLE) | Measures the concentration of high gray-level values short runs in the object. |
| Short run low gray-level emphasis (SRLGLE) | Measures the concentration of low gray-level values short runs in the object. |
Figure 7Visualization of rotation invariant neighborhood system of the center voxel (blue) to construct GLCM.
Figure 8Typical wash-in/out slopes for three different tumors, (benign shown in blue, intermediate in orange, and malignant in green).
Illustration of different categories and their associated number of extracted markers for each subject.
| Morphological markers | |
|---|---|
| Spherical Harmonics | 70 markers |
| Textural markers | |
| First order (Histogram markers) | 104 markers (26/phase) |
| Second order (GLCM) | 24 markers (6/phase) |
| Second order (GLRLM) | 48 markers (12/phase) |
| Functional markers | |
| Wash-In/Out | 3 markers |
| Integrated markers | |
| Combined | 249 markers |
Comparison of the first stage classification performance using the individual markers namely, SHs morphological markers, First order textural markers, second order GLCM textural markers, second order GLRLM textural markers, and wash-in/out slopes functional markers of the developed CAD system: Benign (LR1-2) vs. Intermediate (LR3) vs. Malignant (LR4-5) using RFs classifier. Note that: Sens and Spec denote Sensitivity and Specificity, respectively.
| Markers | Sens% | Spec% | |
|---|---|---|---|
| Spherical Harmonics | 73.39±3.13 | 84.43±1.98 | 0.78±0.02 |
| First Order (Histogram) | 79.72±3.81 | 85.91±2.39 | 0.82±0.03 |
| Second Order (GLCM) | 86.45±2.17 | 87.93±1.76 | 0.86±0.01 |
| Second Order (GLRLM) | 81.94±2.30 | 81.96±2.38 | 0.81±0.01 |
| Wash-In/Out | 81.11±2.71 | 84.37±1.85 | 0.82±0.02 |
| Combined | 91.81±0.88 | 91.17±1.90 | 0.91±0.01 |
Comparison of the first stage classification performance Using the integrated markers of the developed CAD system: Benign (LR1-2) vs. Intermediate (LR3) vs. Malignant (LR4-5) using different machine learning classifiers and three validation approaches for each classifier (i.e. LOSO, 10-Fold, and 5-Fold). Let Sens, Spec, RFs, KNN, SVM, NB, and LDA denote sensitivity, specificity, random forests, k-nearest neighbor, support vector machine, naive Bayes, and linear discriminant analysis, respectively.
| Classifier | Approach | Sens% | Spec% | F |
|---|---|---|---|---|
| RFs | ||||
| kNN | LOSO | 86.24±0.00 | 91.89±0.00 | 0.89±0.00 |
| 10-Fold | 84.69±1.88 | 91.91±1.43 | 0.88±0.01 | |
| 5-Fold | 83.41±2.29 | 90.56±2.79 | 0.87±0.02 | |
| LOSO | 84.62±0.00 | 86.84±0.00 | 0.86±0.00 | |
| 10-Fold | 83.90±2.71 | 85.23±1.54 | 0.84±0.02 | |
| 5-Fold | 83.99±2.89 | 85.23±1.54 | 0.84±0.02 | |
| NB | LOSO | 82.86±0.00 | 86.84±0.00 | 0.84±0.00 |
| 10-Fold | 79.72±3.66 | 88.69±3.45 | 0.83±0.02 | |
| 5-Fold | 79.36±4.07 | 88.78±3.43 | 0.83±0.03 | |
| LDA | LOSO | 86.49±0.00 | 87.50±0.00 | 0.86±0.00 |
| 10-Fold | 82.77±4.62 | 85.96±1.84 | 0.84±0.03 | |
| 5-Fold | 80.85±5.31 | 81.88±3.49 | 0.80±0.03 |
Diagnostic Performance of the developed CAD system in the second stage classification: LR1 vs. LR2 and LR4 vs. LR5 using RFs classifier utilizing the combined markers.
| Approach | Accuracy% | ||
|---|---|---|---|
| Benign | |||
| LR1 | LR2 | Overall | |
| LOSO | 88.95±4.37 | 90.00±1.58 | 89.47±2.35 |
| 10-Fold | 86.32±4.82 | 88.42±3.16 | 87.37±3.29 |
| 5-Fold | 84.74±6.84 | 85.79±4.74 | 85.26±5.02 |
| Malignant | |||
| LR4 | LR5 | Overall | |
| LOSO | 92.63±2.58 | 85.26±2.11 | 88.95±1.58 |
| 10-Fold | 90.00±4.37 | 80.00±2.11 | 85.00±2.06 |
| 5-Fold | 87.37±6.74 | 74.74±2.11 | 81.05±3.07 |
Figure 9The overall confusion matrix obtained for the developed CAD system using LOSO approach utilizing the integrated markers for grading the tumors into (LR1, LR2, LR3, LR4, and LR5) using (a) a two-stage RFs classifier (proposed classification approach) compared to (b) a one-stage RFs classifier.
Comparison of the two-stage diagnostic performance using the developed CAD system (combined markers) with the performance of six different features/markers selection scenarios. Let m and ST denote the number of the used markers and significance threshold, respectively.
| Markers selection approach | m | Accuracy% | ||||||
|---|---|---|---|---|---|---|---|---|
| LR1 | LR2 | LR3 | LR4 | LR5 | Overall | |||
| Proposed CAD system (combined) | 249 | 85±2 | 83±4 | |||||
| Wrapper Approach | forward (ST=0.05) | 19 | 81±4 | 84±3 | 67±5 | 86±3 | 67±4 | 77±2 |
| forward (ST=0.10) | 196 | 75±3 | 88±0 | 65±5 | 85±3 | 64±4 | 75±2 | |
| bi-directional (ST=0.05) | 13 | 76±5 | 86±3 | 72±7 | 85±3 | 66±5 | 77±2 | |
| bi-directional (ST=0.10) | 16 | 76±4 | 73±3 | 84±4 | 64±6 | 78±2 | ||
| Gini-Impurity-based | combined selection | 134 | 75±3 | 76±0 | 73±4 | 87±2 | 65±5 | 75±2 |
| separate selection | 109 | 74±3 | 82±0 | 70±3 | 71±3 | 77±1 | ||
Figure 10Confusion matrices comparing the final diagnostic performance obtained by using the proposed two-stage RFs classification model along with a LOSO approach for grading the tumors into (LR1, LR2, LR3, LR4, and LR5) after applying six different features/markers reduction techniques as follows: (a) wrapper approach based on bi-directional elimination (step-wise selection) using a significance threshold (ST) of 0.1, (b) wrapper approach based on bi-directional elimination (step-wise selection) using an ST of 0.05, (c) wrapper approach based on forward selection using an ST of 0.1, (d) wrapper approach based on forward selection using an ST of 0.05, (e) Gini impurity-based approach using combined selection, and (f) Gini impurity-based approach using separate selection.
The final diagnostic performance for grading the tumors into (LR1, LR2, LR3, LR4, and LR5) by using (a) the proposed CAD system, (b) approach by Stocker et al.[18], and (c) approach by Wu et al.[21].
| Model | Accuracy% | |||||
|---|---|---|---|---|---|---|
| LR1 | LR2 | LR3 | LR4 | LR5 | Overall | |
| Proposed CAD System | 88±5 | 78±3 | 79±3 | |||
| Stocker[ | 71±0 | 82±0 | 71±0 | 53±0 | 70±0 | 69±0 |
| Wu[ | 58±9 | 60±8 | 76±4 | |||
Figure 11Confusion matrices comparing the final diagnostic performance for grading the tumors into (LR1, LR2, LR3, LR4, and LR5) by using (a) the proposed CAD sytem, (b) approach by Stocker et al.[18], and (c) approach by Wu et al.[21].