| Literature DB >> 31217445 |
Asuka Oyama1, Yasuaki Hiraoka2,3, Ippei Obayashi3, Yusuke Saikawa1, Shigeru Furui1,4, Kenshiro Shiraishi4, Shinobu Kumagai5, Tatsuya Hayashi1, Jun'ichi Kotoku6,7.
Abstract
The purpose of this study is to evaluate the accuracy for classification of hepatic tumors by characterization of T1-weighted magnetic resonance (MR) images using two radiomics approaches with machine learning models: texture analysis and topological data analysis using persistent homology. This study assessed non-contrast-enhanced fat-suppressed three-dimensional (3D) T1-weighted images of 150 hepatic tumors. The lesions included 50 hepatocellular carcinomas (HCCs), 50 metastatic tumors (MTs), and 50 hepatic hemangiomas (HHs) found respectively in 37, 23, and 33 patients. For classification, texture features were calculated, and also persistence images of three types (degree 0, degree 1 and degree 2) were obtained for each lesion from the 3D MR imaging data. We used three classification models. In the classification of HCC and MT (resp. HCC and HH, HH and MT), we obtained accuracy of 92% (resp. 90%, 73%) by texture analysis, and the highest accuracy of 85% (resp. 84%, 74%) when degree 1 (resp. degree 1, degree 2) persistence images were used. Our methods using texture analysis or topological data analysis allow for classification of the three hepatic tumors with considerable accuracy, and thus might be useful when applied for computer-aided diagnosis with MR images.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31217445 PMCID: PMC6584736 DOI: 10.1038/s41598-019-45283-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Classification results for hepatocellular carcinoma (HCC), metastatic tumor (MT) and hepatic hemangioma (HH) using texture features and linear discriminant analysis (LDA) with an elastic net penalty.
| Subjects | Accuracy | Sensitivity | Specificity | AUC# |
|---|---|---|---|---|
| HCC and MT | 0.92 (92/100) | 1.00 (50/50) | 0.84 (42/50) | 0.95 |
| HCC and HH | 0.90 (90/100) | 0.96 (48/50) | 0.84 (42/50) | 0.95 |
| MT and HH | 0.73 (73/100) | 0.72 (36/50) | 0.74 (37/50) | 0.75 |
#AUC = area under the curve.
Classification results for hepatocellular carcinoma (HCC), metastatic tumor (MT) and hepatic hemangioma (HH) from persistence images of three types (degree 0, degree 1, and degree 2) using logistic classifier with an elastic net penalty and XGBoost.
| Subjects | MLM* | Degree | Accuracy | Sensitivity | Specificity | AUC# |
|---|---|---|---|---|---|---|
| HCC and MT | Logistic | 0 | 0.69 (69/100) | 0.48 (24/50) | 0.90 (45/50) | 0.69 |
| 1 | 0.75 (75/100) | 0.66 (33/50) | 0.84 (42/50) | 0.78 | ||
| 2 | 0.70 (70/100) | 0.58 (29/50) | 0.82 (41/50) | 0.69 | ||
| XGBoost | 0 | 0.82 (82/100) | 0.82 (41/50) | 0.82 (41/50) | 0.85 | |
| 1 | 0.85 (85/100) | 0.86 (43/50) | 0.84 (42/50) | 0.85 | ||
| 2 | 0.68 (68/100) | 0.80 (40/50) | 0.56 (28/50) | 0.68 | ||
| HCC and HH | Logistic | 0 | 0.74 (74/100) | 0.60 (30/50) | 0.88 (44/50) | 0.73 |
| 1 | 0.73 (73/100) | 0.56 (28/50) | 0.90 (45/50) | 0.73 | ||
| 2 | 0.69 (69/100) | 0.56 (28/50) | 0.82 (41/50) | 0.69 | ||
| XGBoost | 0 | 0.79 (79/100) | 0.66 (33/50) | 0.92 (42/50) | 0.83 | |
| 1 | 0.84 (84/100) | 0.86 (43/50) | 0.82 (41/50) | 0.89 | ||
| 2 | 0.72 (72/100) | 0.64 (32/50) | 0.80 (40/50) | 0.71 | ||
| MT and HH | Logistic | 0 | 0.62 (62/100) | 0.64 (32/50) | 0.60 (30/50) | 0.61 |
| 1 | 0.56 (56/100) | 0.18 (9/50) | 0.94 (47/50) | 0.49 | ||
| 2 | 0.52 (52/100) | 0.28 (14/50) | 0.76 (38/50) | 0.45 | ||
| XGBoost | 0 | 0.64 (64/100) | 0.74 (37/50) | 0.54 (27/50) | 0.60 | |
| 1 | 0.60 (60/100) | 0.90 (45/50) | 0.30 (15/50) | 0.57 | ||
| 2 | 0.74 (74/100) | 0.68 (34/50) | 0.80 (40/50) | 0.71 |
*MLM = machine learning models.
#AUC = area under the curve.
Figure 1ROC curves obtained from texture analysis (method 1, dotted line) and topological data analysis using XGBoost (method 2, solid line). (a) Classification between HCC and MT. Method 2 uses degree 1 persistence images. The areas under the curve (AUC) are 0.95 and 0.85, respectively, for method 1 and method 2. (b) Classification between HCC and HH. Method 2 uses degree 1 persistence images. The AUCs are 0.95, and 0.89, respectively, for method 1 and method 2. (c) Classification between MT and HH. Method 2 uses degree 2 persistence images. The AUCs are 0.75 and 0.71, respectively, for method 1 and method 2.
Texture features and texture extraction parameters used for this study.
| Texture features | ||
|---|---|---|
| Feature index | Texture type | Texture name |
| 1–3 | Global | Variance, Skewness, Kurtosis |
| 4–12 | GLCM | Energy, Contrast, Correlation, Homogeneity, Variance, Sum Average, Entropy, Dissimilarity, Auto Correlation |
| 13–25 | GLRLM | Short Run Emphasis, Long Run Emphasis, Gray-Level (GL) Non-uniformity, Run-Length Non-uniformity, Run Percentage, Low GL Run Emphasis, High GL Run Emphasis, Short Run Low GL Emphasis, Short Run High GL Emphasis, Long Run Low GL Emphasis, Long Run High GL Emphasis, GL Variance, Run-Length Variance |
| 26–38 | GLSZM | Small Zone Emphasis, Large Zone Emphasis, GL Level Non-uniformity, Zone-Size Non-uniformity, Zone Percentage, Low GL Zone Emphasis, High GL Zone Emphasis, Small Zone Low GL Emphasis, Small Zone High GL Emphasis, Large Zone Low GL Emphasis, Large Zone High GL Emphasis, GL Variance, Zone-Size Variance |
| 39–43 | NGTDM | Coarseness, Contrast, Busyness, Complexity, Strength |
|
| ||
| Wavelet band-pass filtering | 1/2, 2/3, 1, 3/2, 2 | |
| Isotropic voxel size at resampling | initial in-plane resolution, 1, 2, 3, 4, 5 | |
| Number of gray levels at quantization | 8, 16, 32, 64 | |
| Quantization algorithm | ‘Equal-probability’, ‘Lloyd–Max’ | |
GLCM: Gray-level co-occurrence matrix.
GLRLM: Gray-level run-length matrix.
GLSZM: Gray-level size zone matrix.
NGTDM: Neighborhood gray-tone difference matrix.
Figure 2Gray-scale images of rectangular parallelepiped ROIs: axial views around the mid-section: five images each of the ROIs that contain HCC (top), MT (middle), and HH (bottom). We generated persistence diagrams of the ROIs from these voxel values.
Figure 3(a) A gray-scale image. (b) Filtration of binary images (h, threshold gray-scale value). A sequence of binary images is obtained by changing the threshold. The area surrounded by the blue line represents an example of the birth of a connected component. The area surrounded by the red line represents the death of the connected component. (c) Persistence diagram (degree 0). The point in the circle corresponds to the birth–death described above a pair of the connected component.
Figure 4(a) Persistence diagram (degree 1) of an ROI containing HCC. Numerous dots denote birth–death pairs of cavities above the line of y = x. The dot colors reflect the number of birth–death pairs created at the points. (b) Persistence images were obtained from this persistence diagram. The dot color density is determined by the importance level in data characterization.
Figure 5Persistence images of ROIs including tumors: four persistence images (degree 1) each of the ROIs that include HCC (top), MT (middle) and HH (bottom). The dot color density is determined by the importance level in data characterization.