| Literature DB >> 35885564 |
Mardhiyati Mohd Yunus1,2, Ahmad Khairuddin Mohamed Yusof3, Muhd Zaidi Ab Rahman3, Xue Jing Koh1, Akmal Sabarudin1, Puteri N E Nohuddin4,5, Kwan Hoong Ng6,7, Mohd Mustafa Awang Kechik8, Muhammad Khalis Abdul Karim8.
Abstract
Radiomics is the process of extracting useful quantitative features of high-dimensional data that allows for automated disease classification, including atherosclerotic disease. Hence, this study aimed to quantify and extract the radiomic features from Coronary Computed Tomography Angiography (CCTA) images and to evaluate the performance of automated machine learning (AutoML) model in classifying the atherosclerotic plaques. In total, 202 patients who underwent CCTA examination at Institut Jantung Negara (IJN) between September 2020 and May 2021 were selected as they met the inclusion criteria. Three primary coronary arteries were segmented on axial sectional images, yielding a total of 606 volume of interest (VOI). Subsequently, the first order, second order, and shape order of radiomic characteristics were extracted for each VOI. Model 1, Model 2, Model 3, and Model 4 were constructed using AutoML-based Tree-Pipeline Optimization Tools (TPOT). The heatmap confusion matrix, recall (sensitivity), precision (PPV), F1 score, accuracy, receiver operating characteristic (ROC), and area under the curve (AUC) were analysed. Notably, Model 1 with the first-order features showed superior performance in classifying the normal coronary arteries (F1 score: 0.88; Inverse F1 score: 0.94), as well as in classifying the calcified (F1 score: 0.78; Inverse F1 score: 0.91) and mixed plaques (F1 score: 0.76; Inverse F1 score: 0.86). Moreover, Model 2 consisting of second-order features was proved useful, specifically in classifying the non-calcified plaques (F1 score: 0.63; Inverse F1 score: 0.92) which are a key point for prediction of cardiac events. Nevertheless, Model 3 comprising the shape-based features did not contribute to the classification of atherosclerotic plaques. Overall, TPOT shown promising capabilities in terms of finding the best pipeline and tailoring the model using CCTA-based radiomic datasets.Entities:
Keywords: AutoML; CCTA; TPOT; atherosclerotic plaques; radiomic features; supervised
Year: 2022 PMID: 35885564 PMCID: PMC9318450 DOI: 10.3390/diagnostics12071660
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Tree-based pipeline from TPOT. Reprinted with permission from Ref. [37]. 2019, Oxford University Press.
Figure 2Overall flow of patient selection.
Figure 3Overall research workflow.
Demographic data of the study.
| Characteristics | Total ( |
|---|---|
| | |
| Gender | |
| Male, | 138 (68.3%) |
| Female, | 64 (31.7%) |
| Ethnicity | |
| Malay, | 104 (51.5%) |
| Chinese, | 46 (22.8%) |
| Indian, | 51 (25.2%) |
| Others, | 1 (0.5%) |
| Age (years ± SD) | 58.84 ± 9.497 |
| Body Mass Index (kg/m2 ± SD) | 26.81 ± 3.746 |
| | |
| CAD-RADS 0, | 6 (3.0%) |
| CAD-RADS 1, | 9 (4.5%) |
| CAD-RADS 2, | 54 (26.7%) |
| CAD-RADS 3, | 36 (17.8%) |
| CAD-RADS 4, | 95 (47.0%) |
| CAD-RADS 5, | 2 (1.0%) |
| | |
| Total DLP (mGy × cm ± SD) | 322.00 ± 167.926 |
| Heart rate (bpm ± SD) | 70.20 ± 10.522 |
| Contrast medium (mL ± SD) | 59.57 ± 2.347 |
Figure 4(a) Before segmentation of proximal LAD and (b) after segmentation of non-calcified lesion on proximal LAD using semi-automated (growth from seed) type of segmentation which was colored into yellow colour.
Figure 5LIFEx software is used to perform semi-automated segmentation on RCA, LAD, and LCX. (a1) Mid RCA with a mixed calcified atherosclerotic plaque seen. (a2) The mixed calcified plaque was enclosed by the VOI placement (pink colour) on the mid RCA. (b1) Proximal LAD with a non-calcified atherosclerotic plaque was seen. (b2) The non-calcified plaque was surrounded by the VOI placement (yellow colour) on the proximal LAD. (c1) Proximal LCX with a calcified atherosclerotic plaque was observed. (c2) The calcified atherosclerotic plaque was surrounded by the VOI placement (blue colour) on the proximal LCX.
First order features (n = 29), second order features (n = 31), and shape order features (n = 5) are extensive descriptions of radiomic features collected from each divided region.
| First Order Features ( | Second Order Features ( | Shape Order Features ( |
|---|---|---|
Figure 6Pipeline search by TPOT. Initially, raw data was split into input and output variables.
The diagnostic performance of each ML model in classifying the atherosclerotic plaques.
| Atherosclerotic Plaques (Output) | ML | Recall | Precision (PPV) | F1- | Inverse Recall | Inverse | Inverse F1- | Accuracy |
|---|---|---|---|---|---|---|---|---|
| 1 | 0.97 | 0.80 | 0.88 * | 0.90 | 0.99 | 0.94 * | 0.92 * | |
| Normal | 2 | 0.83 | 0.73 | 0.77 | 0.88 | 0.93 | 0.90 | 0.86 |
| 3 | 0.24 | 0.33 | 0.28 | 0.79 | 0.70 | 0.74 | 0.62 | |
| 4 | 1.00 | 0.72 | 0.84 | 0.85 | 1.00 | 0.92 | 0.89 | |
| 1 | 0.71 | 0.86 | 0.78 * | 0.95 | 0.87 | 0.91 * | 0.87 * | |
| Calcified | 2 | 0.66 | 0.82 | 0.73 | 0.93 | 0.85 | 0.89 | 0.84 |
| 3 | 0.63 | 0.59 | 0.61 | 0.71 | 0.74 | 0.72 | 0.68 | |
| 4 | 0.74 | 0.74 | 0.74 | 0.88 | 0.88 | 0.88 | 0.83 | |
| 1 | 0.53 | 0.67 | 0.59 | 0.94 | 0.90 | 0.92 * | 0.87 * | |
| Non-calcified | 2 | 0.58 | 0.69 | 0.63 * | 0.94 | 0.91 | 0.92 * | 0.87 * |
| 3 | 0.16 | 0.75 | 0.26 | 0.98 | 0.78 | 0.87 | 0.78 | |
| 4 | 0.37 | 0.78 | 0.50 | 0.98 | 0.87 | 0.92 * | 0.87 * | |
| 1 | 0.79 | 0.72 | 0.76 * | 0.84 | 0.89 | 0.86 * | 0.82 * | |
| Mixed | 2 | 0.79 | 0.69 | 0.74 | 0.81 | 0.88 | 0.84 | 0.80 |
| 3 | 0.69 | 0.45 | 0.55 | 0.49 | 0.73 | 0.59 | 0.57 | |
| 4 | 0.72 | 0.74 | 0.73 | 0.86 | 0.85 | 0.85 | 0.81 |
The highest result of F1 score, Inverse F1 score, and accuracy was marked as (*).
Figure 7Heatmap confusion matrix for (a) Model 1, (b) Model 2, (c) Model 3 and (d) Model 4. Each column of the matrix represents the occurrence in a predicted class, whereas each row represents the occurrence in an actual class.
Figure 8ROC curve for (a) Model 1, (b) Model 2, (c) Model 3 and (d) Model 4.