| Literature DB >> 34737340 |
Jiyeon Ha1, Taeyong Park2, Hong-Kyu Kim3, Youngbin Shin4, Yousun Ko4, Dong Wook Kim1, Yu Sub Sung5,6, Jiwoo Lee4, Su Jung Ham1, Seungwoo Khang7, Heeryeol Jeong7, Kyoyeong Koo7, Jeongjin Lee7, Kyung Won Kim8.
Abstract
As sarcopenia research has been gaining emphasis, the need for quantification of abdominal muscle on computed tomography (CT) is increasing. Thus, a fully automated system to select L3 slice and segment muscle in an end-to-end manner is demanded. We aimed to develop a deep learning model (DLM) to select the L3 slice with consideration of anatomic variations and to segment cross-sectional areas (CSAs) of abdominal muscle and fat. Our DLM, named L3SEG-net, was composed of a YOLOv3-based algorithm for selecting the L3 slice and a fully convolutional network (FCN)-based algorithm for segmentation. The YOLOv3-based algorithm was developed via supervised learning using a training dataset (n = 922), and the FCN-based algorithm was transferred from prior work. Our L3SEG-net was validated with internal (n = 496) and external validation (n = 586) datasets. Ground truth L3 level CT slice and anatomic variation were identified by a board-certified radiologist. L3 slice selection accuracy was evaluated by the distance difference between ground truths and DLM-derived results. Technical success for L3 slice selection was defined when the distance difference was < 10 mm. Overall segmentation accuracy was evaluated by CSA error and DSC value. The influence of anatomic variations on DLM performance was evaluated. In the internal and external validation datasets, the accuracy of automatic L3 slice selection was high, with mean distance differences of 3.7 ± 8.4 mm and 4.1 ± 8.3 mm, respectively, and with technical success rates of 93.1% and 92.3%, respectively. However, in the subgroup analysis of anatomic variations, the L3 slice selection accuracy decreased, with distance differences of 12.4 ± 15.4 mm and 12.1 ± 14.6 mm, respectively, and with technical success rates of 67.2% and 67.9%, respectively. The overall segmentation accuracy of abdominal muscle areas was excellent regardless of anatomic variation, with CSA errors of 1.38-3.10 cm2. A fully automatic system was developed for the selection of an exact axial CT slice at the L3 vertebral level and the segmentation of abdominal muscle areas.Entities:
Mesh:
Year: 2021 PMID: 34737340 PMCID: PMC8568923 DOI: 10.1038/s41598-021-00161-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1An overview of dataset composition.
Subject characteristics of internal and external validation cohorts.
| Characteristics | Development dataset | Internal validation dataset | External validation dataset |
|---|---|---|---|
| Number of subjects | 922 | 496 | 586 |
| Age (years) | 54.4 ± 14.0 | 53.7 ± 8.7 | 58.5 ± 12.3 |
| Female (%, female:male) | 39.3% (362:560) | 39.3% (195:301) | 40.8% (239:347) |
| Normal anatomy group | 807 (87.5%) | 438 (88.3%) | 505 (86.2%) |
| Anatomic variants group | 115 (12.5%) | 58 (11.7%) | 81 (13.8%) |
| Thoracolumbar variant | 48 (5.2%) | 20 (4.0%) | 26 (4.4%) |
| Lumbosacral variant | 43 (4.7%) | 29 (5.8%) | 43 (7.3%) |
| Numeric variant | 12 (1.3%) | 4 (1.4%) | 7 (1.2%) |
| Combined variant | 12 (1.3%) | 5 (1.7%) | 5 (0.9%) |
| Institution | AMC | AMC | UUH, KHUH, AUH |
| None (healthy) | 87 | 496 | 586 |
| Gastric cancer | 436 | 0 | 0 |
| Sepsis | 245 | 0 | 0 |
| Pancreatic cancer | 154 | 0 | 0 |
AMC Asan Medical Center, AUH Ajou University Hospital, KHUH Kyung Hee University Hospital, UUH Ulsan University Hospital.
Figure 2Anatomic lumbar spine variants. Examples of normal, thoracolumbar, lumbosacral, numeric, and combined variations are presented.
Figure 3Example of multiple bounding boxes for the training of the YOLOv3-based model and architecture of our YOLOv3-based network. Multiple bounding boxes were generated in the maximum intensity projection images based on the following prerequisites as illustrated in (A): (1) the L4 vertebra was located at the iliac crest level, (2) the L3 vertebra was located superiorly to the L4 vertebra, (3) the morphologies of the lumbar vertebrae were the same. The YOLOv3-based model used an objectness score for each bounding box obtained from logistic regression to predict the width and height of the box as well as its location relative to grid cell. The sum of the squared error loss was used to train the model for minimizing differences between the ground-truth object and the bounding box. Any error between the bounding box over the ground-truth object was incurred for both classification and detection loss. Our model extracted features of the bounding boxes using the network architecture illustrated in (B). Our network architecture used successive 3 × 3 and 1 × 1 convolution layers and a set of residual blocks with shortcut connections. A total of 53 convolutional layers were formed like Darknet-53. YOLOv3 predicted boxes at three different scales to support detection on varying scales.
Figure 4Example of L3 level selection and body composition analysis using L3SEG-net. APCT was converted to a MIP image. L3 level was selected by YOLOv3 based selection algorithm and selected CT slice number was transferred to FCN based segmentation algorithm. The final output of L3SEG-net was areas of each composition element including subcutaneous fat area, skeletal muscle area, and visceral fat area. APCT abdominopelvic computed tomography, MIP maximal intensity projection.
Figure 5Box plots of distance difference between ground truth and deep learning model (DLM) derived results in (A) internal validation dataset and (B) external validation cohorts. The mean differences between the ground truth and the DLM-derived results were 3.7 mm ± 8.4 and 4.1 mm ± 8.3 for the internal, and external validation cohorts, respectively.
Cross-sectional area segmentation using the ground truth-derived and DLM-derived levels.
| Parameter | Internal validation dataset | External validation dataset | ||||
|---|---|---|---|---|---|---|
| SMA | Sfat | Vfat | SMA | Sfat | Vfat | |
| CSA from GT (cm2) | 140.88 ± 34.53 | 140.90 ± 56.71 | 114.53 ± 65.05 | 132.76 ± 31.25 | 133.15 ± 62.16 | 110.59 ± 64.29 |
| CSA from DLM (cm2) | 140.53 ± 34.20 | 141.98 ± 56.60 | 115.93 ± 65.40 | 130.07 ± 31.07 | 135.54 ± 62.64 | 110.72 ± 65.19 |
| p value* | 0.874 | 0.764 | 0.736 | 0.139 | 0.492 | 0.973 |
| CSA error (%) | 1.38 ± 1.46 | 3.51 ± 5.41 | 4.00 ± 6.35 | 3.10 ± 2.85 | 4.54 ± 6.34 | 4.26 ± 6.47 |
| CSA from GT (cm2) | 141.20 ± 34.46 | 138.85 ± 55.86 | 112.42 ± 64.73 | 132.75 ± 31.15 | 133.99 ± 62.82 | 110.88 ± 64.18 |
| CSA from DLM (cm2) | 140.87 ± 34.06 | 140.47 ± 55.72 | 114.11 ± 64.95 | 130.14 ± 31.00 | 136.73 ± 63.15 | 111.13 ± 65.06 |
| p value* | 0.883 | 0.659 | 0.692 | 0.167 | 0.474 | 0.950 |
| CSA error (%) | 1.22 ± 1.08 | 2.31 ± 2.21 | 2.97 ± 3.21 | 2.86 ± 2.57 | 3.39 ± 2.78 | 3.36 ± 4.68 |
| CSA from GT (cm2) | 136.33 ± 35.18 | 169.78 ± 60.54 | 144.23 ± 62.27 | 132.97 ± 32.3 | 123.03 ± 52.30 | 107.10 ± 65.61 |
| CSA from DLM (cm2) | 135.77 ± 35.80 | 163.24 ± 64.41 | 141.53 ± 66.37 | 129.21 ± 31.93 | 122.66 ± 54.42 | 105.78 ± 66.50 |
| p value* | 0.949 | 0.672 | 0.865 | 0.579 | 0.974 | 0.924 |
| CSA error (%) | 3.68 ± 3.19 | 20.42 ± 8.14 | 18.37 ± 15.68 | 6.01 ± 4.18 | 18.28 ± 15.16 | 15.06 ± 12.56 |
| p value§ | < 0.01 | < 0.01 | < 0.01 | < 0.01 | < 0.01 | < 0.01 |
Data are presented as mean ± standard deviation.
*The p-value is calculated from Student t-test comparing the GT CSA and the CSA determined using the DLM.
§The p-value is calculated from Student t-test comparing CSA errors between subjects with technical success and subjects with technical failure.
CSA cross-sectional area, DLM deep learning model, GT ground truth, Sfat subcutaneous fat area, SMA skeletal muscle area, Vfat visceral fat area.
Figure 6Bland Altman plots to evaluate agreement of SMA between the GT and DLM. (A) In subjects with technical success in the internal validation cohort, (B) in subjects with technical failure in the internal validation cohort, (C) in subjects with technical success in the external validation cohort, (D) in subjects with technical failure in the external validation cohort.
Subgroup analysis according to spine anatomy.
| Subgroup | Distance difference (mm) | Technical success (%) | CSA error (%) | Bland–Altman (mean ± limits of agreement) | ||||
|---|---|---|---|---|---|---|---|---|
| SMA | Sfat | Vfat | SMA | Sfat | Vfat | |||
| Normal anatomy (n = 943) | 2.6 ± 6.0 | 96.5 | 2.22 ± 2.46 | 3.46 ± 4.78 | 3.57 ± 5.58 | 1.68 ± 7.22 | -2.29 ± 13.13 | -0.84 ± 10.03 |
| Thoracolumbar variation (n = 46) | 7.4 ± 11.9 | 82.6 | 2.73 ± 2.24 | 5.83 ± 8.79 | 5.87 ± 7.04 | 2.23 ± 7.90 | 2.41 ± 34.33 | -2.69 ± 24.10 |
| Lumbosacral variation (n = 72) | 13.4 ± 15.2 | 63.9 | 3.04 ± 2.49 | 8.72 ± 10.63 | 7.94 ± 9.23 | 1.40 ± 10.56 | 2.17 ± 35.93 | 0.86 ± 24.84 |
| Numeric variation (n = 11) | 16.5 ± 16.1 | 54.5 | 2.37 ± 2.11 | 10.87 ± 7.62 | 10.36 ± 10.19 | -0.22 ± 7.10 | -3.93 ± 46.02 | -1.82 ± 26.79 |
| Combined variation (n = 10) | 21.4 ± 17.0 | 40 | 4.06 ± 2.92 | 11.86 ± 12.66 | 14.95 ± 17.03 | -2.53 ± 14.78 | -7.15 ± 58.06 | 10.82 ± 67.40 |
CSA cross-sectional area, Sfat subcutaneous fat area, SMA skeletal muscle area, Vfat visceral fat area.