| Literature DB >> 35894017 |
Arman Sharbatdaran1, Dominick Romano1, Kurt Teichman1, Hreedi Dev1, Syed I Raza1, Akshay Goel1, Mina C Moghadam1, Jon D Blumenfeld2, James M Chevalier2, Daniil Shimonov2, George Shih1, Yi Wang3, Martin R Prince1,4.
Abstract
Organ volume measurements are a key metric for managing ADPKD (the most common inherited renal disease). However, measuring organ volumes is tedious and involves manually contouring organ outlines on multiple cross-sectional MRI or CT images. The automation of kidney contouring using deep learning has been proposed, as it has small errors compared to manual contouring. Here, a deployed open-source deep learning ADPKD kidney segmentation pipeline is extended to also measure liver and spleen volumes, which are also important. This 2D U-net deep learning approach was developed with radiologist labeled T2-weighted images from 215 ADPKD subjects (70% training = 151, 30% validation = 64). Additional ADPKD subjects were utilized for prospective (n = 30) and external (n = 30) validations for a total of 275 subjects. Image cropping previously optimized for kidneys was included in training but removed for the validation and inference to accommodate the liver which is closer to the image border. An effective algorithm was developed to adjudicate overlap voxels that are labeled as more than one organ. Left kidney, right kidney, liver and spleen labels had average errors of 3%, 7%, 3%, and 1%, respectively, on external validation and 5%, 6%, 5%, and 1% on prospective validation. Dice scores also showed that the deep learning model was close to the radiologist contouring, measuring 0.98, 0.96, 0.97 and 0.96 on external validation and 0.96, 0.96, 0.96 and 0.95 on prospective validation for left kidney, right kidney, liver and spleen, respectively. The time required for manual correction of deep learning segmentation errors was only 19:17 min compared to 33:04 min for manual segmentations, a 42% time saving (p = 0.004). Standard deviation of model assisted segmentations was reduced to 7, 5, 11, 5 mL for right kidney, left kidney, liver and spleen respectively from 14, 10, 55 and 14 mL for manual segmentations. Thus, deep learning reduces the radiologist time required to perform multiorgan segmentations in ADPKD and reduces measurement variability.Entities:
Keywords: ADPKD; artificial intelligence; interobserver variability; kidney volume; liver volume; machine learning; spleen volume
Mesh:
Year: 2022 PMID: 35894017 PMCID: PMC9326744 DOI: 10.3390/tomography8040152
Source DB: PubMed Journal: Tomography ISSN: 2379-1381
Literature on deep learning methods for organ volume measurements in ADPKD.
| First | Modality | Year | ADPKD subjects | Segmentation | Dice Score | Other Metrics | Organ |
|---|---|---|---|---|---|---|---|
| Sharma [ | CT | 2017 | 125 | 2D VGG-16 FCN | 0.86 | 7.8% | Kidney |
| Keshwani [ | CT | 2018 | 203 CT scans ** | Multi-task 3D FCN | 0.95 | 3.86% | Kidney |
| Shin [ | CT | 2020 | 214 | 3D V-net | 0.961 * | 95% within 3% | Kidney + Liver |
| Onthoni [ | CT | 2020 | 97 | 2D SSD Inception Network V2 | - | Images: mAP: 94% | Kidney |
| Hsiao [ | CT | 2022 | 210 | FPN + EfficientNet | 0.969 | - | Kidney |
| Jagtap [ | US | 2022 | 22 | 2D U-Net | 0.80 | 4.12% | Kidney |
| Kim [ | MRI | 2016 | 60 | SPPM + PSC | 0.88 | MCC: 0.97 | Kidney |
| Kline [ | MRI | 2017 | 2000 scans ** | 2D U-Net + | 0.97 | 0.68% | Kidney |
| Guangrui [ | MRI | 2019 | 305 | 3D VB-Net *** | RK-0.958 | - | Kidney |
| Van Gastel [ | MRI | 2019 | 145 | 2D U-Net | - | LK: 0.96 | Kidney + Liver |
| Kline [ | MRI | 2020 | 60 | 2D U-Net + | 1st Reader: 0.86 | 1st Reader: 3.9% | Kidney cysts |
| Goel [ | MRI | 2022 | 173 | 2D U-Net + | External: 0.98 | External: 2.6% | Kidney |
| Raj [ | MRI | 2022 | 100 | 2D Attention U-Net | 0.922 | MSSD: 0.922 and 1.09 mm | Kidney |
| Taylor [ | MRI | 2022 | 227 Scans | 3D U-Net | 0.96 | LK:1.8% | Kidney |
FPN = Feature Pyramid Network; FCN = Fully Convolutional Network; SSD = Single Shot Detector; MSSD = Mean Symmetric Surface Distance; RK= right kidney; LK = left kidney; Cor = Coronal; MAPE= Mean absolute percentage error; mAP= mean Average Precision; MCC = mean correlation coefficient. * DSC corresponds to combination of TKV and liver volume. ** number of subjects is unknown. *** customization of V-Net.
Figure 1DICOMs of prospective and external cases are pushed to a root input directory from PACS. Checkpoints of each organ are then used to perform organ-specific inference. Once all inferences are complete, the algorithm combines the organs and adjudicates the model overlaps to create a multi-organ ensemble.
Figure 2Axial T2 weighted image of a 67-year-old male with ADPKD showing: (A) pink voxels between liver (yellow) and right kidney (red) because they met the 2D model criteria for both kidney and liver. (B) same image as (A) but with increased transparency of labels to show that the pink overlap voxels correspond to a right renal cyst. (C) corrected image with overlap voxels now assigned to red corresponding to the right kidney.
Figure 3Axial T2 weighted image from a 59-year-old male with ADPKD showing: (A) model correctly labeling right kidney (red), left kidney (green), spleen (blue) and liver (yellow) except that inference cropping causes an incorrectly straight liver label border (white arrow). (B) with partial label transparency, the underlying organs are visualized with numerous cysts in both right and left kidneys (red and green), including the incorrectly straight liver label border (white arrow). (C) after removing the cropping step from the inference input, the liver border is correctly labeled without cropping (white arrow).
Demographic data.
| Parameter | Training/Validation | External | Prospective |
|---|---|---|---|
| Number of Patients | 215 | 30 | 30 |
| Number of MR exams | 260 | 30 | 30 |
| DICOM images | 9540 | 1368 | 2137 |
| Male:Female (%male) | 98:117 (46%) | 17:13 (57%) | 11:19 (37%) |
| Age at scan (years) | 49 ± 14 | 49 ± 16 | 46 ± 15 |
| eGFR (mL/min/1.73 m2) | 68 ± 28 | 85 ± 30 | 72 ± 34 |
| Total Kidney Volume (mL) * | 1287 (669–2213) | 1334 (693–2376) | 1444 (885–2020) |
| Ht-TKV (mL/m) * | 757 (415–1275) | 777 (393–1297) | 837 (550–1234) |
| A | 29 (13%) | 4 (13%) | 1 (3%) |
| B | 58 (27%) | 8 (27%) | 6 (20%) |
| C | 70 (33%) | 7 (24%) | 13 (44%) |
| D | 34 (16%) | 10 (33%) | 7 (23%) |
| E | 24 (11%) | 1 (3%) | 3 (10%) |
| Asian | 10 (5%) | 1 (3%) | 4 (13%) |
| White | 148 (69%) | 23 (77%) | 16 (53%) |
| Black | 14 (6%) | 1 (3%) | 2 (67%) |
| Unknown | 43 (20%) | 5 (17%) | 8 (27%) |
* median (interquartile range), ** Calculated based upon the first exam for subjects with multiple MRIs. Ht-TKV: height-adjusted total kidney volume.
(A) Model accuracy on external validation (n = 30), median (interquartile range). (B) Model accuracy on prospective validation (n = 30), median (IQR).
| (A) | ||||
|---|---|---|---|---|
| External Test Set | Right Kidney | Left Kidney | Liver | Spleen |
| Model volume (mL) | 617 (327–1009) | 582 (416–1289) | 1706 (1292–2087) | 220 (145–274) |
| DSC | 0.96 | 0.98 | 0.97 | 0.96 |
| Concordance Coefficient | >0.99 | >0.99 | 0.98 | 0.99 |
| RMS error (mL) | 42 | 39 | 258 | 17 |
| Average % error | 7% | 3% | 3% | 1% |
| Number with zero error | 5 (17%) | 6 (20%) | 1 (3%) | 7 (23%) |
|
| ||||
|
|
|
|
|
|
| Model volume (mL) | 625 (370–1000) | 729 (481–1039) | 1711 (1489–2065) | 244 (177–315) |
| DSC | 0.96 | 0.96 | 0.96 | 0.95 |
| Concordance Coefficient | >0.99 | >0.99 | >0.99 | 0.98 |
| RMS error (mL) | 112 | 65 | 112 | 37 |
| Average % error | 6% | 5% | 5% | 1% |
| # with zero error | 2 (7%) | 3 (20%) | 2 (7%) | 2 (7%) |
Figure 4Bland-Altman plots of external validation comparing the inference volume with radiologist corrected volume for (A) left kidney, (B) right kidney, (C) spleen and (D) liver.
Figure 5Bland-Altman plots of prospective validation comparing the inference volume with radiologist corrected volume for (A) left kidney, (B) right kidney, (C) spleen and (D) liver.
Figure 6Examples of model error on axial T2. (A) Axial T2 weighted image from a 51-year-old male with ADPKD showing an unlabeled cystic region (red arrow) between liver (yellow) and right kidney (red) reflecting the challenge of finding the boundary between the right kidney from the liver, with both being very cystic. Also note that the fluid filled stomach (white arrow) is incorrectly labelled as left kidney (green). (B) Axial T2 weighted image from a 59-year-old male with ADPKD demonstrating correct labeling of the left kidney (green), incorrectly labelled abdominal wall mistaken for liver (yellow arrow), and an unlabeled anterior right renal cyst (red arrow) which should be red for right kidney but is in a location which can be mistaken as gallbladder. (C) Axial T2 weighted image from a 51-year-old male with ADPKD demonstrating (A) correct labeling of the spleen (blue) but incomplete labeling (yellow arrow) of the left edge of a massively enlarged liver which does not commonly extend this far to the left side of the patient. (D) Axial T2 weighted image from a 62-year-old male with ADPKD showing near perfect labeling of the spleen (blue). The gallbladder (yellow arrow) is partially incorrectly labeled as liver (yellow).
Mean time (minutes) for manual organ segmentation and model assisted segmentations for the first 10 prospective cases averaged over four trained observers.
| Manual Segmentation | Model Assisted Segmentation | Time Savings | ||
|---|---|---|---|---|
| Right Kidney | 7:39 ± 2:26 | 4:31 ± 1:34 | 3:08 (41%) | 0.004 |
| Left Kidney | 7:34 ± 3:44 | 4:16 ± 1:35 | 3:19 (44%) | 0.01 |
| Liver | 12:49 ± 6:10 | 8:49 ± 3:52 | 3:59 (31%) | 0.007 |
| Spleen | 4:13 ± 0:48 | 2:04 ± 0:59 | 2:09 (51%) | 0.0003 |
| Total | 33:04 ± 8:05 | 19:17 ± 7:19 | 13:47 (42%) | 0.001 |
Standard deviations of organ volumes measured by three trained observers for segmentations performed manually or with model assistance.
| Volume Measurement Standard Deviations | |||
|---|---|---|---|
| Manual Segmentation (mL) | Model Assisted Segmentation (mL) | ||
| Right Kidney | 14 | 7 | 0.02 |
| Liver | 55 | 11 | 0.001 |
| Spleen | 14 | 5 | 0.001 |
Figure 7Coronal T2 weighted image from a 61-year-old male with ADPKD showing correctly labeled liver (yellow) and left kidney (green). Right kidney (red) is mostly correct but has a thin vertical sliver (red arrow) that crossed the midline and became labeled as left kidney (green).