| Literature DB >> 36010288 |
Quirin David Strotzer1, Hinrich Winther2, Kirsten Utpatel3, Alexander Scheiter3, Claudia Fellner1, Michael Christian Doppler4, Kristina Imeen Ringe2, Florian Raab1, Michael Haimerl1, Wibke Uller4, Christian Stroszczynski1, Lukas Luerken1, Niklas Verloh4.
Abstract
We aimed to evaluate whether U-shaped convolutional neuronal networks can be used to segment liver parenchyma and indicate the degree of liver fibrosis/cirrhosis at the voxel level using contrast-enhanced magnetic resonance imaging. This retrospective study included 112 examinations with histologically determined liver fibrosis/cirrhosis grade (Ishak score) as the ground truth. The T1-weighted volume-interpolated breath-hold examination sequences of native, arterial, late arterial, portal venous, and hepatobiliary phases were semi-automatically segmented and co-registered. The segmentations were assigned the corresponding Ishak score. In a nested cross-validation procedure, five models of a convolutional neural network with U-Net architecture (nnU-Net) were trained, with the dataset being divided into stratified training/validation (n = 89/90) and holdout test datasets (n = 23/22). The trained models precisely segmented the test data (mean dice similarity coefficient = 0.938) and assigned separate fibrosis scores to each voxel, allowing localization-dependent determination of the degree of fibrosis. The per voxel results were evaluated by the histologically determined fibrosis score. The micro-average area under the receiver operating characteristic curve of this seven-class classification problem (Ishak score 0 to 6) was 0.752 for the test data. The top-three-accuracy-score was 0.750. We conclude that determining fibrosis grade or cirrhosis based on multiphase Gd-EOB-DTPA-enhanced liver MRI seems feasible using a 2D U-Net. Prospective studies with localized biopsies are needed to evaluate the reliability of this model in a clinical setting.Entities:
Keywords: Artificial Intelligence; U-Net; cirrhosis; convolutional neural network; liver fibrosis; segmentation
Year: 2022 PMID: 36010288 PMCID: PMC9406317 DOI: 10.3390/diagnostics12081938
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Label distribution. The full dataset’s label distribution (Ishak scores) (n = 112).
Figure 2Nested Cross-Validation Procedure. First, the dataset was split into five combinations of 80% training/validation and 20% test cases using stratified sampling to ensure equal label distribution in the whole dataset and all subsets. Second, five models were trained based on the train/validation partitions (outer loop). Per default, nnU-Net employs a second 5-fold cross-validation, for which the train/validation partitions are split into 80% train and 20% test cases (inner loop). Finally, nnU-Net automatically selects the optimal configuration based on the validation partitions. The final model performance is estimated based on the overall prediction scores on the outer loop test sets. Each block represents 20% of the respective data partition.
Figure 3Model Architecture. 2D U-Net Architecture generated by nnU-Net based on the dataset fingerprint [30]. The input consists of the five dynamic T1-weighted MRI sequences. The input patch size is 320 × 320. Yellow planes each represent a sequence composed of plain convolutions (conv), instance normalization (norm), and leaky rectified linear units (ReLU). Conv kernel size is [3, 3] (except for a kernel size of [1, 1] and stride [1, 1] for the segmentation output and auxiliary segmentation output layers). Resolution is reduced after each two of these blocks by strided convolutions (stride is depicted in the right-sided square brackets). Red planes represent transposed convolutions with kernel size [2, 2] and stride [2, 2]. Feature map sizes are displayed for the encoder part (left) and mirrored by the decoder (right). The illustration in the right upper corner represents an exemplary output segmentation of the model, where the colored pixels depict different Ishak scores and black represents the background value ‘0′. The figure was generated with PlotNeuralNet (version 1.0.0, github.com/HarisIqbal88/PlotNeuralNet, accessed on 21 October 2021).
Binary segmentation results by Ishak score. This table lists the mean DICE and HD95 for the binarized segmentation results separated by the Ishak score.
| Ishak 0 | Ishak 1 | Ishak 2 | Ishak 3 | Ishak 4 | Ishak 5 | Ishak 6 | |
|---|---|---|---|---|---|---|---|
| DICE | 0.951 | 0.935 | 0.962 | 0.957 | 0.951 | 0.948 | 0.890 |
| HD95 | 4.225 | 5.837 | 3.015 | 4.444 | 4.152 | 10.004 | 9.856 |
Figure 4Example segmentation result of fibrosis distribution. (A) 3D rendering superimposed on corresponding MRI slices in portal venous phase. (B) 2D segmentation mask of the same example. The predictions, in this case, are green = Ishak 0, yellow = Ishak 2.
Figure 5Exemplary dynamic MRI scans and segmentation results. Randomly selected cases. For each Ishak score (as indicated by the numbers on the left), an axial section through the dynamic co-registered and z-score normalized MRI scans in arterial, late arterial, portal venous, hepatobiliary, and native phases as well as the segmentation result superimposed on the portal venous scan is depicted. Colors: orange = Ishak 0, yellow = Ishak 1, green = Ishak 2, light blue = Ishak 3, red = Ishak 4, blue = Ishak 5, purple = Ishak 6. The MRI scans were automatically windowed to display the intensity values between the 0.1 and 99.9 percentiles. It can be seen that the network mainly assigns the values 0, 2, and 6. Value 4 was not assigned, possibly because it was omitted due to its infrequent occurrence during training and optimization.
Figure 6Confusion Matrix. Voxel-level prediction results (proportion) aggregated over all test cases. The number of examples per class is depicted in parentheses.
Figure 7ROC curves. Receiver operating characteristic curves for grouped Ishak scores as well as averaged results (micro-average). Cases were grouped as shown in the plot titles.
Comparison of different non-invasive methods to predict liver fibrosis. Comparison of different non-invasive methods to predict liver fibrosis taken from the literature as well as results for the proposed model.
|
| Compared to | AUC (95% CI) | |
|---|---|---|---|
| Magnetic Resonance Elastography | |||
| Bohte et al., 2014 [ | METAVIR | F ≥ 2, 0.909 (0.840, 0.977); | |
| Huwart et al., 2008 [ | METAVIR | F ≥ 1, 0.962 (0.929, 0.995); | |
| Ultrasound-based Transient Elastography | |||
| Bohte et al., 2014 [ | METAVIR | F ≥ 2, 0.914 (0.857, 0.972); | |
| Huwart et al., 2008 [ | METAVIR | F ≥ 1, 0.803 (0.701, 0.904); | |
| Uptake of Gd-EOB-DTPA in the HBP | |||
| Verloh et al., 2015 [ | Ishak | Ishak ≥ 1, 0.94 (0.90, 1.00); | |
| Haimerl et al., 2017 [ | METAVIR | F ≥ 1, 1.00 (1.00, 1.00); | |
| 2D U-Net | |||
| Proposed (test data) | Ishak | Ishak ≥ 1, 0.729 (0.59, 0.86); | |