| Literature DB >> 35732795 |
Joshua R Astley1,2, Alberto M Biancardi2, Paul J C Hughes2, Helen Marshall2, Laurie J Smith2, Guilhem J Collier2, James A Eaden2, Nicholas D Weatherley2, Matthew Q Hatton1, Jim M Wild2,3, Bilal A Tahir4,5,6.
Abstract
Respiratory diseases are leading causes of mortality and morbidity worldwide. Pulmonary imaging is an essential component of the diagnosis, treatment planning, monitoring, and treatment assessment of respiratory diseases. Insights into numerous pulmonary pathologies can be gleaned from functional lung MRI techniques. These include hyperpolarized gas ventilation MRI, which enables visualization and quantification of regional lung ventilation with high spatial resolution. Segmentation of the ventilated lung is required to calculate clinically relevant biomarkers. Recent research in deep learning (DL) has shown promising results for numerous segmentation problems. Here, we evaluate several 3D convolutional neural networks to segment ventilated lung regions on hyperpolarized gas MRI scans. The dataset consists of 759 helium-3 (3He) or xenon-129 (129Xe) volumetric scans and corresponding expert segmentations from 341 healthy subjects and patients with a wide range of pathologies. We evaluated segmentation performance for several DL experimental methods via overlap, distance and error metrics and compared them to conventional segmentation methods, namely, spatial fuzzy c-means (SFCM) and K-means clustering. We observed that training on combined 3He and 129Xe MRI scans using a 3D nn-UNet outperformed other DL methods, achieving a mean ± SD Dice coefficient of 0.963 ± 0.018, average boundary Hausdorff distance of 1.505 ± 0.969 mm, Hausdorff 95th percentile of 5.754 ± 6.621 mm and relative error of 0.075 ± 0.039. Moreover, limited differences in performance were observed between 129Xe and 3He scans in the testing set. Combined training on 129Xe and 3He yielded statistically significant improvements over the conventional methods (p < 0.0001). In addition, we observed very strong correlation and agreement between DL and expert segmentations, with Pearson correlation of 0.99 (p < 0.0001) and Bland-Altman bias of - 0.8%. The DL approach evaluated provides accurate, robust and rapid segmentations of ventilated lung regions and successfully excludes non-lung regions such as the airways and artefacts. This approach is expected to eliminate the need for, or significantly reduce, subsequent time-consuming manual editing.Entities:
Mesh:
Year: 2022 PMID: 35732795 PMCID: PMC9217976 DOI: 10.1038/s41598-022-14672-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Summary of demographics, clinical characteristics and image dataset information stratified by disease.
| Disease | Total number of scans | Number of patients | Number of HP gas scans | Sex* | Median (range) age* | Mean ± SD ventilated lung volume (liters)* | ||
|---|---|---|---|---|---|---|---|---|
| 3He | 129Xe | Male | Female | |||||
| Healthy | 43 | 33 | 1 | 42 | 15 | 13 | 12 (9, 76) | 3.78 ± 1.18 |
| Asthma | 169 | 81 | 4 | 165 | 28 | 52 | 50 (13, 73) | 4.23 ± 1.03 |
| Asthma/COPD overlap | 11 | 5 | 0 | 11 | 0 | 5 | 56 (45, 67) | 4.13 ± 0.68 |
| Bronchiectasis | 3 | 3 | 1 | 2 | 1 | 1 | 15 (9, 29) | 3.76 ± 1.00 |
| CF | 247 | 58 | 134 | 113 | 29 | 28 | 16 (6, 48) | 3.65 ± 1.05 |
| COPD | 62 | 23 | 56 | 6 | 4 | 5 | 64 (52, 80) | 4.43 ± 0.71 |
| Non-IPF ILD** | 77 | 41 | 0 | 77 | 25 | 16 | 69 (39, 83) | 3.78 ± 0.80 |
| Investigation for possible airways disease | 38 | 21 | 5 | 33 | 2 | 16 | 49 (36, 69) | 3.89 ± 1.05 |
| IPF | 46 | 20 | 45 | 1 | 17 | 3 | 72 (52, 80) | 3.87 ± 0.71 |
| Lung cancer | 22 | 16 | 14 | 8 | 10 | 6 | 69 (34, 85) | 4.12 ± 0.86 |
| Preterm birth | 41 | 40 | 4 | 37 | 15 | 25 | 12 (9, 14) | 2.75 ± 0.55 |
HP hyperpolarized, CF cystic fibrosis, COPD chronic obstructive pulmonary disease, ILD interstitial lung disease, IPF idiopathic pulmonary fibrosis, SD standard deviation.
*Data for 25 patients was unavailable.
**Contains connective tissue disease-associated ILD (CTD-ILD), hypersensitivity pneumonitis and drug-induced ILD (DI-ILD).
Figure 1Visual representation of the modified 3D nn-UNet network used in this work. The deconvolution side of the network is omitted as it follows the same structure as the convolutional path, however, with the addition of a 1 × 1 × 1 SoftMax layer.
Figure 2Example coronal slices for a healthy subject and four cases with different pathologies for each DL experimental method. Individual, and median (range), DSC values are displayed.
Comparison of segmentation performance for the five DL training methods for all scans in the testing set.
| Experimental DL methods | Evaluation metrics: median (range) | |||
|---|---|---|---|---|
| DSC | Avg HD (mm) | HD95 (mm) | XOR | |
| Train on 3He | 0.961 (0.765, 0.981) | 2.335 (35.91, 0.644) | 10.00 (140.9, 1.934) | 0.079 (0.613, 0.037) |
| Train on 129Xe | 0.964 (0.886, 0.983) | 1.341 (3.911, 0.675) | 4.809 (15.90, 1.875) | 0.072 (0.253, 0.035) |
| Train on 3He, fine-tuned on 129Xe | 0.963 (0.892, 0.983) | 1.384 (4.628, 0.636) | 4.971 (29.80, 1.934) | 0.075 (0.238, 0.034) |
| Train on 129Xe, fine-tuned on 3He | 0.968 (0.842, 0.983) | 1.483 (10.84, 0.596) | 4.935 (67.85, 1.563) | 0.066 (0.372, 0.034 |
| Combined 3He and 129Xe training | ||||
Medians (ranges) are given; the best result for each metric is in bold.
Figure 3Comparison of segmentation performance on 86 testing scans for five DL experimental methods using the DSC, Avg HD, HD95 and XOR metrics (left to right). P-values are displayed for Friedman tests with Bonferroni correction for multiple comparisons, comparing the combined 3He and 129Xe DL method to the other DL methods. Mean and standard deviation values are marked by a bold line and whiskers, respectively.
Figure 4Comparison of DSC (top) and Avg HD (bottom) values for 129Xe and 3He testing scans for five DL methods. P-values between 129Xe and 3He using Mann–Whitney tests are shown. Mean and standard deviation values are marked by a bold line and whiskers, respectively.
Figure 5Pearson correlation and Bland–Altman analysis of lung volumes for 86 testing set cases compared to volumes derived from expert segmentations for the combined 3He and 129Xe DL model.
Figure 6Comparison of performance on testing scans between the combined 129Xe and 3He DL method and conventional segmentation methods (SFCM and K-means) with P-values for Friedman tests with Bonferroni correction for multiple comparisons. Mean and standard deviation values are marked by a bold line and whiskers, respectively. Individual DSC and Avg HD values for each method are displayed for three cases.