| Literature DB >> 31138878 |
Kenneth A Weber1, Andrew C Smith2, Marie Wasielewski3, Kamran Eghtesad4, Pranav A Upadhyayula4, Max Wintermark5, Trevor J Hastie6, Todd B Parrish7, Sean Mackey4, James M Elliott3,8,9.
Abstract
Muscle fat infiltration (MFI) of the deep cervical spine extensors has been observed in cervical spine conditions using time-consuming and rater-dependent manual techniques. Deep learning convolutional neural network (CNN) models have demonstrated state-of-the-art performance in segmentation tasks. Here, we train and test a CNN for muscle segmentation and automatic MFI calculation using high-resolution fat-water images from 39 participants (26 female, average = 31.7 ± 9.3 years) 3 months post whiplash injury. First, we demonstrate high test reliability and accuracy of the CNN compared to manual segmentation. Then we explore the relationships between CNN muscle volume, CNN MFI, and clinical measures of pain and neck-related disability. Across all participants, we demonstrate that CNN muscle volume was negatively correlated to pain (R = -0.415, p = 0.006) and disability (R = -0.286, p = 0.045), while CNN MFI tended to be positively correlated to disability (R = 0.214, p = 0.105). Additionally, CNN MFI was higher in participants with persisting pain and disability (p = 0.049). Overall, CNN's may improve the efficiency and objectivity of muscle measures allowing for the quantitative monitoring of muscle properties in disorders of and beyond the cervical spine.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31138878 PMCID: PMC6538618 DOI: 10.1038/s41598-019-44416-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Convolutional neural network (CNN) segmentation results of the deep cervical extensors. CNN segmentation masks of the left (green) and right (magenta) deep cervical extensors (i.e., multifidus and semispinalis cervicis) are shown from five randomly selected testing datasets. Example axial images at the C3 to C7 vertebral levels were selected to show changes in the deep extensor muscle morphometry across the cervical spine. For comparison, the segmentation masks from each rater are also shown (rows 2–4). The bottom two rows show the water-only and fat-only images for reference. For each example, the average DICE between the CNN and each rater is reported for the left and right masks. The C3 vertebral level is from the inferior portion of the C3 vertebra. L = left, R = right.
Segmentation Performance Metrics of the CNN on the Testing Dataset (n = 14).
| Performance Metric | Left | Right |
|---|---|---|
| Sørensen–Dice Index | 0.862 ± 0.017 | 0.871 ± 0.016 |
| Jaccard Index | 0.758 ± 0.026 | 0.772 ± 0.026 |
| Conformity Coefficient | 0.678 ± 0.046 | 0.703 ± 0.044 |
| True Positive Rate | 0.904 ± 0.021 | 0.908 ± 0.017 |
| True Negative Rate | 0.999 ± < 0.001 | 0.999 ± < 0.001 |
| Positive Predictive Value | 0.829 ± 0.031 | 0.843 ± 0.032 |
| Volume Ratio | 1.100 ± 0.058 | 1.087 ± 0.058 |
Metrics shown = average ± 1 standard deviation.
Figure 2Reliability and accuracy of the convolutional neural network (CNN) segmentation on the testing dataset (n = 14). Correlation and Bland-Altman plots are shown for the left and right deep cervical extensor muscle volumes and muscle fat infiltration (MFI). The average measures of the three raters were used as the ground truth (GT). (A,C) The dashed black line represents the best fit line. The linear regression coefficient (β) of GT on CNN (intercept = 0) is also provided, which can be used to correct the CNN measurement bias. (B,D) The dashed black and gray lines indicate the average difference (i.e., bias) ± 1.96 × standard deviation (i.e., 95% limits of agreement).
Figure 3Associations between convolutional neural network (CNN) muscle volume and muscle fat infiltration (MFI) and the clinical measures of pain and neck-related disability. Pain and neck-related disability were assessed using the numerical pain rating scale (NPRS) and the neck disability index (NDI), respectively. (A) Muscle volume was significantly negatively correlated with both pain and neck-related disability. A non-significant positive correlation between MFI and neck-related disability was present but not between MFI and pain. (B) The dataset was then split into groups of recovered (NDI ≤ 28, n = 19) versus persisting (NDI > 28, n = 20) whiplash using the NDI at 3 months post motor vehicle collision. The persisting group had significantly higher pain and neck-related disability compared to the recovered group. MFI was significantly higher in the persisting group compared to the recovered group. Muscle volume and MFI were corrected for age, gender, and body mass index. Error bars = 1 standard deviation. *p < 0.05, ***p < 0.001.
CNN Muscle and Clinical Measures for Recovered and Persisting Whiplash.
| Measure | Recovered (n = 19) | Persisting (n = 20) | P-value |
|---|---|---|---|
| NPRS | 1.7 ± 2.1 | 4.2 ± 2.5 | < 0.001 |
| NDI | 6.9 ± 7.9 | 30.6 ± 15.2 | < 0.001 |
| Muscle Volume (ml) | 37.9 ± 4.7 | 36.3 ± 5.3 | 0.154 |
| MFI (%) | 19.1 ± 3.9 | 21.2 ± 3.8 | 0.049 |
Recovery from whiplash was defined as an NDI ≤ 28 at 3 months post motor vehicle collision. NPRS = numerical pain rating scale, NDI = neck disability index. Metrics shown = average ± 1 standard deviation. Muscle volume and muscle fat infiltration (MFI) were corrected for age, gender, and body mass index. P-value based on one-tailed independent samples t-tests.
Segmentation Performance Metrics.
| Metric | Equation | Range | Meaning |
|---|---|---|---|
Sørensen-Dice Index (DICE) |
| 0–1 | Spatial overlap between masks |
| Jaccard Index |
| 0–1 | Spatial overlap between masks |
| Conformity Coefficient |
| ≤ 1 | Ratio of incorrectly and correctly segmented voxels |
True Positive Rate (TPR) |
| 0–1 | Sensitivity |
True Negative Rate (TNR) |
| 0–1 | Specificity |
Positive Predictive Value (PPV) |
| 0–1 | Precision |
| Volume Ratio |
| ≥ 0 | Ratio of mask volumes |
SM = segmentation mask; GT = ground truth mask; TP = true positive, voxels correctly segmented as deep cervical extensor muscle; TN = true negative, voxels correctly segmented as background; FP = false positive, voxels incorrectly segmented as deep cervical extensor muscle; FN = false negative, voxels incorrectly segmented as background. The masks from each of the three raters were used as the GT for the performance metrics.