| Literature DB >> 36071092 |
Scott D Tagliaferri1, Tim Wilkin2, Maia Angelova2, Bernadette M Fitzgibbon3,4, Patrick J Owen5, Clint T Miller5, Daniel L Belavy5,6.
Abstract
Chronic back pain (CBP) is heterogenous and identifying sub-groups could improve clinical decision making. Machine learning can build upon prior sub-grouping approaches by using a data-driven approach to overcome clinician subjectivity, however, only binary classification of pain versus no-pain has been attempted to date. In our cross-sectional study, age- and sex-matched participants with CBP (n = 4156) and pain-free controls (n = 14,927) from the UkBioBank were included. We included variables of body mass index, depression, loneliness/social isolation, grip strength, brain grey matter volumes and functional connectivity. We used fuzzy c-means clustering to derive CBP sub-groups and Support Vector Machine (SVM), Naïve Bayes, k-Nearest Neighbour (kNN) and Random Forest classifiers to determine classification accuracy. We showed that two variables (loneliness/social isolation and depression) and five clusters were optimal for creating sub-groups of CBP individuals. Classification accuracy was greater than 95% for when CBP sub-groups were assessed only, while misclassification in CBP sub-groups increased to 35-53% across classifiers when pain-free controls were added. We showed that individuals with CBP could sub-grouped and accurately classified. Future research should optimise variables by including specific spinal, psychosocial and nervous system measures associated with CBP to create more robust sub-groups that are discernible from pain-free controls.Entities:
Mesh:
Year: 2022 PMID: 36071092 PMCID: PMC9452567 DOI: 10.1038/s41598-022-19542-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Demographics.
| Pain-free controls (n = 15,284) | Chronic back pain (n = 5068) | |
|---|---|---|
| Mean (SD) | Mean (SD) | |
| Age | 63.5 (7.6) | 63.5 (7.7) |
| Sex (male/female—n) | 6772/8493 | 2258/2834 |
| Height (cm) | 169.8 (9.3) | 169.8 (9.6) |
| Weight (kg) | 74.1 (14.3) | 78.2 (16.1) |
| College/University | 8038 (53) | 2138 (42) |
| A/AS | 1925 (13) | 580 (11) |
| O/GCSE | 2586 (17) | 997 (20) |
| CSE | 464 (3) | 264 (5) |
| NVQ or HND or HNC | 755 (5) | 351 (7) |
| Other qualification | 728 (5) | 324 (6) |
| None of the above/did not answer | 788 (5) | 442 (9) |
| Paid employment or self-employed | 6126 (40) | 1920 (38) |
| Retired | 8655 (57) | 2889 (57) |
| Looking after home or family | 214 (1) | 95 (2) |
| Unable to work from sickness or disability | 22 (0.1) | 88 (2) |
| Unemployed | 60 (0.4) | 33 (1) |
| Unpaid or voluntary work | 115 (0.8) | 38 (1) |
| Student | 19 (0.1) | 5 (0.1) |
| None of the above/did not answer | 73 (0.5) | 28 (0.6) |
Data are presented as mean (SD) unless specified.
Figure 1Flow diagram of participant selection for this UKBioBank sub-study.
Figure 2(a) Heat Map and (b) Scatter plot of the distribution of sub-groups of individuals with CBP based on symptoms of depression and loneliness/social isolation. Data are at discrete intervals due to outcomes being questionnaire based. Data is presented on (a) normal item range and (b) normalised scale of 0–1. Higher scores indicate greater symptoms of depression or loneliness/social isolation. Based on the centroids of fuzzy c-means clustering, classes and colours are (1; blue) low social isolation and loneliness and moderate depressive symptoms (n = 776; 18.7%), (2; red) low loneliness/social isolation and depressive symptoms (n = 2296; 55.3%), (3; yellow) high loneliness/social isolation and moderate depressive symptoms (n = 185; 4.5%), (4; green) moderate loneliness/social isolation and high depressive symptoms (n = 297; 7.2%) and (5; pink) moderate loneliness/social isolation and low depressive symptoms (n = 602; 14.5%). Black squares on the heat map indicate no class was available at those values. The X value on the scatter plot indicates the centroid of that cluster.
Figure 3Confusion matrix of classifiers on chronic back pain classes (no pain-free controls included) with (a) Support Vector Machine, (b) Naïve Bayes, (c) k-Nearest Neighbour and (d) Random Forest classifiers. The x-axis is the predicted class while the y-axis is the true class. Blue squares indicate the number in the class that was accurately classified, while the oranges squares show the number of misclassifications. The boxes on the right of the matrix show the percentage of classification (blue) and misclassification (orange) for the class. Classes are (1) low social isolation and loneliness and moderate depressive symptoms (n = 776; 18.7%), (2) low loneliness/social isolation and depressive symptoms (n = 2296; 55.3%), (3) high loneliness/social isolation and moderate depressive symptoms (n = 185; 4.5%), (4) moderate loneliness/social isolation and high depressive symptoms (n = 297; 7.2%) and (5) moderate loneliness/social isolation and low depressive symptoms (n = 602; 14.5%).
Figure 4Confusion matrix of classifiers on chronic back pain and pain-free classes with (a) Support Vector Machine, (b) Naïve Bayes, (c) k-Nearest Neighbour and (d) Random Forest classifiers. The x-axis is the predicted class while the y-axis is the true class. Blue squares indicate the number in the class that was accurately classified, while the oranges squares show the number of misclassifications. The boxes on the right of the matrix show the percentage of classification (blue) and misclassification (orange) for the class. Classes are (0) pain-free individuals (n = 14,927; 78.2%), (1) low social isolation/loneliness and moderate depressive symptoms (n = 776; 4.1%), (2) low loneliness/social isolation and depressive symptoms (n = 2296; 12.0%), (3) high loneliness/social isolation and moderate depressive symptoms (n = 185; 1.0%), (4) moderate loneliness/social isolation and high depressive symptoms (n = 297; 1.6%) and (5) moderate loneliness/social isolation and low depressive symptoms (n = 602; 3.2%).
Results of the Analysis of Covariance (ANCOVA) for sub-groups derived from psychosocial variables.
| Pain-free controlsa | CBP sub-group 1b | CBP sub-group 2c | CBP sub-group 3d | CBP sub-group 4e | CBP sub-group 5f | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Mean (SE) | Mean (SE) | Mean (SE) | Mean (SE) | Mean (SE) | Mean (SE) | ||||||
| Depressive symptoms (4–16) | 4.9 (0.01) | 7.2 (0.05) | 4.6 (0.03) | 8.0 (0.10) | 11.3 (0.08) | 5.6 (0.05) | |||||
| Versus PF | 2.3 (2.2, 2.5) 1.87 (1.79, 1.94) | − 0.2 (− 0.3, − 0.1) − 0.24 (− 0.28, − 0.20) | 3.2 (2.9, 3.5) 2.52 (2.39, 2.68) | 6.4 (6.2, 6.6) 5.22 (5.10, 5.35) | 0.7 (0.6, 0.9) 0.57 (0.49, 0.65) | ||||||
| Loneliness/isolation (0–5) | 0.72 (0.01) | 0.65 (0.03) | 0.37 (0.02) | 3.4 (0.06) | 1.6 (0.05) | 2.2 (0.03) | |||||
| Versus PF | − 0.06 (− 0.14, 0.02) − 0.06 (− 0.13, 0.01) | 0.300 | − 0.34 (− 0.39, − 0.29) − 0.29 (− 0.34, − 0.25) | 2.73 (2.56, 2.90) 2.20 (2.05, 2.35) | 0.90 (0.76, 1.03) 0.72 (0.61, 0.84) | 1.44 (1.34, 1.53) 1.23 (1.14, 1.31) | |||||
Data are presented as adjusted mean (standard error). Adjusted mean differences and 95% confidence interval (95 CI) are reported. Effect sizes and 95% confidence intervals were calculated using standard formulae. All data were adjusted for age and sex. p value s were adjusted for multiple comparisons using Tukey HSD method.
PF pain-free, 95 CI 95% confidence interval, ES effect size.
Significant values are in [bold].
aPain-free individuals (n = 14,927; 78.2%).
bCBP sub-group 1: low social isolation/loneliness and moderate depressive symptoms (n = 776; 4.1%).
cCBP sub-group 2: low loneliness/social isolation and depressive symptoms (n = 2296; 12.0%).
dCBP sub-group 3: high loneliness/social isolation and moderate depressive symptoms (n = 185; 1.0%).
eCBP sub-group 4: moderate loneliness/social isolation and high depressive symptoms (n = 297; 1.6%).
fCBP sub-group 5: moderate loneliness/social isolation and low depressive symptoms (n = 603; 3.2%).
Results of the Analysis of Covariance (ANCOVA) sub-groups derived from physical variables of body mass index and grip strength.
| Pain-free controlsa | CBP sub-group 1b | CBP sub-group 2c | CBP sub-group 3d | ||||
|---|---|---|---|---|---|---|---|
| Mean (SE) | Mean (SE) | Mean (SE) | Mean (SE) | ||||
| Body mass index (kg/m2) | 25.9 (0.03) | 34.4 (0.1) | 24.8 (0.1) | 26.6 (0.1) | |||
| Versus PF | 8.5 (8.1, 8.8) 2.34 (2.27, 2.41) | − 1.1 (− 1.3, − 0.8) − 0.29 (− 0.34, − 0.25) | 0.7 (0.4, 1.0) 0.19 (0.14, 0.25) | ||||
| Grip strength (kg) | 31.9 (0.1) | 29.0 (0.2) | 28.9 (0.2) | 36.8 (0.2) | |||
| Versus PF | − 2.9 (− 3.5, − 2.3) − 0.24 (− 0.31, − 0.17) | − 3.0 (− 3.4, − 2.6) − 0.25 (− 0.30, − 0.21) | 4.8 (4.3, 5.3) 0.41 (0.36, 0.47) | ||||
Data are presented as adjusted mean (standard error). Adjusted mean differences and 95% confidence interval (95 CI) are reported. Effect sizes and 95% confidence intervals were calculated using standard formulae. All data were adjusted for age and sex. p value s were adjusted for multiple comparisons using Tukey HSD method.
PF pain-free, 95 CI 95% confidence interval, ES effect size.
Significant values are in [bold].
aPain-free individuals (n = 14,927; 78.2%).
bCBP sub-group 1: higher body mass index and low grip strength (n = 870; 4.6%).
cCBP sub-group 2: lower body mass index and grip strength (n = 1933; 10.1%);
dCBP sub-group 3: lower body mass index and higher grip strength (n = 1353; 7.1%).
Results of the Analysis of Covariance (ANCOVA) sub-groups derived from brain function variables.
| Pain-free controlsa | CBP sub-group 1b | CBP sub-group 2c | CBP sub-group 3d | ||||
|---|---|---|---|---|---|---|---|
| Mean (SE) | Mean (SE) | Mean (SE) | Mean (SE) | ||||
| FPN-DMN connectivity (z) | 1.33 (0.01) | 2.20 (0.02) | 0.91 (0.02) | 0.81 (0.02) | |||
| Versus PF | 0.87 (0.81, 0.93) 0.73 (0.66, 0.80) | − 0.43 (− 0.49, − 0.37) − 0.35 (− 0.40, − 0.31) | − 0.53 (− 0.59, − 0.47) − 0.44 (− 0.49, − 0.38) | ||||
| FPN-SMN connectivity (z) | 0.70 (0.01) | 0.56 (0.02) | 1.44 (0.02) | − 0.04 (0.02) | |||
| Versus PF | − 0.13 (− 0.19, − 0.08) − 0.12 (− 0.19, − 0.05) | 0.74 (0.68, 0.80) 0.62 (0.58, 0.67) | − 0.74 (− 0.79, − 0.68) − 0.62 (− 0.68, − 0.57) | ||||
| VN-DMN connectivity (z) | 0.40 (0.01) | 0.49 (0.02) | 0.41 (0.02) | 0.49 (0.02) | |||
| Versus PF | 0.09 (0.03, 0.16) 0.08 (0.02, 0.13) | 0.02 (− 0.05, 0.08) 0.01 (− 0.15, 0.06) | 0.927 | 0.09 (0.03, 0.15) 0.08 (0.02, 0.13) | |||
Data are presented as adjusted mean (standard error). Adjusted mean differences and 95% confidence interval (95 CI) are reported. Effect sizes and 95% confidence intervals were calculated using standard formulae. All data were adjusted for age and sex. p value s were adjusted for multiple comparisons using Tukey HSD method.
PF pain-free, 95 CI 95% confidence interval, ES effect size, FPN frontoparietal network, DMN default mode network, SMN sensorimotor network, VN visual network.
Significant values are in [bold].
aPain-free individuals (n = 14,927; 78.2%).
bCBP sub-group 1: high FPN-DMN connectivity (n = 1347; 7.1%).
cCBP sub-group 2: high FPN-SMN connectivity (n = 1335; 7.0%);
dCBP sub-group 3: low FPN-DMN and FPN-SMN network connectivity (n = 1474; 7.7%).
Results of the Analysis of Covariance (ANCOVA) sub-groups derived from brain structure variables.
| Pain-free controlsa | CBP sub-group 1b | CBP sub-group 2c | |||
|---|---|---|---|---|---|
| Mean (SE) | Mean (SE) | Mean (SE) | |||
| Fronto-orbital cortex volume (mm3) | 12,635.6 (10.4) | 13,400.9 (29.5) | 11,836.2 (27.2) | ||
| Versus PF | 765.3 (692.0, 838.7) 0.60 (0.55, 0.65) | − 799.4 (− 867.7, − 731.2) − 0.63 (− 0.67, − 0.58) | |||
| Primary motor cortex volume (mm3) | 27,515.4 (23.3) | 29,001.5 (65.9) | 25,813.4 (60.7) | ||
| Versus PF | 1486.1 (1322.3, 1649.8) 0.52 (0.47, 0.57) | − 3188.0 (− 3400.5, − 2975.6) − 0.60 (− 0.64, − 0.55) | |||
| Primary somatosensory cortex volume (mm3) | 21,706.0 (20.2) | 22,449.1 (57.2) | 20,551.8 (52.7) | ||
| Versus PF | 743.1 (601.0, 885.2) 0.30 (0.25, 0.35) | − 1154.3 (− 1286.5, − 1022.0) − 0.47 (− 0.51, − 0.42) | |||
Data are presented as adjusted mean (standard error). Adjusted mean differences and 95% confidence interval (95 CI) are reported. Effect sizes and 95% confidence intervals were calculated using standard formulae. All data were adjusted for age and sex. p value s were adjusted for multiple comparisons using Tukey HSD method.
PF pain-free, 95 CI 95% confidence interval, ES effect size.
Significant values are in [bold].
aPain-free individuals (n = 14,927; 78.2%).
bCBP sub-group 1: high fronto-orbital and primary motor cortex volumes (n = 1917; 10.1%);
cCBP sub-group 2: low fronto-orbital and primary motor cortex volumes (n = 2239; 11.7%).