Literature DB >> 34588829

Unsupervised Machine Learning-Based Analysis of Clinical Features, Bone Mineral Density Features and Medical Care Costs of Rotator Cuff Tears.

Tong-Fu Wang1, De-Sheng Chen1, Jia-Wang Zhu1, Bo Zhu1, Zeng-Liang Wang1, Jian-Gang Cao1, Cai-Hong Feng1, Jun-Wei Zhao1.   

Abstract

PURPOSE: We aim to present unsupervised machine learning-based analysis of clinical features, bone mineral density (BMD) features, and medical care costs of Rotator cuff tears (RCT). PATIENTS AND METHODS: Fifty-three patients with RCT were reviewed, the clinical features, BMD features, and medical care costs were collected and analyzed by descriptive statistics. Furtherly, unsupervised machine learning (UML) algorithm was used for dimensionality reduction and cluster analysis of the RCT data.
RESULTS: There were 26 males and 27 females. The patients were divided into four subgroups using the UML algorithm. There were significant differences among four subgroups regarding trauma exposure, full-thickness supraspinatus tendon tears, infraspinatus tendon tear, subscapularis tendon tear, BMD distribution, medial row anchors, lateral row anchors, total medical care costs, and consumables costs. We observed the highest frequency of trauma exposure, infraspinatus tendon tear, subscapularis tendon tear, osteoporosis, the highest number of medial row anchors, lateral row anchors, total medical care costs, and consumables costs in subgroup II.
CONCLUSION: The unsupervised machine learning-based analysis of RCT can provide clinically meaningful classification, which shows good interpretability and contribute to a better understanding of RCT. The significance of the results is limited due to the small number of samples, a larger follow-up study is needed to confirm the encouraging results.
© 2021 Wang et al.

Entities:  

Keywords:  bone mineral density; clinical features; medical care costs; rotator cuff tears; unsupervised machine learning

Year:  2021        PMID: 34588829      PMCID: PMC8472212          DOI: 10.2147/RMHP.S330555

Source DB:  PubMed          Journal:  Risk Manag Healthc Policy        ISSN: 1179-1594


Introduction

Rotator cuff tears (RCT) are a common cause of shoulder pain.1–3 The prevalence of RCT is 34% in the general population and 54% in people over 60 years old.4 There are more than 4.5 million hospital visits for RCT in the United States each year.5 The demographic and professional features of RCT patients have been investigated.6–10 In a study conducted by Yamamoto et al, shoulder ultrasound in 683 patients was analyzed, they found that the incidence of RCT increased with age, the risk factors for RCT included age, history of trauma, and the dominant arm.6 In the study conducted by Josserand et al, 254 patients with RCT were reviewed, they found that 69% of patients had right-side RCT, 6.1% patients involved in non-manual work, 25.5% patients involved in manual labor, and 68.3% patients involved in heavy manual labor.7 Although remarkable achievements in RCT have been obtained, some questions remain to be answered. Rotator cuff tendon-bone healing is an essential component for RCT therapy, however, few studies have investigated the relationship between bone mineral density (BMD) and RCT.9–11 Meanwhile, there are few studies that involved medical care costs of RCT.12,13 Furtherly, in the era of big data, the requirements for efficient and intelligent processing of massive RCT data are continuously increasing.14,15 An approach for big data analysis is machine learning (ML). ML is commonly divided into supervised machine learning and unsupervised machine learning (UML).16–18 In supervised machine learning algorithm, training datasets are labeled, and the algorithm learns from the labeled dataset and predicts the unlabeled dataset.18–20 In contrast, the UML algorithm does not require a labeled dataset and tries to work on its own to find information.18,21,22 Currently, more and more studies focus on the application of the UML algorithm in healthcare big data analysis.23–26 In this study, we aim to present conventional descriptive analysis and novel unsupervised machine learning-based analysis of clinical features, BMD features, and medical care costs of RCT.

Materials and Methods

Participants

53 patients with RCT were reviewed. The inclusion criteria were: (1) diagnosis was confirmed by arthroscopy; (2) first surgery for RCT; (3) retrospective study. The exclusion criteria were: (1) simultaneous bilateral shoulder surgery; (2) revision surgery; (3) follow-up data missing. The study was approved by the ethical committee of Tianjin Hospital of Tianjin University and followed the Declaration of Helsinki, the informed consent was obtained from all patients.

Data Collection

16 clinical features including gender, age, marriage, trauma exposure were collected. 18 BMD features including BMD values, T-score, and Z-score of the lumbar spine were collected. 6 medical care costs indicators including total medical care costs, consumables costs, and surgical costs were collected for statistical analysis.

UML Algorithm

The UML algorithm was performed on preprocessed input dataset employing principal component analysis (PCA). The input dataset was defined as a collection of clinical features, BMD features, and medical care costs indicators. The PCA was performed to project the input dataset into a two-dimensional data space. Then, the UML clustering algorithm was applied to identify subgroups, in which the K-means clustering algorithm was used and the best value of k was determined automatically. First, we use the UML algorithm which has been validated and extensively tested in other diseases to convert the input dataset into a compact representation space.25,27,28

Statistical Analysis

The UML algorithms and the statistical analysis were implemented in Python (Python Software Foundation). Descriptive analysis was reported as mean ± SD and frequency. Comparisons among sub-groups were analyzed by chi-square test or Fisher exact test for categorical variables, and ANOVA for continuous variables. A two-sided P value < 0.05 was considered significant.

Results

Descriptive Analysis of RCT

There were 26 males and 27 females with a mean age of 55.5 ± 8.6 years. The clinical features of RCT are shown in Table 1 and Figure 1.
Table 1

5 Clinical Features of Patients

Clinical FeaturesValue
Age (years)55.47 ± 8.6
Duration of symptoms (months)9.86 ± 22.82
Medial row anchors2.13 ± 0.81
Lateral row anchors1.51 ± 0.54
Length of hospitalization (days)7.13 ± 1.69
Figure 1

11 clinical features and 1 BMD feature of patients.

5 Clinical Features of Patients 11 clinical features and 1 BMD feature of patients. The mean BMD values of the lumbar spine were 1.01 ± 0.19 g/cm2, T-score was −0.59 ± 1.69, and Z-score was 0.30 ± 1.61. The T-score of the proximal femur was −0.59 ± 1.69 and the hip T-score was −0.41 ± 0.96. There were 23 patients (43.40%) with osteopenia and 4 patients (7.55%) with osteoporosis. The BMD features of RCT were shown in –figure 3.
Figure 2

The BMD value and Z-score of patients.

Figure 3

The T-score of patients.

The BMD value and Z-score of patients. The T-score of patients. The total medical care costs were 58,612.60 ± 9,823.01 yuan, the consumables costs were 38,557.87 ± 9,253.44 yuan (52.69%), the surgical costs were 7,278.49 ± 227.09 yuan (9.95%), the drug costs were 3,990.47 ± 812.69 yuan (5.45%), the costs of medical service were 2,330.49 ± 386.74 yuan (3.19%), and other costs were 21,012.25 ± 1,024.48 yuan (28.72%). The medical care costs of RCT are shown in Figure 4.
Figure 4

The distribution of medical care costs. The consumables costs contributed the highest proportion (52.69%).

The distribution of medical care costs. The consumables costs contributed the highest proportion (52.69%).

Clustering results of UML Algorithm

The input dataset was divided into four subgroups via the UML algorithm. As shown in Figure 5, there were 13 patients in subgroup I, 8 patients in subgroup II, 24 patients in subgroup III, and 8 patients in subgroup IV.
Figure 5

The input dataset was divided into four subgroups. (A) Dimensionality reduction of the input dataset into a two-dimensional data space using principal component analysis. (B) The k-means clustering algorithm was used to divide the dataset into 4 subgroups, with red dots representing subgroup I (n=13), blue dots representing subgroup II (n=8), green dots representing subgroup III (n=24), and yellow dots representing subgroup IV (n=8).

The input dataset was divided into four subgroups. (A) Dimensionality reduction of the input dataset into a two-dimensional data space using principal component analysis. (B) The k-means clustering algorithm was used to divide the dataset into 4 subgroups, with red dots representing subgroup I (n=13), blue dots representing subgroup II (n=8), green dots representing subgroup III (n=24), and yellow dots representing subgroup IV (n=8).

Comparisons Among Subgroups

The mean age was 52.0 ± 8.3 years in subgroup I, 55.8 ± 9.8 years in subgroup II, 56.8 ± 9.1years in subgroup III, and 57.0 ± 5.2 years in subgroup IV. There was no significant difference in age among the four subgroups (P = 0.412). There were 4 males (30.8%) in subgroup I, 7 males (87.5%) in subgroup II, 12 males (50.0%) in subgroup III, 3 males (37.5%) in subgroup IV. There was no significant difference in gender among four subgroups (P = 0.074). As shown in Figure 6, there were significant differences among four subgroups regarding trauma exposure (P = 0.035), full-thickness (FT) supraspinatus (SSP) tendon tears (P = 0.015), infraspinatus (ISP) tendon tear (P < 0.001), and subscapularis (SCP) tendon tears (P = 0.049).
Figure 6

There were significant differences among four subgroups regarding trauma exposure, FT-SSP, ISP, SCP.

There were significant differences among four subgroups regarding trauma exposure, FT-SSP, ISP, SCP. As shown in Figure 7, there were significant differences among four subgroups in BMD distribution classified as normal bone mass, osteopenia, and osteoporosis (P = 0.044).
Figure 7

There were significant differences among four subgroups in BMD distribution.

There were significant differences among four subgroups in BMD distribution. As shown in Table 2, there were significant differences among four subgroups regarding medial row anchors (P < 0.001), lateral row anchors (P < 0.001), total medical care costs (P < 0.001), and consumables costs (P < 0.001). There were no significant differences among sub-groups for other indicators.
Table 2

Comparisons Among Subgroups Regarding Medial Row Anchors, Lateral Row Anchors, Total Medical Care Costs, Consumables Costs

GroupMedial Row AnchorsLateral Row AnchorsTotal Medical Care Costs (Yuan)Consumables Costs (Yuan)
Subgroup I1.92 ±0.281.08±0.285.16±1.513.22±0.10
Subgroup II3.25 ±1.042.13±0.357.63±0.525.53 ±0.39
Subgroup III2.25 ±0.441.71±0.466.09±0.244.07 ±0.22
Subgroup IV1.00 ±0.001.00±0.004.53±0.142.57±0.17
F value26.7320.52212.02273.64
P value< 0.001< 0.001< 0.001< 0.001
Comparisons Among Subgroups Regarding Medial Row Anchors, Lateral Row Anchors, Total Medical Care Costs, Consumables Costs

Characteristics of Subgroups

We observed the highest frequency of trauma exposure, ISP ten-don tear, SCP tendon tear, osteoporosis in subgroup II, as well as, we observed the highest number of medial row anchors, lateral row anchors, total medical care costs, and consumables costs in subgroup II. Correspondingly, subgroup IV showed the lowest frequency of trauma exposure, FT-SSP tendon tears, the lowest number of medial row anchors, lateral row anchors, total medical care costs, consumables costs, and the highest frequency of osteopenia. There were no ISP ten-don tear, SCP tendon tear, and osteoporosis in subgroup I and sub-group IV (Figure 8).
Figure 8

Summary of clinical features, BMD features, and medical care costs among four subgroups.

Summary of clinical features, BMD features, and medical care costs among four subgroups.

Discussion

Rotator cuff tears (RCT) are a common cause of shoulder pain1–3. Yamaguchi et al pointed out the mean age was 58.7 years for unilateral RCT and 67.8 years for bilateral RCT8. In a study conducted by Andrew et al, 6653 individuals were reviewed, hypertension was involved in 48.3% of patients, diabetes mellitus was involved in 20.1% of patients, and dominant shoulder was involved in 50.4% of patients.29 In our study, the mean age of RCT patients was 55.5 ± 8.6 years, hypertension was involved in 32.08% of patients, diabetes mellitus was involved in 5.66% of patients, and dominant shoulder was involved in 58.49% of patients. RCT occurs as a result of a broken tendon-bone interface, which suggests that rotator cuff tendon-bone healing is an essential component for RCT therapy.30,31 BMD is an important factor linked with rotator cuff tendon-bone healing, recent studies have shown that the failure rate of rotator cuff healing was 7.25 times in patients with osteoporosis than patients with normal bone mass9. In our study, osteoporosis was involved in 7.55% of patients and all patients with osteoporosis were accompanied by FT-SSP tendon tears. In the costs analysis performed by Savoie et al, 50 patients with successful rotator cuff repair surgery were reviewed, the mean medical care costs were $50,302.25 if the patients were referred to a specialist immediately following the diagnosis of RCT, the mean medical care costs were $25,870.6412. In our study, the total medical care costs were 58,612.60 ± 9,823.01 yuan, the consumables costs were 38,557.87 ± 9,253.44 yuan with the highest proportion. Recent advances in UML have shown that it can help physicians to analyze massive information and is crucial in medical practice optimization.32–35 In the study of Baud et al, UML can provide a powerful approach for the automatic identification of recurrent pathological neurophysiological signals, which is important for the accurate, quantitative, and individualized evaluation of focal epilepsy.36 In a study conducted by Yang et al, the UML algorithm was used to reveal phenotypically distinct cell populations and determines whether these populations stratify patient survival with glioblastoma.33 In the study conducted by Cikes et al, 1106 patients with heart failure were reviewed, UML can provide a clinically meaningful classification and may help optimize response rates to specific treatments.25 In our study, 53 patients were reviewed and up to 40 indicators per patient were collected. Finally, the input dataset was divided into four subgroups. Subgroup II had the highest frequency of osteoporosis, ISP tendon tear, and SCP tendon tear, which indicated that decreased BMD downregulated the union strength of tendon-bone interface and decreased the threshold of ISP tendon tear and SCP tendon tear. Meanwhile, subgroup II had the highest number of medial row anchors and lateral row anchors, which could be due to ISP tendon tear and SCP tendon tear increased the area requiring repair, or due to more anchors were required to maintain the union strength of tendon-bone interface which downregulated by decreased BMD. Interestingly, Subgroup IV had the highest frequency of osteopenia, but there were no ISP tendon tear and SCP tendon tear in all patients, which indicated that the decreased BMD taking osteopenia as the lower limit did not reach the threshold of ISP and SCP tendon tear. Meanwhile, subgroup IV had the lowest number of medial row anchors and lateral row anchors, which indicated that the decreased BMD taking osteopenia as the lower limit did not downregulate the union strength of the tendon-bone interface. This study has some limitations: (1) It is a small retrospectively study; (2) There are few indicators in this study, furthermore indicators were needed for a systematic and comprehensive assessment of RCT; (3) The research is a single-center study, the results may be biased and further larger, multicenter research was needed to confirm our findings.

Conclusion

The unsupervised machine learning-based analysis of RCT can provide clinically meaningful classification, which shows good interpretability and contribute to a better understanding of RCT. The significance of the results is limited due to the small number of samples, a larger follow-up study is needed to confirm the encouraging results.
  36 in total

1.  The Association of Osteoporosis and Bisphosphonate Use With Revision Shoulder Surgery After Rotator Cuff Repair.

Authors:  Jourdan M Cancienne; Stephen F Brockmeier; Michelle E Kew; Matthew J Deasey; Brian C Werner
Journal:  Arthroscopy       Date:  2019-06-21       Impact factor: 4.772

2.  Automated detection of shadow artifacts in optical coherence tomography angiography.

Authors:  Acner Camino; Yali Jia; Jeffrey Yu; Jie Wang; Liang Liu; David Huang
Journal:  Biomed Opt Express       Date:  2019-02-28       Impact factor: 3.732

3.  Bioabsorbable tricalcium phosphate bone cement strengthens fixation of suture anchors.

Authors:  Rayshad Oshtory; Derek P Lindsey; Nicholas J Giori; Faisal M Mirza
Journal:  Clin Orthop Relat Res       Date:  2010-06-03       Impact factor: 4.176

4.  Unsupervised machine learning of radiomic features for predicting treatment response and overall survival of early stage non-small cell lung cancer patients treated with stereotactic body radiation therapy.

Authors:  Hongming Li; Maya Galperin-Aizenberg; Daniel Pryma; Charles B Simone; Yong Fan
Journal:  Radiother Oncol       Date:  2018-07-04       Impact factor: 6.280

5.  Unsupervised Learning of Spatiotemporal Interictal Discharges in Focal Epilepsy.

Authors:  Maxime O Baud; Jonathan K Kleen; Gopala K Anumanchipalli; Liberty S Hamilton; Yee-Leng Tan; Robert Knowlton; Edward F Chang
Journal:  Neurosurgery       Date:  2018-10-01       Impact factor: 4.654

6.  Open versus two forms of arthroscopic rotator cuff repair.

Authors:  Neal L Millar; Xiao Wu; Robyn Tantau; Elizabeth Silverstone; George A C Murrell
Journal:  Clin Orthop Relat Res       Date:  2009-01-30       Impact factor: 4.176

7.  A New Machine Learning-Based Framework for Mapping Uncertainty Analysis in RNA-Seq Read Alignment and Gene Expression Estimation.

Authors:  Adam McDermaid; Xin Chen; Yiran Zhang; Cankun Wang; Shaopeng Gu; Juan Xie; Qin Ma
Journal:  Front Genet       Date:  2018-08-14       Impact factor: 4.599

8.  Digging deep into Golgi phenotypic diversity with unsupervised machine learning.

Authors:  Shaista Hussain; Xavier Le Guezennec; Wang Yi; Huang Dong; Joanne Chia; Ke Yiping; Lee Kee Khoon; Frédéric Bard
Journal:  Mol Biol Cell       Date:  2017-10-11       Impact factor: 4.138

9.  Machine Learning Analysis for Quantitative Discrimination of Dried Blood Droplets.

Authors:  Lama Hamadeh; Samia Imran; Martin Bencsik; Graham R Sharpe; Michael A Johnson; David J Fairhurst
Journal:  Sci Rep       Date:  2020-02-24       Impact factor: 4.379

10.  Unsupervised machine-learning classification of electrophysiologically active electrodes during human cognitive task performance.

Authors:  Krishnakant V Saboo; Yogatheesan Varatharajah; Brent M Berry; Vaclav Kremen; Michael R Sperling; Kathryn A Davis; Barbara C Jobst; Robert E Gross; Bradley Lega; Sameer A Sheth; Gregory A Worrell; Ravishankar K Iyer; Michal T Kucewicz
Journal:  Sci Rep       Date:  2019-11-22       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.