Literature DB >> 29952982

Cross-cultural adaptation and validation of the Chinese version of the shoulder pain and disability index in patients with symptomatic shoulder pain: A prospective case series.

Wei Wang¹, Zhen-Yu Jia, Jiao Liu, Qing-Yun Xie, Jin Cui, Wei Zheng, Wei-Dong Xu.

Abstract

The aim of this study is to cross-culturally adapt and translate the original version of SPADI into Chinese (C-SPADI), and to test the reliability, validity, and responsiveness of the C-SPADI.This research was a test of previously developed diagnostic criteria in a series of consecutive patients with universally applied gold standard. The original version was translated into Chinese according to international recognized standards. Patients who were diagnosed with a shoulder disorder and underwent shoulder arthroscopic treatments from 2014 to 2015 were enrolled in our study. Each participant was asked to finish the C-SPADI, the Short-Form 36 (SF-36), and the Oxford Shoulder score (OSS) at first visit. The C-SPADI was completed a second time with an interval of 7 days. Six months after arthroscopic treatments, the C-SPADI was completed a third time for responsiveness evaluation. The Cronbach alpha, intraclass correlation coefficient (ICC), standard error of measurement (SEM), minimally detectable change (MDC), Pearson correlation coefficient (r), effect size (ES), and standardized response mean (SRM) were calculated to evaluate the reliability, validity, and responsiveness of C-SPADI respectively.The original version of the SPADI was well adapted and translated into Chinese. The Cronbach alpha ranged from 0.812 to 0.912 in all subscales and total scale of the C-SPADI, indicating good or excellent internal consistency. The test-retest reliability (ICC = 0.887-0.915, SEM = 5.47, MDC = 15.16) was proved to be good or excellent. Moderate or good correlations (r = 0.556-0.672) were obtained between the C-SPADI and the OSS, physical subscales of SF-36; and poor, fair, or moderate correlations (r = 0.038-0.492) were obtained between the C-SPADI and mental subscales of SF-36, which, adequately illustrated good discriminant validity in the C-SPADI. Additionally, the responsiveness was considered good in the C-SPADI (SRM = 1.58-2.44, ES = 1.79-2.17).The C-SPADI was documented to be a reliable, valid, and responsible instrument for self-assessment of patients with shoulder disorders in China. LEVEL OF EVIDENCE: Level II.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2018 PMID： 29952982 PMCID： PMC6039594 DOI： 10.1097/MD.0000000000011227

Source DB: PubMed Journal: Medicine (Baltimore) ISSN： 0025-7974 Impact factor: 1.889

Introduction

Shoulder pain is the third most common musculoskeletal pain with lifetime prevalence in the general population of 6.7% to 66.7%.[ Although shoulder pain is not a life-threatening disease, it is still a major problem in China because pain and stiffness in the shoulder may lead to labor disability, work absenteeism, and large medical costs. As reported, the cross-sectional prevalence of neck and shoulder pain is 41.1% in Chinese adolescents.[ Since the 1980s, the evaluation of health-related quality of life (HRQOL) was developed to specifically ascertain the patients’ needs and problems of everyday life and to enable patients themselves provide important information about their physical and psychological status.[ Disease-specific scales are used to assess specific patient groups, such as the most commonly used instruments for shoulder disorders: Shoulder Pain and Disability Index (SPADI), Disability of the Arm, Shoulder and Hand Questionnaire (DASH), and the Oxford Shoulder Score (OSS).[ Because of the increasing number of large multicenter international studies and the requirement for globally meaningful epidemiologic and/or therapeutic study results, there is a need for cross-cultural adaptation and validation of HRQOL scales.[ To avoid the evaluation error caused by cultural differences, it is necessary to test the psychometric properties of the scale rather than simply translating the content.[ The SPADI is a simple disease-specific scale to evaluate the impact of shoulder disorders on the general quality of life.[ It has been well adapted in 12 languages and widely proved to be a valid and reliable instrument.[ However, instruments for shoulder are still in shortage in China. Now there is no one Chinese version of SPADI even though China has the largest population of patients with shoulder disorders. As applied in this study, psychometric properties including reliability, validity, and responsiveness were tested for validation according to the advice and guidelines of the Consensus-based Standards for the Selection of Health Status Measurement Instruments study.[ Therefore, the aim of this study was to translate and adapt the SPADI into a Chinese version (C-SPADI) and evaluated the reliability, validity, and responsiveness of the C-SPADI in a cohort of native Chinese-speaking patients with shoulder disorders.

Patients and methods

Translation and cross-cultural adaptation

The procedure of translation and cross-cultural adaptation was performed following the guidelines of the American Association of Orthopaedic Surgeons (AAOS) Outcome Committee.[ There are 5 steps needed to perform. Step 1—Forward translation: respective translation by 2 bilingual translators whose first language was Chinese (a professor of our department and an English professor); Step 2—Synthesis of 2 translations: a discussion was held to integrate the 2 independent forward translation drafts, later we obtained the primary Chinese version of SPADI; Step 3—Back translation: back-translated into English by 2 bilingual translators whose first language was English, and the 2 translators have no knowledge about the original SPADI; Step 4—Creation of prefinal version: a prefinal version of C-SPADI created by an expert committee after the discrepancies, ambiguities, and other language expression issues existed in the Chinese version were solved; Step 5—Test of prefinal version: 20 patients with shoulder disorders completed the prefinal C-SPADI for assessment, and feedbacks were collected. A discussion was held for final adjustments according to these feedbacks, and the final version of C-SPADI was obtained.

Participants and data collection

This research was a test of previously developed diagnostic criteria in a series of consecutive patients with universally applied gold standard. Patients enrolled in this study were mainly consecutively recruited from those with symptomatic shoulder disorders admitted in Shanghai Changhai Hospital and Chengdu Military General Hospital between March 2014 and March 2015. Inclusion criteria were as follows: patients with shoulder disorders and scheduled for shoulder arthroscopic treatment; able to read Chinese and fill in the scales independently. Patients who were under 18 years old and had other disorders that could affect shoulder were excluded. Our study met the quality criteria proposed by Terwee et al[ for measurement properties of health status scales, which required the results from at least 100 patients to perform internal consistency analysis and from at least 50 patients for floor or ceiling effects, reliability, and validity analyses. Informed written consent was obtained from each participant. The study was approved by the clinical research ethics committee of our hospital (No. CHEC 2014-071, Date: 2014/01/10). All patients were asked to provide demographic information on the first day of enrollment, and to independently complete 3 scales included the C-SPADI, OSS, and SF-36 in a quiet meeting room. One week after the first filling-in, also the day before the arthroscopic surgery, they completed the C-SPADI for the second time to evaluate the test–retest reliability of the scale. Six months after the surgery, when the patients came to our hospital for regularly check, they completed the C-SPADI for the third time to help evaluate the responsiveness.

Scales

The SPADI was designed to measure the impact of shoulder pathology in terms of pain and disability.[ It is a self-administered questionnaire including pain subscale (5 items) and disability subscale (8 items). Each item is rated on a visual analogue scale from 0 to 11, with 0 representing no pain/no difficulty and 11 indicating worst pain/most difficulty. The score of total scale consists of the score of 2 subscales and all scores are converted to a 0 to 100 scale. The SF-36 was a widely used instrument that measured 8 subscales of health including physical functioning (PF), role-physical (RP), bodily pain (BP), general health (GH), vitality (VT), social functioning (SF), role-emotional (RE), and mental health (MH). Each subscale score was transformed into a 0 to 100 scale; a lower score indicated worse quality of life and severer disability.[ The OSS was a shoulder pain-specific questionnaire for the evaluation of pain perception and daily function in patients suffering from shoulder pain using 12 items. Each item scored from 0 to 4 and the total score ranged from 0 (worst) to 48 (best). A lower score indicated a worse status of shoulder joint.[ Chinese versions of the above 2 scales were available, which had been proved reliable, valid, and responsive.[

Psychometric assessments and statistical analysis

Statistical package for the Social Sciences, version 20.0 (SPSS, Chicago, IL) was used for statistical analysis. P value of .05 or less was considered statistically significant. Test–retest reliability was evaluated to assess the reliability of C-SPADI using intraclass correlation coefficient (ICC) and the Bland–Altman plot. The outcomes were classified into 5 grades (r = 0.81–1.0 excellent; 0.61–0.80 very good; 0.41–0.60 good; 0.21–0.40 fair; and 0.00–0.20 poor).[ The Bland–Altman plot could be used to measure within-subject variation and limits of agreement.[ Measurement error was assessed by calculating the standard error of measurement (SEM) and the minimally detectable change (MDC).[ SEM was calculated as the square root of the within-subject total variance of an ANOVA analysis. MDC95 was calculated as .[ Internal consistency was assessed using Cronbach alpha. The value of Cronbach alpha between 0.70 and 0.95 indicated a good internal consistency.[ The validity of the C-SPADI was assessed by evaluating both of the content validity and construct validity. The content validity consists of the assessments of comprehensiveness and relevance of items.[ The item response rate, ceiling/floor effects, and patient feedback were the 3 indexes for comprehensiveness assessment. If the response rate for each item in the scale was >95%, the ceiling/floor effect of each subscale was <15%, and no difficulties in understanding the items were fed back from the participants, the scale was considered to have good comprehensiveness.[ In addition, a rehabilitation medicine expert and 3 orthopedics specialists were invited to help us to judge whether the items were relevant for the construct to be measured and for the patients with symptomatic shoulder disorders population. Because there is no gold standard for evaluating the validity of C-SPADI, the hypotheses testing method was employed to evaluate the construct validity.[ In this study, we selected the OSS and SF-36 as the control scales for the C-SPADI. Based on the content of each scale, we hypothesized that the subscales and overall scale of C-SPADI should be well correlated with the OSS and the physical subscales of SF-36 (PF, RP, BP, GH), but poorly with the mental subscales of SF-36 (VT, SF, RE, MH). In addition, because the OSS is specifically designed to assess the patient's function in shoulder area, and the SF-36 is only a generic scale, we hypothesized that the correlation between the C-SPADI and the OSS should be better than that with any subscales of SF-36. Based on the above hypotheses, the Pearson correlation coefficient (r) of the C-SPADI with the OSS and SF-36 were calculated. Correlations were categorized as follows: poor (r=0–0.20), fair (r=0.21–0.40), moderate (r = 0.41–0.60), very good (r = 0.61–0.80), or excellent (r = 0.8–1.0).[ The standardized response mean (SRM) and the effect size (ES) were calculated to assess the responsiveness by comparing the results of 1st and 3rd tests of C-SPADI. SRM was defined as the mean change between these time points divided by the SD of this change. ES was defined as the mean change between preoperative results and 6-month postoperative results divided by the SD of the preoperative C-SPADI score.[ ES and SRM were considered large if >0.80, moderate between 0.51 and 0.80 and small if lower than 0.50.[

Results

Cross-cultural adaptation

During forward and back-translation of SPADI, there were no major problems or large language difficulties existed. Small revision was made in disability subscale to adapt to the cultural background. Because the Chinese do not use “pounds” as weight units, we changed the “10 pounds” in item 7 into “10 jin.” The formula for unit conversion was as follows. No patient raised the problem of understanding in our pretest and formal study. 1 pound= 0.907 jin.

Patients

Altogether 126 (48 female/78 male) patients with shoulder disorders were recruited in this study. All of them completed the first 2 rounds scales. However, 24 patients failed to finish the scale for the third time (6 months after their shoulder arthroscopic surgery). As a result, the results of 126 subjects were included in C-SPADI's reliability and validity tests, while only 102 subjects’ results were assessed in the scale's responsiveness test. Patients’ detailed demographic information was listed in Table 1.

Table 1

Demographic and clinical characteristics of participants.

Reliability

The ICCs for the subscales and overall scale of C-SPADI were all more than 0.80 (0.887–0.915) as shown in Table 2, which indicated excellent test–retest reliability. The Bland–Altman plots revealed that the test–retest differences for the C-SPADI were centered around zero (Fig. 1), which indicated there was no systemic error in the data obtained from the first 2 rounds of completed scales and confirmed the good test–retest agreement of C-SPADI. The SEM was 5.47 on the basis of repeated measurements for test–retest. The MDC on the basis of the SEM for test–retest was 15.16 (Table 2). The Cronbach alpha of all subscales and overall scale was from 0.826 to 0.912 (Table 3), which proved the internal consistency of C-SPADI was good.

Table 2

Test–retest reliability and responsiveness of the C-SPADI.

Figure 1

These are Bland–Altman plots of test–retest reliability of the C-SPADI. Each data point indicates how the difference between the 2 test sessions for an individual patient compares to the mean of the 2 sessions for scores of each C-SPADI. The interval of 2 sessions was 1 week. The dashed line shows the 95% (± 1.96 SD) limits of agreement. C-SPADI = the Chinese Version of the Shoulder Pain and Disability Index.

Table 3

The internal consistency and score distribution of the C-SPADI.

Test–retest reliability and responsiveness of the C-SPADI. These are Bland–Altman plots of test–retest reliability of the C-SPADI. Each data point indicates how the difference between the 2 test sessions for an individual patient compares to the mean of the 2 sessions for scores of each C-SPADI. The interval of 2 sessions was 1 week. The dashed line shows the 95% (± 1.96 SD) limits of agreement. C-SPADI = the Chinese Version of the Shoulder Pain and Disability Index. The internal consistency and score distribution of the C-SPADI.

Validity

The score distribution showed that there was no ceiling effect (0.8%) or floor effect (0%) in overall C-SPADI scale and its subscales. No doubt and misunderstanding were reported. After the invited experts’ assessment, the data acquired from C-SPADI was adequate to evaluate the HRQOL of patients with shoulder disorders. In summary, we concluded C-SPADI had a good content validity. Construct validity assessment data are listed in Table 4. The subscales and overall scale of C-SPADI were moderate or good correlated with the OSS and physical subscales of SF-36 (r = 0.556–0.680), and poor, fair, or moderate correlated with the mental subscales of SF-36 (r = 0.038–0.492). In addition, the subscales and overall scale of C-SPADI were stronger correlated with OSS (r = 0.663–0.680) than that with subscales of SF-36 (r = 0.038–0.613). The above results were consistent with our hypothesis, indicating a good construct validity of C-SPADI.

Table 4

Construct validity of the C-SPADI.

Responsiveness

The average scores of the overall scale and subscales of C-SPADI had all decreased after the arthroscopic treatment. And the values of ES and SRM were ranged from 1.79 to 2.17 and 1.58 to 2.44, respectively (Table 2). The results indicated good responsiveness of C-SPADI.

Discussion

The most important finding of this study was that the C-SPADI was demonstrated to have a good score distribution, high internal consistency, excellent test–retest reliability, and notable content and construct validity. Several instruments were designed to evaluate quality of life (QoL) for patients with shoulder disorders and the most commonly used were DASH, OSS, SPADI, and the Constant-Murley shoulder outcome score (CMSOS).[ The OSS is by far the only shoulder scale that has been translated and adapted into Chinese, and the scales for shoulder are still in shortage in China. DASH was used to evaluate patients with upper extremity musculoskeletal conditions but not a shoulder-specific instrument. CMSOS could be affected by surgeon bias and 2 components of CMSOS were tested by the surgeon.[ SPADI is a widely used scale to evaluate the functioning status of patients with shoulder disorders, and it has been translated into multiple languages and its reliability, validity, and responsiveness have been well established.[ It is meaningful to translate SPADI into Chinese to serve the largest number of population in the world. The value of ICC was ranged from 0.887 to 0.915 indicating excellent reproducibility of C-SPADI, which was similar to values of German, Slovene, Brazilian, Persian, Thai, and Arabic versions (ICC = 0.84–0.95).[ The values of MDC and SEM were similar as previous studies.[ However, compared with the original version, the ICCs of the subscales and overall scale in C-SPADI were higher than that in the original version. It might be some real changes occurred in the original study over a 48-hour period and most of the patients (91%) were placed on analgesic medication and 50% of them received a periarticular corticosteroid injection to decrease pain and inflammation,[ while all patients finished the C-SPADI for the second time before any treatments in our study. The Cronbach alpha was 0.912 indicating excellent internal consistency of C-SPADI and the value was close to other language versions proving SPADI was stable in different cultures. However, the Cronbach alpha was higher for the total score than 2 subscales, which indicated the 2 subscales do not reflect separate dimensions clearly. No ceiling effect and floor effect was observed in the overall scale of C-SPADI and all the subscales. Experts assessments also confirmed that the items of C-SPADI are good relevant for the construct to be measured and for the patients with symptomatic shoulder disorders. And there was no one item not answered in the formal research. Integrating these results, we considered that the C-SPADI has good content validity. Correlations between the C-SPADI and the subscales of SF-36 and the OSS were generally consistent with our hypotheses, suggesting that it has good construct validity, and these results also accorded with relevant conclusions from other studies.[ The correlation between the C-SPADI and the OSS is the strongest, despite that the OSS is not specifically developed for patients with symptomatic shoulder disorders. But the OSS has also focused on the region of shoulder, just as the SPADI does. Although the physical subscales of SF-36 were strongly associated with the C-SPADI, it was still lower than that between the C-SPADI and the OSS. This is because that the accuracy SF-36, as a generic scale, in the function status assessment of specific types of patients is lower than that of other specific scales.[ Furthermore, correlations between the mental subscales of SF-36 and C-SPADI were poor, and this result was logical and consistent with that of other studies.[ Scale's responsiveness is an important factor in determining whether it can be used in prospective clinical research. Our study found that the overall scale and subscales of C-SPADI had good responsiveness. It was sensitive to functioning status changes of patients who had received shoulder arthroscopic surgery. The values of ES and SRM were slightly larger than those in other study.[ This is possibly explained by the fact that the treatment our patients received was arthroscopic surgery, and surgical operation as well as conservative treatment was included in other studies, resulting in the different improvement in functional status. The process of translation and cross-cultural adaptation was successful. There was only 1 tiny modification was made in item 7. The weight was accustomed to measure in unit called jin in China while it was always measured in unit called pound in Western countries. Hence, “pound” was substituted by “jin” and the corresponding conversion was made. Strengths of this study are comprehensive assessments of various measurement indexes of the C-SPADI and the longitudinal design of our study. Furthermore, compared with other studies related to other versions of SPADI, the participants were from 2 medical center which could reduce bias during test. However, this study has 3 major limitations. First, the sample was limited in size and not calculated accurately. Second, our translation version of C-SPADI is in simplified Chinese. While mandarin is the official language in China, the many minority groups living here have their own languages. The diversity of ethnicity and cultures has to be taken into consideration in further use. Third, no effect was assessed in the C-SPADI for the patients with shoulder disorders who had received the conservative treatment. This should be dealt with in further research.

Conclusion

In summary, the SPADI was translated to Chinese successfully and was confirmed to have good reliability, validity, and responsiveness. It may be used to assess the functioning status of Chinese patients with symptomatic shoulder disorders in clinical studies or treatments, helping doctors, or researchers to collect required data.

Acknowledgments

We appreciate for the cooperation all the volunteered patients and the stuff working in the outpatient center and in-patient department had given us. We also show a sincere gratitude to the 3 experts Prof Zi-ming Wang, Zhi-wei Wang, and An-ren Zhang who helped to assess the C-SPADI, and statistician Mr. Guang-wen Cao who helped for statistical consultation. Special thanks are dedicated to the translators Miss Rui-li Liu, Mr Olivier Assayas, and Mr Jacques Rivette, without whom our research would not be accomplished.

Author contributions

Conceptualization: Wei Wang, Wei Zheng, Wei-dong Xu. Data curation: Wei Wang, Zhen-yu Jia, Jiao Liu. Formal analysis: Wei Wang, Jiao Liu, Jin Cui. Investigation: Wei Wang, Qing-yun Xie. Methodology: Wei Wang, Zhen-yu Jia, Jiao Liu, Qing-yun Xie. Supervision: Wei Zheng, Wei-dong Xu. Validation: Zhen-yu Jia, Qing-yun Xie, Jin Cui, Wei-dong Xu. Visualization: Wei Zheng, Wei-dong Xu. Writing – original draft: Wei Wang, Zhen-yu Jia, Wei-dong Xu. Writing – review & editing: Zhen-yu Jia, Qing-yun Xie.

36 in total

Review 1. Guidelines for the process of cross-cultural adaptation of self-report measures.

Authors: D E Beaton; C Bombardier; F Guillemin; M B Ferraz
Journal: Spine (Phila Pa 1976) Date: 2000-12-15 Impact factor: 3.468

Review 2. Methods for assessing responsiveness: a critical review and recommendations.

Authors: J A Husted; R J Cook; V T Farewell; D D Gladman
Journal: J Clin Epidemiol Date: 2000-05 Impact factor: 6.437

Review 3. Generic and disease-specific measures in assessing health status and quality of life.

Authors: D L Patrick; R A Deyo
Journal: Med Care Date: 1989-03 Impact factor: 2.983

4. The reliability and validity of the Turkish version of the Western Ontario Rotator Cuff Index.

Authors: Ozlem El; Cigdem Bircan; Selmin Gulbahar; Yucel Demiral; Ebru Sahin; Meltem Baydar; Ramazan Kizil; Sharon Griffin; Elif Akalin
Journal: Rheumatol Int Date: 2006-06-24 Impact factor: 2.631

5. Quality criteria were proposed for measurement properties of health status questionnaires.

Authors: Caroline B Terwee; Sandra D M Bot; Michael R de Boer; Daniëlle A W M van der Windt; Dirk L Knol; Joost Dekker; Lex M Bouter; Henrica C W de Vet
Journal: J Clin Epidemiol Date: 2006-08-24 Impact factor: 6.437

6. Cross-cultural adaptation, validation, and reliability testing of the Shoulder Pain and Disability Index in the Persian population with shoulder problems.

Authors: Mohammad H Ebrahimzadeh; Ali Birjandinejad; Farideh Golhasani; Ali Moradi; Ehsan Vahedi; Amir R Kachooei
Journal: Int J Rehabil Res Date: 2015-03 Impact factor: 1.479

Review 7. Cross-cultural adaptation of health-related quality of life measures: literature review and proposed guidelines.

Authors: F Guillemin; C Bombardier; D Beaton
Journal: J Clin Epidemiol Date: 1993-12 Impact factor: 6.437

8. Cross-cultural adaption and measurement properties of the Danish version of the Shoulder Pain and Disability Index.

Authors: David H Christiansen; Johan H Andersen; Jens Peder Haahr
Journal: Clin Rehabil Date: 2012-08-09 Impact factor: 3.477

9. Shoulder pain and disability index: a validation study in Turkish women.

Authors: A Bicer; H Ankarali
Journal: Singapore Med J Date: 2010-11 Impact factor: 1.858

10. Cross-cultural adaptation and psychometric properties of an Arabic version of the Shoulder Pain and Disability Index.

Authors: Hisham A Alsanawi; Ahmad Alghadir; Shahnawaz Anwer; Kathryn E Roach; Alia Alawaji
Journal: Int J Rehabil Res Date: 2015-09 Impact factor: 1.479

3 in total

1. Functional outcome of single stage capsular release and rotator cuff repair for cuff tear in periarthritic shoulder.

Authors: Roshan Wade; Easwar Elango; Deven Kuruwa
Journal: J Orthop Date: 2020-08-29

2. Development and Validation of a Simplified Chinese Version and a Face-Scale Version of the Oxford Shoulder Score: A 2-Center Prospective Study.

Authors: Wu Xu; Kailun Wu; Stephen Roche; Weili Fu; Lixin Huang; Michael F G Held; Jiong Jiong Guo
Journal: Orthop J Sports Med Date: 2021-09-21

Review 3. How to Assess Shoulder Functionality: A Systematic Review of Existing Validated Outcome Measures.

Authors: Rocio Aldon-Villegas; Carmen Ridao-Fernández; Dolores Torres-Enamorado; Gema Chamorro-Moriana
Journal: Diagnostics (Basel) Date: 2021-05-08

3 in total