Literature DB >> 35583050

Reliability and Validity of a Chinese Version of the Lysholm Score and Tegner Activity Scale for Knee Arthroplasty.

Hongshi Huang1, Si Zhang1, Yi Wang1, Yelverton Tegner2, Yongjian Wang1, Yanfang Jiang1, Yingfang Ao3.   

Abstract

OBJECTIVE: To verify the reliability and validity of Chinese versions of the Lysholm score and the Tegner activity scale for knee arthroplasty.
METHODS: Sixty-four patients undergoing total knee arthroplasty and 28 healthy volunteers were included in this study. Participants were divided into 4 groups: a pre-operation group; a 3 months post-operation group; a 1-year post-operation group; and a control group of healthy volunteers. Participants completed the Lysholm score and Tegner activity scale twice over a period of 3-7 days.
RESULTS: The intraclass correlation coefficients of the Lysholm score and Tegner scale were both relatively high, at 0.99 and 0.97, respectively. Moreover, the Cronbach's alpha of the Lysholm score was 0.71. The items "locking" and "instability" differed slightly between groups (Kruskal-Wallis: for locking, χ2 (p) = 13.48, p = 0.0037; for instability, χ2 (p) = 4.32, p = 0.2292).
CONCLUSION: The simplified-Chinese versions of the Lysholm score and the Tegner scale are applicable for use with patients undergoing total knee arthroplasty, and have relatively high validity and reliability. The items "locking" and "instability" should be combined with clinical data to make the Lysholm score more suitable for assessment of total knee arthroplasty.

Entities:  

Mesh:

Year:  2022        PMID: 35583050      PMCID: PMC9340814          DOI: 10.2340/jrm.v54.2304

Source DB:  PubMed          Journal:  J Rehabil Med        ISSN: 1650-1977            Impact factor:   3.959


Arthropathy is a major disease that affects many people worldwide, with the knee regarded as the most affected joint (1). Degenerative osteoarthritis significantly affects patients’ quality of life. Total knee replacement (TKR) is the one of optimal way to reduce pain and recover function in severely damaged knee joints (2). As the overall burden to society of knee arthritis is increasing, the use of TKRs is also increasing (3). Thus, it is important that clinicians and patients have a good understanding of the clinical effects of TKR, with standard outcome measures, including measurements of pain, functioning and level of activity. Most previous studies have used the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) to evaluate postoperative function and recovery of patients undergoing TKR. The WOMAC questionnaire comprises 3 dimensions of evaluation, but includes many questions (4–6). Although there are many rating scales used to evaluate knee condition after surgery, there is no consensus on a single, effective evaluation system, due to the importance of knee functioning and the complexity of the knee joint anatomy (7). The evaluation systems cannot be used universally, due to cultural and language differences. Thus, convenient, simple and effective Chinese versions of the assessment scales are needed. In a meta-analysis, Waleed Alrawashdeh’s team (4) found that the WOMAC is currently routinely used to assess postoperative functioning and recovery in patients undergoing TKR. The Lysholm score and the Tegner activity scale were initially designed for evaluation of cruciate ligament injuries, especially injuries of the anterior cruciate ligament (ACL) (8–10). The Tegner activity scale score ranges from 0 (sick leave or disability pension due to knee problems) to 10 (competitive sports at a relatively high level) on a numerical scale (11). The Lysholm Knee Score, was proposed by Yelverton Tegner and Jack Lysholm, and was published in the Journal of Clinical Orthopaedics and Related Research in 1985 (10). The Lysholm score characterizes the way the knee is used in work, sports and leisure activities to assess the functioning of knee joint; moreover, the scale is used as a supplement to the knee functional rating system (10). Verification of the Lysholm and Tegner scales could provide better ACLR (Anterior Cruciate Ligament Reconstruction) data and support for clinicians and researchers. Osteoarthrosis of the knee joint is a highly prevalent condition; therefore, if assessment with the Lysholm score and Tegner scale are suitable for patients with TKA, then they may also be useful for assessment of patients with knee osteoarthrosis. The International Knee Documentation Committee (IKDC) Subjective Knee Evaluation Form (SKEF) was published in 1993 and modified in 1994; however, the most widely used version was published in 2001 (12). Moreover, the Chinese version of the IKDC-SKEF was translated and verified in 2010 (13). The IKDC system is currently one of the most popular knee assessment scale systems. Many international rating scales are used in China; however, most of the scales have not undergone verification for rehabilitation and validity. Currently, the only Chinese versions that have undergone a verification process are the IKDC 2000, Knee Injury and Osteoarthritis Outcome Score (KOOS), the Lysholm Knee Score and the Tegner activity scale. The simplified-Chinese version of the Lysholm Knee Score has recently been proven to be reliable and effective in an ACLR group (14); however, the reliability and effectiveness of the simplified-Chinese version of the Lysholm Score has not been verified in a TKA group (14).

METHODS

Translation procedure

The Lysholm Knee Score was translated from English to simplified-Chinese by 2 bilingual translators whose mother tongue is simplified-Chinese. The project and score settings were completely in accordance with the original English settings. The translator translated and calibrated the items according to the Oxford English Dictionary and Chinese customs. The 2 translation results were integrated into a single version, which was accepted by both translators. Another 2 bilingual translators, who spoke English as their native language, independently translated the single version back to English. The items in the back-translated English version were the same as in the original scale. The Tegner activity scale has been successfully translated into a simplified-Chinese version in previous studies, and its reliability and validity have been verified (15). The translation procedure was consistent with that of the Lysholm Knee Score (15).

Participants

Sixty-four patients undergoing total knee arthroplasty and 28 healthy volunteers were included in this study. Participants were divided into 4 groups: pre-operation, 3 months post-operation, 1-year post-operation, and a control group comprising healthy volunteers. There were 22 patients in the pre-operative group, 21 in the 3-months post-operation group and 21 in the 1-year post-operation group. The participants had to be able to read and communicate with others. All of the patients were undergoing TKA surgery at the Institute of Sports Medicine at Peking University Third Hospital within the year 2019. Health volunteers recruited a total of 28 people. Criteria for inclusion were unconscious cognitive disorders, absence of any symptoms of lower limb joint and confirmed disease, and age 40 years or older. All volunteers got in touch with the team through the recruiter’s advertisement and enrolled in the group.

Validation process

Demographic and clinical features were obtained upon screening of participants. All participants completed the rating scale under the instructions of study team members. The instructions provided by the research assistant were identical to those in the questionnaire (11). In addition, the patients completed the IKDC. The participants then completed the same Lysholm and Tegner scales again 3–7 days later. If the participants could not attend the hospital to complete the questionnaires, the study team members contacted them to complete the questionnaires by telephone.

Statistical analysis

The baseline characteristics are reported as means ± standard deviations (SD). Mann–Whitney U test was used to determine if there were any differences among measures, sexes, etc. Intraclass correlation coefficient (ICC) and 95% confidence interval (95% CI) were used to calculate relative reliability, and the result was considered acceptable when ICC was greater than 0.70. To obtain internal consistency within the Lysholm score, Cronbach’s alpha was calculated; however, because of the particularity of the Tegner scale, which consists of only 1 question, Cronbach’s alpha could not be used (16). The standard error of measurement (SEM) and the smallest detectable change (SDC) are usually used to assess absolute reliability. As for the differences among the pre-operation, 3 months post-operation, 1-year post-operation, and control groups, the Kruskal–Wallis test was used for each item. Spearman’s coefficient values were used to indicate the relationship between 2 items that were extremely good (> 0.9), good (0.7–0.9), moderate (0.5–0.69), fair (0.2–0.5), or minimal to absent (0.0–0.2). Furthermore, floor and ceiling effects were also calculated, and were considered to be nonexistent (10). All statistical analyses were performed using STATA SE 13.0 software (Statacorp, College Station, TX, USA). The significance level alpha was 0.05 and for post-hoc analysis was set at 0.05/4 = 0.0125.

RESULTS

According to the forward and backward translations, no major modifications were made by the translators, except for the terms “limp”, “instability”, “swelling” and “squatting”, which were modified slightly by the translators for clarity. However, no other problems were found according to the response in the process of scales application. A total of 92 volunteers participated in this study, 27 men and 65 women, mean age 65 ± 8 years (range 40–85 years). Moreover, all participants returned to the hospital to complete the follow-up assessments. There were 20 participants in the pre-operation group, all of whom completed all of the questions. There were 28 participants in the control group (age 62 ± 13 years), 20 in the pre-operation group (66 ± 7 years), 24 in the 3-month post-operation group (66 ± 4 years) and 20 in the 1-year post-operation group (66 ± 6 years). All of the baseline characteristics for the participants are shown in Table I. Most participants were in the older age range due to the particular nature of the TKA surgery. The overall sex ratio was approximately 1:2 (men:women), and the Mann–Whitney U test showed no significant differences between the 2 measurements (p > 0.05). However, it proved that there are differences between the sexes with regards to the Tegner scale (p = 0.0051, p < 0.01), for which the scores for men were higher than those for women.
Table I

Baseline participant characteristics and the mean for all groups.

Control group n = 28Pre-operation group n = 203 months post-operation group n = 24One-year post-operation group n = 20
Male/female13/152/188/164/16
Age, years (SD)62 (13)66 (7)66 (4)66 (6)
Range, years40–8552–7860–7554–76
Side, n Left/right/both4/16/012/9/37/11/2
Tegner (SD)4 (2)2 (1)2 (1)2 (1)
Lysholm (SD)94 (10)53 (13)64 (18)68 (20)

SD: standard deviation.

Baseline participant characteristics and the mean for all groups. SD: standard deviation. The mean and median results are shown in Table II. According to the ICC, there was good reliability for both measurements, with results of 0.99 for the Lysholm score and 0.97 for the Tegner scale in all participants. At the same time, Cronbach’s alpha was 0.71 for the Lysholm score; because the Tegner scale only had 1 question, the alpha value was not calculated. SEM and SDC values were also calculated. For the Lysholm score, the SEM was 1.95 and the SDC was 5.41, and, for the Tegner scale, the SEM was 0.24 and the SDC was 0.68. The Kruskal–Wallis test is used to show if there is an overall significant difference among the groups; the results showed that differences exist among the 4 groups (p < 0.001). The Mann–Whitney U post-hoc test was used to calculate the relationship between groups. For the Tegner scale, the control group was significantly different from the pre-operation group, the 3 months post-operation group and the 1-year post-operation group (p < 0.05). Compared with the Tegner scale, not only the control group, but also the pre-operation group has significant differences compared with other groups (p < 0.05).
Table II

Parameters for test-retest reliability related to the Lysholm and Tegner for all patients

Lysholm (maximum 100 points)Tegner (maximum 10 points)
1st measurement, mean (SD)71.48 (22.14)2.42 (1.53)
1st measurement, median722
2nd measurement, mean (SD)71.67 (21.83)2.48 (1.52)
2nd measurement, median722
ICC (95% CI)0.99 (0.99; 0.99)0.97 (0.96; 0.98)
SEM1.950.24
SDC5.410.68

SD: standard deviation; 95% CI: 95% confidence interval; ICC: intraclass correlation coefficient; SEM: standard error of measurement; SDC: smallest detectable change.

Parameters for test-retest reliability related to the Lysholm and Tegner for all patients SD: standard deviation; 95% CI: 95% confidence interval; ICC: intraclass correlation coefficient; SEM: standard error of measurement; SDC: smallest detectable change. Spearman’s ρ coefficient values are used to show the relationship between 2 items; moderate correlations were found between the IKDC and Lysholm scores (ρ = 0.74, p < 0.001), the IKDC and Tegner scales (ρ = 0.76, p < 0.001), and the Tegner scales and Lysholm scores (ρ = 0.68, p < 0.001). None of the groups, except for the control group, displayed floor and ceiling effects. Specific results are shown in Table III.
Table III

Parameters for test-retest reliability related to the Lysholm items in all patients

Item 1: limp (maximum 5 points)Item 2: support (maximum 5 points)Item 3: locking (maximum 15 points)Item 4: instability (maximum 25 points)Item 5: pain (maximum 25 points)Item 6: swelling (maximum 10 points)Item 7: stair-climbing (maximum 10 points)Item 8: squatting (maximum 5 points)
1st measurement, mean (SD)3.32 (1.92)4.28 (1.29)14.23 (2.95)22.88 (5.36)13.26 (10.26)6.59 (4.23)4.35 (3.88)2.58 (1.96)
1st measurement, median351525101022
2nd measurement, mean (SD)3.40 (1.86)4.32 (1.27)14.07 (2.76)23.04 (5.29)13.37 (10.19)6.59 (4.19)4.33 (3.78)2.57 (1.93)
2nd measurement, median351525101022
ICC (95% CI)0.97(0.95/0.98)0.97(0.96/0.98)0.85(0.78/0.90)0.98(0.96/0.98)0.99(0.99/0.99)0.99(0.99/0.99)0.97(0.96/0.98)0.98(0.97/0.99)
SEM0.340.221.100.820.900.420.610.27
SDC0.960.613.062.282.501.151.680.73
Kruskal–Wallis χ2(p)60.17(p < 0.001)16.33(p = 0.001)13.48(p = 0.0037)4.32(p = 0.2292)78.09(p < 0.001)32.44(p < 0.001)43.07(p < 0.001)82.85(p < 0.001)

SD: standard deviation; 95% CI: 95% confidence interval; ICC: intraclass correlation coefficient; SEM: standard error of measurement; SDC: smallest detectable change.

Parameters for test-retest reliability related to the Lysholm items in all patients SD: standard deviation; 95% CI: 95% confidence interval; ICC: intraclass correlation coefficient; SEM: standard error of measurement; SDC: smallest detectable change. Both the Lysholm and the Tegner scales seemed to have a constant variance between 2 measurements, as shown in Fig. 1. Correlations of the 2 rating scales’ with the IKDC are shown in Tables IV and V. Most items appeared to have good correlation with the IKDC.
Fig. 1

Bland–Altman plot for the reliability of test-retest related to the Lysholm scale (patients with total knee arthroplasty (TKA) only).

Table IV

Parameters for test-retest reliability related to the Lysholm scores in different groups

Control group n = 28Pre-operation group n = 203 months post-operation group n = 24One-year post-operation group n = 20
IKDC_overall score0.600.650.510.78
IKDC_Q1 score0.650.210.530.54
IKDC_Q2 score–0.60–0.41–0.48–0.35
IKDC_Q3 score–0.60–0.76–0.50–0.24
IKDC_Q4 score0.380.550.310.43
IKDC_Q5 score0.650.840.540.41
IKDC_Q6 score–0.350.50–0.18–0.06
IKDC_Q7 score0.520.350.360.53
IKDC_Q8 score0.410.590.750.61
IKDC_Q9a score0.810.560.590.74
IKDC_Q9b score0.810.670.590.72
IKDC_Q9c score0.840.690.400.85
IKDC_Q9d score0.770.140.470.68
IKDC_Q9e score0.230.460.590.43
IKDC_Q9f score0.530.450.590.68
IKDC_Q9g score0.680.330.350.52
IKDC_Q9h score0.690.030.130.62
IKDC_Q9i score0.340.780.140.48
IKDC_Q10 score0.790.740.680.56

IKDC: The International Knee Documentation Committee

Q1: question 1 (The following are all the same. The number is the question number.)

Table V

Parameters for test-retest reliability related to the Tegner scales in different groups

Control group n = 28Pre-operation group n = 203 months post-operation group n = 24One-year post-operation group n = 20
IKDC_overall score0.890.650.580.41
IKDC_Q1 score0.940.030.650.34
IKDC_Q2 score–0.35–0.58–0.28–0.09
IKDC_Q3 score–0.35–0.59–0.36–0.12
IKDC_Q4 score0.340.280.350.18
IKDC_Q5 score0.940.660.640.56
IKDC_Q6 score–0.230.39–0.040.21
IKDC_Q7 score0.840.250.320.22
IKDC_Q8 score0.780.350.710.44
IKDC_Q9a score0.760.610.700.44
IKDC_Q9b score0.790.720.710.34
IKDC_Q9c score0.680.620.450.45
IKDC_Q9d score0.670.270.570.57
IKDC_Q9e score0.110.310.560.42
IKDC_Q9f score0.390.470.540.16
IKDC_Q9g score0.750.260.410.14
IKDC_Q9h score0.700.03–0.120.15
IKDC_Q9i score0.470.660.280.18
IKDC_Q10 score0.740.770.540.34

IKDC: The International Knee Documentation Committee

Q1: question 1(The following are all the same. The number is the question number.

Bland–Altman plot for the reliability of test-retest related to the Lysholm scale (patients with total knee arthroplasty (TKA) only). Parameters for test-retest reliability related to the Lysholm scores in different groups IKDC: The International Knee Documentation Committee Q1: question 1 (The following are all the same. The number is the question number.) Parameters for test-retest reliability related to the Tegner scales in different groups IKDC: The International Knee Documentation Committee Q1: question 1(The following are all the same. The number is the question number.

DISCUSSION

This study is the first to validate the Lysholm score and Tegner activity scale for Chinese patients undergoing TKA, for quantification of the functional level of the knee in sports activities, as well as in daily life. The Lysholm score and Tegner activity scale are valid and reliable, thus both scales can enable the assessment of knee function and activities of patients undergoing TKA. Clinicians and the researchers could use the Lysholm score in combination with the Tegner scale as an easy assessment tool to evaluate the results of TKA. Due to cultural differences, although all participants were willing to cooperate with the study and could individually complete the Lysholm-C and Tegner-C assessments, some of them responded that they could not choose their answers by themselves because of the ambiguous options. In contrast to previous research, an ICC value of approximately 0.98 was satisfactory in this study, except for the case of the “locking” item. Furthermore, Cronbach’s alpha for the Lysholm score, which is used to assess homogeneity, was 0.71. The alpha value was similar to a Chinese ACL group; however, the ICC from this study was a little better than that group (14). Thus, it can be inferred that although every condition is different and exhibits diverse symptoms, the participants’ scores remained similar. The SDC of the Lysholm score (5 points) for TKA participants in this study was slightly lower than for patients with meniscal (9 points) or ACL injuries (11–14.6 points) (8, 14, 17–19). It can be concluded that the condition of participants is similar. Furthermore, although the SDC of the Tegner scale for participants in this study was 0.68 points, which is slightly lower than the SDC (1 point) in ACL injury populations (8, 14, 17–19), it can be ensured that a measured change is not caused by errors in measurement. In conclusion, the Lysholm and Tegner scales are valid for use in clinical practice. Bland–Altman plot for reliability of test-retest related to the Tegner scale (patients with total knee arthroplasty (TKA) only). For the Tegner activity scale, the overall score of participants in this study appears to be a slightly lower compared with other studies (8, 11, 15, 17, 19). The Tegner scale is a unique questionnaire scaling system, because it consists of only 1 item. The objective of this study was to evaluate the functioning and structure of TKA patients who were at least 60 years old. Moreover, the demands and expectations for elderly patients are different from those for young people. Most elderly people do not play sports, and their only activity is housework; thus, they do not need as much function and structure as young people (20). In this study, the IKDC scale was regarded as the gold standard; hence the Lysholm and Tegner scales were compared with the IKDC (14). The results show that both the Lysholm and Tegner scales correlated with the IKDC. This conclusion is similar to that found for ACL patients (14, 15). In general, the best scale for measuring patients’ function and ability depends on evaluation of the scale dimensions. In the current study only 1 or 2 participants in each of the 3 experimental groups attained the highest possible score; therefore, there was no ceiling or floor effect for the Lysholm score. Ceiling and floor effects exist if the effects are observed for more than 30% of the group; the control group was excluded from this analysis. Thus, the absence of these effects indicates that the items were able to distinguish well between the different groups. As a result, most of the patients have different scores. The Karen K. (8) found that, in ACL patients in Germany, the items “support” and “locking” had high ceiling effects, and the item “squatting” had high floor effects (8). In contrast, the current data imply that the Lysholm and Tegner scales are more suitable for assessment of Chinese TKA patients. This study has a number of limitations. The limited time and patient availability resulted in a small sample size, which limits the conclusions that can be drawn. Further research, using groups with longer times post-operation and larger sample sizes, is needed to assess the sensitivity and validation of these tests. In addition, this study followed up only on patients who underwent surgery within 1 year of participation. Future studies should follow up patients for a longer time after surgery. Moreover, each group was comprised of different participants; further studies should follow the same individuals and evaluate them prior to surgery and at 3 months and 1 year or more after surgery. In addition, more detailed standards should be developed, so that the patients can better understand the questions despite differences in language and culture. Further studies, aimed specifically at TKA patients, are needed to verify more rating scales that perform better than the Lysholm and Tegner assessments, for development of a gold standard for assessment of these patients.

CONCLUSION

Overall, the Chinese version of the Lysholm score and the Tegner activity scale displayed acceptable reliability and validity for use in assessing patients with TKA.
  19 in total

1.  Translation and validation of Chinese version of International Knee Documentation Committee Subjective Knee Form.

Authors:  Siu-Ngor Fu; Yat-Hing Chan
Journal:  Disabil Rehabil       Date:  2010-10-22       Impact factor: 3.033

2.  [Quality of life and physical activity among elderly and old people.]

Authors:  A S Bashkireva; D Y Bogdanova; A Y Bilyk; A V Shishko; E Y Kachan; V A Arutyunov
Journal:  Adv Gerontol       Date:  2018

3.  Function and activity in patients with knee arthroplasty: validity and reliability of a German version of the Lysholm Score and the Tegner Activity Scale.

Authors:  Jaap Swanenburg; Peter P Koch; Nadja Meier; Brigitte Wirth
Journal:  Swiss Med Wkly       Date:  2014-06-12       Impact factor: 2.193

4.  [Development and evaluation of a German version of the Tegner activity scale for measuring outcome after anterior cruciate ligament injury].

Authors:  B Wirth; N Meier; P P Koch; J Swanenburg
Journal:  Sportverletz Sportschaden       Date:  2013-02-14       Impact factor: 1.077

5.  Validation of the Spanish Version of the Knee Injury and Osteoarthritis Outcome Score (KOOS) for Elderly Patients With Total Knee Replacement.

Authors:  Alejandro Lizaur-Utilla; Francisco A Miralles-Muñoz; Santiago Gonzalez-Parreño; Fernando A Lopez-Prats
Journal:  J Orthop Res       Date:  2019-06-21       Impact factor: 3.494

6.  Is there a role for rehabilitation streaming following total knee arthroplasty? Preliminary insights from a randomized controlled trial.

Authors:  Justine M Naylor; Jack Crosbie; Victoria Ko
Journal:  J Rehabil Med       Date:  2015-03       Impact factor: 2.912

7.  Development of a patient-reported measure of function of the knee.

Authors:  J J Irrgang; L Snyder-Mackler; R S Wainner; F H Fu; C D Harner
Journal:  J Bone Joint Surg Am       Date:  1998-08       Impact factor: 5.284

8.  The reliability and validity of knee-specific and general health instruments in assessing acute patellar dislocation outcomes.

Authors:  Elizabeth W Paxton; Donald C Fithian; Mary Lou Stone; Patricia Silva
Journal:  Am J Sports Med       Date:  2003 Jul-Aug       Impact factor: 6.202

9.  The reliability, validity, and responsiveness of the Lysholm score and Tegner activity scale for anterior cruciate ligament injuries of the knee: 25 years later.

Authors:  Karen K Briggs; Jack Lysholm; Yelverton Tegner; William G Rodkey; Mininder S Kocher; J Richard Steadman
Journal:  Am J Sports Med       Date:  2009-03-04       Impact factor: 6.202

Review 10.  Measures of knee function: International Knee Documentation Committee (IKDC) Subjective Knee Evaluation Form, Knee Injury and Osteoarthritis Outcome Score (KOOS), Knee Injury and Osteoarthritis Outcome Score Physical Function Short Form (KOOS-PS), Knee Outcome Survey Activities of Daily Living Scale (KOS-ADL), Lysholm Knee Scoring Scale, Oxford Knee Score (OKS), Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC), Activity Rating Scale (ARS), and Tegner Activity Score (TAS).

Authors:  Natalie J Collins; Devyani Misra; David T Felson; Kay M Crossley; Ewa M Roos
Journal:  Arthritis Care Res (Hoboken)       Date:  2011-11       Impact factor: 4.794

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.