| Literature DB >> 33786328 |
Lauren N Bockhorn1, Angelina M Vera1, David Dong1, Domenica A Delgado1, Kevin E Varner1, Joshua D Harris1.
Abstract
BACKGROUND: The Beighton score is commonly used to assess the degree of hypermobility in patients with hypermobility spectrum disorder. Since proper diagnosis and treatment in this challenging patient population require valid, reliable, and responsive clinical assessments such as the Beighton score, studies must properly evaluate efficacy and effectiveness.Entities:
Keywords: Beighton score; hypermobility; interrater; intrarater; systematic review
Year: 2021 PMID: 33786328 PMCID: PMC7960900 DOI: 10.1177/2325967120968099
Source DB: PubMed Journal: Orthop J Sports Med ISSN: 2325-9671
Reliability
| Very Good | Adequate | Doubtful | Inadequate | Not applicable | ||
|---|---|---|---|---|---|---|
| Design requirements | ||||||
| 1 | Were patients stable in the interim period on the construct to be measured? | Evidence provided that patients were stable | Assumable that patients were stable | Unclear if patients were stable | Patients were NOT stable | |
| 2 | Was the time interval appropriate? | Time interval appropriate | Doubtful whether time interval was appropriate or time interval was not stated | Time interval NOT appropriate | ||
| 3 | Were the test conditions similar for the measurements (eg type of administration, environment, instructions)? | Test conditions were similar (evidence provided) | Assumable that test conditions were similar | Unclear if test conditions were similar | Test conditions were NOT similar | |
| Statistical methods | ||||||
| 4 | For continuous scores: Was an intraclass correlation coefficient (ICC) calculated? | ICC calculated and model or formula of the ICC is described | ICC calculated but model or formula of the ICC not described or not optimal. Pearson or Spearman correlation coefficient calculated with evidence provided that no systematic change has occurred | Pearson or Spearman correlation coefficient calculated WITHOUT evidence provided that no systematic change has occurred or WITH evidence that systematic change has occurred | No ICC or Pearson or Spearman correlations calculated | Not applicable |
| 5 | For dichotomous/nominal/ordinal scores: Was kappa calculated? | Kappa calculated | No kappa calculated | Not applicable | ||
| 6 | For ordinal scores: Was a weighted kappa calculated? | Weighted Kappa calculated | Unweighted Kappa calculated or not described | Not applicable | ||
| 7 | For ordinal scores: Was the weighting scheme described? eg linear, quadratic | Weighting scheme described | Weighting scheme NOT described | Not applicable | ||
| Other | ||||||
| 8 | Were there any other important flaws in the design or statistical methods of the study? | No other important methodological flaws | Other minor methodological flaws | Other important methodological flaws | ||
From Mokkink LB, de Vet HCW, Prinsen CAC, et al. COSMIN risk of bias checklist for systematic reviews of patient-reported outcome measures. Qual Life Res. 2018;27(5):1171-1179.[44] Material distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/).
Extracted Data
| Population Description | Test Conditions | Whether Test Conditions Were Similar for the Measurements |
|---|---|---|
| Number of participants | Beighton score modifications | Participant sequence generation |
| Age | Examination setting | Whether sequence of participants was concealed |
| Sex | Number of raters | Blinding of raters |
| Diagnostic criteria | Rater professions | Key conclusions of study authors |
| Inclusion criteria | Experience | Statistical tests |
| Exclusion criteria | Training | COSMIN criteria |
| Time between measurements | Whether patients were stable in the interim |
COSMIN, Consensus-Based Standards for the Selection of Health Measurement Instruments.
Strength of Agreement for the Kappa Coefficient and Intraclass Correlations[14,39,40,5] [6]
| Kappa Coefficient | Agreement | Intraclass Correlation | Reliability |
|---|---|---|---|
| ≤0 | Poor | 0.5 | Poor |
| 0.01-0.20 | Slight | >0.5-0.75 | Moderate |
| 0.21-0.40 | Fair | >0.75-0.9 | Good |
| 0.41-0.60 | Moderate | >0.90 | Excellent |
| 0.61-0.80 | Substantial | ||
| 0.81-1.00 | Almost perfect |
Figure 1.Flow diagram summarizing the literature search, screening, and review using the PRISMA (Preferred Reporting Items for Systematic Meta-Analyses) guidelines.
Population Characteristics, Time Interval, Study Design, and Associated COSMIN Scores
| Study (Year) | Population Characteristics | Time Interval | Study Design | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Sample, Age (y), Female Sex (%), DOP | 6.1 | Interrater | Intrarater | 6.2 | Test Condition | No. of Raters | Rater Profession | Combined Rater Experience | Rater Training | 6.3 | |
| Aartun (2014)[ | 111, 12-14, 46.8, middle school students | VG | <4 d | 1-4 h | VG | 5 item | 2 | Chiropractors | 18 y | Standardization session | VG |
| Aslan (2006)[ | 72, 20.36 ± 1.24 (18-25), 40.20, undergraduate PT students | VG | <24 h | 12.84 ± 7.41 d | VG | 5 item + goniometer | 2 | PTs | 21 y | 2 h practice together | VG |
| Baumhauer (1995)[ | 21, 18-23, 57, intercollegiate athletes | VG | 12-16 wk | NA | VG | 5 item | 2 | NS | NS | NS | A |
| Boyle (2003)[ | 42, 25.4 ± 4.2 (15-45), 100, noninjured HS athletes and PT students | VG | 15-60 min | 6 ± 4 d | VG | 5 item + goniometer | 2 | PTs | 17 y | CME, trained with index | VG |
| Bulbena (1992)[ | 173, 43.98 | VG | Consecutive | NA | D | 5 item | 2 | Rheumatologists | Experienced | NS | A |
| Cooper (2018)[ | 50, 49 (22-60), 56, community members | VG | NS | 1 wk | VG | 5 item + goniometer | 1 | NS | NS | NS | A |
| Erdogan (2012)[ | 15, 31.8 (16-50), 59.15, treated for ingrown nails | VG | NS | NS | D | 5 item + goniometer | 2 | Rheumatologists | NS | NS | A |
| Erkula (2005)[ | 50, 10.4 ± 1.2 (8-15) | VG | 2 wk | NA | VG | 5 item | 2 | Orthopaedic surgeons | NS | NS | A |
| Evans (2012)[ | 30, 10.6 ± 2.3 (7-15), 65, asymptomatic podiatry clinic patients | VG | >2 h | >2 h | VG | 5 item | 2 | Podiatrists | 21 y | NS | A |
| Fritz (2005)[ | 38, 39.2 ± 11 | VG | 5 min | NA | VG | 5 item | 2 | PTs | NS | NS | A |
| Glasoe (2002)[ | 30, 14-24, 100, athletes | VG | NS | NA | VG | 5 item | 2 | NS | >6 y | NS | A |
| Hansen (2002)[ | 100, 9-13, NS, asymptomatic competitive athletes | VG | NS | NA | D | 4/5, no fifth finger | 4 | 2 rheumatologists 1 untrained physician | NS | Guided by illustrations | A |
| Hicks (2003)[ | 63, 36 (20-66), 60.30, patients with lower back pain | VG | >15 min | NA | VG | 5 item | 4 | 3 PT, 1 PT and chiropractor | 20 y | Group review, 1 h practice | VG |
| Hirsch (2007)[ | 50, 38.3 ± 11.3 (20-60), 56, asymptomatic | VG | NS | 24.6 d | VG | 5 item + goniometer | 2 | Dentists | NS | Instructions, directed by orthopaedic surgeon | VG |
| Junge (2013)[ | 103, 7-8 and 10-12, 44 | VG | <30 min | NA | VG | 4 | PT students | NS | Trained | VG | |
| Juul-Kristensen (2007)[ | 40, 42.27 (18-71) | VG | NS | NA | D | 5 item | 2 | NS | NS | Trained per protocol | VG |
| Karim (2011)[ | 30, 24 (18-32), 100, contemporary professional dancers | VG | NS | NA | VG | 5 item | 4 | 1 PT, 3 PT students | 30 y | PT trained students | VG |
| Naal (2014)[ | 55, 28.5 ± 4.1, 32.70, symptomatic FAI cases | VG | NS | NA | D | 5 item | 2 | Clinicians | NS | NS | A |
| Pitetti (2015)[ | 25, 13.3 ± 2.9, 44, intellectually disabled | VG | 3-4 wk | NA | VG | 5 item + goniometer | 2 | DPT students | None | Peer supportive learning | VG |
| Smith (2012)[ | 5, 27, 100, patellar instability patients | VG | <1 d | 30 min | VG | 5 item | 5 | Orthopaedic surgeons | 125 y | Familiarized | VG |
| Tarara (2014)[ | 19, 20.3 ± 1.2 (male), 19.8 ± 1.0 (female), 57.89, club athletes | VG | <2.5 h | 4-7 d | VG | 5 item | 3 | 1 clinician and 2 novice students | 22 y | Prior reading, 1 h training and questions | VG |
| Vaishya (2013)[ | 300, 24.6 ± 0.9, 36.67, postoperative ACL reconstruction and controls | VG | NS | NA | D | 5 item | 2 | NS | NS | NS | A |
| Vallis (2015)[ | 36, 22.7 (18-32), 75, asymptomatic PT and OT students | VG | <1 d, 1 wk | NA | VG | 5 item + goniometer | 2 | Researchers | NS | Teaching session | VG |
| van der Giessen (2001)[ | 48, 4-12, 48.9 | VG | NS | NA | D | 5 item | 2 | PT students | 1 mo | Professional PT trained students | VG |
ACL, anterior cruciate ligament; BJHS, benign joint hypermobility syndrome; CME, continuing medical education; DOP, description of participants; DPT, doctorate of physical therapy; EDS, Ehlers-Danlos syndrome; FAI, femoroacetabular impingement; HS, high school; JHS, joint hypermobility syndrome; NA, not available/applicable; NS, not specified; OT, occupational therapy; PT, physical therapy.
Age reported as mean ± SD or range.
Calculated.
COSMIN criterion (Consensus-Based Standards for the Selection of Health Measurement Instruments; see Appendix 2 for details). Scoring: VG = very good, A = adequate, D = doubtful, I = inadequate.
Weighted average of groups or 2-phase studies.
Demographics of larger sample, of which reliability population is a subgroup.
Inter- and Intrarater Reliability and Associated COSMIN Scores
| Reliability, Mean (95% CI) | COSMIN Item | ||||||
|---|---|---|---|---|---|---|---|
| Study (Year) | Cutoff Score | Interrater | Intrarater | 6.4 | 6.5 | 6.6 | 6.7 |
| Aartun (2014)[ | ≥4/9 | κ = 0.65 (0.33 to 0.97) | κ = 0.66-1 (0.03 to 1) | NA | VG | D | A |
| ≥5/9 | κ = 0.56 (0.11 to 1.00) | κ = 1 | |||||
| Aslan (2006)[ | Composite | ICC = 0.82 | ICC = 0.92 | A | NA | NA | NA |
| Baumhauer (1995)[ | Composite | ρ = 1 | NA | I | D | A | |
| Boyle (2003)[ | Composite | ρ = 0.87 | ρ = 0.86 | D | NA | NA | NA |
| Bulbena (1992)[ | Each item | κ = 0.79-0.93 | D | VG | D | NA | |
| Cooper (2018)[ | ≥4/9 | κ = 0.96 | κ = 1 | NA | VG | D | A |
| Erdogan (2012)[ | Each item | κ = 0.71-1.0 | κ = 0.81-1.0 | NA | VG | D | A |
| Erkula (2005)[ | ρ = 0.86 | ρ = 0.62 | D | NA | NA | NA | |
| Evans (2012)[ | Composite | ICC = 0.73 | ICC = 0.96-0.98 | VG | NA | NA | NA |
| Fritz (2005)[ | Composite | ICC = 0.72 (0.50 to 0.85) | VG | NA | NA | NA | |
| Glasoe (2002)[ | Composite | κ = 0.7 | NA | VG | D | A | |
| Hansen (2002)[ | ≥4/9 | κ = 0.44-0.82 | D | VG | D | A | |
| Hicks (2003)[ | Composite | ICC = 0.79 (0.68 to 0.87) | VG | NA | NA | NA | |
| Hirsch (2007)[ | ≥4/9 | ICC >0.84 | ICC > 0.89 | A | NA | NA | NA |
| Junge (2013)[ | Each item | κ = 0.49-0.94, 0.30-0.84 | NA | VG | D | A | |
| ≥5/9 | κ = 0.64, 0.59 | ||||||
| Juul-Kristensen (2007)[ | Composite | ICC = 0.91 | VG | VG | D | A | |
| ≥5/9 | κ = 0.66 (0.30 to 1.02) | ||||||
| Karim (2011)[ | NS | κ = 0.6 | NA | VG | D | NA | |
| Naal (2014)[ | Composite | κ = 0.82 | NA | VG | VG | VG | |
| Pitetti (2015)[ | Composite | ICC = 0.88 | A | VG | D | A | |
| Each item | κ = 0.45-0.80 | ||||||
| Smith (2012)[ | Composite | κ = 0.00 (−0.16 to 0.17) | κ = 0.25 (0.03 to 0.51) | NA | VG | VG | A |
| Tarara (2014)[ | Modified composite | κ = 0.64-0.69 | Expert: κ = 0.69 (0.46 to 0.92) | NA | VG | VG | A |
| Vaishya (2013)[ | ≥4/9 | κ = 0.7 | NA | VG | D | A | |
| Vallis (2015)[ | Composite | ICC = 0.72-0.80 ([0.51-0.84] to [0.64-0.89]) | A | VG | VG | A | |
| van der Giessen (2001)[ | Composite | κ = 0.81 | NA | VG | D | A | |
A, adequate; COSMIN, Consensus-Based Standards for the Selection of Health Measurement Instruments; D, doubtful; I, inadequate; ICC, intraclass correlation; NA, not available/applicable; VG, very good.
Observer-participant reliability.
Percentage agreement omitted.
For 2 distinct methods of performing Beighton score.
Modified composite scale: 0 = pain with test, 1 = 8-9 points, 2 = 6-7 points, 3 = 4-5 points, 4 = 2-3 points, 5 = 0-1 points.
Expert-novice rater reliability.
Novice-novice rater reliability.