Literature DB >> 22021736

Inter-examiner reproducibility of clinical tests and criteria used to identify subacromial impingement syndrome.

Mikkel Vind1, Søren Bie Bogh, Camilla Marie Larsen, Hans Kromann Knudsen, Karen Søgaard, Birgit Juul-Kristensen.   

Abstract

Introduction A specific algorithm has been proposed for classifying impingement related shoulder pain in athletes with overhead activity. Data on the inter-examiner reproducibility of the suggested clinical tests and criteria and their mutual dependencies for identifying subacromial impingement symptoms (SIS) are not available. Objective To test the inter-examiner reproducibility of selected tests and criteria suggested for classifying SIS and the mutual dependencies of each of the individual tests and SIS. Method A standardised three-phase protocol for clinical reproducibility studies was followed, consisting of a training, an overall agreement and a study phase. To proceed to the study phase, an overall agreement of 0.80 was required. In total 10, 20 and 44 subjects were included in the three phases, respectively. The case prevalence in the study phase was 50%. The inclusion criterion for cases was ≥3, and for controls ≤1 positive test out of four. Cohen's κ statistics were used for calculating agreement. Results In the overall agreement phase, an agreement of 0.90 was obtained, while in the study phase it was 0.98 with a κ of 0.95 for SIS. κ Values for the individual tests varied between 0.60 and 0.95. Mutual dependencies between each test and SIS showed Neer's test with anterior pain to be most often used to determine SIS. Conclusions Inter-examiner reproducibility was moderate to almost perfect for the selected tests and criteria for SIS. The next challenge will be to establish reproducibility in clinical practice, as well as the validity of the tests and criteria for SIS.

Entities:  

Year:  2011        PMID: 22021736      PMCID: PMC3191397          DOI: 10.1136/bmjopen-2010-000042

Source DB:  PubMed          Journal:  BMJ Open        ISSN: 2044-6055            Impact factor:   2.692


Introduction

Fifty per cent of the general population experience shoulder pain every year.1 Subacromial impingement syndrome (SIS) is the most common shoulder disorder in the population, representing 44–65% of all registered shoulder complaints in the clinical setting.2 Its prevalence is especially high in sports with overhead activity, such as swimming, volleyball, handball and badminton. These overhead athletes have a high demand for optimal shoulder performance and dynamic stability is required in order to prevent injury.3–7 Until now, historical information and clinical signs have been included in the clinical examination as the most common tools for diagnosing shoulder impingement, even though radiological examinations have improved the accuracy of the diagnosis.8 MRI has recently been recommended as the best method to identify shoulder lesions, while ultrasonography is also recognised as highly accurate in detecting SIS.9 10 However, neither method is able to identify subtle instability and scapula dyskinesis, while MRI findings may be unable to differentiate between symptomatic and asymptomatic shoulders or to provide a structural diagnosis.1 Structural changes can be present without pain, thus giving a high number of false positive and negative results.1 In the clinical setting, manual diagnostic tests are fast and easy to perform compared with imaging techniques, which are both time-consuming and expensive. A recent systematic review concluded that several clinical shoulder tests have sufficient sensitivity, but inadequate specificity.11 Neer's and Hawkins' impingement tests, in particular, have been found useful in confirming SIS, but poor at ruling out pathology.9 12–14 Furthermore, Jobe's test and passive abduction had a relatively high sensitivity of 74% compared with MRI, and were described as good diagnostic tools.9 A systematic review found Neer's and Hawkins' tests to have a sensitivity and specificity of 79% and 53%, and 79% and 59%, respectively, and states that no shoulder test can by itself confirm impingement.11 It is further suggested that more large, prospective, well-designed studies should be carried out.11 A recent prospective study of patients admitted for surgery found Neer's test, the Painful Arc test and the External Rotation Resistance test to be excellent screening tools to rule out SIS.15 Furthermore, Painful Arc, the External Rotation Resistance test in the neutral position and Jobe's test had the highest diagnostic accuracy compared with surgical findings, and were best at confirming/ruling in SIS. Inter-examiner reliability showed κ values ranging from 0.39 to 0.67, but the protocol lacked an overall agreement phase, and this may have influenced κ values negatively.15 A stronger design is therefore needed to establish more accurate κ values for inter-examiner reliability. Recently, a clinical reasoning algorithm based on clinical tests has been developed as a screening tool for the detection of different shoulder pathologies in overhead athletes (figure 1).16 The type of impingement is decided upon, and then the underlying pathologies are examined by a battery of commonly accepted shoulder tests. For classification of SIS and ‘internal posterio-superior glenoid impingement’, a group of four tests is performed (Jobe's test, Neer's test, Hawkins' test and the Apprehension test). The use of a group of tests as a diagnostic tool is supported by the results in a recent study, where three or more positive tests out of five confirmed SIS, and less than three positive tests out of five ruled out SIS.15
Figure 1

Algorithm for clinical reasoning in the examination of impingement related shoulder pain (from Cools et al16). IR, internal rotation; GIRD, glenohumeral internal rotation deficit; ROM, range of motion; SLAP, superior labrum from anterior to posterior tear; SAT, scapular assistance test; SRT, scapular retraction test.

Algorithm for clinical reasoning in the examination of impingement related shoulder pain (from Cools et al16). IR, internal rotation; GIRD, glenohumeral internal rotation deficit; ROM, range of motion; SLAP, superior labrum from anterior to posterior tear; SAT, scapular assistance test; SRT, scapular retraction test. However, the algorithm has not been tested for reliability or validity, and there are no guidelines regarding how many positive test responses are required in order to differentiate between SIS and internal posterio-superior glenoid impingement. In order to validate this algorithm, it is necessary to establish the reproducibility. Therefore, the aim of this study was to determine the inter-examiner reproducibility of selected tests and criteria for classifying SIS based on the algorithm, and second, to test the mutual dependency of the individual tests and SIS.

Methods

Study design

The study was carried out using the protocol for diagnostic procedures in reproducibility studies presented by the International Federation of Manual/Musculoskeletal Medicine (FIMM).17 The protocol for a reproducibility study is a three-phase study consisting of a training phase, an overall agreement phase (>80%) and an actual study phase with a case prevalence of 50%. Two examiners (examiners A and B), with a maximum of 6 months' clinical experience, blinded to their mutual test results, performed all tests. The subjects were instructed not to give information about results from previous tests. In the training phase, the test manual was refined on the basis of feedback, and the performance and interpretation of the tests were evaluated in order to remove potential errors. The four tests (Jobe, Neer, Hawkins and Apprehension) were used as a battery of tests to detect SIS. In the overall agreement phase, examiner A tested one sample first (11 players) and examiner B tested another sample first (nine players), and then the examiners switched samples and agreements were calculated. In the study phase, the 0.50 prevalence method was applied in order to obtain an accurate κ coefficient. In this phase, one examiner selected and tested a minimum of 20 subjects (10 cases and 10 controls) and sent them to the other examiner, and vice versa. The criterion for inclusion of subjects in the study phase was therefore either SIS (≥3 positive tests) for cases or no-SIS (≤1 positive tests out of four) for controls. A prevalence of 50% was obtained through this method (figure 2). This method is recommended when one diagnostic test is performed, and therefore all four tests were reduced to one diagnostic criterion classification, either SIS or no-SIS.
Figure 2

Flow diagram of the 0.50 prevalence method (study phase). Examiners A and B perform the battery of tests on their own groups of subjects. Both examiners sent the same number of positive and negative tests to each other (minimum of 10 positive and 10 negative).

Flow diagram of the 0.50 prevalence method (study phase). Examiners A and B perform the battery of tests on their own groups of subjects. Both examiners sent the same number of positive and negative tests to each other (minimum of 10 positive and 10 negative).

Materials

According to the study protocol, the training phase was carried out on 10 subjects (three male and seven female physiotherapy students with a mean±SD age of 25±4.1 years), and the overall agreement phase was carried out on 20 subjects (active male handball players with a mean±SD age of 19.9±3.7 years) with the dominant shoulder being tested, comprising 15 right and five left shoulders. In total, 30 males and 14 females with a mean±SD age of 19.6±5.4 years, comprising 35 right and nine left shoulders, respectively (n=44), met the criteria for inclusion in the study phase. To include these 44 subjects, 134 shoulders of active overhead athletes were tested (figure 3). The study was approved by the Ethics Committee in Science of the Region of Southern Denmark.
Figure 3

Flow diagram of the subjects included in the study phase.

Flow diagram of the subjects included in the study phase.

Clinical tests

The tests were performed according to the recommendations of Cools et al16 (table 1).
Table 1

Performance of the clinical tests for SIS (from Cools et al16), described as if performed on the right shoulder

Introduction

I am going to perform four shoulder tests on you.

For each different test I will ask if you experience any pain, and you must answer either yes or no.

Sometimes I will ask if the pain is in the front or the back of your shoulder, and you use your judgement. Pain down to the insertion of the deltoid muscle is accepted as a positive response (show!).

PreparationFor Jobe's test a line is marked on the floor, on which the subject is asked to stand. On this line an angle of 40° is marked, which is used to point out the plane of the scapula.

Results are noted on the score sheet by the examiner.

SIS, subacromial impingement syndrome.

Performance of the clinical tests for SIS (from Cools et al16), described as if performed on the right shoulder I am going to perform four shoulder tests on you. For each different test I will ask if you experience any pain, and you must answer either yes or no. Sometimes I will ask if the pain is in the front or the back of your shoulder, and you use your judgement. Pain down to the insertion of the deltoid muscle is accepted as a positive response (show!). Does this hurt your shoulder? Yes/no Does this hurt your shoulder? Yes/no Does it hurt in the front of the shoulder or in the back? Does this hurt your shoulder? Yes/no Does this hurt your shoulder? Yes/no Does it hurt in the front of the shoulder or in the back? Results are noted on the score sheet by the examiner. SIS, subacromial impingement syndrome.

Jobe's test/empty can test

Jobe's test/empty can test was performed with the subject standing, and the arm to be examined in 90° elevation in the scapular plane in maximal internal rotation. The angle of the scapular plane was set to 40° as mentioned elsewhere in the literature.18 19 This angle was marked on the floor to guide subject movement. Manual resistance prevented further elevation of the arm. The test was positive if the subject experienced pain.16 Jobe's test, as a single test for impingement, has been found to have almost perfect levels of agreement.12

Neer's test

Neer's test was performed with the subject sitting on a chair. The examiner performed a maximal forward flexion of the subject's arm, while elevation of the scapula was prevented. The test was positive when pain was located in the anterior aspect of the shoulder (Neerant). Pain to the posterior aspect was also recorded. Combined with reports of anterior pain, these reports were called ‘Neer general’ (Neergen).

Hawkins' test

Hawkins' test was performed with the subject sitting on the examination table with feet on the ground. The arm was held in 90° forward flexion, elbow flexed to 90°. Passive internal rotation of the shoulder was performed. The test was positive if pain was reported. Hawkins' test was found to have almost perfect levels of agreement,12 and the test has been recommended for use in screening for impingement.11

The Apprehension test

The Apprehension test was carried out with the subject lying supine on the examination table. The shoulder (90° abduction, the elbow in 90° flexion) was passively placed in maximal external rotation. The test was considered positive if the subject reported pain from the anterior aspect of the shoulder.16

Statistics

For the dichotomous data, Cohen's κ statistics were used to calculate levels of agreement between the two examiners (inter-examiner reproducibility) with 95% CIs, and the relationship between the individual tests and the classification of SIS (mutual dependency). Furthermore, values for observed agreement, prevalence and expected agreement were calculated. Data were analysed in Microsoft Office Excel 2007, using 2×2 contingency tables. The κ values were interpreted at five levels according to Landis and Koch.20

Results

The groups of cases and controls were comparable on age with mean (±SD) ages of 19.1±5.8 years and 20.0±5.1 years, respectively. The overall agreement phase was completed with an observed agreement of 90% (table 2), meaning that the criterion of a minimum of agreement of 80% was met.
Table 2

Agreement regarding number (%) of positive and negative tests in the overall agreement and study phases for SIS, with calculations of observed agreement and κ values

SISTotal agreement in positive tests (n, %)Total agreement in negative tests (n, %)Total agreement (n, %)κ Value
Overall agreement phase (n=20)6 (30%)12 (60%)18/20 (90%)
Study phase (n=44)22 (50%)21 (48%)43/44 (98%)0.95

SIS, subacromial impingement syndrome.

Agreement regarding number (%) of positive and negative tests in the overall agreement and study phases for SIS, with calculations of observed agreement and κ values SIS, subacromial impingement syndrome. In the study phase, an ‘almost perfect’ κ value of 0.95 was calculated for SIS (table 3), while κ values for the specific tests varied between 0.60 and 0.95. κ for Neer's test was calculated as 0.86 for anterior pain (Neerant) and as 0.95 for pain in general (Neergen).
Table 3

κ Statistics and 95% CIs for SIS and each of the specific tests in the study phase

κ Value (95% CI)Prevalence of the index conditionObserved agreementExpected agreement by chance
SIS0.95 (0.87 to 1.00)0.510.980.50
Test
 Jobe's0.90 (0.76 to 1.00)0.340.950.55
 Neer's (ant pain)0.86 (0.71 to 1.00)0.530.930.50
 Neer's (general pain)0.95 (0.86 to 1.00)0.580.580.51
 Hawkins'0.60 (0.34 to 0.85)0.660.820.55
 Apprehension (ant pain)0.71 (0.59 to 0.98)0.390.860.53

ant, anterior; SIS, subacromial impingement syndrome.

κ Statistics and 95% CIs for SIS and each of the specific tests in the study phase ant, anterior; SIS, subacromial impingement syndrome. The κ values for mutual dependency indicate that Neer's test with anterior pain (Neerant) was the most frequently used test for classifying SIS, with Jobe's test being the least frequently used (table 4). The highest κ values were obtained on the Neerant with κ values for examiner A and B of 0.82 and 0.91, respectively, followed by Apprehensionant with κ=0.77 and κ=0.73, Hawkins' with κ=0.73 and κ=0.68 and Jobe's with κ=0.64 and κ=0.60.
Table 4

κ Statistics for mutual dependency of the individual tests and the diagnosis of SIS

Testκ ValuePrevalence of the index conditionObserved agreementExpected agreement by chance
Jobe's
 Examiner A0.640.410.820.50
 Examiner B0.600.440.800.49
Neer's (anterior pain)
 Examiner A0.820.500.910.50
 Examiner B0.910.540.950.50
Hawkins'
 Examiner A0.730.570.860.50
 Examiner B0.680.600.840.50
Apprehension (anterior pain)
 Examiner A0.770.440.890.50
 Examiner B0.730.450.860.50

SIS, subacromial impingement syndrome.

κ Statistics for mutual dependency of the individual tests and the diagnosis of SIS SIS, subacromial impingement syndrome.

Discussion

The results from this study showed an almost perfect κ value for SIS (κ=0.95) and for the individual tests (κ values between 0.60 and 0.90). Calculations of mutual dependencies found the highest κ values for the Neerant test followed by the Apprehensionant test, Hawkins' test and Jobe's test. Our study included an overall agreement phase and a 50% case prevalence, which may explain the higher strength/validity of the present κ values compared to those found in earlier studies examining the reproducibility of clinical shoulder tests.12 15 21 Without including an overall agreement phase, the risk of a systematic bias cannot be excluded. Only one study has examined the reproducibility of several tests as a group. Michener et al found that three positive tests out of five were most likely to confirm SIS.15 The conflicting results in the three studies mentioned above, however, together with the almost perfect levels of agreement in the current study, indicate a need for methodologically stronger study designs in order to minimise errors due to systematic bias. Implementation of a training phase, an overall agreement phase (≥80%) and a study phase including the 0.50 prevalence method in future studies is therefore recommended. In the current study, Jobe's test produced the highest κ value (κ=0.90) of all the individual tests (table 3), even though the prevalence of 0.34 weakens the credibility of the κ value compared to a prevalence of 0.50. For Neer's test for anterior pain (Neerant) and Neer's test for pain in general (Neergen), where pain is located either anteriorly and/or posteriorly in the shoulder, the κ values were 0.86 and 0.95, respectively, with a prevalence close to 0.50 (table 3). These results show that a high level of agreement was still obtained even when differentiating between anterior and posterior pain. Dromerick et al examined Neer's test and presented a substantial κ value of 0.78.21 Johansson et al12 examined, among others, Jobe's test, Neer's test and Hawkins' test. High κ values were obtained for all three tests (Jobe's κ=0.94, Neer's test κ=1.0, Hawkins' test κ=0.91), but only Jobe's test was performed with a prevalence close to 0.50.12 Michener et al also examined Jobe's, Neer's and Hawkins' tests for reproducibility and found κ values ranging from 0.39 to 0.47, with a prevalence of SIS of 0.29,15 which may explain the relatively low κ. The Apprehension test obtained a κ value of 0.71, with a prevalence of 0.39 making the result slightly weaker compared with a prevalence of 0.50. The Apprehension test was carried out in accordance with the classical protocol described by Cools et al (90° abduction and maximal external rotation).16 However, disagreement was found, since Cools et al16 refer to Meister et al22 where the test is described and performed differently (with the shoulder abducted 90–110° and in slight extension). Meister et al conclude that posterior pain is a sign of posterio-superior glenoid impingement.22 However, there are no references in the literature suggesting that the Apprehension test with anterior pain can be used for the detection of SIS. The validity of this modified test, therefore, needs to be verified, that is, the use of a modification of this test, originally designed for internal posterio-superior glenoid impingement, for the detection of SIS, which is regarded as an external anterio-superior glenoid impingement syndrome. According to our results, there is clear coherence between a positive Apprehensionant test and SIS, shown by the substantial κ values for mutual dependency (κ=0.77 for examiner A, κ=0.73 for examiner B). The high κ values could be attributed to an actual relationship between anterior pain/instability and impingement symptoms, as argued by Meister et al.22 This κ value should therefore be considered with some caution, since this was not a study on the validity of the tests. Whether there is a clear relationship between pain from the anterior aspect of the shoulder in the Apprehension test and SIS, can only be verified by imaging techniques. Also Neerant and Hawkins' tests had a relatively high mutual dependency, showing these tests to be closely related to SIS, in contrast to Jobe's test. However, we have no further explanation for this difference in mutual dependency between tests. The current mutual dependencies could have been influenced by our inclusion criteria, since subjects with two out of four positive test responses were excluded, meaning that our results for mutual dependency are deceptively high. However, the Neerant and the Apprehension tests have the highest values of mutual dependency, which indicates that these two tests should carry more weight than Jobe's test and Hawkins's test. No prior studies were found that addressed the mutual dependencies of these tests. The limitation of the study is that the current mutual dependencies may have been deceptively high because of a possible bias due to the inclusion criteria used in this algorithm, since subjects with two out of four positive test responses are not included. However, this limitation has not biased the high reproducibility of the tests and criteria for SIS. Another limitation is the unknown reproducibility of the current tests carried out by untrained clinicians, which of course might be different from the reproducibility of trained examiners. This means that reproducibility also needs to be tested in a normal clinical environment. In case inexperienced examiners have a low inter-examiner test reproducibility, our study has shown that it should be possible through education and training to obtain high enough skills to perform the tests in a reproducible way. Reproducibility is only the first step on the path to establishing the diagnostic value of these tests. Therefore, the validity (ie, concurrent, discriminative, predictive, prescriptive) and the diagnostic accuracy (ie, sensitivity and specificity) of this clinical reasoning algorithm must be determined. With respect to internal impingement, exactly the same procedures (tests for reliability and validity) need to be performed. The strength of the study lies in its design, which followed a standardised protocol, originally presented by FIMM.17 The two examiners undertook a training phase in order to minimise the risk of bias in the performance of the tests, and an overall agreement of 90% was achieved before carrying out the study phase. The study phase was performed using the 0.50 prevalence method, which strengthens the κ values. Finally, the study was carried out in overhead athletes, for whom the algorithm is intended, making the results relevant for screening purposes within this group.

Conclusions

The current study on overhead athletes showed almost perfect inter-examiner reproducibility of the tests and criteria for SIS, based on a clinical reasoning algorithm. Also each of the selected tests (Jobe's, Neer's, Hawkins' and Apprehension) presented high levels of agreement and reproducibility. Based on the mutual dependency of each of the individual tests and SIS, Neer's test with anterior pain (Neerant) had the highest level of agreement, and is therefore important in the classification of SIS. Although showing excellent levels of agreement, it is recommended that further research be carried out in order to establish reproducibility in clinical practice, and also the validity of this clinical algorithm.
  20 in total

Review 1.  Secondary impingement in the shoulder. An improved terminology in impingement.

Authors:  A K Belling Sørensen; U Jørgensen
Journal:  Scand J Med Sci Sports       Date:  2000-10       Impact factor: 4.221

Review 2.  A review of the special tests associated with shoulder examination. Part I: the rotator cuff tests.

Authors:  T Duncan Tennent; William R Beach; John F Meyers
Journal:  Am J Sports Med       Date:  2003 Jan-Feb       Impact factor: 6.202

3.  Scapular muscle recruitment patterns: trapezius muscle latency with and without impingement symptoms.

Authors:  Ann M Cools; Erik E Witvrouw; Geert A Declercq; Lieven A Danneels; Dirk C Cambier
Journal:  Am J Sports Med       Date:  2003 Jul-Aug       Impact factor: 6.202

Review 4.  Shoulder impingement syndrome. A critical review.

Authors:  F H Fu; C D Harner; A H Klein
Journal:  Clin Orthop Relat Res       Date:  1991-08       Impact factor: 4.176

5.  Accuracy of physical examination in subacromial impingement syndrome.

Authors:  L Silva; J L Andréu; P Muñoz; M Pastrana; I Millán; J Sanz; C Barbadillo; M Fernández-Castro
Journal:  Rheumatology (Oxford)       Date:  2008-03-27       Impact factor: 7.580

6.  Scapular kinematics during humeral elevation in adults and children.

Authors:  Sudarshan Dayanidhi; Margo Orlin; Scott Kozin; Susan Duff; Andrew Karduna
Journal:  Clin Biomech (Bristol, Avon)       Date:  2005-07       Impact factor: 2.063

7.  Intra- and interexaminer reliability of four manual shoulder maneuvers used to identify subacromial pain.

Authors:  Kajsa Johansson; Sören Ivarson
Journal:  Man Ther       Date:  2008-05-05

8.  Reliability and diagnostic accuracy of 5 physical examination tests and combination of tests for subacromial impingement.

Authors:  Lori A Michener; Matthew K Walsworth; William C Doukas; Kevin P Murphy
Journal:  Arch Phys Med Rehabil       Date:  2009-11       Impact factor: 3.966

Review 9.  Anatomical and biomechanical mechanisms of subacromial impingement syndrome.

Authors:  Lori A Michener; Philip W McClure; Andrew R Karduna
Journal:  Clin Biomech (Bristol, Avon)       Date:  2003-06       Impact factor: 2.063

10.  Rotator cuff tendinopathy/subacromial impingement syndrome: is it time for a new method of assessment?

Authors:  J S Lewis
Journal:  Br J Sports Med       Date:  2008-10-06       Impact factor: 13.800

View more
  4 in total

1.  Scapular muscle-activation ratios in patients with shoulder injuries during functional shoulder exercises.

Authors:  Chad R Moeller; Kellie C Huxel Bliven; Alison R Snyder Valier
Journal:  J Athl Train       Date:  2014-05-19       Impact factor: 2.860

2.  Communication breakdown: clinicians disagree on subacromial impingement.

Authors:  Pieter Bas de Witte; Jurriaan H de Groot; Erik W van Zwet; Paula M Ludewig; Jochem Nagels; Rob G H H Nelissen; Jon P Braman
Journal:  Med Biol Eng Comput       Date:  2013-04-25       Impact factor: 2.602

3.  A neuromuscular exercise programme versus standard care for patients with traumatic anterior shoulder instability: study protocol for a randomised controlled trial (the SINEX study).

Authors:  Henrik Eshoj; Sten Rasmussen; Lars Henrik Frich; Inge Hvass; Robin Christensen; Steen Lund Jensen; Jens Søndergaard; Karen Søgaard; Birgit Juul-Kristensen
Journal:  Trials       Date:  2017-02-28       Impact factor: 2.279

4.  Comprehensive supervised heavy training program versus home training regimen in patients with subacromial impingement syndrome: a randomized trial.

Authors:  Pierre Schydlowsky; Marcin Szkudlarek; Ole Rintek Madsen
Journal:  BMC Musculoskelet Disord       Date:  2022-01-15       Impact factor: 2.362

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.