Literature DB >> 21622769

Diagnostic performance of the ACR/EULAR 2010 criteria for rheumatoid arthritis and two diagnostic algorithms in an early arthritis clinic (REACH).

Celina Alves1, Jolanda Jacoba Luime, Derkjen van Zeben, Anne-Margriet Huisman, Angelique Elisabeth Adriana Maria Weel, Pieternella Johanna Barendregt, Johanna Maria Wilhelmina Hazes.   

Abstract

INTRODUCTION: An ACR/EULAR task force released new criteria to classify rheumatoid arthritis at an early stage. This study evaluates the diagnostic performance of these criteria and algorithms by van der Helm and Visser in REACH.
METHODS: Patients with symptoms ≤12 months from REACH were used. Algorithms were tested on discrimination, calibration and diagnostic accuracy of proposed cut-points. Two patient sets were defined to test robustness; undifferentiated arthritis (UA) (n=231) and all patients including those without synovitis (n=513). The outcomes evaluated were methotrexate use and persistent disease at 12 months.
RESULTS: In UA patients all algorithms had good areas under the curve 0.79, 95% CI 0.73 to 0.83 for the ACR/EULAR criteria, 0.80, 95% CI 0.74 to 0.87 for van der Helm and 0.83, 95% CI 0.77 to 0.88 for Visser. All calibrated well. Sensitivity and specificity were 0.74 and 0.66 for the ACR/EULAR criteria, 0.1 and 1.0 for van der Helm and 0.59 and 0.93 for Visser. Similar results were found in all patients indicating robustness.
CONCLUSION: The ACR/EULAR 2010 criteria showed good diagnostic properties in an early arthritis cohort reflecting daily practice, as did the van der Helm and Visser algorithms. All were robust. To promote uniformity and comparability the ACR/EULAR 2010 criteria should be used in future diagnostic studies.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21622769      PMCID: PMC3147228          DOI: 10.1136/ard.2010.142299

Source DB:  PubMed          Journal:  Ann Rheum Dis        ISSN: 0003-4967            Impact factor:   19.103


Recently an American College of Rheumatology/European League Against Rheumatism (ACR/EULAR) task force released new classification criteria for rheumatoid arthritis (RA) at an early stage.1 These criteria might also have diagnostic value early in the disease process although this has not yet been evaluated. Early diagnosis is important to improve patient outcome by early treatment to prevent joint damage and functional impairment.2 The previous classification criteria for RA (the 1987 ACR criteria) proved inadequate in the early stages of disease.3 4 This led to the development of other diagnostic algorithms.5 6 These algorithms showed good diagnostic performance and identified patients at an early stage of the disease.7–9 Diagnostic algorithms tend to be overoptimistic in their capabilities when only tested in the population they were derived from.10 For instance, if a high erythrocyte sedimentation rate (ESR) is an important predictor for RA but in the derivation cohort by chance only a few patients had a high ESR, the data-driven way in which these algorithms are build will not identify this predictor. Therefore before use in practice the discriminative abilities of such algorithms should be tested in another cohorts with similar patients (similar incidence rate). In addition, the robustness of algorithms to variation of incidence rates can be tested in cohorts with different previous disease probabilities.11–13 We aim to evaluate the diagnostic performance of the ACR/EULAR 2010 criteria and two diagnostic algorithms simultaneously to predict methotrexate use or persistent disease in the Rotterdam Early Arthritis Cohort (REACH). In addition, we will test robustness after defining two patient sets in the same cohort resulting in different previous probabilities of developing RA.

Methods

Diagnostic algorithms

Three diagnostic algorithms were evaluated. The first is the new ACR/EULAR 2010 criteria set.1 The other two, the algorithms by van der Helm and the one by Visser, are existing, well-known models.5 6

Validation cohort

Clinical data used were from REACH. This ongoing, prospective, inception cohort study was set up in the greater Rotterdam area in July 2004. Patients were recruited either via their general practitioner, or via the outpatient rheumatology clinic of three hospitals at first consultation. Patients were included in case of one or more swollen joint or, in the absence of joint swelling, if they had two or more joints with pain or loss of movement with two or more of the following criteria: morning stiffness for more than 1 h; unable to clench a fist in the morning; pain when shaking someone's hand; pins and needles in the fingers; difficulties wearing rings or shoes; a family history of RA; unexplained fatigue for less than 1 year. Patients were excluded if their symptoms resulted from trauma or overexertion, were for over 12 months, or if they were younger than 16 years. A trained research nurse at the REACH clinic took a standardised history and conducted a physical examination at baseline, 6 and 12 months, including blood and urine samples. For the current analysis data from baseline and 1 year were used. Physical examination included the measurement of tender and swollen joints, using a 44 joint count. Laboratory variables included IgM-rheumatoid factor (ELISA), anti-cyclic citrullinated peptide (Elia CCP on immunoCAP 250; Phadia Freiburg, Germany),C-reactive protein (local standards) and ESR (local standards). x-Rays of hands and feet were assessed for bony erosions at baseline. For a detailed description of REACH, see Geuskens et al.14

Statistical analyses

To asses overall performance the prediction algorithms were tested on discrimination and calibration.15 Discrimination is the ability of an algorithm to differentiate correctly between patients with and without the disease. Calibration reveals the ability to estimate the probability of the diagnosis for individuals correctly by comparing the probability predicted by the algorithm and the observed probability. To evaluate discrimination receiver operating characteristic curves, including corresponding areas under the curve (AUC), were calculated. Calibration was evaluated using calibration plots and the Hosmer–Lemeshow test.15 The latter indicates good calibration if a non-significant result appears. To assess diagnostic performance of the algorithms sensitivity, specificity, positive predictive values (PPV) and negative predictive values (NPV) were estimated at cut-points proposed for treatment initiation among patients at risk of RA. For the ACR/EULAR 2010 criteria and Visser algorithm a score of 6 or more1 16 was used and for van der Helm a score of 8 or more5 was used. To test robustness this analysis was repeated among all patients included in REACH. This group had a lower previous disease probability by a case-mix of synovitis and inflammatory joint complaints without synovitis. Synovitis was defined as joint swelling. As a classifier for correct diagnosis two outcomes were evaluated at 1 year: the use of methotrexate and persistent disease, defined as synovitis present at physical examination after 1 year, or the use of disease-modifying antirheumatic drugs (DMARD) including biological agents. Patients with a definite alternative diagnosis such as gout were not classified as persistent disease. A complete case analysis was done.

Results

Up to 31 October 2008, 875 patients were referred to REACH and had 1-year follow-up. One hundred and 13 patients did not fulfil the inclusion criteria and 31 patients were lost to follow-up at baseline (see supplementary figure S1, available online only). Patients used in the development of the ACR/EULAR 2010 criteria were excluded (n=216).1 Table 1 reports baseline characteristics of all patients (n=513). Patients had a mean age of 50 years, 73% were women and the median symptom duration was 106 days (range 1–366 days). At baseline 48% (n=246) presented with synovitis. After 1 year, 148 of 513 used methotrexate, of whom 22 did not have synovitis at baseline, and 231 of 513 patients had persistent disease, of whom 59 did not have synovitis at baseline.
Table 1

Patient characteristics for each patient set

UA (n=231)All patients (n=513)
Women (%)6873
Age, years (mean, SD)53 (14)50 (14)
SJC (median, range)4 (1–38)0 (0–38)
TJC (median, range)7 (0–42)6 (0–42) mv=2
RF positive (%)35%26%
Anti-CCP positive (%)28% mv=619% mv=10
ESR, mm/h (median, range)18 (1–103) mv=714 (0–103) mv=15
CRP, mg/l (median, range)6 (1–180) mv=165 (1–180) mv=40
Erosions (%)9% mv=44% mv=9
RA, according to 1987 ACR criteria29% mv=314% mv=5
RA, according to 2010 ACR/EULAR criteria45% mv=1258% mv=6
Persistent arthritis at 1 year45% mv=971% mv=3

ACR, American College of Rheumatology; CCP, cyclic citrullinated protein; CRP, C-reactive protein; ESR, erythrocyte sedimentation rate; EULAR, European League Against Rheumatism; mv, missing values; RA, rheumatoid arthritis; RF, rheumatoid factor; SJC, swollen joint count; TJC, tender joint count; UA, undifferentiated arthritis.

Patient characteristics for each patient set ACR, American College of Rheumatology; CCP, cyclic citrullinated protein; CRP, C-reactive protein; ESR, erythrocyte sedimentation rate; EULAR, European League Against Rheumatism; mv, missing values; RA, rheumatoid arthritis; RF, rheumatoid factor; SJC, swollen joint count; TJC, tender joint count; UA, undifferentiated arthritis.

Discrimination

Table 2 shows AUC of each diagnostic algorithm for both outcomes. In undifferentiated arthritis (UA) patients (n=231) the AUC for methotrexate use were comparable, with overlapping 95% CI, 0.79 (95% CI 0.73 to 0.83) for the ACR/EULAR 2010 criteria, 0.80 (95% CI 0.74 to 0.87) for the van der Helm algorithm and 0.83 (95% CI 0.77 to 0.88) for the Visser algorithm. For persistent disease the AUC were 0.77 (95% CI 0.71 to 0.85) for the ACR/EULAR 2010 criteria, 0.78 (95% CI 0.71 to 0.85) for the van der Helm algorithm and 0.77 (95% CI 0.71 to 0.83) for the Visser algorithm. In all patients (n=513) the AUC were comparable for both outcomes, with slightly better performance of the van der Helm algorithm; 0.88 (95% CI 0.84 to 0.91) and 0.83 (95% CI 0.79 to 0.87).
Table 2

Area under the receiver operating characteristic curves with 95% CI for each algorithm and each patient set

ACR/EULAR 2010Van der HelmVisser
Outcome methotrexate use
 UA patients0.79 (0.73 to 0.85)0.80 (0.74 to 0.87)0.83 (0.77 to 0.88)
 All patients0.79 (0.75 to 0.83)0.88 (0.84 to 0.91)0.80 (0.76 to 0.85)
Outcome persistent disease
 UA patients0.77 (0.71 to 0.85)0.78 (0.71 to 0.85)0.77 (0.71 to 0.83)
 All patients0.74 (0.70 to 0.78)0.83 (0.79 to 0.87)0.74 (0.70 to 0.79)

UA, undifferentiated arthritis.

Area under the receiver operating characteristic curves with 95% CI for each algorithm and each patient set UA, undifferentiated arthritis.

Calibration

Calibration plots of all diagnostic algorithms are shown in figure S2 (see supplementary figure S2, available online only). In UA patients (n=513) calibration was worse than in all patients, although the Hosmer–Lemeshow test was not significant for any of the calibration plots. All algorithms calibrated well in all patients (n=513), confirmed by the Hosmer–Lemeshow test.

Evaluating diagnostic performance using proposed cut-points

To identify patients in need of treatment proposed cut-points were tested in UA patients. The ACR/EULAR criteria showed a sensitivity of 0.74 (95% CI 0.65 to 0.82) and a specificity of 0.66 (95% CI 0.54 to 0.76), with the cut-point of 6 or higher using methotrexate as a classifier for correct diagnosis (table 3). The Visser algorithm and the van der Helm algorithm had a lower sensitivity, 0.47 and 0.59 for the Visser algorithm for both outcomes and 0.08 and 0.10 for the van der Helm algorithm. Specificity was higher: 0.93 for the Visser algorithm and 1.0 for the van der Helm algorithm.
Table 3

Sensitivity, specificity, and PPV and NPV at the proposed cut‑points in UA patients

Sensitivity (95% CI)Specificity (95% CI)PPV (95% CI)NPV (95% CI)
methotrexate use
 ACR/EULAR 2010 criteria0.74 (0.65 to 0.82)0.66 (0.54 to 0.76)0.76 (0.67 to 0.83)0.63 (0.52 to 0.73)
 van der Helm algorithm0.10 (0.05 to 0.17)1.0 (0.95 to 1.0)1.0 (0.74 to 1.0)0.43 (0.35 to 0.50)
 Visser algorithm0.59 (0.50 to 0.68)0.93 (0.85 to 0.97)0.92 (0.84 to 0.97)0.62 (0.53 to 0.71)
Persistent disease
 ACR/EULAR 2010 criteria0.69 (0.61 to 0.76)0.72 (0.59 to 0.83)0.87 (0.80 to 0.92)0.46 (0.35 to 0.56)
 van der Helm algorithm0.08 (0.05 to 0.14)1.0 (0.93 to 1.0)1.0 (0.75 to 1.0)0.27 (0.21 to 0.34)
 Visser algorithm0.47 (0.39 to 0.55)0.93 (0.84 to 0.98)0.95 (0.87 to 0.99)0.40 (0.31 to 0.48)

ACR, American College of Rheumatology; EULAR, European League Against Rheumatism; NPV, negative predictive value; PPV, positive predictive value, UA, undifferentiated arthritis.

Sensitivity, specificity, and PPV and NPV at the proposed cut‑points in UA patients ACR, American College of Rheumatology; EULAR, European League Against Rheumatism; NPV, negative predictive value; PPV, positive predictive value, UA, undifferentiated arthritis. The PPV is the probability that a patient has the disease if the test is positive. The van der Helm algorithm had the highest PPV; 1.0. The NPV is the opposite probability and was highest for the ACR/EULAR criteria with 0.63 for methotrexate use and 0.46 for persistent disease, slightly higher than the Visser algorithm.

Discussion

The results of our study show that the new ACR/EULAR 2010 criteria could aid diagnostics in early arthritis patients. They had good overall performance, with a sufficiently high AUC and good performance of the proposed cut-point of 6 for persistent disease, which could be considered RA. The other algorithms performed well when tested for discriminatory properties (AUC and calibration), although the van der Helm algorithm failed to detect cases at the proposed cut-point. To promote uniformity and comparability of studies we would suggest using the ACR/EULAR 2010 criteria in future diagnostic studies. The cut-point of 6 in the ACR/EULAR 2010 criteria was well chosen and showed good diagnostic performance, even though it was not intended for diagnostic purposes.1 Choosing a cut-point is a trade-off between harm of treatment in non-cases (overtreatment) and harm of no treatment in cases (undertreatment).13 17 Ideally a cut-point has a high sensitivity to prevent undertreatment and a high specificity to prevent overtreatment. However, a high specificity is often accompanied by a low to moderate sensitivity and vice versa. For the ACR/EULAR 2010 criteria both sensitivity and specificity were approximately 70%. Using this cut-point of 6 to start treatment, in this study 30% of persistent patients would not be treated, whereas 30% of the non-persistent patients would have been. Lowering the cut-point to 4 increases sensitivity to 0.92 at the cost of specificity (0.33). Increasing it to 7 had a sensitivity of 0.53 and a specificity of 0.85. Perhaps creating a low, intermediate and high-risk group for disease using dual cut-points would enable treatment with different intensities. The cut-point of 6 was chosen using the AUC of three cohorts, including our own. In this study the AUC for methotrexate use, 0.79, was similar to that in the derivation article (0.66–0.82), indicating consistency. It was also similar (0.77) for persistent disease. It could be argued that this is a direct result of the use of our data in the derivation cohort. However, the decision to use 6 as cut-point was based on expert opinion and two other cohorts. Furthermore, patients included in the derivation of the ACR/EULAR 2010 criteria were removed from analyses. The strengths of our study include the heterogeneity of patients’ subsets to test robustness of the algorithms and simultaneous evaluation of three diagnostic algorithms in one study sample. We showed that the ACR/EULAR criteria and both algorithms were robust in a case-mix of synovitis and non-synovitis patients. Calibration was good for all algorithms, but not perfect. Calibration and robustness have not been evaluated before by others, but discrimination was. The van der Helm algorithm showed AUC of 0.82–0.88 and the Visser algorithm an AUC of 0.70, both similar to the AUC in the present study.7–9 This study should be interpreted in the light of current developments in diagnostic research in RA. Current diagnostic studies within RA are faced with defining a suitable outcome. We defined two outcomes; methotrexate use similar to the definition of the ACR/EULAR 2010 and persistent disease (either synovitis or DMARD use at 12 months).1 This may have led to misclassification in two ways. Patients could be classified as true positive because they were still using methotrexate or other DMARD at 12 months, whereas in fact some patients may not need treatment. Likewise, patients may have had episodes of arthritis with no episode or DMARD use at 12 months, while later on they developed persistent arthritis. In conclusion, the new ACR/EULAR 2010 criteria showed good diagnostic properties in an early arthritis cohort reflecting daily clinical practice, as did the van der Helm and Visser algorithms. All were robust. To promote uniformity and comparability we would suggest using the ACR/EULAR 2010 criteria in future diagnostic studies.
  14 in total

1.  2010 rheumatoid arthritis classification criteria: an American College of Rheumatology/European League Against Rheumatism collaborative initiative.

Authors:  Daniel Aletaha; Tuhina Neogi; Alan J Silman; Julia Funovits; David T Felson; Clifton O Bingham; Neal S Birnbaum; Gerd R Burmester; Vivian P Bykerk; Marc D Cohen; Bernard Combe; Karen H Costenbader; Maxime Dougados; Paul Emery; Gianfranco Ferraccioli; Johanna M W Hazes; Kathryn Hobbs; Tom W J Huizinga; Arthur Kavanaugh; Jonathan Kay; Tore K Kvien; Timothy Laing; Philip Mease; Henri A Ménard; Larry W Moreland; Raymond L Naden; Theodore Pincus; Josef S Smolen; Ewa Stanislawska-Biernat; Deborah Symmons; Paul P Tak; Katherine S Upchurch; Jirí Vencovsky; Frederick Wolfe; Gillian Hawker
Journal:  Ann Rheum Dis       Date:  2010-09       Impact factor: 19.103

Review 2.  Sensitivity and specificity of the American College of Rheumatology 1987 criteria for the diagnosis of rheumatoid arthritis according to disease duration: a systematic literature review and meta-analysis.

Authors:  F Banal; M Dougados; C Combescure; L Gossec
Journal:  Ann Rheum Dis       Date:  2008-08-26       Impact factor: 19.103

Review 3.  Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors.

Authors:  F E Harrell; K L Lee; D B Mark
Journal:  Stat Med       Date:  1996-02-28       Impact factor: 2.373

4.  How to diagnose rheumatoid arthritis early: a prediction model for persistent (erosive) arthritis.

Authors:  Henk Visser; Saskia le Cessie; Koen Vos; Ferdinand C Breedveld; Johanna M W Hazes
Journal:  Arthritis Rheum       Date:  2002-02

5.  The American Rheumatism Association 1987 revised criteria for the classification of rheumatoid arthritis.

Authors:  F C Arnett; S M Edworthy; D A Bloch; D J McShane; J F Fries; N S Cooper; L A Healey; S R Kaplan; M H Liang; H S Luthra
Journal:  Arthritis Rheum       Date:  1988-03

6.  Long-term impact of early treatment on radiographic progression in rheumatoid arthritis: A meta-analysis.

Authors:  Axel Finckh; Matthew H Liang; Carmen Mugica van Herckenrode; Paola de Pablo
Journal:  Arthritis Rheum       Date:  2006-12-15

7.  Assessing the generalizability of prognostic information.

Authors:  A C Justice; K E Covinsky; J A Berlin
Journal:  Ann Intern Med       Date:  1999-03-16       Impact factor: 25.391

8.  What do we mean by validating a prognostic model?

Authors:  D G Altman; P Royston
Journal:  Stat Med       Date:  2000-02-29       Impact factor: 2.373

9.  Validation of a prediction rule for disease outcome in patients with recent-onset undifferentiated arthritis: moving toward individualized treatment decision-making.

Authors:  Annette H M van der Helm-van Mil; Jacqueline Detert; Saskia le Cessie; Andrew Filer; Hans Bastian; Gerd R Burmester; Tom W J Huizinga; Karim Raza
Journal:  Arthritis Rheum       Date:  2008-08

10.  Work and sick leave among patients with early inflammatory joint conditions.

Authors:  Goedele A Geuskens; Johanna M W Hazes; Pieternella J Barendregt; Alex Burdorf
Journal:  Arthritis Rheum       Date:  2008-10-15
View more
  16 in total

1.  The new 2010 ACR/EULAR criteria as predictor of clinical and radiographic response in patients with early arthritis.

Authors:  R B Mueller; M Schiff; T Kaegi; A Finckh; S R Haile; H Schulze-Koops; J von Kempis
Journal:  Clin Rheumatol       Date:  2014-07-15       Impact factor: 2.980

Review 2.  The epidemiology of early inflammatory arthritis.

Authors:  Johanna M W Hazes; Jolanda J Luime
Journal:  Nat Rev Rheumatol       Date:  2011-06-14       Impact factor: 20.543

3.  The 2010 ACR/EULAR classification criteria for rheumatoid arthritis in the Heinola inception cohort--diagnoses confirmed by long-term follow-up.

Authors:  Kalevi Kaarela; Jenni E Kauppi; Markku J Kauppi
Journal:  Clin Rheumatol       Date:  2011-11-04       Impact factor: 2.980

4.  [Prevalence, comorbidity and interdisciplinary treatment of rheumatoid arthritis - Insurance data on outpatient and inpatient care in Baden-Württemberg].

Authors:  A Strahl; O Schneider; J Frankenhauser-Mannuß; S Knapstein; C Hermann; B Lembeck; H-M Lorenz; W Rüther; J Flechtenmacher
Journal:  Z Rheumatol       Date:  2018-03       Impact factor: 1.372

5.  Circulating miRNA Correlates with Lipid Profile and Disease Activity in Psoriatic Arthritis, Rheumatoid Arthritis, and Ankylosing Spondylitis Patients.

Authors:  Krzysztof Bonek; Ewa Kuca Warnawin; Anna Kornatka; Magdalena Plebańczyk; Tomasz Burakowski; Włodzimierz Maśliński; Małgorzata Wisłowska; Piotr Głuszko; Marzena Ciechomska
Journal:  Biomedicines       Date:  2022-04-13

6.  The performance of a point of care test for detection of anti-mutated citrullinated vimentin and rheumatoid factor in early rheumatoid arthritis.

Authors:  Preeda Rojanasantikul; Prapa Pattrapornpisut; Kulvara Anuruckparadorn; Wanruchada Katchamart
Journal:  Clin Rheumatol       Date:  2014-02-28       Impact factor: 2.980

7.  Performance of the 2010 classification criteria for rheumatoid arthritis: a systematic literature review and a meta-analysis.

Authors:  Garifallia Sakellariou; Carlo Alberto Scirè; Antonella Zambon; Roberto Caporali; Carlomaurizio Montecucco
Journal:  PLoS One       Date:  2013-02-20       Impact factor: 3.240

Review 8.  Computer-based diagnostic expert systems in rheumatology: where do we stand in 2014?

Authors:  Hannes Alder; Beat A Michel; Christian Marx; Giorgio Tamborrini; Thomas Langenegger; Pius Bruehlmann; Johann Steurer; Lukas M Wildi
Journal:  Int J Rheumatol       Date:  2014-07-08

9.  The incidence of rheumatoid arthritis in the UK: comparisons using the 2010 ACR/EULAR classification criteria and the 1987 ACR classification criteria. Results from the Norfolk Arthritis Register.

Authors:  Jennifer H Humphreys; Suzanne M M Verstappen; Kimme L Hyrich; Jacqueline R Chipping; Tarnya Marshall; Deborah P M Symmons
Journal:  Ann Rheum Dis       Date:  2012-09-03       Impact factor: 19.103

10.  Application of the 2010 ACR/EULAR classification criteria in patients with very early inflammatory arthritis: analysis of sensitivity, specificity and predictive values in the SAVE study cohort.

Authors:  Iuliia Biliavska; Tanja A Stamm; Jose Martinez-Avila; Thomas W J Huizinga; Robert B M Landewé; Günter Steiner; Daniel Aletaha; Josef S Smolen; Klaus P Machold
Journal:  Ann Rheum Dis       Date:  2012-09-15       Impact factor: 19.103

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.