| Literature DB >> 17605816 |
Désirée van der Heijde1, Robert Landewé, Annelies Boonen, Steve Einstein, Gertraud Herborn, Rolf Rau, Siegfried Wassenberg, Barbara N Weissman, Carl S Winalski, John T Sharp.
Abstract
The objective of the present study was to test the hypothesis that experts recognize repair of erosions and, if so, to determine which, if any, morphologic features permitted them to recognize the repair. We also tested whether scoring by a standard method detected repair. Seven experienced readers of radiographs in rheumatoid arthritis were presented with 64 sets of single joints-of-interest at two time points, randomized and blinded for the correct sequence. The readers assessed which joint was better, and recorded whether any of six specific features were seen. Two independent readers, experienced in scoring by the van der Heijde-modified Sharp method who were not on the expert panel, then scored the complete films that included the joint-of-interest. The panel agreed very well on which of two joints was better, and, even though they did not know the true sequence, the panel accurately assigned a sequence slightly better than chance alone (58%) but worse than their agreement on which image was 'better or worse' (78%). The readers therefore indirectly assigned repair by choosing the second film as the best. Putative repair features were seen in cases of both repair and progression, and were not discriminatory. Similar results were obtained when the experts were presented with the entire hand or foot containing the joint-of-interest. In the third repair exercise, two independent readers who scored whole hands and feet using a standard method found a mean negative score in 22/60 joints-of-interest. All 22 joints were also scored as repair by the panel. Repair was detected reliably by a majority of the panel on viewing paired images based on a better/worse decision and assigning sequence in a set of images that were blinded for sequence by an independent project manager. In this test set of images, repair was manifested by a reduction in the size of erosion in many cases. Size was one feature that aided the experts to detect repair but cannot be the only one; the experts had to find other features to determine whether a smaller erosion was the first in a sequence of radiographs in a patient with progressive damage or was the second film in a patient exhibiting repair. The change in size of erosion was also picked up by independent readers applying the van der Heijde-modified Sharp scoring method and was reflected in their scores.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17605816 PMCID: PMC2206368 DOI: 10.1186/ar2220
Source DB: PubMed Journal: Arthritis Res Ther ISSN: 1478-6354 Impact factor: 5.156
Study decision tree
| Reader judged image A | Combined with true sequence of image Ab | Conclusion in analysisb |
| Bettera | First time point | Progression |
| Better | Second time point | Repair |
| Better and first time point | First time point | Reader recognized progression |
| Better and first time point | Second time point | Reader failed to recognize repair |
| Better and second time point | Second time point | Reader recognized repair |
| Better and second time point | First time point | Reader failed to recognize progression |
The true sequence was unknown to the reader.
aA similar decision tree constructed for a reader judging image A as worse exchanges repair and progression in the conclusion column.
bIn Exercises I and II, when the reader made a judgement as to whether a pair of images represented progression or repair that decision was called direct assignment. In the exercises when the analyst interpreted a reader's combined responses on better/worse and the sequence of images, this is called an inferred or an indirect assignment.
Results of specific repair features in Exercise Ia
| Single repair feature | First film is betterb (progression) ( | Second film is better (repair) ( | Odds ratio to detect repair | True positive rate (sensitivity) | False positive rate (1 – specificity) | Positive likelihood ratio | Negative likelihood ratio |
| Filling-in of erosions | 112 (33%)* | 225 (67%)** | 2.2 | 0.89 | 0.78 | 1.1 | 0.50 |
| Cortication | 84 (30%) | 192 (70%) | 2.2 | 0.76 | 0.59 | 1.3 | 0.58 |
| Sclerosis | 68 (31%) | 149 (69%) | 1.6 | 0.59 | 0.47 | 1.1 | 0.78 |
| Remodelling | 32 (24%) | 97 (76%) | 2.1 | 0.38 | 0.22 | 1.7 | 0.79 |
| Trabeculation | 41 (37%) | 78 (63%) | 1.1 | 0.20 | 0.21 | 1.0 | 1.0 |
| Reconstitution of a normal joint | 41 (53%) | 37 (47%) | 0.43 | 0.15 | 0.29 | 0.5 | 1.2 |
| Any of the above features of repair | 130 (36%) | 234 (64%) | 1.2 | 0.92 | 0.91 | 1.0 | 0.89 |
aTest performance in Exercise I of putative features of repair in relation to progression and repair as indicated by inferred assignment (first film is better versus second film is better). Of the total number of 397 observations, 143 (36%) were judged as showing progression and 254 (64%) as showing repair without taking into account specific features of repair. Adding information on features of repair only marginally influences the discrimination between progression and repair.
bDesignation of first or second film based on the true sequence. Numbers indicate the numbers of observations in which a given single repair feature was recorded as present. Percentages indicate the positive predictive value of a specific repair feature for a progression* or repair** classification. For example, 'filling-in' was observed 325 times: 103 times (32%) in cases with progression and 222 times (68%) in cases of repair. The positive likelihood ratio is the quotient of the true positive rate divided by the false positive rate (for example, 'filling-in' in cases of repair) and the false positive rate (for example, 'filling-in' in cases of progression). The negative likelihood ratio is the quotient of the false negative rate (no 'filling-in' in cases of repair) divided by the true negative rate (no 'filling-in' in cases of progression). In order to be of diagnostic value, the positive likelihood ratio should be high (for example, > 4) and the negative likelihood ratio should be low (for example, < 0.3).
Assignment of progression or repair based on direct assignment versus inferred assignment in Exercise IIa
| Whole hand direct assignment of progression/repair | Single joint inferred assignment of progression/repair | ||||
| Total | First film is better (progression) | Second film is better (repair) | Both films are similar (no change) | No majority agreement obtained | |
| Progression | 28 | 18 | 8 | 0 | 2 |
| Repair | 9 | 0 | 9 | 0 | 0 |
| No change | 5 | 0 | 0 | 4 | 1 |
| No majority agreement obtained | 22 | 3 | 11 | 0 | 8 |
| Total | 64 | 21 | 28 | 4 | 11 |
aBased on agreement by five of seven readers.
Typical example of the direct versus the inferred assignment of one of the readers
| Reader's direct judgement (direct assignment) | Reader's better/worse interpretation in combination with true time order of X-rays (inferred assignment): | Totals | |
| Compatible with progression or no change | Compatible with repair | ||
| Progression/no change | 21 | 15 | 36 |
| Repair | 1 | 27 | 28 |
| Totals | 22 | 42 | 64 |
The inferred assignment is considered the gold standard. Direct assignment underestimates the true prevalence of repair. Percentage correctly classified, 75%; false direct assignment of repair, 1.5%; false direct assignment of progression, 23%; positive predictive value, 96%; negative predictive value, 58%.
Figure 1Agreement between negative van der Heijde-modified Sharp scores and the expert panel judgement on repair. Comparison of the number of cases with a negative mean score of the van der Heijde-modified Sharp score by two readers for the joint-of-interest (total n = 22) with the numbers of experts (total n = 7) assessing the joint as showing repair.