| Literature DB >> 32532218 |
Richard A Parker1, Charles Scott2, Vanda Inácio3, Nathaniel T Stevens4.
Abstract
BACKGROUND: Studies of agreement examine the distance between readings made by different devices or observers measuring the same quantity. If the values generated by each device are close together most of the time then we conclude that the devices agree. Several different agreement methods have been described in the literature, in the linear mixed modelling framework, for use when there are time-matched repeated measurements within subjects.Entities:
Keywords: Agreement; Concordance correlation coefficient; Limits of agreement; Method comparison studies; Repeated measures
Mesh:
Year: 2020 PMID: 32532218 PMCID: PMC7291585 DOI: 10.1186/s12874-020-01022-x
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Standard agreement model assumptions (with suggested procedures to check their validity in brackets)
| • Independent subjects | |
| • Normally distributed random effects (diagnosed by Q-Q plots) | |
| • Normally distributed error terms (diagnosed by Q-Q plots) | |
| • Fixed mean bias across the range of measurement (plots of standardized residuals against fitted values) | |
| • Constant between-subject and within-subject variabilities across the range of measurements (plots of residuals against fitted values) |
Fig. 1Scatterplot comparing the chest-band and gold standard measurement devices. Each point corresponds to individual measurements made on a subject. The solid diagonal line is the line of equality
Fig. 2Bland-Altman plot showing the paired difference between the devices against the average of the pairs of devices. Points shown correspond to individual pairs of observations rather than individual patients. Dashed line shows the mean bias (red) and limits of agreement (blue). Dotted lines are 95% bootstrap confidence intervals
Summary of the different statistical approaches
| Statistical Approach | Advantages/Strengths | Disadvantages | Key summary results (COPD study example) |
|---|---|---|---|
| Concordance correlation coefficient | - A widespread and frequently used method. - Can still be used in cases where defining an appropriate CAD is either very difficult or impossible. | - Heavily influenced by the degree of between-subject and between-activity variability and the range of the data. - Can be very difficult to determine if the CCC is large enough to constitute acceptable agreement. - Can be very difficult to interpret clinically: interpretation not in terms of original measurement unit. | CCC 0.68 (95% CI 0.60 to 0.72) |
| Limits of agreement | - Simplicity of application: relatively straightforward to compute limits. - Clinical interpretation is based on the original measurement scale. - Estimate of mean bias. - Easy to understand and interpret. | - Standard approach is highly dependent on the normality assumption for validity. - High variability in residual errors may mask the fact that a device could measure the true value more precisely than the gold-standard. - Easy to apply method incorrectly without explicitly specifying a clinically acceptable difference. | Mean bias −1.60 95% LoA − 11.57 to 8.38 |
| TDI | - Easy to compute. - Easy to interpret. - Clinical interpretation is based on the original measurement scale. | - Can be difficult to determine if the TDI is large enough to constitute acceptable agreement. - Does not explicitly calculate the mean bias. | TDI 10.9 (95% CI 9.4 to 12.7) |
| CP | - Easy to interpret. - Easy to compute. - Method cannot be used without explicitly specifying a clinically acceptable difference, which is based on the original measurement scale. | - Does not explicitly calculate the mean bias. | CP of 0.63 (95% CI 0.56 to 0.70) for boundary of ± 5 |
| CIA | - Directly compares the disagreement between devices against the disagreement within devices and within subjects. - Much less dependent on the between-subject and between-activity variability compared to the CCC. - Can still be used in cases where defining an appropriate CAD is either very difficult or impossible. | - Depends heavily on the within-subject within-device variance. - Relies on data which has acceptable replication error. | CIA 0.68 (95% CI 0.57 to 0.75) |
Fig. 3Bland-Altman style plot corresponding to the calculation of the coverage probability showing the raw data, raw mean bias (red), and clinically acceptable difference of 5 (green)