| Literature DB >> 32456091 |
Oke Gerke1,2.
Abstract
The Bland-Altman Limits of Agreement is a popular and widespread means of analyzing the agreement of two methods, instruments, or raters in quantitative outcomes. An agreement analysis could be reported as a stand-alone research article but it is more often conducted as a minor quality assurance project in a subgroup of patients, as a part of a larger diagnostic accuracy study, clinical trial, or epidemiological survey. Consequently, such an analysis is often limited to brief descriptions in the main report. Therefore, in several medical fields, it has been recommended to report specific items related to the Bland-Altman analysis. The present study aimed to identify the most comprehensive and appropriate list of items for such an analysis. Seven proposals were identified from a MEDLINE/PubMed search, three of which were derived by reviewing anesthesia journals. Broad consensus was seen for the a priori establishment of acceptability benchmarks, estimation of repeatability of measurements, description of the data structure, visual assessment of the normality and homogeneity assumption, and plotting and numerically reporting both bias and the Bland-Altman Limits of Agreement, including respective 95% confidence intervals. Abu-Arafeh et al. provided the most comprehensive and prudent list, identifying 13 key items for reporting (Br. J. Anaesth. 2016, 117, 569-575). An exemplification with interrater data from a local study accentuated the straightforwardness of transparent reporting of the Bland-Altman analysis. The 13 key items should be applied by researchers, journal editors, and reviewers in the future, to increase the quality of reporting Bland-Altman agreement analyses.Entities:
Keywords: Bland–Altman plot; Limits of Agreement; Tukey mean-difference plot; agreement; confidence interval; interrater; method comparison; repeatability; reporting; reproducibility
Year: 2020 PMID: 32456091 PMCID: PMC7278016 DOI: 10.3390/diagnostics10050334
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) flow diagram. GRRAS: Guidelines for Reporting Reliability and Agreement Studies [19].
Characteristics of studies included.
| Publication | Field/Area | Search Approach or Target Journals | Time Frame | Evidence Base |
|---|---|---|---|---|
| Flegal (2019) [ | Self-reported vs. measured weight and height | Unrestricted; reference lists of systematic reviews, repetition of 2 PubMed searches of these, “related articles” in PubMed | 1986–May 2019 | N = 394 published articles |
| Abu-Arafeh (2016) [ | Anesthesiology | Anaesthesia, Anesthesiology, Anesthesia & Analgesia, British Journal of Anaesthesia, Canadian Journal of Anesthesia | 2013–2014 | N = 111 papers |
| Montenij (2016) [ | Cardiac output monitors | N/A | N/A | Expert opinion |
| Olofsen (2015) [ | Unrestricted | N/A | N/A | Narrative literature review and Monte Carlo simulations |
| Chhapola (2015) [ | Laboratory analytes | PubMed and Google Scholar | 2012 and later | N = 50 clinical studies |
| Berthelsen (2006) [ | Anesthesiology | Acta Anaesthesiologica Scandinavica | 1989–2005 | N = 50 |
| Mantha (2000) [ | Anesthesiology | Seven anesthesia journals | 1996–1998 | N = 44 |
N/A: not applicable.
Comparison of proposed reporting items for a Bland–Altman analysis across included studies.
| Reporting Item | Flegal (2019) [ | Abu-Arafeh (2017) [ | Montenij (2016) [ | Olofsen (2015) [ | Chhapola (2015) [ | Berthelsen (2006) [ | Mantha (2000) [ |
|---|---|---|---|---|---|---|---|
| (1) Pre-established acceptable limit of agreement | X | X | X | X | X | X | |
| (2) Description of the data structure (e.g., no. of raters, replicates, block design) | X | X | X | ||||
| (3) Estimation of repeatability of measurements if possible (mean of differences between replicates and respective standard deviations) | X | X | X | X | X | ||
| (4) Plot of the data, and visual inspection for normality, absence of trend, and constant variance across the measurement range (e.g., histogram, scatter plot) | X | X | X | X | X | X | X |
| (5) Transformation of the data (e.g., ratio, log) according to 4), if necessary | X | X | |||||
| (6) Plotting and numerically reporting the mean of the differences (bias) | X | X | X | X | |||
| (7) Estimation of the precision, i.e., standard deviation of the differences or 95% confidence interval for the mean difference | X | X | X | X | X | ||
| (8) Plotting and numerically reporting the BA LoA | X | X | X | X | X | ||
| (9) Estimation of the precision of the BA LoA by means of 95% confidence intervals | X | X | X | X | X | X | X |
| (10) Indication of whether the measurement range is sufficiently wide (e.g., apply the Preiss-Fisher procedure [ | X | ||||||
| (11) Between- and within-subject variance or stating that the confidence intervals of the BA LoA were derived by taking the data structure into account | X | X | X | ||||
| (12) Software package or computing processes used | X | X | |||||
| (13) Distributional assumptions made (e.g., normal distribution of the differences) | X | X | X | ||||
| (14) Sample size considerations | X | ||||||
| (15) Correct representation of the | X | X | |||||
| (16) Upfront declaration of conflicts of interest | X |
BA LoA: Bland–Altman Limits of Agreement.
Figure 2Histogram with approximating normal distribution of interrater differences (n = 140).
Figure 3Bland–Altman plot for interrater agreement analysis (n = 140). Limits of Agreement are shown as solid, black lines with 95% confidence intervals (light blue areas), bias (as dotted black line) with 95% confidence interval (olive-teal area), and regression fit of the differences on the means (as solid red line).
Figure 4Distribution of the standard deviation from 1000 random mispairings according to the Preiss-Fisher procedure [34]. The observed standard deviation in the interrater sample (1.21) is clearly smaller than the minimum of the standard deviations from 1000 random mispairings (7.12).