| Literature DB >> 34084888 |
Yue Ma1,2, Derek L Norton2,3, Carol A Van Hulle1,2, Richard J Chappell3,4, Karen K Lazar1,2,5, Erin M Jonaitis1,6, Rebecca L Koscik1,6, Lindsay R Clark1,2,5,6, Rachel Krause1,2, Ulf Andreasson7,8, Nathaniel A Chin1,2, Barbara B Bendlin1,2,6, Sanjay Asthana1,2,5, Ozioma C Okonkwo1,2,6, Carey E Gleason1,2,5,6, Sterling C Johnson1,2,5,6, Henrik Zetterberg7,8,9,10, Kaj Blennow7,8, Cynthia M Carlsson1,2,5,6.
Abstract
INTRODUCTION: Batch differences in cerebrospinal fluid (CSF) biomarker measurement can introduce bias into analyses for Alzheimer's disease studies. We evaluated and adjusted for batch differences using statistical methods.Entities:
Keywords: Alzheimer's disease; batch difference; biomarker; cerebrospinal fluid; conversion; generalized linear model
Year: 2021 PMID: 34084888 PMCID: PMC8144935 DOI: 10.1002/dad2.12194
Source DB: PubMed Journal: Alzheimers Dement (Amst) ISSN: 2352-8729
FIGURE 1Distribution of cerebrospinal fluid (CSF) samples between measurement batches. CSF samples included in this analysis were from participants of six National Institutes of Health (NIH)–funded Alzheimer's disease (AD) studies: the Wisconsin Alzheimer's Disease Research Center (ADRC) Clinical Core, the Wisconsin Registry for Alzheimer's Prevention (WRAP), the Longitudinal Early Alzheimer's Detection (LEAD) study, The Longitudinal Course of Neural Function and Amyloid in People At Risk for Alzheimer's Disease (PREDICT) study, the PIB Imaging in People at Risk for Alzheimer's Disease (PIPR) study, and the Statins in Healthy, At Risk Adults: Impact on Amyloid and Regional Perfusion (SHARP) study. A total of 792 CSF samples collected from 528 participants were assayed in three batches, with a subset of samples re‐assayed, which resulted in 977 batch measures. Groups {1}, {2}, {3} represent the CSF samples assayed in a single batch 1, 2, and 3, respectively. Group {1‐2} represents the CSF samples assayed in both batches 1 and 2, and group {1‐3} represents the CSF samples assayed in both batches 1 and 3. The groups with CSF samples assayed in two batches had smaller sample sizes than the groups with CSF samples assayed in a single batch. The mean age for each group ranged from 55.5 to 65.2 years. The clinical diagnoses associated with the CSF samples were all cognitively unimpaired in groups {1‐2}, {1‐3}, {3}, and were mostly cognitively unimpaired in groups {1}, {2}. Types and counts (percentages) of impaired diagnoses for these two groups are summarized in Table S1
FIGURE 2Bland‐Altman plot for evaluating between‐batch differences. We present this example to illustrate how to evaluate and interpret between‐batch differences using the Bland‐Altman plot. The differences between the two batch values of the biomarker sAPP‐α for the same cerebrospinal fluid (CSF) sample (Batch 2–Batch 1) are plotted against the average of the two batch values. The dashed line represents the linear regression fit of the difference on the average. Mean ± 2 SD are limits of agreement (LOA).The mean of the differences is above zero, which indicates that batch 2 on average yielded higher values than batch 1. The negative slope of the linear fit indicates less variability in batch 2, and its crossing over the zero line indicates that batch 2 measurements exceeded batch 1 at the lower end of the range, but at higher levels, batch 1 exceeded batch 2. The variance of these batch differences was non‐constant at different biomarker levels, with the largest variance observed in the middle of the biomarker level range. Under ideal circumstances, one can evaluate the significance of batch differences by comparing the LOA against an a priori threshold for a clinically significant difference. However, in this case we had no such a priori thresholds. Furthermore, the observed non‐constant variance of the differences and its marked trend complicate such an interpretation. Consequently we focus on these descriptive aspects in our report
FIGURE 3AβX‐42 converting from batch 2 to batch 1 and checking out of range values. Only samples assayed in batch 2 but not batch 1 (group {2}) were converted. Conversion was done by applying the mean structure from the final generalized linear model (GLM): [converted AβX‐42 batch 1 value, represented with red dot] = 171 + 1.82 × [raw AβX‐42 batch 2 value, represented with blue dot]. The final GLM model was developed using the raw batch 1 and raw batch 2 values of the group {1‐2} (represented with black dot), with 5% outliers excluded (represented with triangle). The final GLM line deviated from the identity line Y = X, which indicated that raw batch 1 and batch 2 values were not comparable and conversion was important. We use x to represent the cerebrospinal fluid (CSF) value being checked, and use min and max to represent the minimum and maximum values of the range being compared to, that is, the range of the raw batch values used to fit the final GLM. POR indicates the extent a value is out of the range, calculated as a proportion of that range's width. A negative value represents below the range, a positive value represents above the range, and zero represents in the range. A larger absolute value indicates being further beyond the range. Thus,
if
For each CSF value being converted, there are two sets of POR values, one for the raw value being converted and the other for the converted value. The two horizontal dotted lines parallel to the x‐axis represent the minimum (Min) and maximum (Max) of the raw batch 1 values of the group {1‐2}. If a converted value of the group {2} (represented with red dot) is between these two lines, it is in the range of the raw batch 1 values of the group {1‐2}, Converted POR value = 0. If a converted value is below the Min line, it is below the range, Converted POR value < 0. If a converted value is above the Max line, it is above the range, Converted POR value > 0. The two vertical dotted lines parallel to the y‐axis represent the Min and Max of the raw batch 2 values of the group {1‐2}. If a raw value of the group {2} (represented with blue dot) is between these two lines, it is in the range of the raw batch 2 values of the group {1‐2}, Raw POR = 0. If a raw value is below the Min line, it is below the range, Raw POR < 0. If a raw value is above the Max line, it is above the range, Raw POR > 0
Between‐batch comparison of the assayed CSF values for the same CSF sample group
| CSF variable |
| Bland‐Altman plot | Difference in means | Ratio of variances | Linear regression analyses for testing y = x | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Batch 1 is y | Batch 1 is x | Model | No. of methods | |||||||
| Intercept | Slope | Intercept | Slope | |||||||
| Comparison between batch 1 versus batch 2 for the CSF sample group {1‐2} | ||||||||||
| sAPP‐α | 95 | ! | * | * | + | *+ | *+ | 0.52 | 4 | |
| sAPP‐β | 95 | ! | * | *+ | *+ | 0.45 | 3 | |||
| AβX‐38 | 96 | ! | * | * | *+ | *+ | *+ | 0.77 | 4 | |
| AβX‐40 | 96 | ! | * | * | + | *+ | *+ | *+ | 0.70 | 4 |
| AβX‐42 | 96 | ! | * | * | *+ | *+ | + | *+ | 0.66 | 4 |
| MCP‐1 | 96 | ! | * | * | *+ | *+ | *+ | 0.70 | 4 | |
| YKL‐40 | 96 | * | + | + | 0.92 | 2 | ||||
| NfL | 95 | 0.95 | 0 | |||||||
| Aβ1‐42 | 96 | ! | *+ | *+ | 0.76 | 2 | ||||
| T‐tau | 95 | ! | * | * | *+ | *+ | *+ | 0.96 | 4 | |
| P‐tau | 94 | * | + | + | *+ | *+ | 0.64 | 3 | ||
| Ng | 77 | ! | * | * | + | + | + | *+ | 0.66 | 4 |
| AβX‐42/AβX‐40 | 96 | ! | * | *+ | + | + | 0.71 | 3 | ||
| T‐tau/Aβ1‐42 | 95 | ! | * | * | *+ | *+ | 0.97 | 4 | ||
| P‐tau/Aβ1‐42 | 94 | ! | * | *+ | *+ | + | 0.74 | 3 | ||
| Comparison between batch 1 versus batch 3 for the CSF sample group {1‐3} | ||||||||||
| sAPP‐α | 89 | ! | * | + | *+ | *+ | + | 0.51 | 3 | |
| sAPP‐β | 89 | ! | * | + | + | *+ | *+ | 0.49 | 3 | |
| AβX‐38 | 89 | ! | * | * | *+ | + | *+ | 0.88 | 4 | |
| AβX‐40 | 89 | ! | * | * | *+ | *+ | *+ | 0.77 | 4 | |
| AβX‐42 | 89 | ! | * | * | *+ | *+ | + | *+ | 0.72 | 4 |
| MCP‐1 | 89 | ! | * | * | *+ | *+ | *+ | 0.71 | 4 | |
| YKL‐40 | 89 | ! | *+ | *+ | 0.84 | 2 | ||||
| NfL | 89 | ! | * | * | *+ | + | 0.94 | 4 | ||
| Aβ1‐42 | 89 | ! | * | * | + | *+ | *+ | *+ | 0.53 | 4 |
| T‐tau | 89 | ! | * | * | *+ | *+ | 0.93 | 4 | ||
| P‐tau | 88 | ! | * | *+ | *+ | *+ | *+ | 0.16 | 3 | |
| AβX‐42/AβX‐40 | 89 | ! | * | *+ | *+ | + | + | 0.54 | 3 | |
| T‐tau/Aβ1‐42 | 89 | ! | * | * | *+ | *+ | *+ | 0.88 | 4 | |
| P‐tau/Aβ1‐42 | 88 | ! | * | * | *+ | *+ | + | 0.44 | 4 | |
NOTE Difference in means was tested using paired t‐test with the null hypothesis M2 ‐ M1 = 0. Ratio of variances was tested using the Pitman‐Morgan test with the null hypothesis V2/V1 = 1. Model R 2 is the same between the two regression models switching between x and y. Ng was not available for comparison between batch 1 versus batch 3 because it was not assayed in batch 3. No. of methods indicates the number of methods that have found batch differences.
!Batch differences were found in Bland‐Altman plots.
*Statistical significance, that is., P < (.05/29) for testing difference in means and ratio of variances, and P < (.05/58) for linear regression analyses testing intercept = 0 or slope = 1. Bonferroni correction was applied to adjust for the inflation of type I error rate due to multiple testing. The critical P‐value was calculated by dividing the conventional P‐value .05 by the number of tests.
+Effect size significance for linear regression analyses, that is, |intercept–0| > 5% range of y, or |slope–1| > 5% of 1. The statistical significance of a test depends on sample size, and the sample sizes were small. It was possible that a large deviation from 0 for the intercept or from 1 for the slope did not yield a significant P‐value. Thus, the effect sizes of these deviations were also evaluated, using 5% as the cut‐off value for significance, analogous to using 0.05 for the conventional cut off P‐value, that is, 5% type I error rate. For the intercept, because the value range (ie., the difference between the minimum and maximum values) of y varies a lot between different biomarker variables, the same absolute amount of deviation can represent different extents of deviation for different biomarkers. Therefore, the intercept was compared against 5% of the value range of y.
Generalized linear models for between‐batch conversion of CSF values
| CSF variable |
|
| Distribution | Model | Distribution | Model |
|---|---|---|---|---|---|---|
| CSF sample group {1‐2} | From batch 2 to batch 1 | From batch 1 to batch 2 | ||||
| sAPP‐α | 95 | 0.72 | Gamma | 0.50 | Normal | 0.62 |
| sAPP‐β | 95 | 0.67 | Gamma | 0.43 | Gamma | 0.50 |
| AβX‐38 | 96 | 0.88 | Gamma | 0.86 | Gamma | 0.82 |
| AβX‐40 | 96 | 0.84 | Gamma | 0.73 | Normal | 0.78 |
| AβX‐42 | 96 | 0.81 | Gamma | 0.69 | Normal | 0.77 |
| MCP‐1 | 96 | 0.84 | VPM | 0.74 | Gamma | 0.79 |
| YKL‐40 | 96 | 0.96 | Normal | 0.95 | Gamma | 0.95 |
| NfL | 95 | 0.98 | Normal | 0.96 | Normal | 0.96 |
| Aβ1‐42 | 96 | 0.87 | Gamma | 0.74 | Normal | 0.81 |
| T‐tau | 95 | 0.98 | Normal | 0.97 | Gamma | 0.96 |
| P‐tau | 94 | 0.80 | Gamma | 0.66 | Gamma | 0.71 |
| Ng | 77 | 0.81 | Gamma | 0.70 | VPM | 0.75 |
| AβX‐42/AβX‐40 | 96 | 0.85 | Gamma | 0.77 | Gamma | 0.77 |
| T‐tau/Aβ1‐42 | 95 | 0.99 | Normal | 0.98 | Normal | 0.98 |
| P‐tau/Aβ1‐42 | 94 | 0.86 | Normal | 0.70 | Normal | 0.75 |
| CSF sample group {1‐3} | From batch 3 to batch 1 | From batch 1 to batch 3 | ||||
| sAPP‐α | 89 | 0.71 | VPM | 0.57 | Normal | 0.56 |
| sAPP‐β | 89 | 0.70 | VPM | 0.59 | Normal | 0.57 |
| AβX‐38 | 89 | 0.94 | Normal | 0.91 | Gamma | 0.88 |
| AβX‐40 | 89 | 0.88 | Gamma | 0.81 | Normal | 0.82 |
| AβX‐42 | 89 | 0.85 | Normal | 0.79 | Normal | 0.79 |
| MCP‐1 | 89 | 0.84 | Normal | 0.78 | Normal | 0.81 |
| YKL‐40 | 89 | 0.91 | Normal | 0.94 | Normal | 0.93 |
| NfL | 89 | 0.97 | Gamma | 0.96 | Normal | 0.96 |
| Aβ1‐42 | 89 | 0.73 | Normal | 0.57 | Gamma | 0.56 |
| T‐tau | 89 | 0.97 | Normal | 0.95 | Gamma | 0.94 |
| P‐tau | 88 | 0.40 | Normal | 0.25 | Normal | 0.17 |
| AβX‐42/AβX‐40 | 89 | 0.74 | Normal | 0.64 | Normal | 0.65 |
| T‐tau/Aβ1‐42 | 89 | 0.94 | Normal | 0.86 | Normal | 0.91 |
| P‐tau/Aβ1‐42 | 88 | 0.66 | Normal | 0.52 | Normal | 0.35 |
NOTE. The conversion models were not developed for Ng for CSF sample {1‐3} because it was not assayed in batch 3. r is Pearson correlation. All rs are statistically significant at P < (.05/29).
Abbreviations: VPM, variance proportional to mean.
Counts and percentages of the raw and converted batch values that were out of range for the CSF samples being converted
| CSF variable |
| Raw | Converted |
| Raw | Converted |
|---|---|---|---|---|---|---|
| Conversion from batch 2 to batch 1 | Conversion from batch 1 to batch 2 | |||||
| sAPP‐α | 202 | 9 (4.5) | 1 (0.5) | 332 | 3 (0.9) | 0 (0.0) |
| sAPP‐β | 202 | 7 (3.5) | 0 (0.0) | 332 | 5 (1.5) | 1 (0.3) |
| AβX‐38 | 201 | 12 (6.0) | 7 (3.5) | 332 | 27 (8.1) | 18 (5.4) |
| AβX‐40 | 201 | 9 (4.5) | 8 (4.0) | 332 | 14 (4.2) | 12 (3.6) |
| AβX‐42 | 201 | 10 (5.0) | 9 (4.5) | 332 | 35 (10.5) | 7 (2.1) |
| MCP‐1 | 200 | 4 (2.0) | 0 (0.0) | 332 | 2 (0.6) | 1 (0.3) |
| YKL‐40 | 202 | 5 (2.5) | 4 (2.0) | 332 | 9 (2.7) | 10 (3.0) |
| NfL | 199 | 10 (5.0) | 4 (2.0) | 330 | 15 (4.5) | 20 (6.1) |
| Aβ1‐42 | 202 | 7 (3.5) | 0 (0.0) | 330 | 12 (3.6) | 7 (2.1) |
| T‐tau | 202 | 21 (10.4) | 21 (10.4) | 330 | 39 (11.8) | 38 (11.5) |
| P‐tau | 202 | 23 (11.4) | 23 (11.4) | 330 | 43 (13.0) | 23 (7.0) |
| Ng | 202 | 37 (18.3) | 5 (2.5) | 326 | 37 (11.3) | 4 (1.2) |
| AβX‐42/AβX‐40 | 201 | 3 (1.5) | 0 (0.0) | 332 | 13 (3.9) | 13 (3.9) |
| T‐tau/Aβ1‐42 | 202 | 14 (6.9) | 14 (6.9) | 330 | 38 (11.5) | 33 (10.0) |
| P‐tau/Aβ1‐42 | 202 | 21 (10.4) | 19 (9.4) | 330 | 32 (9.7) | 25 (7.6) |
| Conversion from batch 3 to batch 1 | Conversion from batch 1 to batch 3 | |||||
| sAPP‐α | 162 | 3 (1.9) | 0 (0.0) | 338 | 23 (6.8) | 0 (0.0) |
| sAPP‐β | 162 | 1 (0.6) | 1 (0.6) | 338 | 19 (5.6) | 6 (1.8) |
| AβX‐38 | 162 | 3 (1.9) | 3 (1.9) | 339 | 14 (4.1) | 11 (3.2) |
| AβX‐40 | 162 | 1 (0.6) | 1 (0.6) | 339 | 12 (3.5) | 13 (3.8) |
| AβX‐42 | 162 | 3 (1.9) | 0 (0.0) | 339 | 40 (11.8) | 28 (8.3) |
| MCP‐1 | 162 | 9 (5.6) | 6 (3.7) | 339 | 7 (2.1) | 7 (2.1) |
| YKL‐40 | 162 | 0 (0.0) | 0 (0.0) | 339 | 20 (5.9) | 20 (5.9) |
| NfL | 162 | 4 (2.5) | 3 (1.9) | 337 | 48 (14.2) | 52 (15.4) |
| Aβ1‐42 | 162 | 4 (2.5) | 5 (3.1) | 337 | 55 (16.3) | 6 (1.8) |
| T‐tau | 162 | 2 (1.2) | 2 (1.2) | 337 | 39 (11.6) | 35 (10.4) |
| P‐tau | 162 | 1 (0.6) | 1 (0.6) | 337 | 28 (8.3) | 7 (2.1) |
| AβX‐42/AβX‐40 | 162 | 2 (1.2) | 0 (0.0) | 339 | 33 (9.7) | 2 (0.6) |
| T‐tau/Aβ1‐42 | 162 | 7 (4.3) | 4 (2.5) | 337 | 41 (12.2) | 41 (12.2) |
| P‐tau/Aβ1‐42 | 162 | 3 (1.9) | 0 (0.0) | 337 | 55 (16.3) | 40 (11.9) |
NOTE. The conversions from and to batch 3 were not performed for Ng, because it was not assayed in batch 3 and thus no conversion models were developed.