| Literature DB >> 29945689 |
J Asselineau1, A Paye1, E Bessède2, P Perez1, C Proust-Lima1.
Abstract
In the absence of perfect reference standard, classical techniques result in biased diagnostic accuracy and prevalence estimates. By statistically defining the true disease status, latent class models (LCM) constitute a promising alternative. However, LCM is a complex method which relies on parametric assumptions, including usually a conditional independence between tests and might suffer from data sparseness. We carefully applied LCMs to assess new campylobacter infection detection tests for which bacteriological culture is an imperfect reference standard. Five diagnostic tests (culture, polymerase chain reaction and three immunoenzymatic tests) of campylobacter infection were collected in 623 patients from Bordeaux and Lyon Hospitals, France. Their diagnostic accuracy were estimated with standard and extended LCMs with a thorough examination of models goodness-of-fit. The model including a residual dependence specific to the immunoenzymatic tests best complied with LCM assumptions. Asymptotic results of goodness-of-fit statistics were substantially impaired by data sparseness and empirical distributions were preferred. Results confirmed moderate sensitivity of the culture and high performances of immunoenzymatic tests. LCMs can be used to estimate diagnostic tests accuracy in the absence of perfect reference standard. However, their implementation and assessment require specific attention due to data sparseness and limitations of existing software.Entities:
Keywords: Campylobacter; diagnostic accuracy; imperfect gold standard; latent class model; sparseness
Mesh:
Year: 2018 PMID: 29945689 PMCID: PMC6090718 DOI: 10.1017/S0950268818001723
Source DB: PubMed Journal: Epidemiol Infect ISSN: 0950-2688 Impact factor: 2.451
Fig. 1.Diagram (left panel) and corresponding profile probability (right panel) for three latent class models assuming different dependence structures, CampyLCA study, France, 2016. LCM CI, latent class model under conditional independence; LCM CD, latent class model with a residual dependence common to all tests; LCM SD, latent class model with a residual dependence specific to the three immunoenzymatic tests. Ovals and rectangles indicate latent quantities and observed quantities, respectively: D = 0/1: unobserved presence/absence of campylobacter infection; T1: Culture Karmali; T2: Real-time PCR; T3: Ridascreen®; T4: Premier®Campy®; T5: ImmunoCardStat!®Campy; u: random residual dependence which follows a standard Gaussian distribution. In the equations, t+ and t− indicate a positive and negative result for test T, respectively; Φ is the standard cumulative Gaussian distribution function; parameters to estimate are (akd)k = 1,…,K, d = 0, 1 for the probit transformations of sensitivities and specificities, μ for the logit transformation of the prevalence and σ for the intensity of the individual random deviation.
Test results profiles: observed and predicted (by the Latent Class Models) number of patients for each combination of test results, CampyLCA Study, France, 2016
| Culture Karmali | Real-time PCR | Rida-screen® | Premier® Campy | ImmunoCard Stat!® Campy | Observed patients (%) | Predicted patients | |||
|---|---|---|---|---|---|---|---|---|---|
| LCM CI | LCM CD | LCM SD | |||||||
| − | − | − | − | − | 522 | (83.8) | 519.6 | 521.2 | 522.1 |
| − | − | − | − | + | 15 | (2.4) | 16.9 | 16.4 | 15.6 |
| − | − | − | + | + | 4 | (0.6) | 0.2 | 0.7 | 1.4 |
| − | − | + | − | − | 7 | (1.1) | 9.6 | 7.6 | 5.2 |
| − | − | + | − | + | 3 | (0.5) | 0.4 | 1.8 | 4.1 |
| − | − | + | + | + | 2 | (0.3) | 2.6 | 2.4 | 3.0 |
| − | + | − | − | − | 3 | (0.5) | 3.4 | 2.4 | 3.0 |
| − | + | + | − | − | 1 | (0.2) | 0.1 | 0.5 | 0.2 |
| − | + | + | + | − | 1 | (0.2) | 1.7 | 1.9 | 1.2 |
| − | + | + | + | + | 9 | (1.4) | 10.8 | 7.1 | 8.7 |
| + | − | − | − | − | 2 | (0.3) | 1.9 | 1.5 | 2.0 |
| + | − | − | + | − | 1 | (0.2) | 0.1 | 0.2 | 0.0 |
| + | − | + | + | − | 1 | (0.2) | 1.2 | 1.2 | 0.8 |
| + | − | + | + | + | 5 | (0.8) | 8.1 | 4.3 | 5.4 |
| + | + | + | − | − | 1 | (0.2) | 0.2 | 0.2 | 0.8 |
| + | + | + | + | − | 5 | (0.8) | 5.2 | 3.5 | 5.7 |
| + | + | + | + | + | 41 | (6.6) | 34.2 | 42.4 | 40.2 |
LCM CD, latent class model with a residual dependence common to all tests; LCM CI, latent class model under conditional independence; LCM SD, latent class model with a residual dependence specific to the three immunoenzymatic tests.
Akaike information criterion and goodness-of-fit statistics for each model, CampyLCA Study, France, 2016
| LCM CI | LCM CD | LCM SD | |
|---|---|---|---|
| Akaike information criterion | 1041.5 | 1023.9 | 1011.9 |
| Pearson statistics | |||
| Asymptotic χ2 distribution | <0.001 | 0.012 | 0.021 |
| Empirical distribution | <0.001 | 0.022 | 0.052 |
| Likelihood ratio statistics | |||
| Asymptotic χ2 distribution | <0.001 | 0.052 | 0.52 |
| Empirical distribution | <0.001 | 0.004 | 0.086 |
| Power divergence statistics | |||
| Asymptotic χ2 distribution | <0.001 | 0.039 | 0.20 |
| Empirical distribution | <0.001 | 0.012 | 0.054 |
LCM CD, latent class model with a residual dependence common to all tests; LCM CI, latent class model under conditional independence; LCM SD, latent class model with a residual dependence specific to the three immunoenzymatic tests.
The P-value when using the empirical distribution was calculated as one minus the percentile of the statistic in 500 samples generated under the null assumption.
Fig. 2.Evaluation of local independence hypothesis by residual correlations and their 95% confidence interval, as well as by P-values of bivariate statistics, CampyLCA study, France, 2016. (a) Residual correlations for latent class model under conditional independence; (b) Residual correlations for latent class model with a residual dependence common to all tests; (c) Residual correlations for latent class model with a residual dependence specific to the three immunoenzymatic tests. T1: Culture Karmali; T2: Real-time PCR; T3: Ridascreen®; T4: Premier®Campy; T5: ImmunoCard Stat!®Campy. Residual correlations presented with dots (point estimates) and bars (95% confidence intervals). P-values of bivariate statistics are provided above each pair of tests described on the horizontal axis.
Diagnostic accuracy of medical tests according to LCM models, CampyLCA Study, France, 2016
| Reference standard | LCM CI | LCM CD | LCM SD | |||||
|---|---|---|---|---|---|---|---|---|
| Sensitivity, 95% CI | ||||||||
| Culture Karmali | 100.0 | 76.0 | 64.7–84.8 | 78.4 | 64.8–88.4 | 82.1 | 70.7–90.0 | |
| Real-time PCR | 83.9 | 74.3–93.5 | 80.9 | 70.5–89.0 | 83.9 | 67.8–93.3 | 88.2 | 78.1–94.6 |
| Ridascreen® | 94.6 | 88.7–100.0 | 93.0 | 85.3–97.2 | 96.7 | 68.7–99.9 | 98.5 | 91.1–99.9 |
| Premier®Campy | 94.6 | 88.7–100.0 | 97.1 | 87.0–99.7 | 96.9 | 83.4–99.7 | 97.2 | 89.3–99.5 |
| ImmunoCard Stat!®Campy | 82.1 | 72.1–92.2 | 86.7 | 76.7–93.2 | 85.8 | 74.7–93.1 | 85.2 | 73.4–92.1 |
| Specificity, 95% CI | ||||||||
| Culture Karmali | 100.0 | 99.6 | 98.7–99.9 | 99.5 | 98.3–99.9 | 99.6 | 98.7–99.9 | |
| Real-time PCR | 97.5 | 96.3–98.8 | 99.4 | 98.1–99.8 | 99.3 | 98.0–99.8 | 99.5 | 98.4–99.8 |
| Ridascreen® | 95.9 | 94.3–97.6 | 98.2 | 96.7–99.1 | 98.2 | 96.6–99.1 | 97.9 | 96.3–98.9 |
| Premier®Campy | 97.2 | 95.8–98.5 | 100.0 | NE | 99.5 | 95.4–100.0 | 99.1 | 97.8–99.7 |
| ImmunoCard Stat!®Campy | 94.2 | 92.3–96.1 | 96.9 | 95.2–98.1 | 96.4 | 94.1–97.8 | 95.8 | 93.7–97.3 |
| Negative predictive value, 95% CI | ||||||||
| Culture Karmali | 100.0 | 97.0 | 95.2–98.2 | 97.4 | 95.2–98.7 | 97.9 | 96.4–98.9 | |
| Real-time PCR | 98.4 | 97.4–99.4 | 97.6 | 96.0–98.7 | 98.1 | 95.6–99.3 | 98.6 | 97.3–99.4 |
| Ridascreen® | 99.5 | 98.8–100.0 | 99.1 | 98.0–99.6 | 99.6 | 95.8–100.0 | 99.8 | 98.9–100.0 |
| Premier®Campy | 99.5 | 98.8–100.0 | 99.6 | 0.0–99.9 | 99.6 | 97.9–100.0 | 99.7 | 98.7–99.9 |
| ImmunoCard Stat!®Campy | 98.2 | 97.0–99.3 | 98.3 | 96.8–99.2 | 98.2 | 96.6–99.2 | 98.2 | 96.7–99.1 |
| Positive predictive value, 95% CI | ||||||||
| Culture Karmali | 100.0 | 96.4 | 88.2–99.2 | 95.4 | 84.5–98.9 | 96.3 | 88.0–99.2 | |
| Real-time PCR | 77.0 | 66.5–87.6 | 94.2 | 84.2–98.1 | 93.6 | 83.0–97.9 | 94.9 | 86.2–98.5 |
| Ridascreen® | 69.7 | 59.4–80.1 | 86.9 | 77.0–93.3 | 86.5 | 75.0–92.8 | 84.6 | 74.4–91.8 |
| Premier®Campy | 76.8 | 66.9–86.8 | 100.0 | 9.1–100.0 | 95.8 | 69.4–99.8 | 92.4 | 83.1–97.3 |
| ImmunoCard Stat!®Campy | 58.2 | 47.4–69.1 | 78.0 | 67.9–85.9 | 74.4 | 59.9–84.5 | 70.7 | 59.4–79.9 |
| 9.0 | 6.9–11.5 | 11.4 | 9.2–14.2 | 10.9 | 8.3–14.1 | 10.5 | 8.4–13.3 | |
| 0.9 | 0.5–1.2 | 1.7 | 1.0–2.4 | |||||
95% CI, two-sided 95% confidence interval.
LCM CD, latent class model with a residual dependence common to all tests; LCM CI, latent class model under conditional independence; LCM SD, latent class model with a residual dependence specific to the three immunoenzymatic tests.
NE, not estimated because of estimate on the boundary.
p, prevalence of campylobacter infection.
σ, random effect.
Fig. 3.Diagnostic accuracy estimates (point estimate and 95% confidence interval) of campylobacter infection tests according to the LCM SD model and to culture as the reference standard, CampyLCA study, France, 2016. LCM SD, latent class model with a residual dependence specific to the three immunoenzymatic tests; Ref Std: culture Karmali.
Diagnostic accuracy of medical tests according to leave-one-test-out analyses for LCM SD model, CampyLCA study, France, 2016
| without Ridascreen® | without Premier®Campy | without ImmunoCard Stat!®Campy | ||||
|---|---|---|---|---|---|---|
| Sensitivity (95% CI) | ||||||
| Culture Karmali | 82.2 | (69.8–90.2) | 81.1 | (70.0–89.7) | 80.8 | (70.5–89.0) |
| Real-time PCR | 87.3 | (76.9–93.9) | 88.8 | (78.5–95.0) | 86.8 | (77.0–93.2) |
| Ridascreen® | 100.0 | – | 98.7 | (57.1–98.6) | ||
| Premier®Campy | 98.0 | (2.4–99.9) | 97.2 | (55.9–97.6) | ||
| ImmunoCard Stat!®Campy | 85.8 | (14.5–95.3) | 86.1 | (73.8–92.9) | ||
| Specificity (95% CI) | ||||||
| Culture Karmali | 99.7 | (98.7–99.9) | 99.5 | (98.5–99.8) | 99.6 | (98.7–99.9) |
| Real-time PCR | 99.3 | (98.2–99.8) | 99.5 | (98.5–99.8) | 99.4 | (98.4–99.8) |
| Ridascreen® | 98.1 | (95.9–99.0) | 98.2 | (56.7–98.0) | ||
| Premier®Campy | 99.1 | (4.0–99.8) | 99.2 | (57.8–99.2) | ||
| ImmunoCard Stat!®Campy | 95.8 | (9.5–97.5) | 95.9 | (93.0–97.3) | ||
| 10.6 | (8.4–13.2) | 10.5 | (8.3–13.0) | 10.7 | (8.6–13.3) | |
| 4.7 | (−2.1 to –11.6) | 1.2 | (0.4–2.0) | 0.0 | (−10.3 to −10.6) | |
LCM SD, latent class model with a residual dependence specific to the three immunoenzymatic tests.
95% CI, two-sided 95% confidence interval.
p, prevalence of campylobacter infection.
σ, random effect.
Statistical power of goodness-of-fit statistics (in %) using empirical distribution to detect violation of the conditional independence hypothesis when applying LCM CI model, CampyLCA study, France, 2016
| True model | Pearson statistics | Likelihood ratio statistics | Power divergence statistics |
|---|---|---|---|
| LCM CD | 86.6 | 92.4 | 91.8 |
| LCM SD | 93.4 | 95.0 | 95.0 |
LCM CI, latent class model under conditional independence; LCM CD, latent class model with a residual dependence common to all tests; LCM SD, latent class model with a residual dependence specific to the three immunoenzymatic tests.
Statistical power is defined as the percentage of times the test concludes that observations and predictions significantly differ (at a 5% significance level) when they actually do.
Type-I error rates of goodness-of-fit statistics (in %) using asymptotic distribution for each model, CampyLCA study, France, 2016
| Models | Pearson statistics | Likelihood ratio statistics | Power divergence statistics |
|---|---|---|---|
| LCM CI | 11.6 | 0.2 | 2.4 |
| LCM CD | 6.2 | 1.4 | 1.8 |
| LCM SD | 8.0 | 0.2 | 3.6 |
LCM CI, latent class model under conditional independence; LCM CD, latent class model with a residual dependence common to all tests; LCM SD, latent class model with a residual dependence specific to the three immunoenzymatic tests.
Type I error rate is defined as the percentage of times the test concludes that observations and predictions significantly differ (at a 5% significance level) while they actually do not. The nominal value of type I error rate is 5%.