| Literature DB >> 23548755 |
Artemis Koukounari, Irini Moustaki, Nicholas C Grassly, Isobel M Blake, María-Gloria Basáñez, Manoj Gambhir, David C W Mabey, Robin L Bailey, Matthew J Burton, Anthony W Solomon, Christl A Donnelly.
Abstract
In disease control or elimination programs, diagnostics are essential for assessing the impact of interventions, refining treatment strategies, and minimizing the waste of scarce resources. Although high-performance tests are desirable, increased accuracy is frequently accompanied by a requirement for more elaborate infrastructure, which is often not feasible in the developing world. These challenges are pertinent to mapping, impact monitoring, and surveillance in trachoma elimination programs. To help inform rational design of diagnostics for trachoma elimination, we outline a nonparametric multilevel latent Markov modeling approach and apply it to 2 longitudinal cohort studies of trachoma-endemic communities in Tanzania (2000-2002) and The Gambia (2001-2002) to provide simultaneous inferences about the true population prevalence of Chlamydia trachomatis infection and disease and the sensitivity, specificity, and predictive values of 3 diagnostic tests for C. trachomatis infection. Estimates were obtained by using data collected before and after mass azithromycin administration. Such estimates are particularly important for trachoma because of the absence of a true "gold standard" diagnostic test for C. trachomatis. Estimated transition probabilities provide useful insights into key epidemiologic questions about the persistence of disease and the clearance of infection as well as the required frequency of surveillance in the post-elimination setting.Entities:
Keywords: diagnosis; latent Markov model; multilevel; nonparametric model; trachoma
Mesh:
Substances:
Year: 2013 PMID: 23548755 PMCID: PMC3639724 DOI: 10.1093/aje/kws345
Source DB: PubMed Journal: Am J Epidemiol ISSN: 0002-9262 Impact factor: 4.897
Figure 1.Descriptive (proportion positive) results according to the 3 diagnostic tests under investigation (DNA-PCR for Chlamydia trachomatis infection and TF and TI for signs of active trachoma disease) for children aged <10 years prior to and after mass azithromycin administration in A) Tanzania (2000–2002) and B) The Gambia (2001–2002). Numbers of children participating at each time point of the surveys are indicated in parentheses. PCR, polymerase chain reaction; TF, trachomatous inflammation, follicular; TI, trachomatous inflammation, intense.
Figure 2.Latent Markov modeling path diagram. The variables in boxes represent the 3 observed categorical indicators of the latent categorical variables at each time point, C. The 4 arrows between the circled variables indicate the regression model for the latent categorical variable at time point t on the latent categorical variable at time point t − 1. PCR, polymerase chain reaction; TF, trachomatous inflammation, follicular; TI, trachomatous inflammation, intense.
Latent Markov Model Notation
| Symbol | Definition |
|---|---|
| Categorical latent variable at the individual level representing health (level 1) | |
| η | Proportion of individuals belonging in a specific category of |
| γ | Coefficient for latent variable |
| τ | Transition probability |
| Total number of latent health states | |
| Index for latent health states of | |
| Cba | New categorical latent variable at the household level (level 2) capturing the cluster variability in the distribution of the level-1 latent health states proportions |
| Observed indicator (i.e., diagnostic test) of | |
| ρ | Item response probability |
| Index for individual | |
| Total number of observed time points varies across individuals | |
| Index for time | |
| Index for diagnostic test | |
| Total number of observed indicators | |
| Sample size | |
| δ | Coefficient relating |
a See section, “Maximum likelihood estimation and numerical challenges” and Web Appendix for further technical details.
Figure 3.Nonparametric multilevel latent Markov modeling path diagram with J latent health states (J = 4). The within-household model (level 1) is similar to the model in Figure 2 with 3 observed categorical indicators and 4 measurement time points; the 3 single filled circles at the bottom of each of the 4 C latent categorical variables represent the random means for the within-household latent health states (there are J − 1 random means). These random means are referred to as C1 # 1 …. C4 # 3 in the between-household model (level 2), and they vary across level 2 between households (cluster level) latent classes (labeled Cb in the above diagram). PCR, polymerase chain reaction; TF, trachomatous inflammation, follicular; TI, trachomatous inflammation, intense.
Item Response Probabilitiesa ρ of Yielding Negative Results From a Nonparametric Multilevel Latent Markov Model for Each Diagnostic Test and Latent Health State in Tanzania (2000–2002) and The Gambia (2001–2002)
| Latent Health State | Tanzania | The Gambia | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PCR | 95% CI | TF | 95% CI | TI | 95% CI | PCR | 95% CI | TF | 95% CI | TI | 95% CI | |
| I − /D − | 0.990 | 0.972, 0.997 | 1.000a | 1.000, 1.000 | 0.999 | 0.210, 1.000 | 0.992 | 0.982, 0.996 | 0.993 | 0.837, 1.000 | 0.999 | 0.992, 1.000 |
| I + /D − | 0.172 | 0.056, 0.421 | 0.812 | 0.655, 0.908 | 0.974 | 0.883, 0.995 | ||||||
| I − /D + | 0.983 | 0.818, 0.999 | 0.402 | 0.305, 0.508 | 0.707 | 0.634, 0.771 | 1.000 | 1.000, 1.000 | 0.083 | 0.000, 0.982 | 0.929 | 0.846, 0.969 |
| I + /D + | 0.222 | 0.055, 0.585 | 0.133 | 0.032, 0.415 | 0.357 | 0.182, 0.582 | 0.276 | 0.054, 0.717 | 0.000 | 0.000, 0.000 | 0.707 | 0.445, 0.879 |
Abbreviations: CI, confidence interval; I − /D − , not infected and not diseased; I + /D − , infected and not diseased; I − /D + , not infected and diseased; I + /D + , infected and diseased; PCR, polymerase chain reaction; TF, trachomatous inflammation, follicular; TI, trachomatous inflammation, intense.
a These parameters were estimated close to 1 or 0, so to avoid numerical instability in the estimation algorithm, MPlus software (Muthén & Muthén, Los Angeles, California) fixed them automatically to 1 or 0, respectively. For this same reason, standard errors and confidence intervals are not provided.
Figure 4.Estimates of prevalences η of latent health states over time as calculated from the nonparametric multilevel latent Markov model for A) Tanzania (2000–2002) and B) The Gambia (2001–2002). I − /D − , not infected and not diseased; I + /D − , infected and not diseased; I − /D + , not infected and diseased; I + /D + , infected and diseased.
Transition Probability Results τ From the Nonparametric Multilevel Latent Markov Model for Tanzania, 2000–2002
| Time, | Treatment Interval (0–2 months), Time | Nontreatment Intervals (2–6 months and 6–12 months), Time | ||||
|---|---|---|---|---|---|---|
| I − /D − | I − /D + | I + /D + | I − /D − | I − /D + | I + /D + | |
| I − /D − | 1.000 | 0.000 | 0.000 | 0.768 | 0.131 | 0.101 |
| I − /D + | 0.316 | 0.684 | 0.000 | 0.000 | 1.000 | 0.000 |
| I + /D + | 0.260 | 0.541 | 0.198 | 0.000 | 0.001 | 0.999 |
Abbreviations: I − /D − , not infected and not diseased; I + /D − , infected and not diseased; I − /D + , not infected and diseased; I + /D + , infected and diseased.
a Time t – 1 represents the index for the immediately previous time point in the study.
Transition Probability Results τ From the Nonparametric Multilevel Latent Markov Model for The Gambia, 2001–2002
| Time | Treatment Interval (0–2 months), Time | Nontreatment Intervals (2–6 months and 6–12 months), Time | ||||||
|---|---|---|---|---|---|---|---|---|
| I − /D − | I + /D − | I − /D+ | I + /D + | I − /D − | I + /D − | I − /D + | I + /D + | |
| I − /D − | 0.980 | 0.000 | 0.016 | 0.004 | 0.985 | 0.000 | 0.015 | 0.000 |
| I + /D − | 0.721 | 0.000 | 0.279 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 |
| I − /D + | 0.557 | 0.000 | 0.443 | 0.000 | 0.790 | 0.040 | 0.170 | 0.000 |
| I + /D + | 0.000 | 0.000 | 1.000 | 0.000 | 0.434 | 0.000 | 0.566 | 0.000 |
Abbreviations: I − /D − , not infected and not diseased; I + /D − , infected and not diseased; I − /D + , not infected and diseased; I + /D + , infected and diseased.
a Time t – 1 represents the index for the immediately previous time point in the study.
Evaluation of Diagnostics for Detection of Infection From the Nonparametric Multilevel Latent Markov Modela in Tanzania (2000–2002) and The Gambia (2001–2002)
| Tanzania | The Gambia | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PCR, % | 95% CI | TF, % | 95% CI | TI, % | 95% CI | PCR, % | 95% CI | TF, % | 95% CI | TI, % | 95% CI | |
| Sensitivity | 77.8 | 41.5, 94.5b | 86.7 | 58.5, 96.8b | 64.3 | 41.8, 81.8b | 79.0 | 58.6, 99.4 | 48.6 | 6.3, 90.8 | 12.4 | 0, 27c |
| Specificity | 98.8 | 98.1, 99.6 | 85.8 | 73.6, 96.3 | 92.5 | 87.0, 98.1 | 99.2 | 98.9, 99.5 | 93.0 | 79.7, 100 | 99.4 | 98.2, 100 |
| Positive predictive value | 76.8 | 47.5, 100c | 27.3 | 0, 64.6c | 35.4 | 0, 78.6c | 84.8 | 65.4, 100 | 31.3 | 0, 73.7c | 55.4 | 0, 100d |
| Negative predictive value | 98.3 | 96.3, 100 | 98.7 | 97.2, 100 | 97.1 | 94.4, 99.7 | 98.5 | 96.0, 100 | 95.8 | 85.4, 100c | 93.8 | 82.4, 100c |
Abbreviations: CI, confidence interval; PCR, polymerase chain reaction; TF, trachomatous inflammation, follicular; TI, trachomatous inflammation, intense.
a The δ method has been used to approximate the standard errors for these functions of parameters as explained in the Web Appendix.
b Because in the Tanzania analysis there is an I + /D + latent health state but no I + /D − latent health state, the sensitivity estimates and their confidence limits are obtained directly from the item response probabilities and their confidence limits displayed in Table 2 by using the equation that sensitivity equals 1 minus the item response probability for I + /D + .
c Because the normal approximation (i.e., estimate plus or minus 1.96 times the estimated standard error) does not take into account intrinsic constraints on parameter values, confidence intervals estimated in this way could extend beyond 0% or 100%. In these cases, we have used 0% as the minimum or 100% as the maximum.
d In this case, the approximate standard error was so large that the normal approximation to the 95% confidence interval spanned the range from 0% to 100%. Clearly, there is little information in the data set about this positive predictive value.