| Literature DB >> 22912804 |
Camille Szmaragd1, Laura E Green, Graham F Medley, William J Browne.
Abstract
BACKGROUND: Imperfect diagnostic testing reduces the power to detect significant predictors in classical cross-sectional studies. Assuming that the misclassification in diagnosis is random this can be dealt with by increasing the sample size of a study. However, the effects of imperfect tests in longitudinal data analyses are not as straightforward to anticipate, especially if the outcome of the test influences behaviour. The aim of this paper is to investigate the impact of imperfect test sensitivity on the determination of predictor variables in a longitudinal study. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2012 PMID: 22912804 PMCID: PMC3418251 DOI: 10.1371/journal.pone.0043116
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
List of the 12 possible patterns resulting from a 6-year pattern of herd status.
| Pattern No | Pattern | Prior Probability | Response | No years since last event (years at risk) |
| 1 | 100210 | ½ (1– | 100110 | 112311 |
| 2 | 101210 | ½ (1– | 101.10 | 112.11 |
| 3 | 100211 | ½ | 10011. | 11231. |
| 4 | 101211 | ½ | 101.1. | 112.1. |
| 5 | 100110 | ½ | 1001.0 | 1123.1 |
| 6 | 101110 | ½ | 101.0 | 112.1 |
| 7 | 100111 | ½ | 1001. | 1123. |
| 8 | 101111 | ½ | 101… | 112… |
| 9 | 111210 |
| 1…10 | 1…11 |
| 10 | 111211 |
| 1…1. | 1…1. |
| 11 | 111110 |
| 1….0 | 1….1 |
| 12 | 111111 |
| 1…. | 1…. |
The “.” indicates years where the herd was not at risk of a herd breakdown because it was already under restrictions. Those years are virtually ignored by the model fitting algorithm (Appendix S1).
Figure 1Diagram of the data preparation process.
Example based on a hypothetical herd. *The herd might not exist for the whole duration of the study period. Missing year at the start and/or end of the study period are ignored, as the dataset does not to be balanced.
The deterministic rules used for filling in missing test values.
| After | 0 | 1 | 2 |
| Before | |||
| 0 | 0 | 0 or 1 | 0 or 1 |
| 1 | 0 or 1 or 2 | 1 | 1 |
| 2 | 0 | 0 or 1 | 0 or 1 |
The “Before” rows and “After” columns give the states of the herd in the years prior and after the missing test.
Values of p used as prior probability of each pattern, for a range of possible sensitivity values.
| Se | 0.5 | 0.60 | 0.75 | 0.8 | 0.85 | 0.95 |
| Prior probability value (p) | 0.153 | 0.102 | 0.0508 | 0.038 | 0.027 | 0.008 |
Probability p = (NHr/(NHt-NHr))*((1–Se)/Se), with NHr and NHt being respectively the number of herds classified as reactors and the total number of herds. These numbers are overall averages computing from the statistics available from Defra for the British counties in the RBCT area between 1998 and 2005.
Figure 2Diagram of the model fitting process.
*cubic function of the time at risk variable was initially used. At the end of the first univariable iteration, the significance of each term will be assessed as part of the predictor selection step. None of the three terms were found significant and the time at risk variable was thus removed from the model §A predictor is considered to be significant if its z-score (|posterior mean|/(posterior standard deviation)) is larger than 1.96 The models are fitted using the MCMC algorithm detailed in Appendix S1.
List of predictor variables appearing in the models.
| Abbreviation | Full Variable name - Description | Statistics |
| LSoldPY | Loge(#animals sold, previous year) | 1.71;1.84 |
| LCalfPY | Loge(#Calves born, previous year) | 2.61;1.86 |
| LCalfPY2 | Loge(#Calves born, two years previously) | 2.30;1.94 |
| LAT | Loge(#animals tested) | 3.57;2.00 |
| LAvHSY | Loge(mean herd size, in that year) | 4.35;1.24 |
| ReactPY2 | #reactors found two years previously | 0.56;2.74 |
| CumReacPY4 | Cumulative #of reactors found in the previous four years | 2.00;5.63 |
| NeighPosY | #neighbour herds tested positive, same year | 0.71;0.90 |
| NeighPY | #neighbour herds, in the previous year | 4.93;3.50 |
| NeighNegPY | #neighbour herds tested negative, in the previous year | 3.03;3.03 |
| NeighPosPY | #neighbour herds tested positive, in the previous year | 0.63;0.89 |
| AvLHSNeighNegY | Mean loge(herd size of neighbour herds tested negative, same year) | 0.61;1.14 |
| AvHSNeighNotTPY | Mean(herd size of neighbour herds not tested, previous year) | 14.22;36.45 |
| FMD1 | FMD Indicator 1; = 1 post February 2001 | NA |
| FMD2 | FMD Indicator 2; = 1 post February 2002 | NA |
| FMD3 | FMD Indicator 3; = Year indicator (1–3), before February 2001 | NA |
| FMD4 | FMD Indicator 4; = Year indicator (4–8), post February 2001 | NA |
| BeefFarm | Beef-only enterprise (baseline category Dairy only) | 57.8 |
| MixedFarm | Mixed enterprise (baseline Dairy only) | 30.7 |
| Dep1 | Depopulation indicator = 1 if herd depopulated in the past | 51.7 |
| LDirYSNegPast | Loge #animals bought directly in the test-year from a farm, which always tested negative for TBbefore the move | 0.60;1.12 |
| LDirPYSPosPY | Loge #animals bought directly in the previous test-year from a farm, which tested positive for TBin the 12 months before the move | 0.03;0.23 |
| DirYSNotTYRBCT | #animals bought directly in the test-year from a farm in the RBCT, which was not TB tested the12 months before the move | 0.99;7.96 |
| LMarkYSNotTFolY | Loge #animals bought through market in the test-year from a farm, which was nottested for TB the 12 months following the move | 0.31;0.78 |
| LMarkYSNegPYFq12 | Loge #animals bought through market in the test-year from a farm in high TB risk area, which wastested negative for TB the 24–12 months before the move | 0.07;0.33 |
| LMarkYSNotTPYRBCT | Loge #animals bought through market in the test-year from a farm in the RBCT, which was nottested for TB the 24–12 months before the move | 0.16;0.55 |
| LMarkYSPosPY2RBCT | Loge #animals bought through market in the test-year from a farm in the RBCT, which was testedpositive for TB in the 36–24 months before the move | 0.17;0.57 |
| LDirPYSNegPast | Loge #animals bought directly in the previous test-year from a farm, which always tested negativefor TB before the move but after the 1st of July 1996 when cattle passports were first implemented | 0.08;0.35 |
| LDirYSNegPYRBCT | Loge #animals bought directly in the test-year from a farm in the RBCT, which was tested negativefor TB the 24–12 months before the move | 0.18;0.60 |
| LDirYSPosPastRBCT | Loge #animals bought directly in the test-year from a farm in the RBCT, which was tested positivefor TB at some point before the move | 0.47;0.99 |
| LDirYSPosPYFq34 | Loge #animals bought directly in the test-year from a farm in low TB risk area, which was testedpositive for TB the 24–12 months before the move | 0.04;0.28 |
| MarkYSPosFolY | #animals bought through market in the test-year from a farm, which was tested positive for TB the12 months following the move | 2.20;11.97 |
| MarkYSPosPY | #animals bought through market in the test-year from a farm, which was tested positive for TBthe 24–12 months before the move | 7.23;43.14 |
| MarkYSNegPY2 | #animals bought through market in the test-year from a farm, which was tested negative for TBin the 36–24 months before the move | 5.54;28.43 |
| LMarkYSPosFolYFq34 | Loge #animals bought through market in the test-year from a farm in low TB risk area, which wastested positive for TB the 12 months following the move | 0.02;0.18 |
| MarkYSNotTYFq34 | #animals bought through market in the test-year from a farm in low TB risk area, which was nottested for TB the 12 months before the move | 0.33;2.55 |
Table of abbreviations for the predictor variables found in the models, with full description and key statistics, being mean and standard deviation for continuous variables (expressed as per herd/per year) and frequencies for categorical variables over the number of herds present in the study area.
Best fit models obtained for each sensitivity value tested.
| Se = 1 | Se = 0.95 | Se = 0.85 | Se = 0.80 | Se = 0.75 | Se = 0.60 | Se = 0.50 | |
| Parameters | Est (sd) | Est (sd) | Est (sd) | Est (sd) | Est (sd) | Est (sd) | Est (sd) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| LCalfPY | 0.58 (0.09) | 0.79 (0.11) | |||||
| LCalfPY2 | 0.35 (0.07) | 0.38 (0.07) | 0.41 (0.08) | 0.43 (0.09) | 0.75 (0.14) | ||
| LAT | 0.81 (0.12) | 1.06 (0.13) | 0.77 (0.14) | 0.90 (0.15) | |||
| LAvHSY | 1.12 (0.12) | 1.25 (0.16) | 1.31 (0.18) | ||||
| ReactPY2 | 0.12 (0.04) | 0.10 (0.04) | 0.09 (0.05) | 0.10 (0.05) | 0.11 (0.05) | 0.15 (0.06) | |
| CumReacPY4 | 0.11(0.05) | ||||||
| NeighPosY | 0.29 (0.11) | 0.44 (0.10) | 0.37 (0.12) | 0.57 (0.12) | 0.62 (0.13) | 0.39 (0.15) | 0.56 (0.19) |
| NeighPY | 0.41 (0.11) | ||||||
| NeighNegPY |
|
|
| ||||
| NeighPosPY |
| ||||||
| AvLHSNeighNegY |
|
|
| ||||
|
|
|
|
|
| |||
| FMD1 | 1.19 (0.28) | ||||||
| FMD2 | 1.68 (0.29) | 1.18 (0.25) | 0.96 (0.24) | 1.13 (0.27) | 1.38 (0.28) | 0.08 (0.01) | |
| FMD3 | 0.26 (0.12) | ||||||
| FMD4 | 0.16 (0.05) | ||||||
| BeefFarm |
| 0.15 (0.30) | |||||
| MixedFarm | 0.66 (0.30) | 0.64 (0.31) | |||||
| Dep1 | 1.34 (0.38) | ||||||
|
|
|
|
|
| |||
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||||
|
|
|
| |||||
|
|
|
| |||||
|
|
|
|
| ||||
|
|
| ||||||
| LDirPYSNegPast | 0.81 (0.28) | 0.80 (0.27) | 0.90 (0.30) | 0.92 (0.31) | 0.99 (0.37) | 1.33 (0.44) | |
| LDirYSNegPYRBCT | 0.40 (0.17) | ||||||
| LDirYSPosPastRBCT | 0.42 (0.15) | ||||||
| LDirYSPosPYFq34 | 1.76 (0.60) | 1.49 (0.46) | 1.87 (0.61) | 1.60 (0.50) | 1.39 (0.55) | ||
| MarkYSPosFolY | 0.09 (0.02) | 0.10 (0.02) | 0.11 (0.03) | 0.11 (0.02) | |||
| MarkYSPosPY | 0.06 (0.01) | ||||||
| MarkYSNegPY2 | 0.04 (0.01) | ||||||
| LMarkYSPosFolYFq34 | 1.49 (0.65) | ||||||
| MarkYSNotTYFq34 | 0.38 (0.13) | ||||||
| Sigma2u | 0.05 (0.08) | 0.11 (0.13) | 0.06 (0.10) | 0.15 (0.17) | 0.28 (0.36) | 0.07 (0.11) | 0.08 (0.16) |
The values are given as posterior mean estimate plus posterior standard deviation (sd). The different predictors are grouped according to the type of predictor. The predictors in italic indicate protective factors and the predictors in bold indicate non-significant differences for categorical predictors.
Parameter estimates by test sensitivities (Se), based on the best fit model under the assumption of a perfect test.
| Se = 1 | Se = 0.95 | Se = 0.85 | Se = 0.80 | Se = 0.75 | Se = 0.60 | Se = 0.50 | |
| Parameter | Est (sd) | Est (sd) | Est (sd) | Est (sd) | Est (sd) | Est (sd) | Est (sd) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| LCalfPY | 0.58 (0.09) | 0.59 (0.09) | 0.59 (0.1) | 0.61 (0.1) | 0.61 (0.1) | 0.63 (0.11) | 0.66 (0.13) |
| LAT | 0.81 (0.12) | 0.84 (0.12) | 0.87 (0.13) | 0.88 (0.14) | 0.90 (0.13) | 0.93 (0.15) | 0.99 (0.16) |
| ReactPY2 | 0.12 (0.04) | 0.13 (0.05) | 0.13 (0.05) | 0.13 (0.05) | 0.14 (0.05) | 0.15 (0.06) | 0.17 (0.07) |
| NeighPosY | 0.29 (0.11) | 0.31 (0.12) | 0.35 (0.13) | 0.37 (0.13) | 0.39 (0.14) | 0.44 (0.16) | 0.48 (0.17) |
|
|
|
|
|
|
|
|
|
| FMD2 | 1.68 (0.29) | 1.69 (0.29) | 1.68 (0.30) | 1.66 (0.31) | 1.66 (0.31) | 1.53 (0.35) | 1.36 (0.38) |
| FMD3 | 0.26 (0.12) |
|
|
|
|
|
|
| BeefFarm | 0.26 (0.3) | 0.25 (0.3) | 0.26 (0.32) | 0.26 (0.32) | 0.26 (0.33) | 0.23 (0.36) | 0.27 (0.4) |
| MixedFarm | 0.66 (0.3) | 0.66 (0.31) | 0.66 (0.33) | 0.66 (0.33) | 0.68 (0.34) | 0.67 (0.37) | 0.72 (0.41) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| LDirPYSNegPast | 0.81 (0.28) | 0.83 (0.29) | 0.89 (0.3) | 0.90 (0.31) | 0.93 (0.32) | 1.00 (0.36) | 1.10 (0.42) |
| LDirYSNegPYRBCT | 0.4 (0.17) | 0.41 (0.18) | 0.42 (0.19) | 0.43 (0.19) | 0.45 (0.2) | 0.48 (0.23) | 0.53 (0.26) |
| LDirYSPosPYFq34 | 1.76 (0.6) | 1.85 (0.64) | 2.02 (0.72) | 2.11 (0.77) | 2.21 (0.80) | 2.33 (0.97) |
|
| MarkYSPosFolY | 0.09 (0.02) | 0.09 (0.02) | 0.1 (0.02) | 0.10 (0.02) | 0.10 (0.02) | 0.11 (0.03) | 0.11 (0.03) |
| LMarkYSPosFolYFq34 | 1.49 (0.65) | 1.64 (0.81) |
|
|
|
|
|
| Sigma2u | 0.05 (0.08) | 0.05 (0.08) | 0.05 (0.08) | 0.05 (0.09) | 0.05 (0.09) | 0.04 (0.08) | 0.06 (0.09) |
Posterior mean estimates (posterior standard deviations) obtained by running 2 independent MCMC chains of 50,000 iterations after 5,000 burnin. In italic are highlighted predictors which have a protective effect. Non significant estimates are highlighted in bold.
Problem with convergence of the two chains encountered.