| Literature DB >> 28575370 |
Maria Deloria Knoll1, Wei Fu1,2, Qiyuan Shi1, Christine Prosperi1, Zhenke Wu3,4, Laura L Hammitt1,5, Daniel R Feikin1,6, Henry C Baggett7,8, Stephen R C Howie9,10,11, J Anthony G Scott5,12, David R Murdoch13,14, Shabir A Madhi15,16, Donald M Thea17, W Abdullah Brooks18,19, Karen L Kotloff20, Mengying Li1,21, Daniel E Park1,22, Wenyi Lin3, Orin S Levine1,23, Katherine L O'Brien1, Scott L Zeger3.
Abstract
In pneumonia, specimens are rarely obtained directly from the infection site, the lung, so the pathogen causing infection is determined indirectly from multiple tests on peripheral clinical specimens, which may have imperfect and uncertain sensitivity and specificity, so inference about the cause is complex. Analytic approaches have included expert review of case-only results, case-control logistic regression, latent class analysis, and attributable fraction, but each has serious limitations and none naturally integrate multiple test results. The Pneumonia Etiology Research for Child Health (PERCH) study required an analytic solution appropriate for a case-control design that could incorporate evidence from multiple specimens from cases and controls and that accounted for measurement error. We describe a Bayesian integrated approach we developed that combined and extended elements of attributable fraction and latent class analyses to meet some of these challenges and illustrate the advantage it confers regarding the challenges identified for other methods.Entities:
Keywords: .; Bayes theorem; epidemiologic methods; etiologic estimations; pneumonia; statistical models
Mesh:
Year: 2017 PMID: 28575370 PMCID: PMC5447849 DOI: 10.1093/cid/cix144
Source DB: PubMed Journal: Clin Infect Dis ISSN: 1058-4838 Impact factor: 9.079
Figure 1.Alternative analytic approaches used for determining pneumonia etiology. Abbreviations: BCX+, positive blood culture; Cor, coronavirus; Hinf, Haemophilus influenzae; HMPV, human metapneumovirus A/B; NP/OP, nasopharyngeal/oropharyngeal; PCR, polymerase chain reaction; Rhino, rhinovirus; RSV, respiratory syncytial virus; S. aur, Staphylococcus aureus; Spn, Streptococcus pneumoniae.
Figure 2.Estimating the etiologic fraction from a study with 2 types of measurements, 1 with control data, and accounting for imperfect sensitivity of both measurements: the PERCH integrated analysis method. The PERCH integrated analysis can combine multiple specimens (shown here for 2 but can integrate more specimen/test measurements, such as whole-blood polymerase chain reaction [PCR] and lung aspirate culture and PCR) and adjust each measurement for pathogen-specific sensitivity to estimate the etiologic fraction using all available evidence. Abbreviations: BCx, blood culture; NP/OP PCR, nasopharyngeal/oropharyngeal polymerase chain reaction; PIA, PERCH integrated analysis.
Description of the Key Parameters Requiring Specification Used in the PERCH Integrated Analysis Approach
| Input parameter |
|
|
|
|---|---|---|---|
|
| All pathogens included in the analysis will be included in the final etiologic distribution estimate; although some estimates may get near 0%, none is ruled out as definitely not a causative pathogen. | Determined on the basis of biological relevance to the disease, implying included pathogens are believed to be causative organisms. | Attributable fraction: Organisms with measurements lacking association with case status are assigned 0% etiologic fraction (ie, ruled as definitely not a causative pathogen), even though the lack of association may be due to small sample size or true pathogens that are commonly observed in controls. |
|
| Knowledge about the likely infection status of the cases before any data are available (prior probabilities or etiology fraction priors). Evidence in the observed data shifts the etiologic fractions away from the prior values to the final estimates. | Determined by 3 characteristics: informativeness, uncertainty, and distribution. If the goal is to summarize the current study evidence with as little influence from external evidence or opinion as possible, then all pathogens would be considered equally likely, the uncertainty would be wide, and the distribution would be flat. | Clinical: Similar to how a clinician approaches a pneumonia patient with a set of prior beliefs regarding their etiology. |
|
| Pathogen-noninformative etiologic priors assign each organism the same etiologic fraction distribution at the outset. | Choice depends on how results are to be interpreted: | Clinical: Knowledge of seasonality of pathogens or etiology of previous patients may sway beliefs to be more in favor of some pathogens over others (informative) rather than equipoise (noninformative). |
|
| Refers to the probability distribution around the etiologic prior value for each pathogen. | In general, the impact of the etiology fraction prior for a given pathogen is related to both the level of uncertainty and the strength of evidence in the data, whether for that organism or against that organism due to strong evidence for other organisms. When the probability distribution is wide (ie, uncertain), less evidence in the data is required to shift the etiologic fraction from the prior value. | Clinical: A decision to treat without testing or to test in spite of an ongoing epidemic of a particular pathogen are expressions of uncertainty in the clinical setting. |
|
| A Dirichelet prior is used to describe the distribution of the etiologic fractions of the pathogens (Supplementary Figure 2) | A minimally informative Dirichelet prior is one where each pathogen has an equally likely chance of being the cause for a given child. | Clinical: During an epidemic, a physician may assume that a case is more likely caused by the epidemic pathogen than any other pathogen. |
|
| Sensitivity priors are set separately for each silver- and bronze-standard measurement for each specimen/test/pathogen combination. | Because the primary analysis goal is to determine etiology, it is important to use as much information about sensitivity as is available (ie, informative priors) because the model estimates both etiology and sensitivity simultaneously and cannot estimate etiology without a sensitivity distribution. | All analytic methods require assumptions about the sensitivity of the measurements to estimate etiologic fraction |
| Uncertainty range | Each sensitivity prior is specified as a range of plausible values rather than as a single point estimate. The range represents 95% of the prior’s distribution (ie, not min-max) and allows posterior estimates outside of this range to occur. | A plausibility range of values is selected to reflect our inherent uncertainty about the exact value of sensitivity. Wider ranges in the measurement’s sensitivity prior result in more uncertainty in the etiology estimate. | Other analytic methods that have accounted for sensitivity use only a point estimate and therefore do not incorporate its uncertainty. |
| Informative vs noninformative | Noninformative priors are those at the full range of 0–100% that give equal probability to every sensitivity value (ie, uniform distribution). | Noninformative sensitivity priors introduce considerable uncertainty in the analysis, which, in turn, produces wide confidence intervals around the etiology fraction estimates. Where there is a general consensus, making even modest assumptions about the sensitivity of the measurements can improve the ability to estimate etiology. Sensitivity priors at the individual level may also be set to account for case-specific influential factors, such as antibiotic preexposure and specimen volume. | All other methods are informative because they select a single value, such as 100% sensitivity. |
| Distribution | A beta distribution is used with parameters that produce a unimodal distribution with more of its probability in the center of the uncertainty range (ie, as opposed to a uniform distribution). | Other analytic methods that have accounted for sensitivity use only a point estimate and therefore do not incorporate its probability across a range of values. |
Figure 3.Analysis of 500 simulated datasets for a study that had only 1 specimen (A) and resulting etiologic fraction estimates using attributable fraction and PERCH integrated analysis methods (B). A, Analyses performed on measurements from 600 cases and 600 controls for each of 500 simulated datasets. *Prevalence and odds ratios estimated by averaging across the 500 datasets that were created based on the true etiology, sensitivity, and specificity values. B, Description of boxplots: Bold black line, mean of the true value across the 500 datasets; Diamond, average etiologic estimate across the 500 datasets; Vertical line through diamond, confidence interval around the average etiologic estimate; Boxplot, distribution of the etiologic estimates across the 500 datasets. Numbers above boxplots indicate numeric value of the diamond. Abbreviations: NA, not applicable; NoA, none-of-the-above; PIA, PERCH integrated analysis.
Figure 4.Etiologic fraction estimates using PERCH integrated analysis: distribution of results from analysis of 500 simulated datasets containing cases with known etiology (A) and results from 1 randomly selected dataset (B). A, Pathogens A through D represent true pneumonia-causing pathogens that were tested for, pathogen E represents a pathogen that was tested for but does not cause pneumonia, and NoA represents pathogens that cause pneumonia but were not tested for. Slashes in table indicate not applicable for the pathogen. Description of boxplots: Bold black line, mean of the true value across the 500 datasets; Boxplots display the distribution of etiologic fraction point estimates from 500 simulated datasets: Diamond, average etiologic estimate across the 500 datasets; Vertical line through diamond, confidence interval around the average etiologic estimate; Numbers above boxplots indicate the numeric value of the diamond; whiskers denote the 5th and 95th percentiles of the etiologic fraction point estimates. B, Bubble plot presenting the PERCH integrated analysis results from 1 randomly selected dataset. The area of the bubble is proportional to the estimated etiologic fraction (number above the bubbles) divided by its standard error (ie, the larger the bubble, the greater the degree of confidence in the estimate). Abbreviations: BrS, bronze-standard data (imperfect sensitivity and imperfect specificity; eg, nasopharyngeal polymerase chain reaction); NoA, none-of-the-above pathogens; PIA, PERCH integrated analysis; SS, silver-standard data (imperfect sensitivity and perfect specificity; eg, blood culture).
Figure 5.Impact of inaccurate or imprecise sensitivity priors on etiologic estimation from the PERCH integrated analysis using 500 simulated datasets: a sensitivity analysis compared with the base scenario presented in Figure 4. A, Impact of changing true bronze-standard (BrS) sensitivity for pathogen A from 75% (white) to 90% (gray). B, Impact of changing true bronze-standard (BrS) sensitivity for pathogen B1, B2, and B3 from 75% (white) to 90% (gray). C, Impact of changing true silver-standard (SS) sensitivity for pathogens B1, B3, B4, C1, C3, D1, and D3 from 15% (white) to 5% (gray) and changing SS sensitivity prior from 5%–25% (white) to 10%–20% (gray). D, Impact of increasing width of SS sensitivity prior from 5%–25% (white) to 1%–50% (gray). Description of boxplots: Bold black line, mean of the true value across the 500 datasets; Boxplots display the distribution of etiologic fraction point estimates from 500 simulated datasets: Diamond, average etiologic estimate across the 500 datasets; Vertical line through diamond, confidence interval around the average etiologic estimate; Numbers above boxplots indicate the numeric value of the diamond; whiskers denote the 5th and 95th percentiles of the etiologic fraction point estimates. Abbreviation: BrS, bronze-standard data (imperfect sensitivity and imperfect specificity; eg, nasopharyngeal polymerase chain reaction); NoA, none-of-the-above (ie, pathogens not tested for); SS, silver-standard data (imperfect sensitivity and perfect specificity; eg, blood culture).