| Literature DB >> 26918859 |
Alwin Schierenberg1, Margaretha C Minnaard1, Rogier M Hopstaken2, Alma C van de Pol1, Berna D L Broekhuizen1, Niek J de Wit1, Johannes B Reitsma1, Saskia F van Vugt1, Aleida W Graffelman3, Hasse Melbye4, Timothy H Rainer5,6, Johann Steurer7, Anette Holm8, Ralph Gonzales9, Geert-Jan Dinant10, Joris A H de Groot1, Theo J M Verheij1.
Abstract
BACKGROUND: Pneumonia remains difficult to diagnose in primary care. Prediction models based on signs and symptoms (S&S) serve to minimize the diagnostic uncertainty. External validation of these models is essential before implementation into routine practice. In this study all published S&S models for prediction of pneumonia in primary care were externally validated in the individual patient data (IPD) of previously performed diagnostic studies. METHODS ANDEntities:
Mesh:
Year: 2016 PMID: 26918859 PMCID: PMC4769284 DOI: 10.1371/journal.pone.0149895
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Overview of included prediction models to diagnose pneumonia in a primary care setting and their incorporated predictors.
| Model | Total | Diehr et al. [ | Singal et al. [ | Heckerling et al. [ | Melbye et al. [ | Hopstaken et al. [ | van Vugt et al. [ |
|---|---|---|---|---|---|---|---|
| Total predictors in model | 6 | 3 | 5 | 6 | 3 | 6 | |
| Absence of asthma | 1 | ||||||
| Duration of illness | 1 | ||||||
| Chest pain | 1 | ||||||
| Coryza (absence) | 3 | ||||||
| Cough (dry) | 2 | ||||||
| Diarrhea | 1 | ||||||
| Dyspnea | 2 | ||||||
| Fever | 1 | ||||||
| Myalgia | 1 | ||||||
| Phlegm | 1 | ||||||
| Sore throat | 1 | ||||||
| Sweats (night) | 1 | ||||||
| Crackles | 4 | ||||||
| Diminished breath sounds | 2 | ||||||
| Fever | 5 | ||||||
| Tachycardia | 2 | ||||||
| Tachypnea | 1 |
• = predictor present
*combined predictor.
Fig 1PRISMA flow diagram of the selection process of IPD used for external validation of prediction models [39].
Baseline characteristics of included individual patient datasets used in the external validation of prediction models for pneumonia in primary care setting (numbers are percentages [%] per dataset or specified otherwise).
| Characteristic | Validation dataset | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Melbye et al. [ | Hopstaken et al. [ | Flanders et al. [ | Graffelman et al. [ | Holm et al. [ | Rainer et al. [ | Steurer et al. [ | van Vugt et al. [ | All datasets | |
| Setting | OHD | GP | ED/AC | GP | GP | ED | GP, ED | GP | AC/ED/GP/OHD |
| Number of patients | 402 | 243 | 168 | 129 | 364 | 561 | 621 | 2820 | 5308 |
| Pneumonia | 5% | 13% | 12% | 20% | 13% | 43% | 21% | 5% | 12% |
| Age, mean (SD) | 33 (14) | 52 (16) | 40 (16) | 50 (14) | 50 (16) | 53 (22) | 47 (16) | 50 (17) | 49 (18) |
| Gender, Male | 41% | 47% | 41% | 47% | 49% | 53% | 50%1 | 40% | 44% |
| Duration illness in days, mean (SD) | 10 (14) | Categorized | 7 (5) | 9 (6) | -- | 17 (9) | 7 (10) | 10 (10) | 8,4 (10) |
| Smoker | 56% | 33% | 11% | 36% | 45% | 17% | 29% | 28% | 30% |
| Asthma | 10% | 19% | 11% | 6% | 8% | -- | -- | 10% | 10% |
| Cough | 91% | 92% | 100% | 98% | 98% | 88% | 97% | 100% | 97% |
| Chest pain (lateral) | 53% | 60% | 40% | 23% | 64% | 40% | 29% | 46% | 45% |
| Coryza | 80% | 38% | 69% | 59% | -- | 50% | -- | 71% | 67% |
| Diarrhea | -- | 8% | 14% | 24% | -- | 9% | -- | 7% | 8% |
| (Daily) Fever, subjective | 31% | 35% | 59% | 85% | 42% | 83% | 56% | 35% | 47% |
| Dyspnea | 69% | 77% | 51% | 76% | 72% | 56% | 36% | 57% | 57% |
| Myalgia | 54% | 62% | 55% | 59% | -- | 50% | -- | 50% | 52% |
| Sore throat | 73% | 39% | 65% | 39% | -- | 50% | -- | -- | 55% |
| Phlegm | 88% | 55% | 55% | 79% | 81% | 77% | 49% | 79% | 75% |
| (Night) Sweats | 84% | 61% | 58% | -- | -- | 42% | -- | -- | 60% |
| Crackles | 11% | 21% | 9% | 60% | -- | -- | 20% | 9% | 573 |
| Diminished breath sounds | 5% | -- | 17% | 12% | -- | -- | 12% | 13% | 13% |
| Heart rate, p.m. (SD) | 79 (13) | -- | 85 (19) | 82 (11) | 81 (15) | 98 (18) | -- | 77 (12) | 81 (15) |
| Respiratory rate, p.m. (SD) | -- | Categorized | 18 (4) | 21 (4) | 19 (4) | 19 (3) | 17 (6) | 17 (4) | 18 (4) |
| Temperature, C° (SD) | 37.3 (0.7) | 37.5 (0.8) | 37.3 (0.8) | 37.9 (0.7) | 37.4 (0.6) | 37.8 (1.1) | 37.4 (1) | 36.7 (0.6) | 37.1 (1) |
OHD = Out of Hours Department, GP = General Practitioner, ED = Emergency Department, AC = Ambulatory Clinic
1Data from original publication
2 Categorized as ≤2, 3–7, 8–28 days
3Categorized as >20 p.m.
"--" = Variable missing
Discriminative performance of pneumonia prediction models per dataset, measured as Area Under the ROC Curve (AUC) and as pooled AUC in all suited individual patient data (IPD).
| Model | Validation dataset | Development AUC (95% CI) | Pooled AUC (95% CI) | Patients in IPD /development (N =) | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Melbye et al. [ | Hopstaken et al. [ | Flanders et al. [ | Graffelman et al. [ | Holm et al. [ | Rainer et al. [ | Steurer et al. [ | van Vugt et al. [ | ||||
| Van Vugt et al. [ | 0.78 | X | 0.89 | 0.60 | X | X | X | D | 0.70 (0.65–0.75) | 0.79 (0.74–0.85) | 699/2820 |
| Heckerling et al. [ | 0.69 | X | 0.89 | 0.62 | X | X | X | 0.66 | 0.82 (0.78–0.86) | 0.72 (0.68–0.76) | 3519/1134 |
| Diehr et al. [ | X | 0.57 | 0.76 | X | X | 0.64 | X | X | NA | 0.65 (0.61–0.68) | 972/474 |
| Singal et al. [ | 0.68 | 0.62 | 0.81 | 0.63 | 0.62 | X | 0.61 | 0.64 | 0.73 (0.69–0.77) | 0.64 (0.61–0.67) | 4747/255 |
| Melbye et al. [ | D | 0.57 | 0.62 | 0.49 | X | X | X | X | 0.75 (0.66–0.84) | 0.56 (0.49–0.63) | 540/402 |
| Hopstaken et al. [ | X | D | 0.58 | 0.61 | X | 0.52 | X | 0.56 | 0.70 (0.59–0.80 | 0.53 (0.50–0.56) | 3678/243 |
X = Model not validated in dataset due to missing predictors, D = Development dataset (AUCs shown under “Development”), NA = Not available (none reported in development study)
* 95% CI not available in original study report (recalculated in original dataset)
†AUC of Development dataset (“D”) not included.
Fig 2Graphic representation of model performance relative to dataset average AUC, measured as delta AUC.
Each point represents the performance of an individual model relative to the average performance of all models per dataset (deltaAUC, calculated as individual model AUC minus [–] the mean AUC of dataset). The figure shows how the discriminative performance per model, in the datasets in which it could be validated, is compared to the discriminative performance of the other models in that same dataset. For example, we see that the model by van Vugt et al. performs above average in all datasets in which it could be validated (i.e. Graffelman et al., Melbye et al, and Flanders et al). Furthermore, by studying the figure more closely, we can see the order of what model performed best in what dataset. For example, the models by van Vugt et al. and Heckerling et al. perform best in the dataset by Flanders et al., followed by the models by Singal et al., Diehr et al., Melbye et al. and Hopstaken et al.
Fig 3Calibration plots of prediction models clustered per risk group with low (0–10%), intermediate (10–30%) and high (30–100%) predicted probabilities.
Calibration results are presented for each validation dataset where the model could be validated. Plots show how well the predicted probabilities (x-axis) agree with observed probabilities (y-axis). For perfect agreement, the calibration curve falls on the ideal diagonal line (optimal calibration). Two vertical cut-off lines for 10% and 30% risk of pneumonia are depicted. (A) Calibration plot of the model by van Vugt et al. (B) Calibration plot of the model by Singal et al. (C) Calibration plot of the model by Hopstaken et al. (D) Calibration plot of the model by Heckerling et al. (E) Calibration plot of the model by Diehr et al. (F) Calibration plot of the model by Melbye et al.