| Literature DB >> 33978306 |
Dipika Neupane1,2, Brooke Levis1,2,3, Parash M Bhandari1,2, Brett D Thombs1,2,4,5,6,7,8, Andrea Benedetti2,5,9.
Abstract
OBJECTIVES: Selectively reported results from only well-performing cutoffs in diagnostic accuracy studies may bias estimates in meta-analyses. We investigated cutoff reporting patterns for the Patient Health Questionnaire-9 (PHQ-9; standard cutoff 10) and Edinburgh Postnatal Depression Scale (EPDS; no standard cutoff, commonly used 10-13) and compared accuracy estimates based on published cutoffs versus all cutoffs.Entities:
Keywords: diagnostic test accuracy; individual participant data meta-analysis; meta-analysis; publication bias; selective cutoff reporting
Mesh:
Year: 2021 PMID: 33978306 PMCID: PMC8412225 DOI: 10.1002/mpr.1873
Source DB: PubMed Journal: Int J Methods Psychiatr Res ISSN: 1049-8931 Impact factor: 4.182
Comparison of accuracy results from IPDMA of PHQ‐9 and EPDS with the published dataset only versus the full dataset
| PHQ‐9 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| |||||||||||
| Cutoff | No. of studies | No. of participants | No of MD cases | Sensitivity | 95% CI | Specificity | 95% CI | Sensitivity | 95% CI | Specificity | 95% CI | |
| 5 | 5 | 1663 | 367 | 0.91 | 0.86, 0.94 | 0.68 | 0.55, 0.79 | 0.97 | 0.94, 0.98 | 0.54 | 0.48, 0.60 | |
| 6 | 6 | 2193 | 377 | 0.87 | 0.77, 0.93 | 0.72 | 0.61, 0.82 | 0.96 | 0.92, 0.97 | 0.62 | 0.56, 0.68 | |
| 7 | 6 | 2050 | 438 | 0.87 | 0.75, 0.93 | 0.72 | 0.60, 0.81 | 0.94 | 0.90, 0.97 | 0.69 | 0.63, 0.74 | |
| 8 | 12 | 5798 | 720 | 0.87 | 0.78, 0.92 | 0.77 | 0.70, 0.82 | 0.92 | 0.87, 0.95 | 0.75 | 0.70, 0.79 | |
| 9 | 14 | 5283 | 766 | 0.85 | 0.76, 0.91 | 0.81 | 0.75, 0.85 | 0.87 | 0.81, 0.91 | 0.80 | 0.76, 0.84 | |
| 10 | 26 | 10 593 | 1378 | 0.82 | 0.74, 0.88 | 0.86 | 0.83, 0.89 | 0.83 | 0.76, 0.88 | 0.85 | 0.81, 0.88 | |
| 11 | 15 | 5292 | 767 | 0.83 | 0.72, 0.91 | 0.88 | 0.83, 0.92 | 0.76 | 0.69, 0.82 | 0.88 | 0.85, 0.91 | |
| 12 | 16 | 6188 | 832 | 0.73 | 0.63, 0.81 | 0.91 | 0.87, 0.94 | 0.69 | 0.62, 0.75 | 0.91 | 0.88, 0.93 | |
| 13 | 9 | 2104 | 455 | 0.70 | 0.59, 0.79 | 0.95 | 0.87, 0.98 | 0.60 | 0.54, 0.67 | 0.93 | 0.91, 0.95 | |
| 14 | 5 | 1231 | 277 | 0.63 | 0.47, 0.76 | 0.96 | 0.89, 0.99 | 0.54 | 0.47, 0.61 | 0.95 | 0.93, 0.96 | |
| 15 | 6 | 3546 | 374 | 0.47 | 0.37, 0.59 | 0.97 | 0.97, 0.98 | 0.47 | 0.40, 0.54 | 0.96 | 0.95, 0.97 | |
Abbreviations: CI, Confidence Interval; EPDS, Edinburgh Postnatal Depression Scale; IPDMA, Individual Participant Data Meta‐analysis; MD, Major Depression.
For these cutoffs, one sample proportion test with continuity correction was used to estimate sensitivity and specificity and confidence intervals.
FIGURE 1Receiver operating characteristic (ROC) curves plot for the diagnostic accuracy of Patient Health Questionnaire‐9 (PHQ‐9). The points in the ROC curves indicate each of the PHQ‐9 cutoffs between 5 (right) and 15 (left)
FIGURE 2Receiver operating characteristic (ROC) curves plot for the diagnostic accuracy of Edinburgh Postnatal Depression Scale (EPDS). The points in the ROC curves indicate each of the EPDS cutoffs between 5 (right) and 18 (left)
Differences in estimated sensitivity and specificity using the published dataset only versus the full dataset for PHQ‐9 and EPDS
| PHQ‐9 | ||||||
|---|---|---|---|---|---|---|
|
|
| |||||
|
|
|
|
|
| ||
|
|
|
|
| |||
| 5 | 14 | 23 | −0.06 | −0.13, 0.00 | 0.14 | 0.02, 0.26 |
| 6 | 19 | 24 | −0.09 | −0.18, −0.01 | 0.10 | 0.00, 0.20 |
| 7 | 17 | 28 | −0.07 | −0.20, 0.00 | 0.03 | −0.09, 0.15 |
| 8 | 49 | 45 | −0.05 | −0.14, 0.02 | 0.02 | −0.03, 0.08 |
| 9 | 45 | 48 | −0.02 | −0.11, 0.05 | 0.01 | −0.04, 0.05 |
| 10 | 90 | 87 | −0.01 | −0.05, 0.01 | 0.01 | 0.00, 0.04 |
| 11 | 45 | 48 | 0.07 | 0.00, 0.13 | 0.00 | −0.03, 0.03 |
| 12 | 53 | 52 | 0.04 | −0.03, 0.09 | 0.00 | −0.02, 0.03 |
| 13 | 18 | 29 | 0.10 | −0.02, 0.20 | 0.02 | −0.04, 0.05 |
| 14 | 10 | 17 | 0.09 | −0.07, 0.23 | 0.01 | −0.04, 0.04 |
| 15 | 30 | 24 | 0.00 | −0.12, 0.13 | 0.01 | 0.00, 0.03 |
Note: For PHQ‐9, 15 iterations (1.5%) that did not produce difference estimates were removed prior to determining the bootstrap CI.
For EPDS, 284 iterations (28.4%) for cutoffs 5‐6, 32 iterations (3.2%) for cutoffs 7‐15 and 275 iterations (27.5%) for cutoff 16 that did not produce difference estimates were removed prior to determining bootstrap CIs. Only 1 study published EPDS cutoffs 17 and 18, so only participant level resampling was done for published dataset.
Abbreviations: CI, Confidence Interval, EPDS: Edinburgh Postnatal Depression Scale, PHQ‐9, Patient Health Questionnaire‐9.
FIGURE 3Pattern of cutoff reporting for PHQ‐9 studies. Cells shaded in gray represent cutoff points for which diagnostic accuracy results are reported in the primary studies. “O” represents the optimal cutoff for PHQ‐9 explicitly stated in the studies except for Inagaki et al. (2013), Pence et al. (2012), Arroll (2010), Cholera (2014), Amoozegar (2017), which did not identify an optimal cutoff. For those, Youden's J optimal was calculated from published accuracies. For Gjerdingen (2009) and Vöhringer (2013), only one cutoff was reported without stating whether it was optimal or not. van Steenbergen‐Weijenburg 2010 reported 10 and 12 as optimal cutoffs. Studies that reported accuracies for cutoffs beyond presented in the table: Inagaki et al. (2013) reported the accuracy for cutoffs 4–13, Thombs (2008) reported the accuracy for cutoffs 1–10, Lambert et al. (2015) reported the accuracy for cutoffs 5, 9, 10, 15, 20, Hyphantis (2011) reported the accuracy for cutoffs 4–16, Osorio (2009) reported the accuracy for cutoffs 10–21. All the reported cutoffs were included while calculating the mean of reported cutoffs though they are not shown in the figure
FIGURE 4Pattern of cutoff reporting for EPDS studies. Cells shaded in gray represent cutoff points for which diagnostic accuracy results are reported in the primary studies. “O” represents the optimal cutoff for EPDS explicitly stated in the studies except for Philips (2009), which did not identify an optimal cutoff. For Philips 2009, Youden's J optimal was calculated from published accuracies. For Bakare et al. (2014), Pawlby et al. (2008), Beck 2001 only one cutoff was reported without stating whether it was optimal or not. Studies that reported accuracies for cutoffs beyond presented in the table: Khalifa et al. (2015) reported accuracy for cutoffs 1–15, Vega‐Dienstmaier et al. (2002) reported the accuracy for cutoffs 1–26. All the reported cutoffs were included while calculating the mean of reported cutoffs though they are not shown in the figure