| Literature DB >> 26116616 |
Yemisi Takwoingi1, Boliang Guo2, Richard D Riley3, Jonathan J Deeks1.
Abstract
Hierarchical models such as the bivariate and hierarchical summary receiver operating characteristic (HSROC) models are recommended for meta-analysis of test accuracy studies. These models are challenging to fit when there are few studies and/or sparse data (for example zero cells in contingency tables due to studies reporting 100% sensitivity or specificity); the models may not converge, or give unreliable parameter estimates. Using simulation, we investigated the performance of seven hierarchical models incorporating increasing simplifications in scenarios designed to replicate realistic situations for meta-analysis of test accuracy studies. Performance of the models was assessed in terms of estimability (percentage of meta-analyses that successfully converged and percentage where the between study correlation was estimable), bias, mean square error and coverage of the 95% confidence intervals. Our results indicate that simpler hierarchical models are valid in situations with few studies or sparse data. For synthesis of sensitivity and specificity, univariate random effects logistic regression models are appropriate when a bivariate model cannot be fitted. Alternatively, an HSROC model that assumes a symmetric SROC curve (by excluding the shape parameter) can be used if the HSROC model is the chosen meta-analytic approach. In the absence of heterogeneity, fixed effect equivalent of the models can be applied.Entities:
Keywords: Diagnostic accuracy; HSROC model; bivariate model; diagnostic odds ratio; hierarchical models; meta-analysis; random effects; sensitivity; sparse data; specificity
Mesh:
Year: 2015 PMID: 26116616 PMCID: PMC5564999 DOI: 10.1177/0962280215592269
Source DB: PubMed Journal: Stat Methods Med Res ISSN: 0962-2802 Impact factor: 3.021
Figure 1.Forest plot of sensitivity and specificity estimates from studies included in the two motivating examples. FN: false negative; FP: false positive; TN: true negative; TP: true positive.
Summary accuracy measures obtained from different meta-analytic models applied to the two motivating examples.
| Meta-analytic model | Sensitivity (95% CI) | Specificity (95% CI) | LR + (95% CI) | LR − (95% CI) | DOR (95% CI) |
|---|---|---|---|---|---|
| Non-contrast CT for appendicitis ( | |||||
| Univariate fixed effect logistic regression | 93.8 (90.6, 96.0) | 96.9 (95.1, 98.0) | 30 (19, 48) | 0.06 (0.04, 0.10) | 471 (244, 907) |
| Univariate random effects logistic regression | 93.8 (90.8, 95.9) | 97.4 (93.7, 99.0) | 37 (14, 93) | 0.06 (0.04, 0.10) | 579 (205, 1635) |
| Bivariate random effects model | NE | NE | NE | NE | NE |
| Complete HSROC | 94.1 (88.5, 97.0) | 97.8 (92.6, 99.4) | 42 (12, 148) | 0.06 (0.03, 0.12) | 700 (130, 3771) |
| Symmetric HSROC | 94.1 (88.2, 97.2) | 97.5 (94.1, 99.0) | 38 (15, 93) | 0.06 (0.03, 0.13) | 628 (149, 2657) |
| Fixed accuracy | 93.8 (90.1, 96.2) | 96.9 (94.7, 98.2) | 30 (18, 51) | 0.06 (0.04, 0.10) | 471 (223, 995) |
| Fixed threshold | 94.1 (88.9, 96.9) | 97.8 (93.0, 99.3) | 42 (13, 140) | 0.06 (0.03, 0.12) | 701 (141, 3485) |
| Fixed accuracy and threshold | 93.8 (90.6, 96.0) | 96.9 (95.1, 98.0) | 30 (19, 48) | 0.06 (0.04, 0.10) | 471 (244, 907) |
| Symmetric fixed accuracy and threshold | 93.8 (90.6, 96.0) | 96.9 (95.1, 98.0) | 30 (19, 48) | 0.06 (0.04, 0.10) | 471 (244, 907) |
| CT for scaphoid fractures ( | |||||
| Univariate fixed effect logistic regression | 93.2 (78.8, 98.1) | 100 | 2.26E + 07 (NE) | 0.07 (0.02, 0.23) | 3.31E + 08 (NE) |
| Univariate random effects logistic regression | 99.0 (3.7, 100) | 100 | 3.25E + 07 (NE) | 0.01 (3.94E − 06, 24) | 3.34E + 09 (NE) |
| Bivariate random effects | NE | NE | NE | NE | NE |
| Complete HSROC | 99.1 (2.2, 100) | 100 | 2.07E + 09 (NE) | 0.01 (2.14E − 06, 41) | 2.21E + 11 (NE) |
| Symmetric HSROC | 98.6 (12.9, 100) | 100 (0, 100) | 54,762 (1.29E − 05, 2.33E + 14) | 0.01 (3.36E − 05, 6.11) | 3,818,334 (0.0001, 1.33E + 17) |
| Fixed accuracy | 99.0 (6.4, 100) | 100 | 1.59E + 11 (NE) | 0.01 (7.03E − 06, 13) | 1.64E + 13 (NE) |
| Fixed threshold | 99.0 (6.4, 100) | 100 | 2.37E + 09 (NE) | 0.01 (7.03E − 06, 13) | 2.43E + 11 (NE) |
| Fixed accuracy and threshold | 93.2 (78.8, 98.1) | 100 | 7.80E + 09 (NE) | 0.07 (0.02, 0.23) | 1.14E + 11 (NE) |
| Symmetric fixed accuracy and threshold | 93.2 (78.8, 98.1) | 100 | 2.27E + 07 (NE) | 0.07 (0.02, 0.23) | 3.34E + 08 (NE) |
n: number of studies in the meta-analysis, NE: not estimable.
Figure 2.Profile log-likelihood function of the covariance parameter in the bivariate model applied to the appendicitis example.
Scenarios evaluated in the simulation.[a]
| Scenario | Prevalence (%) | DOR | Heterogeneity in accuracy and threshold | Asymmetry in SROC curve |
|---|---|---|---|---|
| 1–3 | 5 | 38 | No | No |
| 4–6 | 25 | 38 | No | No |
| 7–9 | 50 | 38 | No | No |
| 10–12 | 5 | 38 | Yes | No |
| 13–15 | 25 | 38 | Yes | No |
| 16–18 | 50 | 38 | Yes | No |
| 19–21 | 5 | 231 | No | No |
| 22–24 | 25 | 231 | No | No |
| 25–27 | 50 | 231 | No | No |
| 28–30 | 5 | 231 | Yes | No |
| 31–33 | 25 | 231 | Yes | No |
| 34–36 | 50 | 231 | Yes | No |
| 37–39 | 5 | 15 | Yes | Yes |
| 40–42 | 25 | 15 | Yes | Yes |
| 43–45 | 50 | 15 | Yes | Yes |
| 46–48 | 5 | 59 | Yes | Yes |
| 49–51 | 25 | 59 | Yes | Yes |
| 52–54 | 50 | 59 | Yes | Yes |
Each subset of 3 scenarios corresponds to 5, 10 and 20 studies.
Convergence and estimability of the complete HSROC model applied to 10,000 datasets in 36 different scenarios.[a]
| DOR |
| Prevalence (%) | No heterogeneity in accuracy and threshold | Heterogeneity in accuracy and threshold | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Meta- analyses with a zero cell[ | Successful model fit (positive definite) (%) | % | % | % | Meta- analyses with a zero cell[ | Successful model fit (positive definite) (%) | % | % | % | |||
| 38 | 5 | 5 | 48 | 18 | 14 | 2.6 | 1.8 | 50 | 36 | 21 | 0.6 | 14 |
| 38 | 5 | 25 | 50 | 18 | 15 | 1.7 | 1.6 | 51 | 54 | 31 | 0.2 | 22 |
| 38 | 5 | 50 | 52 | 18 | 15 | 1.4 | 2.0 | 53 | 60 | 34 | 0.2 | 26 |
| 38 | 10 | 5 | 60 | 25 | 17 | 3.7 | 4.4 | 60 | 52 | 21 | 0.2 | 31 |
| 38 | 10 | 25 | 65 | 24 | 18 | 2.4 | 3.8 | 67 | 77 | 24 | 0.0 | 54 |
| 38 | 10 | 50 | 72 | 25 | 18 | 2.6 | 4.1 | 73 | 85 | 24 | 0.0 | 61 |
| 38 | 20 | 5 | 75 | 32 | 20 | 5.8 | 6.4 | 77 | 70 | 20 | 0.0 | 50 |
| 38 | 20 | 25 | 77 | 30 | 20 | 3.7 | 6.3 | 78 | 93 | 12 | 0.0 | 82 |
| 38 | 20 | 50 | 82 | 28 | 20 | 3.1 | 5.8 | 84 | 97 | 10 | 0.0 | 88 |
| 231 | 5 | 5 | 96 | 18 | 11 | 5.7 | 1.4 | 97 | 30 | 18 | 2.4 | 10 |
| 231 | 5 | 25 | 97 | 21 | 14 | 5.0 | 1.6 | 98 | 43 | 29 | 2.1 | 13 |
| 231 | 5 | 50 | 99 | 21 | 15 | 4.7 | 1.9 | 99 | 48 | 31 | 2.3 | 14 |
| 231 | 10 | 5 | 99 | 23 | 13 | 7.7 | 3.0 | 99 | 41 | 21 | 1.3 | 19 |
| 231 | 10 | 25 | 99 | 29 | 17 | 7.6 | 4.1 | 100 | 62 | 29 | 0.9 | 33 |
| 231 | 10 | 50 | 100 | 29 | 18 | 6.7 | 4.1 | 100 | 71 | 32 | 0.9 | 39 |
| 231 | 20 | 5 | 100 | 29 | 15 | 9.7 | 4.8 | 100 | 54 | 20 | 0.3 | 34 |
| 231 | 20 | 25 | 100 | 35 | 20 | 9.2 | 6.2 | 100 | 80 | 23 | 0.2 | 57 |
| 231 | 20 | 50 | 100 | 35 | 20 | 9.2 | 6.5 | 100 | 88 | 22 | 0.1 | 66 |
: estimated between study correlation; DOR: diagnostic odds ratio; N: number of studies.
All results are presented as percentages and are based on 10,000 meta-analysis datasets.
The percentage of meta-analyses out of 10,000 where at least one study included a zero cell.
Performance of all meta-analytic models in estimating the log DOR for scenarios with a DOR of 231.
| Studies | Heterogeneity[ | Meta-analytic model | 5% prevalence | 25% prevalence | 50% prevalence | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| Bias (%)[ | MSE | Coverage (%) |
| Bias (%) | MSE | Coverage (%) |
| Bias (%) | MSE | Coverage (%) | |||
| 5 | No | Complete HSROC | 1767 | 13.5 | 6.24 | 98.9 | 2086 | 4.20 | 0.41 | 98.8 | 2114 | 3.27 | 0.23 | 97.7 |
| Symmetric HSROC | 1276 | 4.32 | 4.26 | 95.5 | 2753 | 2.28 | 0.26 | 97.1 | 4179 | 2.58 | 0.20 | 97.4 | ||
| FA | 1932 | 36.2 | 31.9 | 98.7 | 2228 | 4.86 | 0.59 | 98.5 | 2232 | 3.64 | 0.26 | 97.4 | ||
| FT | 1792 | 37.8 | 32.6 | 98.9 | 2174 | 4.77 | 0.63 | 99.3 | 2288 | 3.26 | 0.27 | 98.9 | ||
| FAT | 9798 | 30.1 | 28.0 | 97.0 | 9556 | 1.82 | 0.37 | 95.9 | 4722 | 1.24 | 0.17 | 94.5 | ||
| SFAT | 10,000 | 37.0 | 40.7 | 88.5 | 10,000 | 1.69 | 0.42 | 95.8 | 10,000 | 1.19 | 0.15 | 95.5 | ||
| UREM | 3173 | 11.2 | 5.67 | 98.5 | 2869 | 3.69 | 0.40 | 97.5 | 2883 | 2.71 | 0.22 | 97.4 | ||
| 5 | Yes | Complete HSROC | 3020 | 40.2 | 41.6 | 97.6 | 4339 | 2.43 | 0.57 | 95.8 | 4772 | 1.26 | 0.30 | 94.4 |
| Symmetric HSROC | 3442 | 20.6 | 20.6 | 94.2 | 6331 | 0.76 | 0.33 | 93.6 | 7594 | 0.58 | 0.25 | 93.3 | ||
| FA | 5490 | 40.6 | 42.6 | 97.1 | 6222 | 2.97 | 0.82 | 95.4 | 6331 | 1.57 | 0.31 | 93.9 | ||
| FT | 2976 | 51.9 | 51.4 | 98.2 | 2266 | 3.66 | 1.12 | 97.7 | 2171 | 2.26 | 0.36 | 97.9 | ||
| FAT | 9691 | 22.3 | 22.9 | 91.8 | 9288 | −2.16 | 0.66 | 85.1 | 4833 | −3.18 | 0.30 | 79.4 | ||
| SFAT | 10,000 | 28.3 | 33.0 | 84.2 | 10,000 | −2.38 | 0.77 | 84.6 | 10,000 | −3.23 | 0.30 | 80.7 | ||
| UREM | 5833 | 11.8 | 6.53 | 97.8 | 6311 | 2.01 | 0.43 | 96.8 | 6573 | 1.10 | 0.30 | 96.4 | ||
| 10 | No | Complete HSROC | 2325 | 7.70 | 0.94 | 99.0 | 2903 | 2.42 | 0.15 | 97.2 | 2862 | 1.90 | 0.10 | 97.0 |
| Symmetric HSROC | 1924 | 1.70 | 0.37 | 97.3 | 3581 | 1.60 | 0.11 | 96.4 | 5383 | 1.50 | 0.09 | 96.9 | ||
| FA | 2175 | 11.0 | 4.52 | 98.7 | 2569 | 2.58 | 0.15 | 97.3 | 2719 | 1.96 | 0.10 | 96.4 | ||
| FT | 2177 | 10.6 | 4.55 | 98.9 | 2670 | 2.44 | 0.15 | 98.3 | 2666 | 1.72 | 0.10 | 98.2 | ||
| FAT | 9881 | 5.55 | 3.45 | 96.9 | 9594 | 0.71 | 0.09 | 95.5 | 4596 | 0.55 | 0.07 | 95.5 | ||
| SFAT | 10,000 | 6.40 | 4.96 | 95.9 | 10,000 | 0.62 | 0.09 | 95.4 | 10,000 | 0.51 | 0.07 | 95.4 | ||
| UREM | 5612 | 6.59 | 1.02 | 98.4 | 5417 | 1.71 | 0.12 | 97.0 | 5311 | 1.30 | 0.08 | 96.7 | ||
| 10 | Yes | Complete HSROC | 4129 | 9.55 | 4.58 | 98.1 | 6248 | 0.93 | 0.16 | 95.2 | 7136 | 0.61 | 0.13 | 94.5 |
| Symmetric HSROC | 5895 | 2.72 | 1.89 | 95.8 | 8488 | 0.20 | 0.14 | 93.7 | 9387 | 0.35 | 0.12 | 93.4 | ||
| FA | 6772 | 9.14 | 5.04 | 96.9 | 7776 | 0.45 | 0.15 | 93.6 | 8088 | 0.30 | 0.12 | 92.1 | ||
| FT | 2759 | 10.8 | 6.67 | 98.0 | 1840 | 0.60 | 0.17 | 96.9 | 1579 | 0.27 | 0.13 | 97.5 | ||
| FAT | 9775 | −0.31 | 2.42 | 88.0 | 9301 | −4.19 | 0.23 | 77.3 | 4765 | −4.62 | 0.24 | 71.4 | ||
| SFAT | 10,000 | 0.15 | 3.34 | 87.0 | 10,000 | −4.38 | 0.24 | 76.7 | 10,000 | −4.45 | 0.23 | 72.2 | ||
| UREM | 8302 | 5.30 | 1.42 | 97.4 | 8942 | 0.46 | 0.16 | 96.6 | 9237 | 0.36 | 0.12 | 97.1 | ||
| 20 | No | Complete HSROC | 2915 | 4.87 | 0.40 | 99.4 | 3513 | 1.62 | 0.07 | 97.0 | 3513 | 1.28 | 0.05 | 95.9 |
| Symmetric HSROC | 2654 | 1.16 | 0.16 | 96.1 | 4359 | 1.01 | 0.05 | 96.1 | 6149 | 1.04 | 0.04 | 96.3 | ||
| FA | 2425 | 4.77 | 0.44 | 98.5 | 2969 | 1.60 | 0.06 | 96.5 | 3076 | 1.19 | 0.04 | 95.7 | ||
| FT | 2439 | 4.47 | 0.39 | 99.1 | 2888 | 1.53 | 0.07 | 97.6 | 2996 | 1.23 | 0.05 | 97.1 | ||
| FAT | 9917 | 1.25 | 0.17 | 95.7 | 9615 | 0.37 | 0.04 | 95.2 | 4528 | 0.34 | 0.03 | 94.7 | ||
| SFAT | 10,000 | 1.21 | 0.17 | 95.7 | 10,000 | 0.34 | 0.04 | 95.1 | 10,000 | 0.29 | 0.03 | 94.9 | ||
| UREM | 8094 | 3.64 | 0.33 | 97.1 | 7930 | 1.03 | 0.05 | 96.5 | 7963 | 0.83 | 0.04 | 96.1 | ||
| 20 | Yes | Complete HSROC | 5406 | 3.36 | 0.39 | 97.2 | 8011 | 0.43 | 0.07 | 95.5 | 8843 | 0.14 | 0.06 | 94.3 |
| Symmetric HSROC | 8040 | 0.51 | 0.17 | 95.0 | 9679 | 0.14 | 0.07 | 94.1 | 9905 | 0.06 | 0.06 | 93.7 | ||
| FA | 7767 | 1.38 | 0.33 | 95.3 | 8930 | −0.34 | 0.07 | 92.4 | 9179 | −0.39 | 0.06 | 90.7 | ||
| FT | 2054 | 0.95 | 0.24 | 95.9 | 992 | −0.68 | 0.07 | 95.8 | 781 | −0.95 | 0.06 | 95.6 | ||
| FAT | 9758 | −4.37 | 0.30 | 81.7 | 9293 | −4.88 | 0.20 | 66.0 | 4559 | −5.19 | 0.21 | 58.3 | ||
| SFAT | 10,000 | −4.46 | 0.30 | 81.4 | 10,000 | −5.01 | 0.21 | 64.4 | 10,000 | −5.12 | 0.21 | 57.6 | ||
| UREM | 9455 | 1.64 | 0.29 | 96.4 | 9869 | 0.11 | 0.07 | 97.4 | 9951 | 0.04 | 0.06 | 97.2 | ||
DOR: diagnostic odds ratio; FA: fixed accuracy HSROC model; FAT: fixed accuracy and threshold HSROC model; FT: fixed threshold HSROC model; MSE: mean square error; N: number of meta-analyses out of 10,000 where hierarchical models successfully converged; SFAT: symmetric fixed accuracy and threshold HSROC model; UREM: univariate random effects logistic regression model.
Heterogeneity in accuracy and threshold.
Bias is presented as a percentage of the true value of the log diagnostic odds ratio.
Recommendations for selecting alternative models when bivariate or HSROC models fail.[a]
|
| ||
| Visual inspection of forest plots and SROC plots may help to identify whether heterogeneity exists. For example, one may observe complete or near complete lack of variability between estimates of sensitivity and/or specificity, indicating no heterogeneity in one or both parameters (sensitivity and/or specificity equal to 100%), or conversely wide variability in observed estimates (e.g. non-overlapping confidence intervals) indicating large heterogeneity. | ||
|
| ||
| Select a simpler hierarchical fixed effect or random effects model based on inference of interest (summary points or SROC curve), observation from the data plot, and previous output from the failed bivariate or HSROC model | ||
| Note: when prevalence is very low and the number of studies is very small, there is potential for bias and the results of the meta-analysis should be interpreted with caution. | ||
| Focus of inference | ||
| Heterogeneity | Summary point (summary sensitivity and specificity) | SROC curve |
| Variability in sensitivity and/or specificity between studies observed on the plot | Univariate random effects logistic regression models | Symmetric HSROC model |
| Minimal or no variability in sensitivity and/or specificity between studies observed on the plot | Univariate fixed effect logistic regression models[ | Symmetric fixed accuracy and threshold model |
| A symmetric SROC curve can be described using the diagnostic odds ratio (exponent of the value of the accuracy parameter). | ||
| Section 4.1.3 contains suggestions for facilitating convergence of hierarchical models. | ||
Bivariate or HSROC models either failed to converge or converged (i.e. met the convergence criterion) but gave unreliable estimates (e.g. with no standard errors, or dependent on starting values).
The symmetric fixed accuracy threshold model is equivalent to simultaneously fitting two univariate fixed effect logistic regression models for sensitivity and specificity.