| Literature DB >> 31476256 |
Orestis Efthimiou1, Ian R White2.
Abstract
Standard models for network meta-analysis simultaneously estimate multiple relative treatment effects. In practice, after estimation, these multiple estimates usually pass through a formal or informal selection procedure, eg, when researchers draw conclusions about the effects of the best performing treatment in the network. In this paper, we present theoretical arguments as well as results from simulations to illustrate how such practices might lead to exaggerated and overconfident statements regarding relative treatment effects. We discuss how the issue can be addressed via multilevel Bayesian modelling, where treatment effects are modelled exchangeably, and hence estimates are shrunk away from large values. We present a set of alternative models for network meta-analysis, and we show in simulations that in several scenarios, such models perform better than the usual network meta-analysis model.Entities:
Keywords: hierarchical modelling; mixed treatment comparison; multiple testing; multiple treatments
Mesh:
Substances:
Year: 2019 PMID: 31476256 PMCID: PMC7003789 DOI: 10.1002/jrsm.1377
Source DB: PubMed Journal: Res Synth Methods ISSN: 1759-2879 Impact factor: 5.273
Figure 1Network of antidepressants (left panel) and neuroleptic drugs (right panel)
Figure 2Network structures explored in simulation studies. Star network (left) and fully connected network (right)
Overview of the data‐generating mechanisms for all scenarios we explored in our simulations
| Name | Description of Simulated Data | Type of Network | Random Effects | Studies per Comparison | Treatment Effect in Study |
|---|---|---|---|---|---|
| SIMULATION STUDY 1: zero treatment effects | |||||
| A.1 | 10 treatments. No treatment effects. | Star | ✘ | 1 |
|
| A.2 | Fully connected | ||||
| B.1 | 10 treatments. No treatment effects. | Star | ✓ | 3 |
|
| B.2 | Fully connected | ||||
| C | 2 treatments. No treatment effects. | – | ✘ | 5 |
|
| D | 3 treatments. No treatment effects. | Fully connected | ✘ | 1 |
|
| SIMULATION STUDY 2: nonzero treatment effects | |||||
| E.1 | 10 treatments. Equal effects of all active treatments vs a control (treatment 1). | Star (vs treatment 1) | ✘ | 1 |
|
| E.2 | Fully connected | ||||
| F.1 |
10 treatments. Treatments 1‐5 were worthless treatments (group 1). Treatments 6‐10 were effective treatments (group 2). | Star (vs treatment 1) | ✘ | 1 |
|
| F.2 | Fully connected | ||||
| G.1 | 10 treatments. All treatment effects in the network are nonzero. | Star (vs treatment 1) | ✘ | 1 |
For study
|
| G.2 | Fully connected | ||||
| H.1 | 10 treatments. 2 classes of treatments, 2‐5 (class 1) and 6‐10 (class 2). Treatments within each class had equal effects vs treatment 1. Treatment effects vs treatment 1 were set equal to 1(2) for class 1(2), respectively. | Star (vs treatment 1) | ✘ | 1 |
For study
|
| H.2 | Fully connected | ||||
Overview of results for simulation study 1 (zero true treatment effects in the network). For all scenarios except B.1 and B.2 we used the fixed effects version of all models
| # | Model | Mean Absolute Error (Basic Parameters) | Mean Bias (Basic Parameters) | Mean Estimate For Best Vs Worst Treatment ± SD | Mean Estimate Best Treatment In the Network Vs Reference ± SD | % of NMAs Showing with Confidence Nonzero Treatment Effects |
|---|---|---|---|---|---|---|
| A.1 | Model I (flat priors) | 0.57 | 0.01 | 2.16 ± 1.12 | 1.10 | 61.9% |
| Model I (informative priors) | 0.36 | 0.00 | 1.30 ± 0.78 | 0.66 | 35.1% | |
| Model II | 0.24 | 0.01 | 0.63 ± 0.58 | 0.32 | 8.2% | |
| Model III | 0.16 | 0.01 | 0.45 ± 0.51 | 0.23 | 2.4% | |
| A.2 | Model I (flat priors) | 0.22 | −0.02 | 0.59 ± 0.29 | 0.28 ± 0.25 | 61.6% |
| Model I (informative priors) | 0.19 | −0.01 | 0.56 ± 0.28 | 0.27 ± 0.24 | 55.8% | |
| Model II | 0.17 | −0.01 | 0.28 ± 0.23 | 0.13 ± 0.16 | 9.0% | |
| Model III | 0.07 | 0.00 | 0.17 ± 0.19 | 0.08 ± 0.14 | 2.3% | |
| B.1 | Model I (flat priors) | 0.37 | −0.01 | 1.32 ± 0.74 | 0.65 ± 0.52 | 42.6% |
| Model I (informative priors) | 0.28 | −0.01 | 1.01 ± 0.61 | 0.49 ± 0.43 | 32.9% | |
| Model II | 0.17 | −0.01 | 0.40 ± 0.39 | 0.19 ± 0.23 | 4.3% | |
| Model III | 0.11 | 0.00 | 0.29 ± 0.36 | 0.14 ± 0.22 | 0.7% | |
| B.2 | Model I (flat priors) | 0.17 | −0.01 | 0.43 ± 0.21 | 0.22 ± 0.18 | 62.6% |
| Model I (informative priors) | 0.15 | 0.00 | 0.41 ± 0.20 | 0.20 ± 0.18 | 60.2% | |
| Model II | 0.13 | 0.00 | 0.20 ± 0.17 | 0.10 ± 0.12 | 11.0% | |
| Model III | 0.05 | 0.00 | 0.13 ± 0.14 | 0.07 ± 0.11 | 1.9% | |
| C | Model I (flat priors) | 0.23 | −0.02 | 0.23 ± 0.28 | – | 5.8% |
| Model I (informative priors) | 0.21 | −0.02 | 0.21 ± 0.27 | – | 4.5% | |
| Model II | 0.22 | −0.02 | 0.22 ± 0.28 | – | 5.5% | |
| Model III | 0.16 | −0.02 | 0.16 ± 0.24 | – | 0.0% | |
| D | Model I (flat priors) | 0.48 | 0.00 | 0.45 ± 0.54 | 0.23 ± 0.27 | 14.5% |
| Model I (informative priors) | 0.32 | 0.00 | 0.32 ± 0.46 | 0.16 ± 0.23 | 8.5% | |
| Model II | 0.40 | 0.00 | 0.40 ± 0.51 | 0.20 ± 0.25 | 10.0% | |
| Model III | 0.24 | 0.00 | 0.24 ± 0.40 | 0.12 ± 0.20 | 2.3% |
Overview of results for simulation study 2 (nonzero true treatment effects in the network)
| # | Model | Mean Absolute Error (Basic Parameters) | Mean Bias (Basic Parameters) | Best vs Worst Treatment | Best vs Reference Treatment | % of NMAs Showing with Confidence Nonzero Treatment Effects Among Equally Effective Treatments | Power | ||
|---|---|---|---|---|---|---|---|---|---|
| Mean Estimate ± SD | True Value | Mean Estimate ± SD | True Value | ||||||
| E.1 | Model I | 0.57 | 0.01 | 2.37 ± 0.92 | 1.00 | 2.14 ± 0.77 | 1.00 | 58.9% | 36.7% |
| Model II | 0.24 | 0.01 | 1.32 ± 0.45 | 1.32 ± 0.45 | 3.0% | 69.7% | |||
| Model III | 0.32 | −0.11 | 1.37 ± 0.52 | 1.37 ± 0.52 | 13.0% | 46.4% | |||
| Model IV | 0.32 | 0.00 | 1.55 ± 0.56 | 1.53 ± 0.54 | 17.0% | 54.0% | |||
| Model V | 0.35 | −0.08 | 1.50 ± 0.56 | 1.49 ± 0.55 | 54.8% | 22.5% | |||
| E.2 | Model I | 0.22 | −0.02 | 1.28 ± 0.28 | 1.00 | 1.28 ± 0.28 | 1.00 | 55.6% | 93.2% |
| Model II | 0.17 | −0.02 | 1.08 ± 0.24 | 1.08 ± 0.24 | 2.4% | 98.8% | |||
| Model III | 0.32 | −0.27 | 0.92 ± 0.30 | 0.92 ± 0.30 | 28.5% | 76.1% | |||
| Model IV | 0.19 | −0.03 | 1.14 ± 0.56 | 1.14 ± 0.56 | 17.3% | 97.3% | |||
| Model V | 0.24 | −0.15 | 1.06 ± 0.28 | 1.06 ± 0.28 | 28.9% | 87.7% | |||
| F.1 | Model I | 0.68 | 0.01 | 2.69 ± 1.04 | 1.00 | 1.90 ± 0.74 | 1.00 | 51.5% | 22.7% |
| Model II | 0.46 | 0.00 | 1.30 ± 0.62 | 1.13 ± 0.51 | 26.4% | 11.7% | |||
| Model III | 0.39 | −0.08 | 1.20 ± 0.62 | 1.04 ± 0.51 | 14.8% | 9.2% | |||
| Model IV | 0.47 | 0.00 | 1.68 ± 0.69 | 1.33 ± 0.51 | 11.3% | 31.2% | |||
| Model V | 0.43 | −0.01 | 1.53 ± 0.63 | 1.30 ± 0.51 | 2.9% | 36.6% | |||
| F.2 | Model I | 0.60 | 0.00 | 1.46 ± 0.28 | 1.00 | 1.15 ± 0.28 | 1.00 | 48.0% | 94.0% |
| Model II | 0.57 | 0.01 | 1.32 ± 0.28 | 1.09 ± 0.27 | 39.9% | 92.4% | |||
| Model III | 0.21 | −0.06 | 1.28 ± 0.28 | 1.10 ± 0.25 | 37.6% | 92.2% | |||
| Model IV | 0.58 | 0.00 | 1.25 ± 0.24 | 1.10 ± 0.22 | 12.8% | 99.0% | |||
| Model V | 0.55 | 0.00 | 1.20 ± 0.22 | 1.15 ± 0.28 | 5.7% | 99.5% | |||
| G.1 | Model I | 0.61 | 0.01 | 2.73 ± 0.84 | 1.80 | 2.61 ± 0.76 | 1.80 | – | 16.5% |
| Model II | 0.32 | 0.00 | 1.79 ± 0.47 | 1.79 ± 0.47 | – | 17.9% | |||
| Model III | 0.36 | −0.10 | 1.93 ± 0.54 | 1.92 ± 0.53 | – | 16.0% | |||
| Model IV | 0.38 | 0.00 | 2.00 ± 0.53 | 1.99 ± 0.53 | – | 18.0% | |||
| Model V | 0.42 | −0.08 | 2.00 ± 0.54 | 1.99 ± 0.54 | – | 18.7% | |||
| G.2 | Model I | 0.30 | −0.01 | 1.88 ± 0.28 | 1.80 | 1.88 ± 0.28 | 1.80 | – | 40.5% |
| Model II | 0.27 | 0.01 | 1.74 ± 0.27 | 1.74 ± 0.27 | – | 31.5% | |||
| Model III | 0.26 | −0.17 | 1.67 ± 0.29 | 1.67 ± 0.29 | – | 38.0% | |||
| Model IV | 0.27 | −0.03 | 1.74 ± 0.26 | 1.74 ± 0.26 | – | 43.0% | |||
| Model V | 0.31 | −0.11 | 1.67 ± 0.29 | 1.67 ± 0.29 | – | 43.1% | |||
| H.1 | Model I | 0.68 | 0.01 | 3.02 ± 0.82 | 2.00 | 2.90 ± 0.74 | 2.00 | 46.5% | 32.4% |
| Model II | 0.47 | 0.00 | 2.13 ± 0.51 | 2.13 ± 0.51 | 8.5% | 29.3% | |||
| Model III | 0.41 | −0.09 | 2.25 ± 0.55 | 2.25 ± 0.54 | 17.8% | 29.5% | |||
| Model IV | 0.47 | −0.01 | 2.33 ± 0.51 | 2.32 ± 0.51 | 5.7% | 41.6% | |||
| Model V | 0.51 | −0.08 | 2.34 ± 0.52 | 2.32 ± 0.51 | 7.5% | 44.9% | |||
| H.2 | Model I | 0.40 | −0.02 | 2.21 ± 0.28 | 2.00 | 2.21 ± 0.28 | 2.00 | 43.0% | 95.0% |
| Model II | 0.38 | −0.02 | 2.13 ± 0.28 | 2.13 ± 0.28 | 35.2% | 93.9% | |||
| Model III | 0.24 | −0.12 | 2.06 ± 0.28 | 2.06 ± 0.28 | 38.6% | 94.2% | |||
| Model IV | 0.37 | −0.03 | 2.07 ± 0.25 | 2.07 ± 0.25 | 6.5% | 99.2% | |||
| Model V | 0.42 | −0.15 | 1.96 ± 0.26 | 1.96 ± 0.26 | 16.7% | 97.2% | |||
Results for the antidepressants network. All relative treatment effects are OR for response (95% credible intervals). Lower triangle: estimates from the usual NMA model I. OR > 1 favours the column‐defining treatment. Upper triangle: estimates from NMA model III. OR > 1 favours the row‐defining treatment
| Bupropion | 0.96[0.79; 1.18] | 1.05[0.84; 1.33] | 0.83[0.69; 1.00] | 1.05[0.90; 1.25] | 1.05[0.84; 1.35] | 1.02[0.79; 1.33] | 0.83[0.66; 1.02] | 1.04[0.87; 1.27] | 1.32[1.01; 1.79] | 0.88[0.74; 1.05] | 0.86[0.72; 1.01] |
| 0.98[0.79; 1.23] | Citalopram | 1.10[0.88; 1.37] | 0.86[0.73; 1.02] | 1.10[0.93; 1.28] | 1.10[0.88; 1.39] | 1.05[0.82; 1.37] | 0.85[0.69; 1.04] | 1.09[0.91; 1.28] | 1.37[1.08; 1.82] | 0.91[0.75; 1.09] | 0.89[0.74; 1.06] |
| 1.10[0.84; 1.43] | 1.11[0.87; 1.43] | Duloxetine | 0.79[0.65; 0.95] | 1.00[0.82; 1.22] | 1.00[0.78; 1.28] | 0.96[0.74; 1.27] | 0.78[0.61; 0.98] | 0.99[0.82; 1.19] | 1.25[0.95; 1.69] | 0.83[0.67; 1.03] | 0.81[0.65; 1.00] |
| 0.82[0.67; 1.00] | 0.83[0.70; 0.99] | 0.75[0.61; 0.92] | Escitalopram | 1.27[1.10; 1.47] | 1.27[1.02; 1.61] | 1.22[0.96; 1.59] | 0.99[0.81; 1.20] | 1.25[1.08; 1.47] | 1.59[1.20; 2.13] | 1.05[0.89; 1.25] | 1.03[0.88; 1.20] |
| 1.09[0.91; 1.30] | 1.10[0.93; 1.32] | 1.00[0.79; 1.23] | 1.33[1.12; 1.56] | Fluoxetine | 1.00[0.82; 1.23] | 0.96[0.77; 1.20] | 0.78[0.65; 0.93] | 0.99[0.87; 1.12] | 1.25[0.99; 1.61] | 0.83[0.72; 0.96] | 0.81[0.71; 0.93] |
| 1.11[0.83; 1.45] | 1.12[0.87; 1.45] | 1.01[0.75; 1.37] | 1.35[1.04; 1.75] | 1.02[0.81; 1.28] | Fluvoxamine | 0.96[0.73; 1.27] | 0.78[0.61; 0.97] | 0.99[0.80; 1.20] | 1.25[0.95; 1.69] | 0.83[0.66; 1.03] | 0.81[0.65; 1.00] |
| 1.06[0.77; 1.49] | 1.08[0.79; 1.52] | 0.97[0.69; 1.41] | 1.3[0.94; 1.82] | 0.98[0.74; 1.32] | 0.96[0.69; 1.37] | Milnacipran | 0.81[0.61; 1.05] | 1.03[0.81; 1.28] | 1.30[0.98; 1.79] | 0.86[0.67; 1.11] | 0.85[0.65; 1.08] |
| 0.79[0.62; 1.01] | 0.80[0.63; 1.01] | 0.72[0.54; 0.94] | 0.96[0.76; 1.20] | 0.72[0.6; 0.88] | 0.71[0.55; 0.93] | 0.74[0.53; 1.02] | Mirtazapine | 1.27[1.05; 1.54] | 1.61[1.19; 2.22] | 1.06[0.88; 1.30] | 1.04[0.87; 1.25] |
| 1.06[0.87; 1.30] | 1.09[0.90; 1.30] | 0.98[0.79; 1.20] | 1.30[1.10; 1.52] | 0.98[0.85; 1.12] | 0.96[0.76; 1.22] | 1.00[0.74; 1.33] | 1.35[1.11; 1.64] | Paroxetine | 1.27[1.00; 1.67] | 0.84[0.71; 0.99] | 0.82[0.70; 0.96] |
| 1.61[1.20; 2.17] | 1.64[1.25; 2.13] | 1.47[1.06; 2.04] | 1.96[1.49; 2.63] | 1.49[1.16; 1.89] | 1.45[1.05; 2.04] | 1.52[1.03; 2.17] | 2.04[1.52; 2.78] | 1.52[1.16; 2.00] | Reboxetine | 0.66[0.50; 0.87] | 0.65[0.49; 0.85] |
| 0.87[0.71; 1.05] | 0.88[0.72; 1.08] | 0.79[0.62; 1.01] | 1.06[0.88; 1.27] | 0.80[0.69; 0.93] | 0.79[0.61; 1.01] | 0.81[0.59; 1.10] | 1.10[0.88; 1.37] | 0.81[0.69; 0.96] | 0.53[0.41; 0.71] | Sertraline | 0.98[0.83; 1.15] |
| 0.85[0.7; 1.01] | 0.86[0.71; 1.04] | 0.78[0.61; 0.98] | 1.04[0.86; 1.23] | 0.78[0.68; 0.89] | 0.77[0.6; 0.98] | 0.80[0.58; 1.08] | 1.08[0.88; 1.32] | 0.79[0.68; 0.93] | 0.52[0.4; 0.69] | 0.98[0.83; 1.16] | Venlafaxine |
Ranking of antidepressants using SUCRA values
| Drug | Model I | Model III |
|---|---|---|
| Mirtazapine | 0.91 | 0.89 |
| Escitalopram | 0.87 | 0.88 |
| Venlafaxine | 0.82 | 0.82 |
| Sertaline | 0.77 | 0.77 |
| Citalopram | 0.52 | 0.54 |
| Bupropion | 0.48 | 0.43 |
| Milnacipran | 0.36 | 0.40 |
| Paroxetine | 0.35 | 0.32 |
| Fluvoxamine | 0.31 | 0.32 |
| Duloxetine | 0.31 | 0.31 |
| Fluoxetine | 0.30 | 0.30 |
| Reboxetine | 0.00 | 0.02 |
Results for the neuroleptic drugs network. All relative treatment effects are ORs for response (and 95% credible intervals). Lower triangle: estimates from the standard NMA model I. OR > 1 favours the column‐defining treatment. Upper triangle: estimates from model II. OR > 1 favours the row‐defining treatment
| Aripiprazole | 0.73[0.47; 1.09] | 1.18[0.86; 1.61] | 0.78[0.56; 1.05] | 2.13[1.59; 2.86] | 0.77[0.56; 1.04] |
| 0.63[0.40; 1.00] | Clozapine | 1.64[1.05; 2.44] | 1.06[0.78; 1.49] | 2.94[2.00; 4.35] | 1.05[0.76; 1.49] |
| 1.20[0.83; 1.69] | 1.89[1.27; 2.78] | Haloperidol | 0.65[0.51; 0.92] | 1.79[1.33; 2.50] | 0.64[0.49; 0.93] |
| 0.73[0.52; 1.02] | 1.15[0.80; 1.67] | 0.60[0.48; 0.79] | Olanzapine | 2.78[2.04; 3.70] | 0.99[0.79; 1.23] |
| 2.08[1.54; 2.86] | 3.33[2.17; 5.00] | 1.75[1.30; 2.38] | 2.86[2.17; 3.85] | Placebo | 0.36[0.27; 0.48] |
| 0.72[0.51; 1.00] | 1.14[0.78; 1.64] | 0.60[0.46; 0.79] | 0.99[0.78; 1.23] | 0.34[0.26; 0.45] | Risperidone |
Ranking of neuroleptic drugs using SUCRA values
| Drug | Model I | Model II |
|---|---|---|
| Clozapine | 0.90 | 0.84 |
| Risperidone | 0.76 | 0.77 |
| Olanzapine | 0.73 | 0.75 |
| Aripiprazole | 0.39 | 0.41 |
| Haloperidol | 0.23 | 0.23 |
| Placebo | 0.00 | 0.00 |