| Literature DB >> 28571585 |
Diogo Mendes1,2, Carlos Alves3,4, Francisco Batel-Marques3,4.
Abstract
BACKGROUND: The number needed to treat (NNT) is an absolute effect measure that has been used to assess beneficial and harmful effects of medical interventions. Several methods can be used to calculate NNTs, and they should be applied depending on the different study characteristics, such as the design and type of variable used to measure outcomes. Whether or not the most recommended methods have been applied to calculate NNTs in studies published in the medical literature is yet to be determined. The aim of this study is to assess whether the methods used to calculate NNTs in studies published in medical journals are in line with basic methodological recommendations.Entities:
Keywords: Case–control studies; Cohort studies; Data interpretation; Epidemiologic methods; Evidence-based medicine; Meta-analysis; Numbers needed to treat; Randomized controlled trial; Statistical
Mesh:
Year: 2017 PMID: 28571585 PMCID: PMC5455127 DOI: 10.1186/s12916-017-0875-8
Source DB: PubMed Journal: BMC Med ISSN: 1741-7015 Impact factor: 8.775
Fig. 1Flow of studies through the review process
Characteristics of the included studies and of the number needed to treat (NNT)
| Characteristics | Meta-analysis ( | RCT ( | Cohort ( | Nested case–control ( | Overall ( | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Journal | ||||||||||
|
| 9 | (39.1%) | 4 | (23.5%) | 2 | (22.2%) | 2 | (100.0%) | 17 | (33.3%) |
|
| 6 | (26.1%) | 7 | (41.2%) | 1 | (11.1%) | 0 | (0.0%) | 14 | (27.5%) |
|
| 2 | (8.7%) | 0 | (0.0%) | 2 | (22.2%) | 0 | (0.0%) | 4 | (7.8%) |
| Other | 6 | (26.1%) | 6 | (35.3%) | 4 | (44.4%) | 0 | (0.0%) | 16 | (31.4%) |
| Country | ||||||||||
| USA | 13 | (56.5%) | 2 | (11.8%) | 6 | (66.7%) | 0 | (0.0%) | 21 | (41.2%) |
| UK | 4 | (17.4%) | 2 | (11.8%) | 0 | (0.0%) | 0 | (0.0%) | 6 | (11.8%) |
| Canada | 1 | (4.3%) | 2 | (11.8%) | 1 | (11.1%) | 2 | (100.0%) | 6 | (11.8%) |
| Other | 5 | (21.7%) | 11 | (64.7%) | 2 | (22.2%) | 0 | (0.0%) | 18 | (35.3%) |
| Disease/condition | ||||||||||
| Infections and infestations | 4 | (17.4%) | 2 | (11.8%) | 1 | (11.1%) | 0 | (0.0%) | 7 | (13.7%) |
| Cardiac disorders | 3 | (13.0%) | 3 | (17.6%) | 1 | (11.1%) | 0 | (0.0%) | 7 | (13.7%) |
| Psychiatric disorders | 4 | (17.4%) | 3 | (17.6%) | 0 | (0.0%) | 0 | (0.0%) | 7 | (13.7%) |
| Other | 12 | (52.2%) | 9 | (52.9%) | 7 | (77.8%) | 2 | (100.0%) | 30 | (58.8%) |
| Primary outcome of study | ||||||||||
| Efficacy | 12 | (52.2%) | 16 | (94.1%) | 2 | (22.2%) | 0 | (0.0%) | 30 | (58.8%) |
| Safety | 2 | (8.7%) | 1 | (5.9%) | 6 | (66.7%) | 2 | (100.0%) | 11 | (21.6%) |
| Efficacy and safety | 9 | (39.1%) | 0 | (0.0%) | 1 | (11.1%) | 0 | (0.0%) | 10 | (19.6%) |
| Type of variable (primary outcome) | ||||||||||
| Binary | 22 | (95.7%)a | 13 | (76.5%) | 5 | (55.6%) | 1 | (50.0%) | 41 | (80.4%) |
| Time to event | 1 | (4.3%) | 4 | (23.5%) | 4 | (44.4%) | 1 | (50.0%) | 10 | (19.6%) |
| Relative effect measure | ||||||||||
| Yes | ||||||||||
| Relative risk | 11 | (47.8%)b | 5 | (29.4%) | 2 | (22.2%) | 0 | (0.0%) | 18 | (35.3%)a |
| Odds ratio | 9 | (39.1%)b | 4 | (23.5%) | 2 | (22.2%) | 1 | (50.0%) | 16 | (31.4%)a |
| Hazard ratio | 1 | (4.3%) | 3 | (17.6%) | 3 | (33.3%) | 0 | (0.0%) | 7 | (13.7%) |
| Rate ratio | 0 | (0.0%) | 0 | (0.0%) | 1 | (11.1%) | 1 | (50.0%) | 2 | (3.9%) |
| No | 3 | (13.0%) | 5 | (29.4%) | 1 | (11.1%) | 0 | (0.0%) | 9 | (17.6%) |
| Outcome expressed with NNT | ||||||||||
| Primary outcome | 6 | (26.1%) | 14 | (82.4%) | 7 | (77.8%) | 1 | (50.0%) | 28 | (54.9%) |
| Secondary outcome | 0 | (0.0%) | 2 | (11.8%) | 0 | (0.0%) | 0 | (0.0%) | 2 | (3.9%) |
| Primary and secondary outcomes | 17 | (73.9%) | 1 | (5.9%) | 2 | (22.2%) | 1 | (50.0%) | 21 | (41.2%) |
| NNT for benefit or harm? | ||||||||||
| Benefit | 8 | (34.8%) | 15 | (88.2%) | 3 | (33.3%) | 0 | (0.0%) | 26 | (51.0%) |
| Harm | 2 | (8.7%) | 1 | (5.9%) | 6 | (66.7%) | 2 | (100.0%) | 11 | (21.6%) |
| Benefit and harm | 13 | (56.5%) | 1 | (5.9%) | 0 | (0.0%) | 0 | (0.0%) | 14 | (27.5%) |
| Type of NNT calculated in the study | ||||||||||
| Person-based NNT | 21 | (91.3%)a | 13 | (76.5%) | 5 | (55.6%) | 1 | (50.0%) | 40 | (78.4%) |
| Person-time-based NNT | 2 | (8.7%) | 4 | (23.5%) | 4 | (44.4%) | 1 | (50.0%) | 10 | (21.6%) |
| Completeness of NNT estimate | ||||||||||
| Control event rate | ||||||||||
| Yes | 13 | (56.5%) | 17 | (100.0%) | 6 | (66.7%) | 1 | (50.0%) | 37 | (72.5%) |
| No | 10 | (43.5%) | 0 | (0.0%) | 3 | (33.3%) | 1 | (50.0%) | 14 | (27.5%) |
| Time horizon | ||||||||||
| Yes | 10 | (43.5%) | 17 | (100.0%) | 9 | (100.0%) | 2 | (100.0%) | 37 | (72.5%) |
| No | 13 | (56.5%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 14 | (27.5%) |
| Confidence intervals | ||||||||||
| Yes | 16 | (65.2%)c | 8 | (47.1%) | 8 | (88.9%) | 1 | (50.0%) | 32 | (62.7%) |
| No | 8 | (34.8%) | 9 | (52.9%) | 1 | (11.1%) | 1 | (50.0%) | 19 | (37.3%) |
aThe variable for the primary outcome of one meta-analysis is binary, and pooled OR (95% CI) was calculated. However, a person-time-based NNT was calculated by taking the reciprocal of RD between pooled event rates per 1000 patient-years (Preiss 2011)
bOne study reported relative risk (RR) and odds ratio (OR) (Maher et al. 2011)
cConfidence interval was provided with NNT only for the primary outcome in a study reporting NNT for several outcomes (Green et al. 2007)
Assessment of methodology used to calculate number needed to treat (NNT) in included studies
| Meta-analysis ( | RCT ( | Cohort ( | Nested case–control ( | Overall ( | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Methodology used to calculate NNT is defined in the methods section of the study | ||||||||||
| Yes | 19 | 82.6%) | 0 | (0.0%) | 7 | (77.8%) | 2 | (100.0%) | 28 | (54.9%) |
| No | 4 | 17.4%) | 17 | (100.0%) | 2 | (22.2%) | 0 | (0.0%) | 23 | (45.1%) |
| General characteristics of the methodology used to calculate NNT in the study | ||||||||||
| Reciprocal of risk difference | ||||||||||
| Simple proportions | 1 | (4.3%) | 14 | (82.4%) | 2 | (22.2%) | 0 | (0.0%) | 17 | (33.3%) |
| Cumulative IR | 0 | (0.0%) | 3 | (17.6%) | 3 | (33.3%) | 0 | (0.0%) | 6 | (11.8%) |
| Pooled RD | 12 | (52.2%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 12 | (23.15) |
| Average RD | 0 | (0.0%) | 0 | (0.0%) | 4 | (44.4%) | 0 | (0.0%) | 4 | (7.8%) |
| Relative effect measure | 10 | (43.5%) | 0 | (0.0%) | 0 | (0.0%) | 2 | (100.0%) | 12 | (23.5%) |
| Methodology used to calculate NNT is in line with basic recommendations (overall) | ||||||||||
| Yes | 10 | (43.5%) | 16 | (94.1%) | 8 | (88.9%) | 2 | (100.0%) | 37 | (70.6%) |
| No | 13 | (56.5%) | 1 | (5.9%) | 1 | (11.1%) | 0 | (0.0%) | 15 | (29.4%) |
| Methodology used to calculate NNT is in line with basic recommendations (detailed) | ||||||||||
| Binary variables | ||||||||||
| Yes | 9 | (39.1%) | 13 | (76.5%) | 5 | (55.6%) | 1 | (50.0%) | 28 | (54.9%) |
| Reciprocal of risk difference | ||||||||||
| Simple proportions | 0 | (0.0%) | 13 | (76.5%) | 1 | (11.1%) | 0 | (0.0%) | 14 | (27.5%) |
| Cumulative IR | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) |
| Pooled RD | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) |
| Average RD | 0 | (0.0%) | 0 | (0.0%) | 4 | (44.4%) | 0 | (0.0%) | 4 | (7.8%) |
| Relative effect measure | 9 | (39.1%) | 0 | (0.0%) | 0 | (0.0%) | 1 | (50.0%) | 10 | (19.6%) |
| No | 13 | (56.5%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 13 | (25.5%) |
| Reciprocal of risk difference | ||||||||||
| Simple proportions | 1 | (4.3%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 1 | (2.0%) |
| Cumulative IR | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) |
| Pooled RD | 12 | (52.2%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 12 | (23.5%) |
| Average RD | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) |
| Relative effect measure | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) |
| Time-to-event variables | ||||||||||
| Yes | 1 | (4.3%) | 3 | (17.6%) | 3 | (33.3%) | 1 | (50.0%) | 8 | (15.7%) |
| Reciprocal of risk difference | ||||||||||
| Simple proportions | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) |
| Cumulative IR | 0 | (0.0%) | 3 | (17.6%) | 3 | (33.3%) | 0 | (0.0%) | 6 | (11.8%) |
| Pooled RD | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) |
| Average RD | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) |
| Relative effect measure | 1 | (4.3%) | 0 | (0.0%) | 0 | (0.0%) | 1 | (50.0%) | 2 | (3.9%) |
| No | 0 | (0.0%) | 1 | (5.9%) | 1 | (11.1%) | 0 | (0.0%) | 2 | (3.9%) |
| Reciprocal of risk difference | ||||||||||
| Simple proportions | 0 | (0.0%) | 1 | (5.9%) | 1 | (11.1%) | 0 | (0.0%) | 2 | (3.9%) |
| Cumulative IR | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) |
| Pooled RD | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) |
| Average RD | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) |
| Relative effect measure | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) | 0 | (0.0%) |
IR incidence rate, RCT randomized controlled trial, RD risk difference
Characteristics of the included studies in which basic recommendations were not followed to calculate the number needed to treat (NNT)
| Study | Variable | Baseline risk | Time horizon | Confidence interval | Methodology used to compute NNT defined in methods section | Method used to compute NNT | Source of data used to compute NNT | Comments |
|---|---|---|---|---|---|---|---|---|
| Systematic review and meta-analysis | ||||||||
| Jonas 2014 | Binary | No | No | Yes | Yes | NNT = 1/RD | Pooled RD | A pooled RD was calculated for two outcomes. Duration of included trials ranged from 12 to 52 weeks for the outcome any drinking, and from 12 to 24 weeks for heaving drinking |
| Hempel 2012 | Binary | No | No | Yes | Yes | NNT = 1/RD | Pooled RD | The pooled RD (obtained from meta-analysis) led to a loss of follow-up time. Most trials either did not specify the follow-up period, or the assessment was explicitly limited to the time of antibiotics treatment |
| Leucht 2012 | Binary | Yes | Yes | Yes | Yes | NNT = 1/RD | Pooled RD | The outcome is assessed between 7 and 12 months of follow-up; a mean study duration is indicated for each outcome with NNT calculated from absolute RD pooled from the meta-analysis |
| Shah 2012 | Binary | No | No | Yes | Yes | NNT = 1/RD | Pooled RD | The study comprehends the calculation and comparison of NNT for several treatments. However, NNTs are not comparable because they were calculated from pooled RDs and times of follow-up vary considerably across studies included in the meta-analysis (10 days to 48 weeks) |
| Preiss 2011 | Binary | Yes | Yes | No | No | NNT = 1/RD | Pooled RD | The variable for the primary outcome of the study is binary, and pooled OR (95% CI) was calculated. However, NNT was calculated by taking the reciprocal of RD between pooled event rates per 1000 patient-years. Person-time-based NNT was presented and interpreted as the number of persons needed to treat over 1 year |
| Shamliyan 2011 | Binary | Yes | No | Yes | Yes | NNT = 1/RD | Pooled RD | Several antiviral treatments were compared based on estimates of NNT. However, studies with different times of follow-up for antiviral treatments were used to pool absolute RD. The time horizon factor is lost |
| Coker 2010 | Binary | Yes | Yes | Yes | No | NNT = 1/RD | Pooled RD | The pooled RD was obtained for a 14 day follow-up duration in all studies included in the meta-analysis. However, RD varies considerably across the studies included in the meta-analysis (ranging from −8% to 27%) |
| Testa 2008 | Binary | No | No | Yes | Yes | NNT = 1/RD | Pooled RD | Pooled RD was used to calculate NNT. The follow-up of included studies ranged from ”in hospital” to 6 months |
| Bridge 2007 | Binary | Yes | No | Yes | Yes | NNT = 1/RD | Pooled RD | DerSimonian and Laird random-effects model was used to obtain a pooled estimate of the RD (95% CI). NNT was calculated as the reciprocal of RD. The duration of follow-up and the baseline risk varied considerably across included studies |
| Dentali 2007 | Binary | Yes | No | No | Yes | NNT = 1/RD | Simple proportions | Raw totals of patients from each study were added together to estimate proportions and calculate RD, i.e., treating data as if all were from one study (Simpson’s paradox). Further, the baseline risk ranged considerably across included studies (e.g., 0.2–4.0% for pulmonary embolism) |
| Rovers 2006 | Binary | Yes | Yes | No | No | NNT = 1/RD | Pooled RD | Although it is not clearly stated in the methods section, the discussion of the study suggests that the authors calculated pooled RD by means of the meta-analysis |
| Bongartz 2006 | Binary | No | Yes | Yes | Yes | NNT = 1/RD | Pooled RD | NNT calculated for treatment periods of 6–12 months and 3–12 months, using Mantel-Haenszel fixed estimate of absolute RD in cases in which an OR of at least 1.5 was detected |
| Spiegel 2006 | Binary | No | No | No | Yes | NNT = 1/RD | Pooled RD | A pooled RD was calculated for two comparisons. Duration of included trials ranged from 6 to 78 weeks for one comparison and from 12 to 24 weeks for another comparison |
| Randomized controlled trial | ||||||||
| Shepherd 2008 | Time to event | Yes | Yes | No | No | NNT = 1/RD | Simple proportions | NNT calculated as 1/RD using final rates of event and citing a median time of follow-up of 4.8 years (NNT = 14 in patients with diabetes and chronic kidney disease). However, a Kaplan-Meier curve is provided in the study, which should have been used (since the median follow-up is lower than the 5-years objective, at least some patients did not complete the follow-up). From the Kaplan-Meier curve, we would have 20.3% and 14.0% patients with the outcome in the atorvastatin 10 mg and 80 mg/day, respectively, at 4.8 years of follow-up and an NNT = 15.8 |
| Retrospective cohort study | ||||||||
| Graham 2010 | Time to event | Yes | Yes | Yes | Yes | NNT = 1/RD | Simple proportions | NNT was calculated using RD between unadjusted incidence rates. Adjusted incidence rates from the Kaplan-Meier curves should have been used. For example, at 1 year of follow-up, NNT for the composite endpoint would be 92 from Kaplan-Meier curves, rather than 60 person-years from unadjusted incidence rates. The authors interpreted person-years as number of persons treated over 1 year, which is not exactly the same |
NNT number needed to treat, OR odds ratio, RD risk difference