| Literature DB >> 24959719 |
Kerry Dwan1, Douglas G Altman2, Mike Clarke3, Carrol Gamble1, Julian P T Higgins4, Jonathan A C Sterne5, Paula R Williamson1, Jamie J Kirkham1.
Abstract
BACKGROUND: Most publications about selective reporting in clinical trials have focussed on outcomes. However, selective reporting of analyses for a given outcome may also affect the validity of findings. If analyses are selected on the basis of the results, reporting bias may occur. The aims of this study were to review and summarise the evidence from empirical cohort studies that assessed discrepant or selective reporting of analyses in randomised controlled trials (RCTs). METHODS ANDEntities:
Mesh:
Year: 2014 PMID: 24959719 PMCID: PMC4068996 DOI: 10.1371/journal.pmed.1001666
Source DB: PubMed Journal: PLoS Med ISSN: 1549-1277 Impact factor: 11.069
Figure 1PRISMA flow diagram.
Characteristics of included studies comparing protocols or trial registry entries to full publications.
| Study | Objective | Cohort of Studies | Dates | Information Sources Compared | Included Study Designs and Number of Studies/Total Number of Studies | Funding Source for All Studies | Conclusions |
|
| “To examine whether selective reporting of outcomes and subgroup analyses was present” | Trial protocols that had been peer reviewed and accepted for publication in | Trial reports published by June 2007 | Protocols to publications | 71/75 RCTs; author obtained permission to use 64; 37 trials that had been published (50 reports) were assessed | Not stated | “The solution to the problem of selective reporting requires further discussion, the current system is clearly inadequate.” |
|
| “To evaluate how often sample size calculations and methods of statistical analysis are pre-specified or changed in randomised trials” | Protocols and journal publications of published randomised parallel group trials approved by the scientific ethics committees of Copenhagen and Frederiksberg, Denmark | Trials approved 1994–1995 | Protocols to publications | 70 RCTs | 64% (45/70) industry, 16% (11/70) industry and non-industry, 14.3% (10/70) non-industry, 4.3% (3/70) none, 1.4% (1/70) unclear | “When reported in publications, sample size calculations and statistical methods were often explicitly discrepant with the protocol or not pre-specified. Such amendments were rarely acknowledged in the trial publication. The reliability of trial reports cannot be assessed without having access to the full protocols.” |
|
| “To compare the outcomes, analysis and sample size proposed in the original approved study protocol with the results presented in the subsequent study report” | Approved protocols from a single local research ethics committee | Protocols approved in 1994 | Protocols to publications | 15/37 publications obtained; 2/15 RCTs | Not stated | “Our pilot study has shown that within-study selective reporting may be examined qualitatively by comparing the study report with the study protocol. Our results suggest that it [selective reporting] might well be substantial; however, the bias can only be broadly identified as protocols are not sufficiently precise.” |
|
| “To critically review the use of two types of secondary analyses, covariate adjustment and subgroup analysis, which are common in traumatic brain injury trials” | Published RCTs of TBI with >50 participants in each arm | Trial reports published 1966–2004 | Protocols to publications for 6/18 RCTs; otherwise | 18 RCTs | Not stated | “The reported covariate adjustment and subgroup analyses from TBI trials had several methodological shortcomings. Appropriate performance and reporting of covariate adjustment and subgroup analysis should be considerably improved in future TBI trials because interpretation of treatment benefits may be misleading otherwise.” |
|
| “To compare subgroup analyses as outlined in grant applications and their related publications” | Grants awarded by the Netherlands Organisation for Health Research and Development from 2001 onward that were finalised before March 1, 2010 | Grants awarded 2001 to March 1, 2010 | Grant proposals to publications and published protocols | 47/79 RCTs, 13/79 cohort, 10/79 modelling, 9/79 other | 100% Netherlands Organisation for Health Research and Development | “There is a large discrepancy between grant applications and final publications regarding subgroup analyses. Both non-reporting pre-specified subgroup analyses and reporting post-hoc subgroup analyses are common. More guidance is clearly needed.” |
|
| “To determine whether poor reporting of methods in randomised controlled trials reflects on poor methods” | RCTs conducted by the Radiation Therapy Oncology Group since its establishment in 1968 | Terminated RCTs since 1968 | Protocols to publication | 59 RCTs | Not stated | “The reporting of methodological aspects of randomised controlled trials does not necessarily reflect the conduct of the trial. Reviewing research protocols and contacting trialists for more information may improve quality assessment.” |
|
| “To evaluate discrepancies between trial registry entries and final reports of randomized controlled trials published in major general surgical journals” | RCTs published during 2010 in the | Trial reports published in 2010 | Trial registry to publication | 51 RCTs | 37% (19/51) industry, 22% (11/51) no information, 41% (21/51) non-industry | “When interpreting the results of surgical RCTs, the possibility of selective reporting, and thus outcome reporting bias, has to be kept in mind. For future trials, prospective registration should be strictly respected with the ultimate goal to increase transparency and contribute to high-level evidence reports for optimal patient care in surgery.” |
|
| “To assess adjustment practices for primary outcomes of randomized controlled trials and their impact on the results” | RCTs that reported primary outcome, published in print in 2009 in the 25 biomedical journals with the highest impact factor according to | Trial reports published in 2009 | Trial registry, protocol, and design papers to publication | 199 RCTs | Not stated | “There is large diversity on whether and how analyses of primary outcomes are adjusted in randomized controlled trials and these choices can sometimes change the nominal significance of the results. Registered protocols should explicitly specify adjustments plans for main outcomes and analysis should follow these plans.” |
|
| “To compare the timing and completeness of results publicly posted at ClinicalTrials.gov and in published articles for trials of drug interventions” | RCTs of drugs with results posted on ClinicalTrials.gov and published | Results published on ClinicalTrials.gov by March 2012 | Trial registry (results database) to publication | 600 RCTs with results; 297 RCTs randomly sampled; 202 RCTs with posted and published results | 85% (173/202) industry, 10% (20/202) academic, 5% (9/202) academic and industry | “The results highlight the need to assess trial results systematically from both ClinicalTrials.gov and the published article when available. Based on our results, searching Clinical Trials.gov is necessary for all published and unpublished trials to obtain more complete data and to identify inconsistencies or discrepancies between the publicly posted results and the publication.” |
TBI, traumatic brain injury.
Characteristics of included studies comparing other documentation to full publications.
| Study | Objective | Cohort of Studies | Dates | Information Sources Compared | Included Study Designs and Number of Studies/Total Number of Studies | Funding Source for All Studies | Conclusions |
|
| “To investigate the relative impact on publication bias caused by multiple publication, selective publication, and selective reporting (ITT versus PP) in studies sponsored by pharmaceutical companies” | Placebo-controlled studies of five selective serotonin reuptake inhibitors submitted to the Swedish drug regulatory authority as a basis for marketing approval for treating major depression | Submitted between 1989 and 1994 | Study reports in marketing approval compared to publications | 42 RCTs | 100% industry | “The degree of multiple publication, selective publication, and selective reporting differed between products. Thus, any attempt to recommend a specific selective serotonin reuptake inhibitor from the publicly available data only is likely to be based on biased evidence.” |
|
| “To determine the publication rate of efficacy trials submitted to the Food and Drug Administration (FDA) in approved New Drug Applications (NDAs) and to compare the trial characteristics as reported by the FDA with those reported in publications” | All efficacy trials found in approved NDAs for new molecular entities and all published clinical trials corresponding to the trials within the NDAs | From 2001 to 2002 | NDA to publication | 164 efficacy trials found in 33 NDAs | 77% (97/126) at least in part by industry | “Many trials were still not published 5 years after FDA approval. Discrepancies between the trial information reviewed by the FDA and information found in published trials tended to lead to more favourable presentations of the NDA drugs in the publications. Thus, the information that is readily available in the scientific literature to health care professionals is incomplete and potentially biased.” |
|
| “To determine how accurately the published literature conveys data on drug efficacy to the medical community by comparing drug efficacy inferred from the published literature with drug efficacy according to FDA reviews” | Phase 2 and 3 clinical trial programs for 12 antidepressant agents approved by the FDA | Approved by the FDA between 1987 and 2004 | FDA review to publication | 74 FDA-registered studies | 100% industry | “We cannot determine whether the bias observed resulted from a failure to submit manuscripts on the part of authors and sponsors, from decisions by journal editors and reviewers not to publish, or both. Selective reporting of clinical trial results may have adverse consequences for researchers, study participants, healthcare professionals, and patients.” |
|
| “To compare the description of types of analyses and criteria for including participants in the publication (i.e., what was reported) with descriptions in the corresponding internal company documents (i.e., what was planned and what was done)” | Trials of gabapentin funded by Pfizer and Warner-Lambert's subsidiary Parke-Davis for off-label indications | Published 1987–2008 | Internal company documents (statistical analysis plan, protocol, and research report) to trial report | 11/21 published studies | 100% industry | “Descriptions of analyses conducted did not agree between internal company documents and what was publicly reported. Internal company documents provide extensive documentation of methods planned and used, and trial findings, and should be publicly accessible. Reporting standards for randomized controlled trials should recommend transparent descriptions and definitions of analyses performed and which study participants are excluded.” |
FDA, US Food and Drug Administration; ITT, intention to treat; NDA, new drug application; PP, per protocol.
Methodological quality.
| Study | Independent Data Extraction by Two People | Positive and Negative Findings Clearly Defined | Selective Reporting | |||
|
| Uncertain quality | Not stated | High quality | Statistically significant/non-significant | High quality | All results reported on stated objectives |
|
| Uncertain quality | Not stated | NA | Not considered | High quality | All results reported on stated objectives |
|
| High quality | Two reviewers used electronic forms to independently extract data; resolved disagreements by discussion | NA | Not considered for sample size or statistical analysis although considered when looking at outcomes | High quality | All results reported on stated objectives |
|
| High quality | Two independent observers extracted the data using a standardised data sheet, and two other observers, blinded to the results, selected the most important component | NA | Not considered | High quality | All results reported on stated objectives |
|
| High quality | One author extracted all data, and additional coders were trained to double-code portions of the data determined to be potentially subjective in both the FDA reviews and the publications; disagreements were resolved by consensus | High quality | Favourable: statistically significant in favour of the drug; not favourable: not statistically significant or significant in favour of the comparator | High quality | All results reported on stated objectives |
|
| High quality | Double data extraction was used; any discrepancies were resolved by consensus | High quality | Positive: statistical superiority ( | High quality | All results reported on stated objectives |
|
| Methodological concerns | Two authors extracted trial characteristics, but only one extracted outcome data | High quality | Statistically significant/not significant | High quality | All results reported on stated objectives |
|
| High quality | Teams of two reviewers independently abstracted data using standardised pilot-tested extraction forms; discrepancies resolved by consensus | High quality | Statistically significant/not significant—assessed for primary outcome and for association with reporting of subgroup analyses | High quality | All results reported on stated objectives |
|
| High quality | One author extracted data and another verified, with a third person verifying discordant items | High quality |
| High quality | All results reported on stated objectives |
|
| Methodological concerns | All three authors assessed three studies to check for consistency, and then the first author assessed the other studies | NA | NA | High quality | All results reported on stated objectives |
|
| Uncertain quality | Not stated | High quality | Significant: finding a nominally significant difference among treatment groups in a superiority trial or declaring non-inferiority/equivalence in a non-inferiority/equivalence trial | High quality | All results reported on stated objectives |
|
| High quality | Data extraction completed independently by two authors | High quality | Referred to as negative/positive; negative defined as non-significant overall result | Methodological concerns | Not all secondary analyses reported—only that a difference was observed (between positive and negative trials) |
|
| High quality | Data extraction appears to have been completed independently by two authors | NA | NA | High quality | All results reported on stated objectives |
|
| High quality | Data extraction was completed independently by three authors | NA | NA | High quality | All results reported on stated objectives |
|
| Methodological concerns | Data extraction performed by one author | Methodological concerns | Positive and negative used but not defined | High quality | All results reported on stated objectives |
|
| Methodological concerns | Data extraction performed by one author | NA | NA | High quality | All results reported on stated objectives |
|
| Uncertain quality | Not stated | High quality | A positive outcome means that the treatment was beneficial and the treatment effect achieved nominal statistical significance; a negative/equivocal outcome means that a significant benefit was not demonstrated | High quality | All results reported on stated objectives |
|
| Uncertain quality | Not stated | High quality | A subgroup analysis was defined as significant when the researchers reported a significant effect by either (1) providing a significant | High quality | All results reported on stated objectives |
|
| Uncertain quality | Not stated | NA | NA | High quality | All results reported on stated objectives |
|
| Methodological concerns | An internal pilot study was conducted to test the data extraction spreadsheet for feasibility, completeness, and accuracy, and as a quality control check, data entries concerning these RCTs were cross-checked by the second reviewer | NA | NA | High quality | All results reported on stated objectives |
|
| High quality | Two researchers independently extracted data and discussed discrepancies to reach a consensus, otherwise the senior investigator was consulted | High quality | Nominal statistical significance, based on 95% confidence intervals being entirely on one side of the null, | High quality | All results reported on stated objectives |
|
| High quality | All data were extracted in duplicate by two eviewers in data collection forms; all disagreements were resolved by discussion to reach a consensus, including intervention of a third reviewer in case of discrepancies | NA | NA | High quality | All results reported on stated objectives |
FDA, US Food and Drug Administration; NA, not applicable.
Statistical analyses.
| Study (Maximum Number of RCTs) | Quality | Results | Percent of RCTs (Number) |
|
| H–H |
| 1% (1/70) |
|
| 64% (25/39) | ||
|
| 7% (3/42) | ||
|
| 62% (8/13) | ||
|
| 29% (2/7) | ||
|
| X–H |
| 88% (7/8) |
|
| HHH |
| 22% (11/51) |
|
| 16% (4/51) | ||
|
| ?HH |
| 92% (23/25) |
|
| HHH |
| 67% (2/3) |
|
| HHH |
| 19% (24/126) |
|
| 18% (23/126) | ||
|
| ?–H |
| 17% (8/48) |
|
| X–H |
| 96% (49/51) |
|
| 10% (5/51) | ||
|
| 14% (7/51) |
Based on summary assessments for three domains from Table 4: H = high quality; X = methodological concerns; ? = uncertain quality; – = not applicable.
FDA, US Food and Drug Administration; ITT, intention to treat; NDA, new drug application; PP, per protocol.
Composite outcomes.
| Study (Maximum Number of RCTs) | Quality | Results | Percent of RCTs (Number) |
|
| H–H |
| 60% (24/40) |
|
| 33% (13/40) | ||
|
| 75% (3/4) | ||
|
| 69% (11/16) |
Based on summary assessments for three domains from Table 4: H = high quality; – = not applicable.
Handling missing data.
| Study (Maximum Number of RCTs) | Quality | Results | Percent of RCTs (Number) |
|
| H–H |
| 80% (39/49) |
|
| 78% (38/49) | ||
|
| HHH |
| 8% (10/126) |
|
| 9% (11/126) | ||
|
| 12% (15/126) | ||
|
| 13% (16/126) | ||
|
| HHH |
| 12% (6/51) |
Based on summary assessments for three domains from Table 4: H = high quality; – = not applicable.
FDA, US Food and Drug Administration; LOCF, last observation carried forward; NDA, new drug application.
Unadjusted versus adjusted analyses.
| Study (Maximum Number of RCTs) | Quality | Results | Percent of RCTs (Number) |
|
| H–H |
| 82% (23/28) |
|
| 67% (12/18) | ||
|
| 67% (12/18) | ||
|
| XHH |
| 46% (36/79) |
|
| 6% (2/36) | ||
|
| 30% (24/79) | ||
|
| 17% (19/109) | ||
|
| 16% (3/19) | ||
|
| HHH |
| 47% (28/60) |
|
| 75% (21/28) | ||
|
| 25% (7/28) |
Based on summary assessments for three domains from Table 4: H = high quality; X = methodological concerns; – = not applicable.
Continuous/binary data.
| Study (Maximum Number of RCTs) | Quality | Results | Percent of RCTs (Number) |
|
| HHH |
| 1% (1/74) |
|
| X–H |
| 29% (12/42) |
|
| 3% (1/32) | ||
|
| 41% (13/32) | ||
|
| H–H |
| 20% (9/45) |
Based on summary assessments for three domains from Table 4: H = high quality; X = methodological concerns; – = not applicable.
Subgroup analyses.
| Study (Maximum Number of RCTs) | Quality | Results | Percent of RCTs (Number) |
|
| H–H | Discrepancy between protocol and publication (25 trials described subgroup analyses in the protocol or publication) | 100% (25/25) |
| Discrepancy between protocol and publication (pre-specified in protocol and only some or none reported in publication) (13 trials described subgroup analyses in the protocol) | 92% (12/13) | ||
| Discrepancy (information absent) between protocol and publication (included in the publication, but at least one not pre-specified in the protocol) (20 trials described subgroup analyses in the publication) | 95% (19/20) | ||
|
| ?–H | Publication includes at least one unreported or new subgroup analysis not mentioned in the protocol (18 trials pre-specified subgroup analyses in the protocol, 28 trials included subgroup analyses in the publication) | 61% (11/18) |
| Reason for subgroup selection reported in the protocol | 3% (1/37) | ||
| Subgroup analyses reported in the publication but not pre-specified in the protocol (information absent) | 58% (11/19) | ||
| Reason for these subgroup analyses reported in the publication | 0% (0/11) | ||
|
| HHH | Not pre-specified (information absent) (64 trials claimed a subgroup effect for the primary outcome) | 59% (38/64) |
| At least one subgroup analysis not pre-specified | 65% (135/207) | ||
|
| ?HH | Number of subgroup analyses undertaken was unclear | 15% (9/59) |
| Unclear whether any of the subgroup analyses were pre-specified or post hoc | 68% (40/59) | ||
| Unclear whether some subgroup analyses were pre-specified | 5% (3/59) | ||
|
| HHX | Included pre-specified and non-pre-specified subgroups in the publication | 10% (4/39) |
| Subgroup analyses reported in the publication without a rationale (39 trials included subgroup analyses in the publication) | 54% (21/39) | ||
|
| H–H | Subgroup analyses not pre-specified (information absent) (54 subgroup analyses in 27 trials) | 91% (49/54) |
|
| ?HH | Subgroup analyses post hoc | 14% (8/58) |
| partially pre-specified and partially post hoc | 10% (6/58) | ||
| Subgroup analyses failed to indicate whether they were pre-specified or not (58 trials reported subgroup analyses) | 35% (20/58) | ||
|
| XXH | Pre-specified subgroups not reported in the publication (information absent) | 27% (3/11) |
| Partially pre-specified subgroups were reported in the publication (11 trials reported subgroup analyses) | 46% (5/11) | ||
| Discrepancy (information absent) between protocol and publication (subgroup analyses not pre-specified in the protocol but reported in publication) | 50% (3/6) | ||
| Discrepancy between publication and protocol (subgroup analyses pre-specified in the protocol but not reported in publication as planned) (protocols were available for six trials) | 33% (2/6) | ||
|
| X–H | Subgroup analyses unclear | 6% (1/17) |
| Subgroup analyses omitted from the publication (information absent) (17 trials had at least one subgroup analysis) | 53% (9/17) | ||
| Subgroups defined after randomisation (14 trials reported when subgroups were defined in the publication) | 82% (4/14) | ||
|
| ?HH | Discrepancy between the grant proposal and publication (47 were RCTs only) | 77% (36/47) |
| Discrepancy between the grant proposal and publication | 75% (59/79) | ||
| Discrepancy in pre-specified subgroup analyses between the grant proposal and publication | 90% (44/49) | ||
| Discrepancy (information absent) in pre-specified subgroup analyses between the grant proposal and publication (pre-specified subgroups not reported) | 22% (11/49) | ||
| Discrepancy in pre-specified subgroup analyses between the grant proposal and publication (added/omitted subgroups from publications) (49 studies mentioned subgroups in their grant application) | 67% (33/49) | ||
| Subgroups included in the publication (30 did not pre-specify subgroups) | 50% (15/30) | ||
| No discrepancy between protocol and grant proposal | 62% (13/21) | ||
| No discrepancy between protocol and publication (a protocol was available for 21 trials) | 38% (8/21) | ||
| Subgroup effects were reported only for significant interaction tests (four publications included interaction tests) | 75% (3/4) | ||
|
| X–H | Discrepancy between trial registry and publication | 0% (0/7) |
| Subgroup analyses not included in trial registry but included in publication (information absent) (seven trials included subgroup analyses in trial registry or publication) | 86% (6/7) | ||
|
| H–H | It was commonly difficult to determine whether subgroup analyses were pre-defined or post hoc | Not applicable |
Based on summary assessments for three domains from Table 4: H = high quality; X = methodological concerns; ? = uncertain quality; – = not applicable.
Characteristics of included studies comparing information within a trial report.
| Study | Objective | Cohort of Studies | Dates | Information Sources Compared | Included Study Designs and Number of Studies/Total Number of Studies | Funding Source for All Studies | Conclusions |
|
| “To study how composite outcomes, which have combined several components into a single measure, are defined, reported, and interpreted” | Parallel group RCTs published in 2008 reporting a binary composite outcome | Published in 2008 | Abstract to | 40 RCTs | 40% (16/40) industry; 20% (8/40) partly industry, 18% (7/40) no industry, 23% (9/40) unclear | “The use of composite outcomes in trials is problematic. Components are often unreasonably combined, inconsistently defined, and inadequately reported. These problems will leave many readers confused, often with an exaggerated perception of how well interventions work.” |
|
| “To evaluate the use and reporting of adjusted analysis in randomised controlled trials (RCTs) and compare the quality of reporting before and after the revision of the CONSORT Statement in 2001” | Journal articles indexed in PubMed | Indexed on PubMed in December 2000 and December 2006 |
| 1,135 retrieved; 776 included; 197 reported adjusted analyses; 188 had a statistical | Not stated | “The analyses specified in the |
|
| “The first is to describe the reporting of subgroup analyses and claim of subgroup effects. The second is to assess study characteristics associated with reporting of subgroup analyses, and with claims of subgroup effects. The third objective is to examine the analysis and interpretation of subgroup effects conducted for the primary outcome.” | Articles published in 118 core clinical journals (defined by the National Library of Medicine), randomly selected, stratified in a 1∶1 ratio by higher impact versus lower impact journals | Published in 2007 |
| 207/469 reported subgroup analyses | 48% (99/207) industry, 52% (108/207) other | “Industry funded randomised controlled trials, in the absence of statistically significant primary outcomes, are more likely to report subgroup analyses than non-industry funded trials. Industry funded trials less frequently pre-specify subgroup hypotheses and less frequently test for interaction than non-industry funded trials. Subgroup analyses from industry funded trials with negative results for the primary outcome should be viewed with caution.” |
|
| “To assess the completeness and quality of subgroup analyses reported in | Original articles published in | Published July 1, 2005, through June30, 2006 |
| 97 RCTs in 95 articles; 59/97 reported subgroup analyses and were assessed further | Not stated | “When properly planned, reported, and interpreted, subgroup analyses can provide valuable information. With the availability of Web supplements, the opportunity exists to present more detailed information about the results of a trial. The purpose of the guidelines is to encourage more clear and complete reporting of subgroup analyses.” |
|
| “To review the appropriateness of reporting of subgroup analysis in RCTs recently published in major cardiology and internal medicine journals” | Phase 3 therapeutic cardiovascular RCTs with at least 100 patients, from journals with a high impact factor in the Institute for Scientific Information | Published September 1 to November 30, 2002, and May 1 to July 31, 2004 |
| 63 RCTs | Not stated | “Subgroup analyses in recent cardiovascular RCTs were reported with several shortcomings, including a lack of pre-specification and testing of a large number of subgroups without the use of the statistically appropriate test for interaction. Reporting of subgroup analysis needs to be substantially improved because emphasis on these secondary results may mislead treatment decisions.” |
|
| “To assess the extent and quality of subgroup analyses in major clinical trial reports” | A sample of 50 consecutive clinical trial reports were obtained from four major medical journals ( | Published July to September 1997 |
| 50 RCTs | Not stated | “Clinical trials need a predefined statistical analysis plan for uses of baseline data, especially covariate-adjusted analyses and subgroup analyses. Investigators and journals need to adopt improved standards of statistical reporting, and exercise caution when drawing conclusions from subgroup findings.” |
|
| “To evaluate the current use of baseline comparability tests and subgroup analyses in surgical randomized controlled trials” | Published surgical RCTs in four medical journals ( | Published January 2000 to April 2003 |
| 72 RCTs | Not funded 19.4% (14/72), not stated 1.4% (1/72), funded 79.2% (57/72) Funded by: not reported, 2; industry, 24; peer reviewed agency, 36; charity, 3; internal, 10 | “We identified important problems with the reporting of randomization, baseline comparisons, and subgroup analyses. The most concerning was the misuse of subgroup analyses. One in three studies did a subgroup analysis, of which most found subgroup differences using inappropriate statistical tests. The presentation of these subgroup findings in the conclusions only exaggerates the perceived significance of such exploratory analyses.” |
|
| “To show how often inaccurate or incomplete reports of subgroup analysis occur” | Published RCTs in four leading journals, i.e., | Published after July 1, 1998 |
| 32 RCTs | Not stated | “Current reporting of subgroup analysis in RCT is incomplete and inaccurate. The results of such subgroup analysis may have harmful effects on treatment recommendations if accepted without judicious scrutiny.” |
|
| “To determine how subgroup analyses are performed in large randomized trials of cardiovascular pharmacotherapy” | Double-blind RCTs of pharmacotherapy with clinical outcomes as primary end points, published between 1980 and 1997; trials with <1,000 patients were excluded to ensure a sizeable number of patients in the selected subgroups | Published between 1980 and 1997 |
| 67 RCTs | Not stated | “Trial subgroups should ideally be defined a priori on two bases: single-factor subgroups with a strong rationale for biological response modification and multifactorial prognostic subgroups defined from baseline risks. However, single-factor subgroup analyses are often reported without a supporting rationale or formal statistical tests for interactions. We suggest that clinicians should interpret published subgroup-specific variations in treatment effects sceptically unless there is a pre-specified rationale and a significant treatment-subgroup interaction.” |