| Literature DB >> 18769481 |
Kerry Dwan1, Douglas G Altman, Juan A Arnaiz, Jill Bloom, An-Wen Chan, Eugenia Cronin, Evelyne Decullier, Philippa J Easterbrook, Erik Von Elm, Carrol Gamble, Davina Ghersi, John P A Ioannidis, John Simes, Paula R Williamson.
Abstract
BACKGROUND: The increased use of meta-analysis in systematic reviews of healthcare interventions has highlighted several types of bias that can arise during the completion of a randomised controlled trial. Study publication bias has been recognised as a potential threat to the validity of meta-analysis and can make the readily available evidence unreliable for decision making. Until recently, outcome reporting bias has received less attention. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2008 PMID: 18769481 PMCID: PMC2518111 DOI: 10.1371/journal.pone.0003081
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Study flow diagram.
Figure 2QUOROM flow diagram.
Study characteristics for inception cohorts.
| Study | Objective | Committee approving protocols (country) | Period protocols approved | Date of follow up | Included study designs; Number of studies/Total number of studies (percentage of studies included) | Funding source for all studies | Conclusions |
| Easterbrook, 1991 |
| Central Oxford research Ethics committee (UK) | 1984–1987 | 1990 | Analysed | 17% unfunded, 20% NHS or department, 13% Government, 38% Pharmaceutical industry, 12% private/charity | Studies with statistically significant results were more likely to be published, also more likely to lead to a greater number of publications and presentations and to be published in journals with a high citation impact factor. |
| Dickersin, 1992 |
| Institutional Review Boards that serve The John Hopkins Health Institutions (USA) | 1980 | 1988 | Completed | 45% NIH, 12% other government, 8% Drug industry, 63% Other, 4% Internal, 18% None. | There is a statistically significant association between significant results and publication. |
| Dickersin, 1993 |
| National Institutes of Health (USA) | 1979 | 1988 | RCTs 310/310 (100%) | 50% Grant, 30% Contract, 20% Intramural. | Publication bias is a significant problem |
| Stern, 1997 |
| Approved Royal Prince Alfred hospital ethics committee application (Australia) | 1979–1988 | 1992 | Total: RCTs 418/748 (56%), observational 165/748 (22%), non trial experiment 165/748 (22%) | 117/321 Internal, 206/321 External | Confirms the evidence of publication bias found in other studies. Identifies delay in publication as an additional important factor |
| Completed questionnaires: RCTs 277/520 (53%), observational 129/520 (25%), non trial experiment 114/520 (22%) | |||||||
| Analysed | |||||||
| Cooper, 1997 |
| Department of Psychology Human Subjects Committee or Institutional Review Board, Midwestern, research oriented, state university, USA | 1986–1988 | NI | NI | NI | Significant findings were more likely than non-significant findings to be submitted for meeting presentation or publication. |
| Wormald, 1997 |
| Trials processed through the pharmacy of Moorfields Eye Hospital (UK) | 1963–1995 | 1997 | RCTs 61/61 (100%) | NI | There was limited evidence of publication bias |
| Ioannidis, 1998 |
| Efficacy clinical trials conducted by AIDS Clinical Trials Group and Terry Beirn Community Programs for Clinical Research on AIDS (USA) | 1986–1996 | 1996 | RCTs 109/109 (100%) | Data managed by: 10% Pharmaceutical industry, 90% Other federally sponsored. | There is a time lag in the publication of negative findings that occurs mostly after the completion of the trial follow up. |
| Pich, 2003 |
| Hospital Clinic Ethics Committee (Spain) | 1997 | 2001 | RCTs 158/158 (100%) | 89% Pharmaceutical industry, 11% Other. | Only 64% of trials that started were finally implemented and finished in accordance with the original protocol. Only 31% of closed clinical trials were published or in-press in peer reviewed journals. |
| Cronin, 2004 |
| R&D projects funded by the NHS and commissioned by the North Thames Regional Office (UK) | 1993–1998 | 1995–1998 | NI | 100% government | Funders should consider the significant number of studies that did not result in publication and the higher rate of publication in peer reviewed journals from some programs |
| Decullier, 2005 |
| French Research Ethics Committees (France) | 1994 | 2000–2002 | Total: RCTs 345/649 (53%), descriptive/observational 91/649 (14%), non-randomised 213/649 (33%) | 8% No funding, 73% Private funding, 13% Public, 6% Mixed. | Too many studies are not completed and too many are not published. |
| Completed: RCTs 269/501 (54%), descriptive/observational 66/501 (13%), non-randomised 166/501 (33%) | |||||||
| Decullier, 2006 |
| Greater Lyon regional scientific committee (France) | 1997 | 2003 | RCTs 20/142 (14%), experimental 15/142 (10%), descriptive 45/142 (32%), analytical 27/142 (19%), not clinical 28/142 (20%), not available 7/142 (5%) | 38% committee funded | Some protocols submitted for funding were initiated and completed without any funding declared. To our understanding this means that not all protocols submitted really needed funding and also that health care facilities are unaware that they implicitly financially support and pay for biomedical research. |
| Hahn, 2002 |
| Local Research Ethics Committee (UK) | 1994 | 1999 | Of 15 published: RCTs 2/15 (13%), non RCT 2 (13%), uncontrolled trial 2 (13%), case control 1 (7%), survey 2 (13%), cohort and case control 1 (7%), method evaluation study 5 (34%) | Not recorded | Within-study selective reporting may be examined qualitatively by comparing the study report with the protocol. The results suggest that it might well be substantial; the bias could only be broadly identified as protocols were not sufficiently precise. |
| Chan, 2004a |
| Canadian Institutes of Health | 1990–1998 | 2002/2003 | RCTs 108/108 (100%) | 42% jointly funded by industry and CIHR/MRC, 58% no industry funding. | Selective reporting of outcomes frequently occurs in publications of high-quality government-funded trials. |
| Research (Canada) | |||||||
| Chan, 2004b |
| Scientific-Ethical Committees for Copenhagen and Frederiksberge, Denmark | 1994–1995 | 2003 | RCTs 304/304 (100%) | 55% Full industry, 17% Partial industry, 22% Non-industry, 7% non declared. | Reporting of trials is frequently incomplete, biased and inconsistent with protocols. |
| Ghersi, 2006 |
| CSAHS Ethics review committee (Australia) | 1992–1996 | NI | RCTs 318/318 (100%) | 37% commercial funding, 63% no commercial funding. | NI |
| Von Elm, 2008 |
| University of Berne/CH ethics committee (Switzerland) | 1988–1998 | 2006 | Total: RCTs 451/1698 (27%) In depth analyses: 451/451 (100%) | 81% industry, 10% other | About half of drug trials are not published. A high prevalence of pre-specified outcomes are not reported and discrepancies includes primary outcomes. Completeness of reporting of an outcome is associated with statistical significance. |
Easterbrook et al assumed that only studies that had been analysed had the potential for being written up and published, so tests for study publication bias were restricted to these.
Studies for which there was a full interview by the researchers of the cohort study and for which information on the nature of results and publication was provided.
Of the 520 studies with completed questionnaires, 321 had analysis undertaken with results available and were included in further analysis of the association between study outcome and time to publication.
Both groups are not mutually exclusive. 4% had a statement of both sources of funding. In the remainder, the protocols did not include information on how study was funded.
NI No information available.
Methodological Quality Assessment.
| Quality criteria | Inception cohort | Complete follow up of all trials | Publication ascertained through personal contact with investigators | Definition of positive and negative findings clearly defined | Comparison of protocol to publication |
| Easterbrook, 1991 | Y | N (25% lost to follow up) | Y | Y (positive: p<0.05/striking, negative: p≥0.05/definite but not striking, null: no difference observed between the groups/null findings.) | NA |
| Dickersin, 1992 | Y | Y | Y | Y (positive: p<0.05/statistically significant, negative: suggestive trend but not statistically significant, null: no trend or difference. In terms of importance when statistical tests were not performed: great, moderate or little.) | NA |
| Dickersin, 1993 | Y | N (14% refused to participate) | Y | Y (positive: p<0.05 significant/of great importance, negative: showing a trend in either direction but not statistically significant/moderate importance/no difference/little importance.), | NA |
| Stern, 1997 | Y | N (only 70% of questionnaires were completed) | Y | Y (positive: p<0.05 significant/striking/important/definite, negative: non-significant trend 0.05≤p<0.10 or non-significant or null p≥0.10/unimportant and negative) | NA |
| Cooper, 1997 | Y | Y | Y | N (significant and non-significant) | NA |
| Wormald, 1997 | Y | Y | Y | Y (positive: p<0.05, negative: p≥0.05) | NA |
| Ioannidis, 1998 | Y | Y | Y | Y (positive: p<0.05 significant and in favour of experimental therapy arm or any arm when there is no distinct control, negative: nonstatistically significant findings or favouring the control arm ) | NA |
| Pich, 2003 | Y | Y | Y | NA | NA |
| Cronin, 2004 | Y | Y | Y | U (study showed effect) | NA |
| Decullier, 2005 | Y | N (only 69% of questionnaires were completed) | Y | Y (confirmatory/inconclusive/invalidating) | NA |
| Decullier, 2006 | Y | N (only 80% of questionnaires were completed) | Y | Y (scale from 1 to 10 for not important to very important) | NA |
| Hahn, 2002 | Y | Y | Y | NA | Y |
| Chan, 2004a | Y | Y | Y | Y (positive: p<0.05, negative: p≥0.05) | Y |
| Chan, 2004b | Y | Y | Y | Y (positive: p<0.05, negative: p≥0.05) | Y |
| Ghersi, 2006 | Y | Y | Y | Y (positive: p≤0.05, negative: p>0.05) | Y |
| Von Elm, 2008 | Y | Y (for drug trials) | Y | Y (positive: p<0.05) | Y |
Y yes.
N no.
U unclear.
NA Not applicable.
Figure 3Status of approved protocols for Chan 2004b study [15].
Figure 4Status of approved protocols for Easterbrook 1991 study [26].
Figure 5Status of approved protocols for Dickersin 1992 study [27].
Figure 6Status of approved protocols for Dickersin 1993 study [3].
Figure 7Status of approved protocols for Stern 1997 study [4].
Figure 8Status of approved protocols for Cooper 1997 study [32].
Figure 9Status of trials for Wormald 1997 study [21].
Figure 10Status of approved protocols for Ioannidis 1998 study [5].
Figure 11Status of approved protocols for Pich 2003 study [28].
Figure 12Status of approved protocols for Cronin 2004 study [31].
Figure 13Status of approved protocols for Decullier 2005 study [29].
Figure 14Status of approved protocols for Decullier 2006 study [30].
Figure 15Status of approved protocols for Hahn 2002 study [13].
Figure 16Status of approved protocols for Chan 2004a study [14].
Figure 17Status of approved protocols for Ghersi 2006 study [17].
Figure 18Status of approved protocols for von Elm 2008 study [18].
Publication and trial findings.
| Study ID | Total published (percentage) | Positive (percentage) | Negative (percentage) | Null (percentage) |
| Easterbrook, 1991 | 138/285 (48%) | 93/154 (60%) | 12/34 (35%) | 33/97 (34%) |
| Dickersin, 1992 | 390/514 (76%) | 260/314 (83%) | 130/200 (65%) | NI |
| Dickersin, 1993 | 184/198 (93%) | 121/124 (98%) | 63/74 (85%) | NI |
| Stern, 1997 | 189/321 (59%) | 153/232 (66%) | 13/37 (35%) | 23/52 (44%) |
| Cooper, 1997 | 38/121 (status known for 117/121) (31%) | - | - | - |
| Wormald, | 30/61 (status known for 39 completed trials) (49%) | 14/15 (93%) | 15/21 (71%) | NI |
| Ioannidis, 1998 | 36/66 (55%) | 20/27 (74%) | 16/39 (41%) | NI |
| Pich, 2003 | 26/123 (21%) | NI | NI | NI |
| Cronin, 2004 | 28/70 (40%) | NI | NI | NI |
| Decullier, 2005 | 205/649 (32%) (status known for 248 | 129/188 (67%) | 3/16 (19%) | 14/44 (32%) |
| Decullier, 2006 | 48/93 (status known for 47/51 completed trials) (52%) | 26/37 (70%) | 6/10 (60%) | NI |
| Hahn, 2002 | 18/27 (67%) | NI | NI | NI |
| Chan, 2004a | 48/105 (46%) | NI | NI | NI |
| Chan, 2004b | 102/274 (37%) | NI | NI | NI |
| Ghersi, 2006 | 103/226 (46%) | NI | NI | NI |
| Von Elm, 2008 | 233/451 (52%) | NI | NI | NI |
Analysis restricted to 248completed, non confidential, with hypothesis tests and direction of results.
NI No information, this study does not look at this.
- Not able to work out values.
Status implies positive or negative findings.
Aspects of study level publication bias.
| Study | Definition of published article employed by eligible studies | Time since protocol approved | % Not submitted | % Submitted but not accepted | % Studies not published that were not submitted (95% CI) |
| Easterbrook, 1991 | Acceptance by a journal (published or in press), but not book chapters or published meeting abstracts or proceedings. | 3 years | 9 (6, 12) | 6 (3, 9) | 63 (49, 77) |
| Dickersin, 1992 | Reported in one or more journal articles, monographs, books, or chapters in books, were available from medical libraries or were in documents available from a published archive, e.g., the National Technical Information service | 8 years | 23 (19, 27) | 1.5 (0, 3) | 95 (91, 99) |
| Dickersin, 1993 | Abstracts, journal articles, book chapters, proceedings, letter to the editor, or other material. | 9 years | 7 (3, 11) | 0 | 100 |
| Stern, 1997 | As an article in a peer reviewed journal | 4 plus years | 33 (28, 38) | 13 (8,17) | 80 (73, 87) |
| Cooper, 1997 | NI | NI | NI | ||
| Wormald, 1997 | NI | 2 to 34 years | 21 (8,34) | 0 | 100 |
| Ioannidis, 1998 | Peer-reviewed journal articles | 12% completed less than 1 year ago, 20% completed over 1 year ago. | 32 (21, 43) | 20 (8, 32) | 70 (54, 86) |
| Pich, 2003 | Peer reviewed journal articles | 3 years |
| ||
| Cronin, 2004 | NI | 1 to 5 years |
| ||
| Decullier, 2005 | Published as a scientific paper. Grey literature – published in other formats than scientific papers, that is generally not accessible through libraries (internal reports, theses, abstracts, posters) | 6 years (2% not ready) | 58 (54,62) | 2 (0, 4) | 99 (98, 100) |
| Decullier, 2006 | Published as a scientific paper | 6 years | 47 (37,57) | 2 (0, 6) | 98 (95, 100) |
| Hahn, 2002 | Published article or a written study report | 5 years |
| ||
| Chan, 2004a | Journal article | 4 years | 31 (22,40) | 1 (0, 3) | 97 (91, 100) |
| Chan, 2004b | Journal article | 8 years | 56 (50, 62) | 2 (0, 5) | 99 (97, 100) |
| Ghersi, 2006 | NI | NI | NI | ||
| Von Elm, 2008 | Journal article | 8 years |
|
Denominator is the total number of studies with known publication status in the cohort i.e. published plus unpublished plus in preparation plus in press.
Denominator is the number submitted.
Denominator includes ongoing studies.
Question not asked.
NI No information available.
Comparisons of primary outcome stated in protocol and in publication.
| Study | Study level | Outcome level | ||||||||
| Are studies with statistically significant or positive results, more likely to be published than those finding no difference between the study group? | Primary outcome stated in protocol is the same as in the publication | Primary outcome stated in protocol is downgraded to secondary in the publication | Primary outcome stated in the protocol is omitted from the publication | A non primary outcome in the protocol is changed to primary in the publication | A new primary outcome that was not stated in the protocol (as primary or secondary) is included in the publication | The definition of the primary outcome was different in the protocol compared to the publication | Completeness of reporting | Overall | Definition of disagreement/discrepancy | |
| Easterbrook, 1991 | OR 2.32, 95% CI; 1.25, 4.28. |
|
|
|
|
|
|
|
|
|
| Dickersin, 1992 | OR 2.54, 95% CI; 1.63, 3.94 |
|
|
|
|
|
|
|
|
|
| Dickersin, 1993 | OR 12.30, 95% CI; 2.54, 60 |
|
|
|
|
|
|
|
|
|
| Stern, 1997 | HR 2.32, 95% CI; 1.47, 3.66, |
|
|
|
|
|
|
|
|
|
| Cooper, 1997 |
|
|
|
|
|
|
|
|
|
|
| Wormald, 1997 | RR 4, 95% CI; 0.6, 32, |
|
|
|
|
|
|
|
|
|
| Ioannidis, 1998 |
|
|
|
|
|
|
|
|
|
|
| Pich, 2003 |
|
|
|
|
|
|
|
|
|
|
| Cronin, 2004 | OR 0.53, 95% CI; 0.25, 1.1 |
|
|
|
|
|
|
|
|
|
| Decullier, 2005 | OR 4.59, 95% CI; 2.21, 9.54 |
|
|
|
|
|
|
|
|
|
| Decullier, 2006 | OR 1.58, 95% CI; 0.37, 6.71 |
|
|
|
|
|
|
|
|
|
| Hahn, 2002 |
| 40% (6/15) stated which outcome variables were of primary interest and 4 of these (67% ) showed consistency in the reports |
| 17% (1/6) |
| 17% (1/6) |
|
|
| NA |
| Chan, 2004a |
| 67% (32/48) | 23% (11/48) | 13% (6/48) | 9% (4/45) | 18% (8/45) | 36 trials reported a power calculation based on a particular outcome in their publications; 2 used an outcome that differed from the one used in the protocol and another introduced a power calculation that had not been mentioned in the protocol | Primary outcomes were incompletely reported in 7 (16%) of 45 trials that defined such outcomes in their publications | 40% (19/48) of the trials contained major discrepancies in the specification of primary outcomes between the protocols and publications. None of the publications stated that an amendment had been made to the protocol. | The failure to report a pre-specified primary outcome; reporting of a prespecified primary outcome as secondary or as neither primary nor secondary in the publication; the introduction of new primary outcomes in the publication; and changing the outcome specified for the power calculation. |
| Chan, 2004b |
| 47% (36/76) | 34% (26/76) | 26% (20/76) | 19% (12/63) | 17% (11/63) | 38 trials reported a power calculation, but 4 calculations were based on an outcome other than the one used in the protocol. In another 6 cases, there was a power calculation presented in a published article but not in the protocol | Primary outcomes were specified for 63 of the published trials, but for 17 (27%) of these trials at least one primary outcome was incompletely reported. | 62% (51/82) of trials had major discrepancies between primary outcomes specified in protocols and those defined in the published articles (at least 1 primary outcome that was changed, introduced or omitted). | A prespecified primary outcome was reported as secondary or was not labelled as either; a prespecified primary outcome was omitted from the published articles; a new primary outcome was introduced in the published articles and the outcome used in the power calculation was not the same in the protocol and the published articles. |
| Ghersi, 2006 |
| 74% |
| 31% |
| 10% | 60% definition the same, 31% unable to judge and 8% discrepant | The odds of fully reporting a comparison were greater if the result was statistically significant, (OR 2.2, 95% CI; 0.9–5.3, |
| NA |
| Von Elm, 2008 |
|
|
| 26% (24/92) |
| 11% (11/101) |
|
|
| NA |
NA Not Applicable.
The study did not investigate this.
Gives results for trials with discrepancies for ≥1 primary outcome.
Results obtained from 248 of the completed studies.