| Literature DB >> 21212122 |
R M D Smyth1, J J Kirkham, A Jacoby, D G Altman, C Gamble, P R Williamson.
Abstract
OBJECTIVES: To provide information on the frequency and reasons for outcome reporting bias in clinical trials.Entities:
Mesh:
Year: 2011 PMID: 21212122 PMCID: PMC3016816 DOI: 10.1136/bmj.c7153
Source DB: PubMed Journal: BMJ ISSN: 0959-8138

Trialists eligible for interview. *Includes 17 associated with trials suspected of outcome reporting bias, 21 responsible for trials with no outcome bias suspected
Characteristics of trialists who responded to the invitation to participate in the study and those who did not
| All (n=268) | Responded (n=59) | Did not respond (n=209) | χ2 | P value | ||
|---|---|---|---|---|---|---|
| Randomly selected from PubMed | 85 | 21 (25%) | 64 (75%) | |||
| Identified in Cochrane reviews: | ||||||
| Suspected of outcome reporting bias* | 85 | 17 (20%) | 68 (80%) | 0.056 | 0.812 | |
| Not suspected of outcome reporting bias* | 98 | 21 (21%) | 77 (79%) | |||
| <100 | 122 | 28 (23%) | 94 (77%) | 0.627 | 0.731 | |
| 100-999 | 128 | 26 (20%) | 102 (80%) | |||
| >1000 | 18 | 5 (28%) | 13 (72%) | |||
| Non-commercial | 105 | 33 (31%) | 72 (69%) | 10.064 | 0.018 | |
| Industry | 116 | 21 (18%) | 95 (82%) | |||
| Unfunded | 1 | 0 (0%) | 1 (100%) | |||
| Not stated | 46 | 5 (11%) | 41 (89%) | |||
| Statistician involved | 17 | 7 (41%) | 10 (59%) | 5.012 | 0.082 | |
| Statistician involvement unclear | 45 | 12 (27%) | 33 (73%) | |||
| No information provided | 206 | 40 (19%) | 166 (81%) | |||
*In relation to review primary outcome.
Characteristics of trial investigators and trials included in the study
| Trialists from Cochrane review cohort | Trialists from PubMed cohort (n=21) | |||
|---|---|---|---|---|
| Suspected of outcome reporting bias (n=17) | Not suspected of outcome reporting bias (n=21) | |||
| Extensive | 10 (59%) | 7 (33%) | 15 (71%) | |
| Coauthor had experience | 7 (41%) | 10 (48%) | 4 (19%) | |
| No experience in research team | 0 | 4 (19%) | 2 (9%) | |
| <100 | 6 (35%) | 11 (52%) | 11 (52%) | |
| 100-999 | 8 (47%) | 9 (43%) | 9 (43%) | |
| >1000 | 3 (18%) | 1 (5%) | 1 (5%) | |
| Non-commercial | 10 (59%) | 15 (71%) | 11 (52%) | |
| Industry | 7 (41%) | 5 (24%) | 10 (48%) | |
| Unfunded | 0 | 1 (5%) | 0 | |
| Throughout trial | 9 (53%) | 8 (38%) | 13 (62%) | |
| Consulted | 1 (6%) | 5 (24%) | 6 (29%) | |
| No statistician involved | 7 (41%) | 8 (38%) | 2 (10%) | |
Responses from trialists who had analysed data on a prespecified outcome but not reported them by the time of the primary publication (n=16)
| Trialist | Explanation from trialist | Category |
|---|---|---|
| 09 | “It was just uninteresting and we thought it confusing so we left it out. It didn’t change, so it was a result that we . . . you know, kind of not particularly informative let’s say, and was to us distracting and uninteresting.” | Bias |
| 12 | “There was no mortality. No mortality at all” | Bias |
| 18 | “We didn’t bother to report it, because it wasn’t really relevant to the question we were asking. That’s a safety issue thing; there was nothing in it so we didn’t bother to report it. It was to keep ethics committee happy. It is not as if we are using a new drug here, it is actually an established one, just an unusual combination, so if we are using new things we report all that sort of stuff, so it’s not that experimental. We didn’t bother to report it, because it wasn’t really relevant to the question we were asking.” | Bias |
| 22 | “The whole study showed that there was nothing in the [intervention]. So the whole study was actually a negative result, so I don’t think the fact that there was no effect prevented me from putting it into the paper. It was either possibly an oversight or possibly something I thought, ‘well this isn’t relevant.’” | Bias |
| 29 | “When I take a look at the data I see what best advances the story, and if you include too much data the reader doesn’t get the actual important message, so sometimes you get data that is either not significant or doesn’t show anything, and so you, we, just didn’t include that. The fact that something didn’t change doesn’t help to explain the results of the paper.” | Bias |
| 30 | “When we looked at that data, it actually showed an increase in harm amongst those who got the active treatment, and we ditched it because we weren’t expecting it and we were concerned that the presentation of these data would have an impact on people’s understanding of the study findings. It wasn’t a large increase but it was an increase. I did present the findings on harm at two scientific meetings, with lots of caveats, and we discussed could there be something harmful about this intervention, but the overwhelming feedback that we got from people was that there was very unlikely to be anything harmful about this intervention, and it was on that basis that we didn’t present those findings. The feedback from people was, look, we don’t, there doesn’t appear to be a kind of framework or a mechanism for understanding this association and therefore you know people didn’t have faith that this was a valid finding, a valid association, essentially it might be a chance finding. I was kind of keen to present it, but as a group we took the decision not to put it in the paper. The argument was, look, this intervention appears to help people, but if the paper says it may increase harm, that will, it will, be understood differently by, you know, service providers. So we buried it. I think if I was a member of the public I would be saying ‘what you are promoting this intervention you thought it might harm people—why aren’t you telling people that?’” | Bias |
| 32 | “If we had a found a significant difference in the treatment group we would have reported that, and it certainly would have been something we probably would have been waving the flag about. To be honest, it would have come down to a word limit and we really just cannot afford to report those things, even a sentence used, and often you have a sentence about this, and a sentence about that, and so it doesn’t allow you to discuss the more important findings that were positive or were negative as some of our research tends to be, because I guess it’s a priority of relevance” | Bias |
| 34 | “No I think probably, it’s possible, I am looking on the final one, but probably was each time, reduced and reduced from the start to submitting to the journal. It is very limited on numbers, probably we start to . . . it didn’t get accepted so we kind of cut and cut, I believe this is what happened.” (the manuscript went to four journals) | Bias |
| 36 | “It’s as dull as ditchwater, it doesn’t really say anything because the outcome wasn’t different, so of course [trial treatment] is going to be more expensive and no more effective so of course it’s not going to have a health economic benefit. Because you have got two treatments that don’t really differ. I just think, we have got to find a different way of . . . so for example I said well can’t we say something about the costs within the groups of those who relapsed and those who didn’t, just so that people get a ball park, but it’s written and he [coauthor—health economist] wants to put it in as it is, and I don’t have a problem with that, it’s rather a sense of I am not sure what it tells anybody.” | Bias |
| 39 | “We analysed it and there was two patients who had the outcome, you know one in each arm, so we decided the numbers were so small that we didn’t think that adding another row to the table to describe two patients added anything.” | Bias |
| 41 | “Patients in this particular trial turned out to use very low amounts of drugs. So, there was nothing essentially to compare. The use of other drugs was not an important issue in this population. There was nothing to report. There was no reportable data, no interesting story in the secondary outcome data, and our intention was always to focus on the opiate use not on the other drugs. I did look, we do have data on other drug use, we have collected data as we promised, but essentially there is nothing to report in this data. Patients do not use other drugs heavily. We will present again, I have all the intentions, the data is available for analysis and for presentation if one of my students decide to do some work with this and help me out with this, absolutely it will get published, but I have to pick and chose what I am actually working on.” | Bias |
| 44 | “We probably looked at it but again it doesn’t happen by magic. So, I can’t imagine that there would be a difference. Why we didn’t? My guess is that we didn’t look at it because that is something that has to be prospectively collected, and so I would assume that we collected it and there was just absolutely no difference, but I don’t recall. I am pretty sure there would not be differences, it would be related to temperature, but what the results were I don’t remember at this point.” | Bias |
| 49 | “Yes because what happened is, I am left with a study where everything is [non-significant], even though we walked in believing that we would see a difference, and even though we had some preliminary information, you know anecdotal, that there should be a difference, there was no difference. So, it really turned out to be a very negative study. So we did collect that information, and again it’s a non-result, but there are only so many negative results you can put into a paper.” | Bias |
| 50 | “It didn’t add anything else to the data, it changed but it wasn’t anything that was remarkable, it wasn’t a significant change. If it had been something that either added additional strength to the data, or if it was conflicted, if it turned out it went totally against our data, and was counteractive to what we were saying, yes we would have reported.” | Bias |
| 54 | “Yes, we have those data on file, and I am sorry to say that we are writing-up so many papers sometimes we do not know what’s in the other papers. It has been analysed, I know, because what I know right now is that all the measurements which would be performed in this study as well as in two other studies were done, very simply because the outcome is very difficult to measure, well it’s very simple to measure, to get the antibodies is very difficult, and we got it from an organisation that gave us just enough to do the measurements. I know the results, saying from what I have in my head the outcome is going up, we have high levels of it, but I am not sure whether it was a significant increase or whether it was significant compared to the control group or the other group as mentioned in the study.” | Delay in writing up of secondary outcomes beyond primary publication |
| 56 | “I actually disagree that this outcome is important, but that was probably a more pragmatic aspect of making sure that our protocol was funded, because I think some reviewers might have said, ‘wow you are not measuring this outcome!’ That said, there is a vast amount of literature showing that it’s of completely no relevance but it was a practical decision to make sure we got money. Once we conducted the study and reflected on our results more we just didn’t think it had that much validity in telling us very much about the condition. So for the sake of brevity we didn’t report that. I didn’t expect there would be much of a difference, and our results show that there wasn’t much of a difference.” | Bias |
Responses from trialists who had collected data on a prespecified outcome but not analysed them by the time of the primary publication (n=17)
| Trialist | Explanation from trialist | Category |
|---|---|---|
| 05 | “I think, you know, the issue with cost is it would have been relatively easy to get hospital charges, but actual costs are more complicated analysis, and given that there was no difference between the groups we didn’t go onto that.” | Secondary data not analysed because no difference in primary outcome |
| 08 | “Part of the problem is, as our economists keep on telling us, we have been collecting economic data, but we haven’t had enough of them. So for instance, for someone going into treatment, you know, we still don’t have a lot of participants who are receiving treatment. So, one thing that we now have is older participant rates, so the economists are just starting to work on that, saying: ‘OK, now that we have older rates, we can project out what potential benefits in terms of like earnings over a life course, we might expect.’” | Long term outcome data not obtained by the time primary outcome data published |
| 15 | “Unfortunately, the year we were funded the funders levied large across the board cuts to all approved projects. Our recruitment also took longer than expected. As a result, we had neither the time nor the resources to analyse for effects on the several less important secondary or exploratory outcomes we mentioned in our initial proposal.” | Insufficient time, resources, or both to analyse less important outcomes |
| 20 | “The problem was that they weren’t completed as well as all the other measures, it was only just over two thirds of people who consistently completed their diary, so it would have reduced our numbers considerably.” | Not analysed because of the amount of missing data |
| 23 | “It was missing, a lot of people were missing data, and because we were looking at pre- and post- if they were missing at either point then we had to throw out that person, so we just didn’t have enough, and then across three different conditions, so if you have only six or seven people with data in each condition it really wasn’t worth looking at.” | Not analysed because of the amount of missing data |
| 27 | “Samples were taken and one could argue that this is unethical actually because those samples were taken, and stored, and frozen, and the results of analysis have never been published because the analysis was never done. Speaking as someone who is interested in trial ethics, that is probably unethical to take samples and not analyse them. Here is the answer to that question then, it’s really simple issue, which is the reality of doing a trial. This was an utter nightmare. For the analysis we did all the preparation abroad, it was an utter nightmare. I wouldn’t say I still wake up screaming. For a small aspect of a trial, you know, some of the more interesting outcomes and laboratory based things to support whatever the hypothesis is, it grew to assume gargantuan proportions because there was no electricity, it was 40 degrees, there were bugs crawling around everywhere, and this turned into a nightmare. Getting it back to . . . there were no facilities to do these in the country and getting it back to us was a nightmare. And, the reason for the reporting differences for this biochemical stuff is actually I suspect not to do with bias but to do with pragmatics.” | Not analysed because of practical difficulties |
| 31 | “The main reason was limited power, so we had fewer variables up front, because of small sample size. Most of it was driven by the small sample size, you know when you have 54 participants and you are trying to look at outcomes, so we made decisions about what seemed to be the most important variable. I think I tend to be conservative and part of this is just the sample was so small. I have tried to be very focussed in the papers and there is lots of data that is lying dormant essentially because of that.” | Limited analysis undertaken owing to poor recruitment |
| 39 | “No, we haven’t analysed it yet, you know we saw no effect on [primary outcome], which means that the drug is not ever going to get, well not in the foreseeable future, going to get on the market. So I think it’s less important, people aren’t going to have access to it, it’s less important to share, to make that data public given the low likelihood that it’s going to show anything. It’s not going to have any immediate clinical implications for anybody.” | Secondary data not analysed because no difference in primary outcome |
| 40 | “Well, we will be doing it, so what happens is you send someone down to a laboratory, they do the various procedures and then you have to have someone who scores the protocol, which requires various software and code writing, and then someone to spend the time to actually analyse the data. And, it just, it takes a while and so our primary interest was getting out the information, these other measures were more secondary outcome measures. It gets into issues around priorities, staff leaving, new staff having to be trained, it has just taken us a while to get this stuff analysed and as I said it’s, it has been analysed and we are hoping to write the manuscript for this, this fall.” | Delay in obtaining the data |
| 41 | “The cost effectiveness was really not conducted in this study. I am not an expert on cost effectiveness, there is a different team that works with us on cost effectiveness analysis, and this is their own survey now. And also the size, the effect sizes to be expected for the cost effectiveness analysis are much smaller and therefore the sample size did not really afford this type of analysis.” | Not analysed because of poor recruitment |
| 42 | “Well that, it turns out that our funders [industry] kind of went through some internal reorganisation, and as a result there are delays in the analyses of this outcome, and so we are still waiting to receive the final data from that, to publish it. We have got partial data completed on that part of the project but not complete. The data, it’s really held by us. The funders have the samples, they just have that one set of samples, and they send us the data and we own the data and do the analysis, they just did the assay. We are interpreting it and analysing it and everything.” | Delay in obtaining the data |
| 43 | “The economic evaluation, it’s in processes. Well, we were supposed to have it analysed by the end of the month, I can’t give you any preliminary. We know about the utilisation already but I don’t know about the related costs. Our goal is to have it actually submitted this year.” | Long term outcome data not obtained by the time primary outcome data published |
| 44 | “So it was a negative trial, it was not a failed trial, we have statistical power, but it was negative basically, we showed that in the context of our environment the intervention did not increase the primary outcome so basically the outcomes were identical in the two groups. Given this there was absolutely no reason to expect the difference in anything we would measure in the blood. So, we had collected all that, it was sitting in the refrigerator, in the freezer, so we didn’t do the analysis because it was completely obvious that it would be negative.” | Samples not analysed because no difference in primary outcome |
| 45 | “That outcome is a paper still in the making. That data was kept completely aside, you know, as you can see the paper that we are talking about has got so much in it and we did hum and ah whether we were going to break it down or present it separately, or any of that sort of stuff, but when we did come down to it, this outcome was just going to be too much again for that paper and so we chose to leave it and it’s still a paper in the making, it’s still data which is sitting there and hopefully will be a standalone paper in itself. I haven’t got round to analysing it, I have gone out of a PhD and jumped straight into something else, I have still got data and all sorts of stuff that I haven’t even have time to enter. I get the impression from talking to others as well that it’s always common when you collect a lot of data to have one or two variables that never make it to the cutting board. So this outcome is not in my thesis. I would like to think that it will reach publication one day. I think at the moment I am very, I am very rushed with all the new things that I have got on my plate, but there is a number of variables that I breeze back on and think to myself ‘oh I will do something about that one day, I will do something about that one day.’ So yes I would like to think that it wasn’t, it wasn’t a wasted time in collecting it.” | Outcome not analysed because volume of data presented in primary publication |
| 53 | “Right, we did that and just because it takes absolutely for ever to score it, and although I am a trained scorer as a primary investigator I can’t do the scoring, so I had to send one of my team members to get trained and it just takes, it is taking forever to get through them all, so it’s still in the pipeline. I would have had to postpone the writing of the primary outcome paper if I had to wait for this data. Yes, we have almost just about done now and so in a separate publication we will report that.” | Delay in obtaining the data |
| 55 | “Doing the measurements is really hard and I wasn’t sort of specifically trained to do them. I mean the clinicians told me how to do them, but it can be quite difficult to do. Which is a shame because that is a nice sort of, a nice measurement to have. We did it the whole way through but kind of knowing that it was pointless. I think, because we ended up doing it for the trial participants rather than for us, because they expected it to be done. So we just did it and noted it down but never analysed it, because we didn’t believe in it, even if it had shown something brilliant, we wouldn’t have thought it was true.” | Not analysed because of practical difficulties and uncertainty about validity of data |
| 59 | “I think it had just fallen off our radar screen and we were focused on the primary outcome, because the story would have been ‘you reduce the outcome, and a higher proportion of people remain independent.’ In fact, we sort of then began telling another story. It didn’t seem important once we had already frightened ourselves by showing that we caused harm and I suppose the story could have been ‘you cause harm and you cause more people to have to go into care.’ I think it just fell off the radar screen because we then began to worry about why, why this didn’t work.” | Not analysed because harmful effect of intervention on primary outcome |