| Literature DB >> 26873063 |
Segun Bello1,2, Maoling Wei3, Jørgen Hilden4, Asbjørn Hróbjartsson5,6.
Abstract
BACKGROUND: Blinding is a pivotal method to avoid bias in randomised clinical trials. In blinded drug trials, experimental and control interventions are often designed to be matched, i.e. to appear indistinguishable. It is unknown how often matching procedures are inadequate, so we decided to systematically identify and analyse studies of matching quality in drug trials. Our primary objective was to assess the proportion of studies that concluded that the matching was inadequate; our secondary objective was to describe mechanisms for inadequate matching.Entities:
Mesh:
Year: 2016 PMID: 26873063 PMCID: PMC4752749 DOI: 10.1186/s12874-016-0111-9
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Fig. 1PRISMA Flow of database search for identifying eligible studies
Characteristics of included matching quality studies
| Characteristics | n = 36 (%) |
|---|---|
| Specialty | |
| Psychiatry | 6 (17) |
| Cardiology | 5 (14) |
| Respiratory medicine | 5 (14) |
| Infectious diseases | 5 (14) |
| Neurology | 2 (6) |
| Rheumatology | 2 (6) |
| Dermatology | 2 (6) |
| Complementary-alternative medicine | 2 (6) |
| Gastroenterology | 1 (3) |
| Medicala | 3 (8) |
| No information | 3 (8) |
| Formulation types | |
| Tablet | 17 (47) |
| Capsule | 12 (33) |
| Liquid (injection and oral medicament) | 2 (6) |
| Ointment | 2 (6) |
| Aerosol | 2 (6) |
| Nasal spray | 1 (3) |
| Type of control intervention | |
| Placebo | 33 (92) |
| Active | 3 (8) |
| Producer of the interventions | |
| Pharmaceutical company | 25 (69) |
| Investigators | 1 (3) |
| Not reported | 9 (25) |
| Pharmacy | 1 (3) |
| Method of comparison | |
| Group of assessors | |
| Assessors n = 4 | 26 (72) |
| Assessors n = 32-52 | 8 (22) |
| Pilot trial | 2 (6) |
aSpecific medical sub-specialties unclear
Characteristics of included publications
| Paper | General information | Assessment methods | Drug format (Number trials), sensory quality | Analysis | Basis for deciding inadequate matching |
|---|---|---|---|---|---|
| Hill and colleagues, 1976 | 22 trials (11 parallel group trial and 10 cross-over trial), described as “double-blind”, conducted in the UK over 12 months in 1973–74 and published in four journals (e.g. BMJ and Lancet). | Four assessors (medical and administrative staff) were twice provided with 4 interventions from each trial. The assessors were “without knowledge of the experimental design” | Tablet (11), colour, shape and appearance, smell, taste | For each trial, the proportion of assessments reported as different for identical interventions (control vs. control) (false positive rate) was subtracted from the proportion of assessments reported as different for the matched interventions: | Based on a qualitative assessment the investigators decided which trials were inadequately matched (“obvious differences” were detected by all 4 assessors) |
| Capsule (5), colour, shape and appearance, smell, taste | |||||
| Aerosol and nasal spray (3), the container, smell and taste of aerosol | |||||
| Ointment (2), colour, appearance and consistency, smell | |||||
| Liquid (1), the ampoule, colour and liquid consistency | |||||
| Blumen-thal and colleagues, 1974 | 6 trials (type not reported), described as “double blind”. | Fifty-two assessors (Students, spouses, lab scientists, secretaries) served as assessors. | Tablet (4), Sensory qualities not specified. | For each trial, the proportion of assessments for which assessors correctly separated all experimental intervention from placebo intervention was calculated. | Based on a Chi2 test the authors decided for each trial whether assessors were able to differentiate better than chance between experimental and control interventions. |
| Capsule (1), Sensory qualities not specified. | |||||
| Liquid (1), Sensory qualities not specified. | |||||
| Walter and colleagues, 2005 | 1 trial (parallel group), described as “double-blind”. The study was a pre-trial assessment of blinding integrity. | Four assessors (“study investigators”) were asked to examine independently a package of 10 tablets. The assessors were informed that it contained an equal number of experimental and control interventions. | Tablet (4), Taste and appearance | The number of correctly assigned tablets were recorded for each assessor | Based on a Chi2 test the authors decided whether assessors were able to differentiate better than chance between experimental and control interventions. |
| Fai and colleagues, 2011 | 1 trial (parallel group), described as “double-blind”. The study was a pre-trial assessment of blinding integrity. | 49 “adults” served as assessors. It was not reported how many interventions each assessors examined or what information was provided to assessors. | Capsule (1), The appearance and texture of package and capsule, and smell of capsule content. | Each assessor of packaging and capsules scored each property (exactly identical = 3 very close to unanimous = 2; significant difference = 1; inconsistent 0). | Based on Fischer’s exact test the authors decided whether assessors were able to differentiate better than chance between experimental and control interventions. |
| Dupin-Spriet and colleagues, 1993 | 3 trials (type not reported) | Four assessors (type not reported) were provided with 8 pairs of tablets from each trial. The information, if any, provided to the assessors on the distribution of experimental and control interventions was not reported. | Tablet (3), appearance (shape, colour, surface) | For each trial, the proportion of assessments reported as different for the identical interventions (false positive rate) was subtracted from the proportion of assessments reported as different for the matched interventions: | Based on Fischer’s exact test the authors decided for each trial whether assessors were able to differentiate better than chance between experimental and control interventions. |
| Wen and colleagues, 2004 | 1 trial (parallel group) | 32 assessors (e.g. doctors, nurses, graduate students) were provided with two bottles of experimental and control interventions that they examined independently. | Capsule (1), bottle and capsule appearance, bottle label and notes, size, shape, quality, colour, smell, taste, and capsule content | Proportion of assessments found “not uniform” on specific qualities and overall was calculated. | Based on Fisher’s exact test the authors decided whether assessors were able to differentiate better than chance between experimental and control interventions. |
| Farr and colleagues, 1987 | 2 pilot-trials (parallel group) | Two pilot-trials (low and high dose zinc) randomizing assessors (students and university employees) to experimental or control. Assessors were informed that one of the compared substances had shown possible effect, and that they would be asked to decide whether they were receiving an “active” or an “inactive” compound. | Tablet (2), bitter taste, aftertaste, palatability and guesses as to whether they believed they were receiving active or placebo. | Proportions of each assessment (e.g. bitter taste) was compared between trial groups (i.e. experimental and control groups) | Based on a Chi2 test the authors decided whether proportions of each assessments (experimental and control) differed more between trial groups than expected by than chance. |
Main and supplementary results
| Paper | Main results: | Supplementary results: Proportion of trials with inadequate matching | |
|---|---|---|---|
| High threshold | Low threshold | ||
| Hill and collegues, 1976 | 7 of 22 trials | 6/22 trials | 14/22 trials |
| Blumenthal and colleagues, 1974 | 5 of 6 trials | 5/6 trialsb | 5/6 trials |
| Walter and colleagues, 2005 | 1 of 1 trial | 0/1 trial | 1/1 trial |
| Fai and colleagues, 2011 | 0 of 1 trial | 0/1 trial | 1/1 trialc |
| Dupin-Spriet and colleagues, 1993 | 1 of 3 trials | 0/3 trials | 3/3 trials |
| Wen and colleagues, 2004 | 1 of 1 trial | 1/1 trials | 1/1 trials |
| Farr and colleagues, 1987 | 1 of 2 trials | 0/2 trialsd | 1/2 trialse |
aThe high thresholds are in most cases based on at least 75 % of assessors finding clear differences. The low thresholds are in most cases based on 25 % of assessors finding clear differences
bTwo groups of assessors were involved. In one group (patient-simulated), only in one trial did 75 % or more of assessors find clear difference between experimental and control intervention. In the second group of assessors (experimenter-simulated), over 80 % of assessors found clear differences in five out of six experimental and control drugs
cTaste was excluded from the study because “taste was not adjusted with bitter agent” though the experimental intervention most likely had a more bitter taste
dBased on chance alone, 50 % of assessors would be expected to correctly guess intervention types. We defined high threshold to imply that at least 75 % of assessors randomized to the experimental intervention perceived being on the experimental intervention
eSignificantly more than 50 % of assessors randomized to experimental group perceived to have received the experimental intervention