| Literature DB >> 26017741 |
S H Heywang-Köbrunner1, I Schreer2, A Hacker3, M R Noftz4, A Katalinic5.
Abstract
UNLABELLED: Twenty-five-year follow-up data of the Canadian National Breast Cancer Screening Study (CNBSS) indicated no mortality reduction. What conclusions should be drawn? After conducting a systematic literature search and narrative analysis, we wish to recapitulate important details of this study, which may have been neglected: Sixty-eight percent of all included cancers were palpable, a situation that does not allow testing the value of early detection. Randomisation was performed at the sites after palpation, while blinding was not guaranteed. In the first round, this "randomisation" assigned 19/24 late stage cancers to the mammography group and only five to the control group, supporting the suspicion of severe errors in the randomisation process. The responsible physicist rated mammography quality as "far below state of the art of that time". Radiological advisors resigned during the study due to unacceptable image quality, training, and medical quality assurance. Each described problem may strongly influence the results between study and control groups. Twenty-five years of follow-up cannot heal these fundamental problems. This study is inappropriate for evidence-based conclusions. The technology and quality assurance of the diagnostic chain is shown to be contrary to today's screening programmes, and the results of the CNBSS are not applicable to them. KEY POINTS: • The evidence base of the Canadian study (CNBSS) has to be questioned.• Severe flaws in the randomization process and test methods occurred. • Problems were criticized during and after conclusion of the trial by experts.• The results are not applicable to quality-assured screening programs. • The evidence base of this study must be re-analyzed.Entities:
Keywords: Breast cancer; Canadian National Breast Cancer Screening Study (CNBSS); Image quality; Mammography screening; Randomization
Mesh:
Year: 2015 PMID: 26017741 PMCID: PMC4712234 DOI: 10.1007/s00330-015-3849-2
Source DB: PubMed Journal: Eur Radiol ISSN: 0938-7994 Impact factor: 5.315
Fig. 1Flow diagram of systematic literature search
Fig. 2This chart shows the number of publications which criticized or defended the CNBSS. With very few exceptions, only the main investigators defended the trial. Numerous authors criticized it. Seven of these authors were involved with review or quality assurance of the CNBSS. These authors are mentioned explicitly. Some published several articles concerning the CNBSS. Further publications mentioned some of the issues, but did not comment them. The list of references can be reviewed in the electronic supplementary material (ESM)
Issues concerning design, quality assurance, and evaluation of the CNBSS: all cited literature
| Topics | Arguments | CNBSS | Literature |
|---|---|---|---|
| Randomisation (n = 45) | Randomisation was performed at each site after clinical examination (change or violation of initial protocol); on-site randomisation after palpation could no longer guarantee blinding according to independent external review [ | Critique (n = 25) | [8], [9], [10], [11], [12], [2], [1], [13], [14], [15], [16], [3], [4], [17], [18], [19], [20], [21], [22] [23],[24], [25], [26], [27], [28] |
| The disproportionally large number of first round participants < age 50 years with advanced cancers entering the mammography group is considered as a strong indicator of flawed randomisation. | |||
| A low number of late stage cancers, possibly shifted from the control to the study group, could strongly affect and bias the calculation of mortality reduction or overdiagnosis, while other available variables will probably not yet be affected (due to their low association to BC mortality) | |||
| Defense: “Irrespective of the findings on physical examination….women were independently and blindly assigned randomly” [ | Defense (n = 14) | [29], [26], [30], [31], [32], [33], [34], [35], [36], [37], [38], [39], [40],[6] | |
| They refer to one analysis of a subgroup (Manitoba), which “showed no definitive evidence to support a nonrandom allocation of women with prior breast disease to the mammography arms of the study” [ | |||
| They argue that “>50 other variables showed no statistically significant difference between study and control groups” [ | |||
| Individual randomisation is assumed generally superior to cluster randomisation | |||
| Not commented, but mentioned (n = 7) | [41], [42], [43], [44], 45, [46], [47] | ||
| Inclusion criteria (n = 14) | By including mainly symptomatic women in this study, the value of screening by definition cannot be tested (68 % of cancers of the mammography arm were palpable). | Critique (n = 9) | [8], [48], [13], [3], [4], [49], [25], [24], [23], |
| Consequence: dilution of the measurable effect, study is underpowered for testing true screening effect | Defense (n = 3) | [37], [31], [29] | |
| Defense: issue not directly addressed (reasons for choosing this protocol are explained) | Not commented, but mentioned (n = 2) | [47], [46] | |
| Mammography quality (n = 46) | According to the responsible physicist, outdated equipment was used. The equipment at some centers was quite old, and at many centres it lacked key features such as automatic exposure control and grids [ | Critique (n = 28) | [8], [9], [13], [25], [4], [24], [3], [23], [49], [22], [18], [19], [9], [27], [21], [17], [20], [51], [52], [53], [50], [54], [54], [55], [56], [57], [58], [28] |
| Image quality was rated by external reviewers to be satisfactory in less than 40 % during 1980-1984 (active recruitment until 1985). According to the external reviewers, improvements of the image quality were regularly demanded and two reviewers resigned during the study due to unacceptable quality. Reported problems concerned incomplete inclusion of the breast tissue, unsharp images, low image contrast, over- or underexposed images resulting in too dark or too bright images, no training of readers, high numbers of obviously missed cancers. | |||
| The principal investigators state [ | Defense (n = 10) | [29], [38], [33], [34], [36], [59], [60], [61], [62], [36] | |
| Not commented, but mentioned (n = 10) | [43], [45], [63], [64], [65], [66], [67], [68], [69], [70] | ||
| Reading quality (n = 26) | Radiologists were not trained in reading mammograms. According to the reviewers, the quality of mammography reading was low and an extraordinarily high number of missed cancers occurred within the 1 year intervals. | Critique (n = 16) | [25], [4], [24], [3], [23], [18], [9], [27], [17], [51], [52], [50], [54], [55], [56], [57] |
| Reading quality and training of readers has not been specifically addressed by proponents of the study | Defense (n = 7) | [33], [34], [36], [38], [7], [71], [62] | |
| Not commented, but mentioned (n = 3) | [68], [63], [67] | ||
| Recommended biopsies not performed (n = 3) | “25 % of needle localization were recommended but not performed” [ | Critique (n = 1) | [4] |
| “ …at least one physician refused to do a biopsy on nonpalpable (=mammographically detected) lesions…” [ | |||
| “Study surgeons decided if diagnostic follow-up was recommended.” “Most biopsies were done” [ | Defense (n = 2) | [36], [5] | |
| Not commented, but mentioned (n = 0) | |||
| Contamination (n = 9) | 26 % of women allocated to the control group underwent mammography during the study period. This is a known problem, but may be more pronounced in the trials with individual randomisation. It falsely dilutes the measurable effect | Critique (n = 5) | [24], [48], [13], [72], [21] |
| Baines argues that a problem concerning 26 % of the control group can only exert a “small” effect | Defense (n = 1) | [36] | |
| Not commented, but mentioned (n = 3) | [66], [47], [41] |
This table gives an overview of the main issues discussed concerning the CNBSS. The list of references can be reviewed in the electronic supplementary material (ESM)
Fig. 3Images demonstrating the significant changes of technology between 1984, 1987 (CNBSS-study), and a follow-up mammogram of the same patient of 1993. Even though the later technology is still far from present contrast resolution, it becomes obvious that on the former mammograms almost no structures can be discerned in 80 % of the breast making detection of both masses and microcalcifications almost impossible. Images reproduced courtesy of Dr. Roberta Jong, Sunnybrook Health Sciences Centre, Toronto Canada