| Literature DB >> 27334105 |
Joann G Elmore1, Anna Na Tosteson2, Margaret S Pepe3, Gary M Longton4, Heidi D Nelson5, Berta Geller6, Patricia A Carney7, Tracy Onega8, Kimberly H Allison9, Sara L Jackson10, Donald L Weaver11.
Abstract
OBJECTIVE: To evaluate the potential effect of second opinions on improving the accuracy of diagnostic interpretation of breast histopathology.Entities:
Mesh:
Year: 2016 PMID: 27334105 PMCID: PMC4916777 DOI: 10.1136/bmj.i3069
Source DB: PubMed Journal: BMJ ISSN: 0959-8138

Fig 1 Algorithm for determination of final biopsy interpretation used in the evaluation of different second opinion policy strategies. Up to three pathologists may be needed to obtain a final interpretation. Data are comprised of 5 145 480 observations each involving three independent pathologist interpretations of a single slide from a breast biopsy specimen and are derived from 115 single pathologists interpreting 60 cases each in four test sets
Characteristics of participating pathologists (n=115) by reported weekly volume of breast biopsy specimens
| Characteristics | Total No (%) | No (%) | P value‡ | |
|---|---|---|---|---|
| Reported weekly breast caseload | ||||
| Low volume* | High volume† | |||
| Total sample | 115 (100) | 75 (100) | 40 (100) | |
| Age at survey (years): | ||||
| 33-39 | 16 (14) | 11 (15) | 5 (13) | 0.98 |
| 40-49 | 41 (36) | 27 (36) | 14 (35) | |
| 50-59 | 42 (37) | 25 (33) | 17 (43) | |
| ≥60 | 16 (14) | 12 (16) | 4 (10) | |
| Women | 46 (40) | 28 (37) | 18 (45) | 0.42 |
| Men | 69 (60) | 47 (63) | 22 (55) | |
| Experience of breast pathology | ||||
| Fellowship training in breast pathology: | ||||
| No | 109 (95) | 71 (95) | 38 (95) | 0.94 |
| Yes | 6 (5) | 4 (5) | 2 (5) | |
| Affiliation with academic medical center: | ||||
| No | 87 (76) | 59 (79) | 28 (70) | 0.50 |
| Adjunct or affiliated | 17 (15) | 9 (12) | 8 (20) | |
| Primary appointment | 11 (10) | 7 (9) | 4 (10) | |
| Breast pathology experience (years): | ||||
| 0-4 | 22 (19) | 17 (23) | 5 (13) | 0.45 |
| 5-9 | 23 (20) | 14 (19) | 9 (23) | |
| 10-19 | 34 (30) | 21 (28) | 13 (33) | |
| ≥20 | 36 (31) | 23 (31) | 13 (33) | |
| Breast specimens as proportion of total clinical case load (% of total clinical work interpreting breast tissue): | ||||
| 0-9 | 59 (51) | 51 (68) | 8 (20) | <0.001 |
| 10-24 | 45 (39) | 23 (31) | 22 (55) | |
| 25-49 | 8 (7) | 1 (1) | 7 (18) | |
| ≥50 | 3 (3) | 0 (0) | 3 (8) | |
| Considered an expert in breast pathology by colleagues: | ||||
| No | 90 (78) | 67 (89) | 23 (58) | <0.001 |
| Yes | 25 (22) | 8 (11) | 17 (43) | |
Some column percentages do not sum to 100% owing to rounding.
*<10 breast biopsy specimens weekly.
†≥10 breast biopsy specimens weekly.
‡Based on Wilcoxon rank sum test for difference in age, experience of breast pathology, and composition of breast specimen of total caseload between low volume and high volume caseload groups. Otherwise P values correspond to a Pearson χ2 test for difference between caseload groups.

Fig 2 Percentage of individual case assessments in which a second opinion was desired or would be required by policy in pathologist’s clinical practice, or both, shown by first `epathologists’ diagnosis of test case (n=115 pathologists, n=6900 individual case assessments). DCIS=ductal carcinoma in situ
Over-interpretation, under-interpretation, and misclassification rates of single interpretation compared with reference consensus standard and nine second opinion strategies
| Strategies | Rate (%) | Overall (95% CI) | P value* | |||
|---|---|---|---|---|---|---|
| Reference consensus diagnosis | ||||||
| Benign | Atypia | DCIS | Invasive | |||
| Single interpretation and second opinion applied to all cases | ||||||
| Single interpretation: | ||||||
| % requiring 2nd opinion | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | |
| Over-interpretation | 12.9 | 17.4 | 2.6 | – | 9.9 (9.0 to 10.8) | |
| Under-interpretation | – | 34.7 | 13.3 | 3.9 | 14.8 (13.8 to 15.9) | |
| Misclassification | 12.9 | 52.2 | 15.9 | 3.9 | 24.7 (23.6 to 25.8) | n/a |
| 1. Second opinion with resolution applied to all cases | ||||||
| % requiring 2nd opinion | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | |
| Requiring 3rd opinion | 19.7 | 55.9 | 21.6 | 3.7 | 29.6 | |
| Over-interpretation | 8.4 | 11.1 | 0.6 | – | 6.0 (4.7 to 7.5) | |
| Under-interpretation | – | 29.9 | 9.3 | 3.5 | 12.1 (10.0 to 14.3) | |
| Misclassification | 8.4 | 40.9 | 9.9 | 3.5 | 18.1 (16.1 to 20.0) | P<0.001 |
| Criterion for obtaining second opinion based on initial diagnosis | ||||||
| 2. Second opinion only for initial interpretations considered atypia or DCIS or invasive: | ||||||
| % requiring 2nd opinion | 12.9 | 65.3 | 93.7 | 99.6 | 61.5 | |
| % requiring 3rd opinion | 10.4 | 36.5 | 17.8 | 3.3 | 19.8 | |
| Over-interpretation | 6.0 | 10.0 | 0.6 | – | 5.0 (3.9 to 6.3) | |
| Under-interpretation | – | 41.8 | 11.4 | 3.9 | 16.4 (14.4 to 18.4) | |
| Misclassification | 6.0 | 51.9 | 12.1 | 3.9 | 21.4 (19.5 to 23.2) | P<0.001 |
| 3. Second opinion only for initial interpretations considered DCIS or invasive: | ||||||
| % requiring 2nd opinion | 3.2 | 17.4 | 86.7 | 99.6 | 42.1 | |
| % requiring 3rd opinion | 2.9 | 13.1 | 12.2 | 3.3 | 8.8 | |
| Over-interpretation | 11.3 | 7.9 | 0.6 | – | 5.9 (4.9 to 7.1) | |
| Under-interpretation | – | 35.9 | 15.2 | 3.9 | 15.8 (13.8 to 17.6) | |
| Misclassification | 11.3 | 43.7 | 15.8 | 3.9 | 21.7 (19.8 to 23.5) | P<0.001 |
| 4. Second opinion only for initial interpretations considered invasive: | ||||||
| % requiring 2nd opinion | 1.0 | 0.4 | 2.6 | 96.1 | 10.4 | |
| % requiring 3rd opinion | 0.8 | 0.3 | 2.4 | 1.9 | 1.2 | |
| Over-interpretation | 12.5 | 17.5 | 0.4 | – | 9.1 (7.7 to 10.6) | |
| Under-interpretation | – | 34.4 | 13.2 | 4.7 | 14.8 (13.0 to 16.5) | |
| Misclassification | 12.5 | 51.9 | 13.6 | 4.7 | 23.9 (22.1 to 25.7) | P=0.25 |
| Second opinion only obtained for cases considered borderline or difficult | ||||||
| 5. Second opinion obtained only for initial interpretations considered borderline: | ||||||
| % requiring 2nd opinion | 19.0 | 45.3 | 21.4 | 3.5 | 26.1 | |
| % requiring 3rd opinion | 7.5 | 25.6 | 9.3 | 0.8 | 12.8 | |
| Over-interpretation | 10.0 | 14.2 | 1.5 | – | 7.7 (6.3 to 9.2) | |
| Under-interpretation | – | 34.6 | 9.9 | 3.8 | 13.8 (11.8 to 15.7) | |
| Misclassification | 10.0 | 48.9 | 11.5 | 3.8 | 21.5 (19.5 to 23.3) | P<0.001 |
| 6. Second opinion obtained only for initial interpretations considered difficult: | ||||||
| % requiring 2nd opinion | 23.2 | 48.2 | 24.8 | 11.1 | 30.0 | |
| % requiring 3rd opinion | 9.0 | 27.3 | 9.8 | 1.5 | 14.0 | |
| Over-interpretation % | 9.2 | 13.8 | 1.6 | – | 7.4 (6.0 to 8.8) | |
| Under-interpretation % | – | 34.3 | 10.1 | 3.3 | 13.7 (11.8 to 15.5) | |
| Misclassification | 9.2 | 48.1 | 11.7 | 3.3 | 21.1 (19.3 to 22.8) | P<0.001 |
| 7. Second opinion only for cases when desired by pathologist: | ||||||
| % requiring 2nd opinion | 26.7 | 55.9 | 30.5 | 15.5 | 35.5 | |
| % requiring 3rd opinion | 10.1 | 31.3 | 11.2 | 1.5 | 16.0 | |
| Over-interpretation | 9.2 | 14.0 | 1.5 | – | 7.4 (5.8 to 9.2) | |
| Under-interpretation | – | 33.7 | 9.9 | 3.6 | 13.4 (11.4 to 15.6) | |
| Misclassification | 9.2 | 47.7 | 11.4 | 3.6 | 20.9 (18.9 to 22.8) | P<0.001 |
| 8. Second opinion only when required by policy: | ||||||
| % requiring 2nd opinion | 33.8 | 40.6 | 55.3 | 59.9 | 44.9 | |
| % requiring 3rd opinion | 7.6 | 23.0 | 10.0 | 2.1 | 12.4 | |
| Over-interpretation % | 10.8 | 13.2 | 1.6 | – | 7.7 (6.3 to 9.1) | |
| Under-interpretation % | – | 33.9 | 12.1 | 4.0 | 14.2 (12.3 to 16.1) | |
| Misclassification | 10.8 | 47.1 | 13.7 | 4.0 | 21.9 (20.0 to 23.7) | P<0.001 |
| 9. Second opinion only when desired or required by policy: | ||||||
| % requiring 2nd opinion | 54.0 | 80.4 | 75.5 | 69.8 | 70.0 | |
| % requiring 3rd opinion | 14.9 | 45.3 | 18.0 | 3.0 | 23.8 | |
| Over-interpretation % | 8.1 | 11.4 | 0.9 | – | 6.1 (4.8 to 7.5) | |
| Under-interpretation % | – | 32.8 | 9.5 | 3.7 | 13.1 (11.1 to 15.2) | |
| Misclassification | 8.1 | 44.3 | 10.3 | 3.7 | 19.2 (17.3 to 21.0) | P<0.001 |
*Based on Wald test for difference in overall misclassification rates between second opinion strategy and single pathologist interpretation. Test statistic uses bootstrap standard error of difference in rates.

Fig 3 Percentage of cases misclassified based on whether initial pathologist indicated case was borderline, difficult, or would have obtained a second opinion (either desired or because of policy at his or her laboratory). Results are shown for single interpretations and after a second opinion strategy is applied to these cases. (A) Indicated the case was borderline between two diagnoses (26% of 6900 single interpretations) compared with not borderline (74% of 6900 interpretations). (B) Indicated case was difficult (30% of 6900 interpretations) compared with not difficult (70% of 6900 interpretations). (C) Policy or desired second opinion (70% of 6900 interpretations) compared with no policy and no desire for a second opinion (30% of 6900 interpretations)
Over-interpretation, under-interpretation, and misclassification rates associated with three second opinion strategies based on case volume of interpreting pathologist
| Strategy | Single interpretation rate (%) | Overall (95% CI) | Second opinion strategy rate (%) | Overall (95% CI) | P value* | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Reference consensus diagnosis | Reference consensus diagnosis | ||||||||||||
| Benign | Atypia | DCIS | Invasive | Benign | Atypia | DCIS | Invasive | ||||||
| Strategy 10: | |||||||||||||
| Over-interpretation | 14.4 | 17.9 | 2.8 | – | 10.5 (9.4 to 11.7) | 8.4 | 11.2 | 0.7 | – | 6.1 (4.7 to 6.7) | |||
| Under-interpretation | – | 36.9 | 14.5 | 4.4 | 15.9 (14.5 to 17.4) | – | 30.0 | 9.4 | 3.6 | 12.2 (10.1 to 14.3) | |||
| Misclassification | 14.4 | 54.8 | 17.2 | 4.4 | 26.4 (25.1 to 27.8) | 8.4 | 41.2 | 10.1 | 3.6 | 18.3 (16.2 to 20.1) | P<0.001 | ||
| Strategy 11: | |||||||||||||
| Over-interpretation | 14.4 | 17.9 | 2.8 | – | 10.5 (9.4 to 11.7) | 7.3 | 11.0 | 0.4 | – | 5.6 (4.1 to 7.6) | |||
| Under-interpretation | – | 36.9 | 14.5 | 4.4 | 15.9 (14.5 to 17.4) | – | 26.8 | 7.9 | 3.1 | 10.7 (8.5 to 13.0) | |||
| Misclassification | 14.4 | 54.8 | 17.2 | 4.4 | 26.4 (25.1 to 27.8) | 7.3 | 37.8 | 8.2 | 3.1 | 16.3 (13.9 to 18.7) | P<0.001 | ||
| Strategy 12: | |||||||||||||
| Over-interpretation | 10.1 | 16.5 | 2.2 | – | 8.7 (7.1 to 10.1) | 6.1 | 10.4 | 0.2 | – | 5.0 (3.0 to 8.2) | |||
| Under-interpretation | – | 30.7 | 11.1 | 3.0 | 12.9 (11.3 to 14.3) | – | 23.7 | 6.4 | 2.5 | 9.3 (6.1 to 12.8) | |||
| Misclassification | 10.1 | 47.2 | 13.3 | 3.0 | 21.5 (19.5 to 23.4) | 6.1 | 34.1 | 6.6 | 2.5 | 14.3 (10.9 to 18.0) | P<0.001 | ||
High volume pathologists defined as those who report interpreting an average of 10 or more breast biopsy specimens per week. A lower volume pathologist reports 9 or fewer breast biopsy specimens per week. Study sample comprised 75 lower volume pathologists and 40 high volume pathologists.
*Based on Wald test for difference in overall misclassification rates between second opinion strategy and single pathologist interpretation. Test statistic uses bootstrap standard error of difference in rates