| Literature DB >> 27905893 |
Frank P Y Lin1,2,3, Adrian Pokorny4, Christina Teng4, Rachel Dear4,5, Richard J Epstein4,6,7.
Abstract
BACKGROUND: Multidisciplinary team (MDT) meetings are used to optimise expert decision-making about treatment options, but such expertise is not digitally transferable between centres. To help standardise medical decision-making, we developed a machine learning model designed to predict MDT decisions about adjuvant breast cancer treatments.Entities:
Keywords: Breast cancer; Clinical decision support system; Cytotoxic drug therapy; Decision analysis; Machine learning
Mesh:
Year: 2016 PMID: 27905893 PMCID: PMC5131452 DOI: 10.1186/s12885-016-2972-z
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Fig. 1The analytic approach for comparing performance between machine learning classifiers and NCCN/ESMO guidelines
Fig. 2Flow diagram of the early breast cancer cases screened and included in the data analysis
Baseline characteristics of early breast cancer cases discussed at the index MDT
| Patient characteristics | Group | N | (%) | |
|---|---|---|---|---|
| Demographics | Age group (years) | <45 | 141 | (13) |
| 45-55 | 242 | (23) | ||
| 56-69 | 393 | (37) | ||
| ≥70 | 266 | (25) | ||
| Surgery type | Primary tumour | WLE or partial mastectomy | 608 | (57) |
| Total mastectomy | 452 | (42) | ||
| Lymph node | Sentinel node biopsy only | 703 | (66) | |
| Axillary lymph node dissection | 255 | (24) | ||
| Nodal status | Sentinel lymph node | Involved | 269 | (25) |
| Not involved | 590 | (55) | ||
| Axillary lymph nodes involved | No | 625 | (59) | |
| 1-3 | 240 | (23) | ||
| ≥4 | 108 | (11) | ||
| Extranodal spread | Present | 140 | (13) | |
| Histopathology | Cell type | Invasive ductal carcinoma | 818 | (77) |
| Invasive lobular carcinoma | 135 | (13) | ||
| Tubular carcinoma | 27 | (3) | ||
| Mucinous carcinoma | 12 | (1) | ||
| Medullary carcinoma | 8 | (0.8) | ||
| Mixed type | 8 | (0.8) | ||
| Basal type | 7 | (0.7) | ||
| Metaplastic carcinoma | 7 | (0.7) | ||
| Other malignant tumour | 42 | (4) | ||
| Multifocal | Multifocal | 83 | (9) | |
| Satellite lesions | 66 | (6) | ||
| Primary tumour size (cm) | ≤0.5 (T1a) | 42 | (4) | |
| 0.6-1.0 (T1b) | 149 | (14) | ||
| 1.1-2.0 (T1c) | 423 | (40) | ||
| 2.1-5.0 (T2) | 347 | (33) | ||
| >5.0 (T3) | 79 | (7) | ||
| Histological grade | Grade 1 | 172 | (12) | |
| Grade 2 | 445 | (42) | ||
| Grade 3 | 418 | (39) | ||
| Lymphovascular invasion | Present | 356 | (33) | |
| Absent | 308 | (29) | ||
| Perineural invasion | Present | 40 | (4) | |
| Absent | 111 | (10) | ||
| Oestrogen receptor (ER) status | Positive | 927 | (87) | |
| Negative | 135 | (13) | ||
| Progesterone receptor (PR) status | Positive | 843 | (79) | |
| Negative | 210 | (20) | ||
| HER2 statusa | Positive | 128 | (12) | |
| Negative | 654 | (61) | ||
| Basal Type | Yes | 31 | (3) | |
| Cytokeratin 5/6 | Positive | 59 | (6) | |
| Ki-67 (%) | <5% | 94 | (9) | |
| 5-9% | 130 | (12) | ||
| 10-29% | 231 | (22) | ||
| ≥30% | 153 | (14) | ||
| Associated lesions | Second primary | Present | 31 | (3) |
| Ductal carcinoma | High Grade | 335 | (32) | |
| Intermediate Grade | 196 | (18) | ||
| Low grade | 43 | (4) | ||
| Extensive | 346 | (33) | ||
| Focal | 318 | (30) | ||
| Lobular carcinoma | Focal | 89 | (8) | |
| Extensive | 60 | (6) | ||
| Other benign lesion(s) | Present | 95 | (9) | |
WLE Wide local excision
NB: aHER2 status as determined by in situ hybridisation
Summary of systemic adjuvant treatment recommendations by modality and expertise
| Adjuvant treatment | Recommendation | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Modality | Expertise | Recorded | Recommended | Not Recommended | For discussion | ||||
| N | (%) | N | (%) | N | (%) | N | (%) | ||
| Chemotherapy | MDT | 975 | (92) | 342 | (35) | 393 | (40) | 240 | (25) |
| ESMO | 1065 | (100) | 321 | (30) | 701 | (66) | 43 | (4) | |
| NCCN | 1065 | (100) | 489 | (46) | 80 | (8) | 496 | (47) | |
| Endocrine therapy | MDT | 1002 | (94) | 794 | (79) | 86 | (9) | 122 | (2) |
| ESMO | 1065 | (100) | 927 | (87) | 138 | (13) | 0 | (0) | |
| NCCN | 1065 | (100) | 874 | (82) | 143 | (13) | 48 | (5) | |
| Trastuzumab | MDT | 456 | (43) | 86 | (19) | 363 | (80) | 7 | (2) |
| ESMO | 1065 | (100) | 142 | (13) | 923 | (87) | 0 | (0) | |
| NCCN | 1065 | (100) | 125 | (12) | 929 | (87) | 11 | (1) | |
MDT multidisciplinary conference, ESMO European society for medical oncology, NCCN national comprehensive cancer network
Fig. 3Performance of machine learning models for predicting the MDT decisions. Each point indicates the mean AUC (from ten cross-validation runs) of a classifier for correctly predicting the outcome of MDT recommendation. The error bars indicate the 95% confidence intervals estimated by the Hanley-McNeil method. The open square indicate the classifiers without bootstrap-aggregation, whereas the solid squares indicate the corresponding classifiers with bootstrap-aggregation. Legend: R: ripple down rule, J J48 classifier. A: multiclass alternating decision tree, Sp support vector machine (SVM) with polynomial kernel, Sr SVM with radial basis function kernel, D decision Table 1: OneR classifier, B naive Bayesian classifier, N nearest neighbour classifier, L Multivariate logistic regression
Multivariate logistic regression model showing the key clinicopathologic factors contributing to the MDT recommendation of chemotherapy in early breast cancer
| Variable | OR | 95% C.I. |
|---|---|---|
| Age (per year) | 0.86 | (0.83–0.89) |
| Number of involved axillary lymph nodes (per node) | 1.42 | (1.22–1.64) |
| Primary tumour size (per mm) | 1.04 | (1.02–1.06) |
| Histology - Grade 3 | 14.5 | (2.59–81.5) |
| Oestrogen receptor (ER) - positive | 0.12 | (0.038–0.37) |
| HER2 (by | 14.2 | (5.37–37.6) |
| Ki-67 (%, per 1% increase) | 1.02 | (1.0–1.04) |
A separate analysis using maximum-likelihood multivariate logistic regression of seven variable first identified by the cfsSubset feature selection algorithm using best-first search strategy [26]
Abbreviation: OR The odds ratio of adjuvant chemotherapy being recommended by the MDT. Results from this table were presented as a scientific poster at the Annual Scientific Meeting of the Medical Oncology Group Australia, Surfers Paradise, Australia, 2-5 August 2016 [27]
Relative performance of machine learning algorithm across all therapy-recommendation combinations
| Algorithm | Median rank |
|---|---|
| Multiclass ADTree (Bagged) | 2.0 |
| J48 decision tree (Bagged) | 3.0 |
| Ripple down rules (Bagged) | 3.0 |
| Multiclass ADTree | 5.0 |
| SVM, polynomial kernel (Bagged) | 6.0 |
| SVM, radial basis function kernel (Bagged) | 7.0 |
| SVM, polynomial kernel | 7.0 |
| SVM, radial basis function kernel | 9.0 |
| Naive Bayes classifier (Bagged) | 9.0 |
| Logistic regression (Bagged) | 10.0 |
| Naive Bayes classifier | 11.0 |
| J48 decision tree | 11.0 |
| Decision table | 12.0 |
| Logistic regression | 14.0 |
| Nearest neighbour classifier | 14.0 |
| Ripple down rules | 16.0 |
| OneR (Bagged) | 17.0 |
| OneR | 17.0 |
Abbreviations: Bagged bootstrap-aggregated, ADTree alternating decision tree, SVM support vector machine
Pairwise comparison of the recommendations from the index MDT versus ESMO and NCCN guidelines
| Treatment Modality | Agreement between the expertise | |||||
|---|---|---|---|---|---|---|
| Strategy | MDT versus ESMO | MDT versus NCCN | ESMO versus NCCN | |||
| N | (%) | N | (%) | N | (%) | |
| Chemotherapy | ||||||
| Overalla | 551/975 | (57) | 462/975 | (47) | 320/1065 | (30) |
| Aggressive | 628/975 | (64) | 616/975 | (63) | 416/1065 | (39) |
| Conservative | 729/975 | (75) | 731/975 | (75) | 721/1065 | (68) |
| Endocrine therapy | ||||||
| Overalla | 853/1002 | (85) | 840/1002 | (84) | 976/1065 | (92) |
| Aggressive | 970/1002 | (97) | 953/1002 | (95) | 1024/1065 | (96) |
| Conservative | 858/1002 | (86) | 861/1002 | (86) | 976/1065 | (92) |
| Trastuzumab | ||||||
| Overalla | 437/456 | (96) | 437/456 | (96) | 1047/1065 | (98) |
| Aggressive | 444/456 | (97) | 446/456 | (98) | 1058/1065 | (99) |
| Conservative | 437/456 | (98) | 440/456 | (97) | 1047/1065 | (98) |
Abbreviations: MDT multidisciplinary team meeting, ESMO European Society for Medical Oncology guideline, NCCN National Comprehensive Cancer Network guideline
aOverall – three-way grouping of “Recommended”, “For discussion”, “Not recommended”
The sensitivity, specificity, and positive likelihood ratio of predicting the index MDT decisions using the best machine learning model versus using ESMO and NCCN guidelines
| MDT recommendation | Accuracy of prediction by | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Modality | Recommended | Best ML modela | ESMO Guidelines | NCCN Guidelines | |||||
| Strategy | N (%) | Sens/Spec | LR+ | Sens/Spec | LR+ | Pb | Sens/Spec | LR+ | Pb |
| Chemotherapy | |||||||||
| Aggressive | 582 (60) | 0.93/0.89 | 8.8 |
| 2.5 | <0.01 | 0.97/ | 1.1 | <0.01 |
| Conservative | 342 (35) | 0.86/0.95 | 16.7 |
| 3.3 | <0.01 | 0.82/ | 2.9 | <0.01 |
| Endocrine | |||||||||
| Aggressive | 916 (91) | 0.98/0.85 | 6.5 | 0.98/0.81 | 5.2 | 0.68 | 0.97/0.75 | 3.9 | 0.25 |
| Conservative | 794 (79) | 0.97/0.65 | 2.8 | 0.99/ | 1.5 | 0.30 | 0.96/0.50 | 1.9 | 0.37 |
| Trastuzumab | |||||||||
| Aggressive | 93 (20) | 0.98/0.99 | 77.9 | 0.97/0.97 | 33.3 | 0.60 | 0.97/0.98 | 45.2 | 0.73 |
| Conservative | 86 (19) | 0.95/0.99 | 122.9 | 0.97/0.96 | 24.0 | 0.24 | 0.92/0.98 | 37.7 | 0.28 |
The sensitivity (sens), specificity (spec), and the positive likelihood ratio (LR+) when using the best machine learning models or guideline to predict MDT recommendations
Note: aThe best models were ripple down rules for the chemotherapy decisions, polynomial SVM for the aggressive endocrine decisions, and ADTree for the remaining groups
bpairwise comparisons of likelihood ratios using two-sided z-test (i.ebest model vs. guideline)
cthe best model performed better than the guideline