| Literature DB >> 32116674 |
Tine Geldof1,2, Nancy Van Damme3, Isabelle Huys2, Walter Van Dyck1,2.
Abstract
OBJECTIVES: Little research has been done in pharmacoepidemiology on the use of machine learning for exploring medicinal treatment effectiveness in oncology. Therefore, the aim of this study was to explore the added value of machine learning methods to investigate individual treatment responses for glioblastoma patients treated with temozolomide.Entities:
Keywords: decision tree; exploratory study; machine learning; oncology; propensity score modeling; real world evidence
Year: 2020 PMID: 32116674 PMCID: PMC7025482 DOI: 10.3389/fphar.2019.01665
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.810
Main characteristics of the real-world study population.
| Real-World | ||
|---|---|---|
| Control group (n = 1438) | Treated group (n = 3090) | |
|
| ||
| Range (median) | 0–94 (74) | 5–98 (61) |
| no. (%) < 50 | 98 (42%) | 582 (19%) |
| no. (%) >= 50 | 1,340 (58%) | 2,508 (81%) |
|
| ||
| Male | 814 (57%) | 1,847 (60%) |
| Female | 624 (43%) | 1243 (40%) |
|
| ||
| 0—asymptomatic | 253 (18%) | 415 (13%) |
| 1—symptomatic but completely ambulatory | 850 (59%) | 2265 (73%) |
| 2—symptomatic, up and about >50% walking hours | 197 (14%) | 313 (10%) |
| 3—symptomatic, confined to bed/chair > 50% walking | 84 (6%) | 61 (2%) |
| hours | 54 (4%) | 36 (1%) |
| 4—completely disabled; totally confined to bed/chair | ||
|
| ||
| Class III† | 43 (3%) | 162 (5%) |
| Class IV‡ | 789 (55%) | 2,419 (78%) |
| Class V § | 606 (42%) | 509 (16%) |
|
| ||
| No | 169 (12%) | 23 (1%) |
| Yes | 1,269 (88%) | 3,067 (99%) |
|
| ||
| No | 899 (63%) | 130 (4%) |
| Yes | 539 (37%) | 2,960 (96%) |
|
| ||
| No | 1,342 (93%) | 2,277 (74%) |
| Yes | 96 (7%) | 813 (26%) |
|
| 377.0–256.3 ( | −313.6 to 186.9 ( |
|
| −4.0 to 190.0 ( | −4.3 to 389.7 ( |
Patients were categorized according to recursive partitioning analysis (RPA) classes: †Age < 50 years and World Health Organization (WHO) status 0. ‡ Age < 50 years and WHO status > 0 or age ≥ 50 years and surgical resection. §Age ≥ 50 years and no surgical resection
Figure 1Flowchart of two-step iterative exploratory learning process. The model is iterated until the area under the receiver operating characteristic curve (AUC) is satisfactory, i.e. until the highest achievable AUC in practice is found. Unobserved confounding variables are (unknown) variables currently not captured in real-world situations. (AUC, area under the receiver operating characteristic curve; CT, decision tree).
Figure 2Summary predictive classification tree model after training and validation. Predicted stratification variables for TMZ in glioblastoma include age, RPA class, and chemotherapeutic (Chemo) and radiotherapeutic (RT) patient status. For each stratified patient class a confusion matrix indicates the number (N) and percentage (P) of treated patients from the test set for which the CT predicts treatment response correctly (responders to predicted treatment response and non-responders to predicted non-response) with respect to the labeled SG value. E.g. the CT model predicts the class of patients aged 52 to 61 years and >63 years not receiving concomitant or adjuvant chemotherapy to respond to the treatment with a true positive (TP) probability of 58%. For this class, with patients aged < 63 years 82% are correctly predicted (true negative [TN]) not to respond.
Figure 3Receiver operating characteristic (ROC) curve featuring model performance evaluation as an area under the curve (AUC), sensitivity TPR and FPR or (1-specificity) for (A) the CT prediction model 3 and (B) a logistic regression model of the test data set. The CT model (A) featured an AUC of 0.6650, a sensitivity of 0.6850, and a specificity of 0.5114, The logistic regression model (B) achieved a slightly lower AUC of 0.6357 with a sensitivity of 0.6337 and specificity of 0.5420.