| Literature DB >> 33939284 |
Jagdeep T Podichetty1, Rebecca M Silvola1, Violeta Rodriguez-Romero1, Richard F Bergstrom1, Majid Vakilynejad2, Robert R Bies1,3,4, Robert E Stratford1.
Abstract
Clinical trial efficiency, defined as facilitating patient enrollment, and reducing the time to reach safety and efficacy decision points, is a critical driving factor for making improvements in therapeutic development. The present work evaluated a machine learning (ML) approach to improve phase II or proof-of-concept trials designed to address unmet medical needs in treating schizophrenia. Diagnostic data from the Clinical Antipsychotic Trials of Intervention Effectiveness (CATIE) trial were used to develop a binary classification ML model predicting individual patient response as either "improvement," defined as greater than 20% reduction in total Positive and Negative Syndrome Scale (PANSS) score, or "no improvement," defined as an inadequate treatment response (<20% reduction in total PANSS). A random forest algorithm performed best relative to other tree-based approaches in model ability to classify patients after 6 months of treatment. Although model ability to identify true positives, a measure of model sensitivity, was poor (<0.2), its specificity, true negative rate, was high (0.948). A second model, adapted from the first, was subsequently applied as a proof-of-concept for the ML approach to supplement trial enrollment by identifying patients not expected to improve based on their baseline diagnostic scores. In three virtual trials applying this screening approach, the percentage of patients predicted to improve ranged from 46% to 48%, consistently approximately double the CATIE response rate of 22%. These results show the promising application of ML to improve clinical trial efficiency and, as such, ML models merit further consideration and development.Entities:
Mesh:
Substances:
Year: 2021 PMID: 33939284 PMCID: PMC8504834 DOI: 10.1111/cts.13035
Source DB: PubMed Journal: Clin Transl Sci ISSN: 1752-8054 Impact factor: 4.689
FIGURE 1Five‐step workflow for full machine learning (ML) model analysis to develop a binary classification model to predict treatment response outcome based on clinical measurements taken at baseline, 1 month, and 3 months
FIGURE 2Schematic of workflow to assess patient screening machine learning (ML) model ability to enhance clinical trial population enrollment for the virtual trials
FIGURE 3Patient distribution between “no improvement” and “improvement” categories of the (a) original curated dataset, n = 639; (b) training dataset, n = 447; (c) testing dataset, n = 192; and (d) balanced training dataset, n = 602
Performance of the full ML model during training and testing stages
| RF | ROC | TPR (sensitivity) | TNR (specificity) | CCR | TP | FP | TN | FN |
|---|---|---|---|---|---|---|---|---|
| Training | 0.956 | 0.740 | 0.991 | 0.884 | 191 | 3 | 341 | 67 |
| Testing | 0.700 | 0.194 | 0.936 | 0.800 | 7 | 10 | 146 | 29 |
Abbreviations: CCR, correct classification rate; FN, false negative; FP, false positive; ML, machine learning; RF, random forest; ROC, receiver operator characteristic; TN, true negative; TNR, true negative rate; TP, true positive; TPR, true positive rate.
FIGURE 4Top 10 patient attributes most predictive of treatment response outcome. CALG, Calgary; PANSS, Positive and Negative Syndrome Scale
Performance of patient screening ML model during training and testing stages using only baseline data
| RF | ROC | TPR (sensitivity) | TNR (specificity) | CCR | TP | FP | TN | FN |
|---|---|---|---|---|---|---|---|---|
| Training | 0.956 | 0.714 | 0.985 | 0.869 | 332 | 9 | 611 | 133 |
| Testing | 0.653 | 0.167 | 0.948 | 0.762 | 12 | 12 | 219 | 60 |
Abbreviations: CCR, correct classification rate; FN, false negative; FP, false positive; ML, machine learning; RF, random forest; ROC, receiver operator characteristic; TN, true negative; TNR, true negative rate; TP, true positive; TPR, true positive rate.
Patient screening ML model performance during the virtual clinical trials
| Virtual clinical trials | Number of patients enrolled | Number of patients predicted to improve at 3 months | Percentage of patients predicted to improve at 3 months |
|---|---|---|---|
| 1 | 50 | 24 | 48 |
| 2 | 50 | 23 | 46 |
| 3 | 50 | 24 | 48 |
Abbreviation: ML, machine learning.