| Literature DB >> 26107615 |
Bin Li1, Hyunjin Shin1, Georgy Gulbekyan2, Olga Pustovalova2, Yuri Nikolsky2, Andrew Hope2, Marina Bessarabova2, Matthew Schu1, Elona Kolpakova-Hart2, David Merberg1, Andrew Dorner1, William L Trepicchio1.
Abstract
Development of drug responsive biomarkers from pre-clinical data is a critical step in drug discovery, as it enables patient stratification in clinical trial design. Such translational biomarkers can be validated in early clinical trial phases and utilized as a patient inclusion parameter in later stage trials. Here we present a study on building accurate and selective drug sensitivity models for Erlotinib or Sorafenib from pre-clinical in vitro data, followed by validation of individual models on corresponding treatment arms from patient data generated in the BATTLE clinical trial. A Partial Least Squares Regression (PLSR) based modeling framework was designed and implemented, using a special splitting strategy and canonical pathways to capture robust information for model building. Erlotinib and Sorafenib predictive models could be used to identify a sub-group of patients that respond better to the corresponding treatment, and these models are specific to the corresponding drugs. The model derived signature genes reflect each drug's known mechanism of action. Also, the models predict each drug's potential cancer indications consistent with clinical trial results from a selection of globally normalized GEO expression datasets.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26107615 PMCID: PMC4480971 DOI: 10.1371/journal.pone.0130700
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1PLSR modeling workflow applied on 183 cancer cell lines on OncoPanel.
(A). Flow chart on the model building and testing steps. (B). A specially designed splitting strategy divides the training dataset into random training, random validation and balance validation subsets. (C). Representative example of random validation and balance validation. Red points were top performing models on 1000 random splits on this balanced split, based on both AUC and correlation measures. (D). AUC and correlation cutoff selection for the core PLSR model.
Fig 2Causal network to depict functional relations between sensitivity-specific and resistance-specific signature genes.
The network was reconstructed from canonical signaling pathways regulated by signature genes and a signature specific direct interaction network. Sensitivity-specific signature genes are highlighted with blue thermometers, resistance-specific genes with red thermometers.
Fig 3PLSR models performance in predicting Erlotinib-treated patient survival in the BATTLE trial.
A. Erlotinib model predicting Erlotinib treated patients; B. Sorafenib model predicting Sorafenib treated patients; C. Erlotinib model predicting Sorafenib treated patients; and D. Sorafenib model predicting Erlotinib treated patients. TP: true positive; FP: false positive; TN: true negative; FN: false negative; PPV: positive predictive value.
Fig 4Survival analysis on biomarker identified treatment sensitive/resistant sub-groups.
A. Using the Erlotinib model to stratify Erlotinib treated patients; B. Using Sorafenib model to stratify Sorafenib treated patients; C. Using Erlotinib model to stratify Sorafenib treated patients; and D. Using Sorafenib model to stratify Erlotinib treated patients.
Predicted percentage of Erlotinib and Sorafenib sensitive samples for some cancer indications from Gene Expression Omnibus datasets.
| Cancer Type | Number of samples | Pred. Erlotinib Sen. percentage | Pred. Sorafenib Sen. percentage |
|---|---|---|---|
|
| |||
| Lung Cancer | 329 | 15.81 | 0.61 |
| Liver Cancer | 85 | 0.00 | 31.76 |
| Kidney Cancer | 218 | 0.46 | 24.77 |
|
| |||
| Head and Neck Cancer | 168 | 94.05 | 12.50 |
| Bladder Cancer | 102 | 41.18 | 3.92 |
| Acute lymphoblastic leukemia | 516 | 4.26 | 64.73 |
| Diffuse Large B-Cell Lymphoma | 816 | 2.45 | 30.51 |
| Acute myeloid leukemia | 1118 | 0.45 | 15.74 |
| Multiple myeloma | 596 | 0 | 27.01 |
| Bone Cancer | 88 | 0.00 | 29.55 |
| Breast Cancer | 1668 | 5.64 | 6.47 |
| Colorectal Cancer | 948 | 0.11 | 0.21 |
| Pancreatic Cancer | 75 | 0.00 | 1.33 |
The predictive models were derived from cell line Oncopanel expression data. Patient data normalization is described in the result section.