| Literature DB >> 36010942 |
Paul Prasse1, Pascal Iversen1, Matthias Lienhard2, Kristina Thedinga2, Ralf Herwig2, Tobias Scheffer1.
Abstract
Large-scale databases that report the inhibitory capacities of many combinations of candidate drug compounds and cultivated cancer cell lines have driven the development of preclinical drug-sensitivity models based on machine learning. However, cultivated cell lines have devolved from human cancer cells over years or even decades under selective pressure in culture conditions. Moreover, models that have been trained on in vitro data cannot account for interactions with other types of cells. Drug-response data that are based on patient-derived cell cultures, xenografts, and organoids, on the other hand, are not available in the quantities that are needed to train high-capacity machine-learning models. We found that pre-training deep neural network models of drug sensitivity on in vitro drug-sensitivity databases before fine-tuning the model parameters on patient-derived data improves the models' accuracy and improves the biological plausibility of the features, compared to training only on patient-derived data. From our experiments, we can conclude that pre-trained models outperform models that have been trained on the target domains in the vast majority of cases.Entities:
Keywords: anti-cancer drugs; deep neural networks; drug-sensitivity prediction
Year: 2022 PMID: 36010942 PMCID: PMC9406038 DOI: 10.3390/cancers14163950
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.575
Statistics of used data sets.
| Data Set | Type | # Cell Lines/ | # Drugs | # Samples | Drug-Overlap |
|---|---|---|---|---|---|
| Patients | with GDSC | ||||
| GDSC [ | Cultivated cell lines | 958 | 282 | 250,625 | – |
| CCLE [ | Cultivated cell lines | 472 | 24 | 10,924 | 7 |
| Beat AML [ | Patient-derived cell culture | 213 | 109 | 18,062 | 31 |
| Xenografts [ | Patient-derived xenograft | 19 | 3 | 120 | 2 |
| PDO [ | Patient-derived organoid | 44 | 25 | 1093 | 13 |
Figure 1Overall model architecture (A) and models. Model architectures for the PaccMann model (B), the tDNN model (C), and the Conv NN (D).
Average Pearson correlations for different models and the different data sets. For results marked “*”—the correlation of the pre-trained model is significantly higher () than that of the same model trained from scratch. Results highlighted in bold mark the best performing model regarding the model trained from scratch and its pre-trained version.
| Data | # Train | Conv NN | Conv NN | PaccMann | PaccMann | tDNN | tDNN | |
|---|---|---|---|---|---|---|---|---|
| Scratch | Pre-Trained | Scratch | Pre-Trained | Scratch | Pre-Trained | |||
| CCLE data | precision oncology | 0 | – | 0.615 ± 0.009 | – | 0.575 ± 0.006 | – | 0.614 ± 0.006 |
| 10 | 0.053 ± 0.023 | 0.039 ± 0.05 | 0.004 ± 0.023 | |||||
| 50 | 0.235 ± 0.033 | −0.018 ± 0.054 | 0.451 ± 0.024 | |||||
| 100 | 0.384 ± 0.026 | 0.022 ± 0.049 | 0.586 ± 0.018 | |||||
| 500 | 0.717 ± 0.007 | 0.379 ± 0.042 |
| 0.74 ± 0.009 | ||||
| 1000 | 0.686 ± 0.022 | 0.353 ± 0.049 |
| 0.759 ± 0.005 | ||||
| 5000 | 0.775 ± 0.006 |
|
| 0.729 ± 0.006 |
| 0.78 ± 0.006 | ||
| all | 0.777 ± 0.008 |
|
| 0.745 ± 0.006 |
| 0.781 ± 0.006 | ||
| drug development | 0 | – | 0.155 ± 0.027 | – | 0.072 ± 0.018 | – | 0.209 ± 0.036 | |
| 10 | −0.005 ± 0.021 |
| −0.018 ± 0.011 | 0.051 ± 0.025 |
| |||
| 50 | −0.019 ± 0.02 |
| 0.008 ± 0.018 |
| 0.058 ± 0.02 |
| ||
| 100 | 0.013 ± 0.028 |
| 0.01 ± 0.011 |
| 0.066 ± 0.014 |
| ||
| 500 | 0.043 ± 0.019 | −0.002 ± 0.015 | 0.126 ± 0.013 |
| ||||
| 1000 | 0.063 ± 0.029 | 0.029 ± 0.009 | 0.15 ± 0.015 |
| ||||
| 5000 | 0.282 ± 0.021 |
|
| 0.181 ± 0.016 |
| 0.284 ± 0.024 | ||
| all | 0.29 ± 0.019 |
|
| 0.22 ± 0.011 | 0.328 ± 0.021 |
| ||
| Beat AML data (PDCs) | precision oncology | 0 | – | 0.229 ± 0.011 | – | 0.258 ± 0.012 | – | 0.252 ± 0.01 |
| 10 | −0.014 ± 0.014 | −0.007 ± 0.026 | −0.08 ± 0.02 | |||||
| 50 | 0.015 ± 0.018 | 0.003 ± 0.026 | −0.035 ± 0.018 | |||||
| 100 | 0.049 ± 0.02 | 0.03 ± 0.03 | 0.052 ± 0.036 | |||||
| 500 | 0.26 ± 0.024 | 0.023 ± 0.023 | 0.332 ± 0.07 | |||||
| 1000 | 0.135 ± 0.032 | 0.094 ± 0.018 |
| 0.595 ± 0.013 | ||||
| 5000 |
| 0.674 ± 0.011 | 0.638 ± 0.026 |
|
| 0.67 ± 0.012 | ||
| 10,000 |
| 0.7 ± 0.01 |
| 0.652 ± 0.009 |
| 0.693 ± 0.01 | ||
| all | 0.693 ± 0.01 |
|
| 0.676 ± 0.009 |
| 0.696 ± 0.009 | ||
| drug development | 0 | – | 0.074 ± 0.012 | – | 0.008 ± 0.015 | – | 0.051 ± 0.013 | |
| 10 | −0.005 ± 0.016 |
| −0.004 ± 0.007 |
| 0.02 ± 0.018 |
| ||
| 50 | 0.007 ± 0.014 |
| 0.019 ± 0.005 |
| 0.044 ± 0.017 |
| ||
| 100 | −0.002 ± 0.017 | 0.005 ± 0.009 |
| 0.089 ± 0.015 |
| |||
| 500 | 0.041 ± 0.01 | 0.003 ± 0.007 | 0.182 ± 0.021 |
| ||||
| 1000 | 0.087 ± 0.021 | 0.006 ± 0.01 |
| 0.292 ± 0.016 | ||||
| 5000 | 0.261 ± 0.027 |
| 0.273 ± 0.02 |
| 0.316 ± 0.01 | |||
| 10,000 | 0.376 ± 0.015 |
|
| 0.335 ± 0.015 |
| 0.367 ± 0.017 | ||
| all |
| 0.383 ± 0.012 |
| 0.348 ± 0.018 |
| 0.354 ± 0.013 | ||
| xenograft data (PDXs) | pre. onc. | 0 | – | 0.715 ± 0.128 | – | 0.646 ± 0.171 | – | 0.7 ± 0.14 |
| 10 | 0.243 ± 0.149 |
| 0.196 ± 0.176 |
|
| 0.246 ± 0.149 | ||
| 50 | 0.19 ± 0.18 |
| −0.131 ± 0.144 |
| 0.424 ± 0.156 | |||
| all | 0.502 ± 0.128 |
| 0.334 ± 0.133 |
|
| 0.533 ± 0.124 | ||
| drug dev. | 0 | – | 0.129 ± 0.057 | – | −0.005 ± 0.049 | – | −0.134 ± 0.08 | |
| 10 |
| −0.254 ± 0.093 | −0.133 ± 0.109 |
| −0.389 ± 0.028 |
| ||
| 50 |
| −0.418 ± 0.021 | −0.171 ± 0.168 |
|
| −0.451 ± 0.075 | ||
| all |
| −0.42 ± 0.135 |
| 0.056 ± 0.098 | −0.28 ± 0.112 |
| ||
| Organoid data (PDOs) | pre. onc. | 0 | – | 0.577 ± 0.021 | – | 0.488 ± 0.027 | – | 0.602 ± 0.028 |
| 10 | 0.023 ± 0.036 | −0.009 ± 0.021 | −0.004 ± 0.02 | |||||
| 50 | 0.201 ± 0.036 | 0.066 ± 0.025 | 0.435 ± 0.038 | |||||
| 100 | 0.404 ± 0.047 | 0.077 ± 0.029 | 0.468 ± 0.079 | |||||
| 500 | 0.874 ± 0.015 |
| 0.153 ± 0.028 |
| 0.899 ± 0.008 | |||
| all |
| 0.904 ± 0.006 | 0.503 ± 0.076 |
| 0.897 ± 0.007 | |||
| drug dev. | 0 | – | 0.001 ± 0.037 | – | −0.0 ± 0.041 | – | −0.086 ± 0.033 | |
| 10 |
| −0.06 ± 0.047 | −0.033 ± 0.023 |
| −0.053 ± 0.045 |
| ||
| 50 | 0.002 ± 0.035 |
|
| 0.039 ± 0.026 | 0.108 ± 0.047 |
| ||
| 100 | −0.035 ± 0.057 | −0.028 ± 0.027 |
| 0.181 ± 0.034 |
| |||
| 500 | 0.051 ± 0.046 | 0.003 ± 0.025 | 0.312 ± 0.063 |
| ||||
| all | 0.282 ± 0.052 | 0.186 ± 0.044 |
| 0.422 ± 0.029 |
|
Figure 2Feature importance analysis. (A) Venn diagram of top 200 features for the model trained from scratch (blue) and the pre-trained (red) PaccMann model on the PDO data set. Feature importance values were averaged over all drug–PDO pairs. (B) Box-plots of pathway enrichment scores of the top-ranked 200 genes derived from feature importance values of the pre-trained model and the model trained from scratch for all drugs and specifically for the drug olaparib (x-axis). The y-axis denotes the of the adjusted enrichment p-values. Boxes denote the 10–90% ranges of pathway enrichment scores. Pathway gene sets were taken from the ConsensusPathDB resource and were judged as significantly enriched if Fisher’s exact test resulted in a p-value of below 0.001, and if the pathway shared at least 10 genes with the 200 top-ranked genes. Sizes of samples were: all-drugs-scratch: 62 pathways; all-drugs-pre-trained: 120; olaparib-scratch: 84; olaparib-pre-trained: 126. On top of each model pair, the result of an unpaired, two-sided Mann–Whitney test is displayed (****: ). (C) Venn diagram of top 200 features from the scratch (blue) and pre-trained (red) PaccMann models and the PDO data set. Feature importances were averaged over the olaparib–PDO pairs. (D) Bar plot of selected enrichment pathway scores ( of adjusted enrichment p-value, y-axis) for the 200 top-ranked genes from the scratch (light yellow) and pre-trained (strong yellow) PaccMann models using the feature importance values for all olaparib–PDO pairs; x-axis: pathways related to DNA damage repair and replication.
Figure 3Effect of pre-training for PDO–drug AUC predictions. (A) Absolute differences in predicted and ground truth AUCs (y-axis) for standard therapies with pre-trained and models trained from scratch (x-axis). Boxes show the 10–90% ranges of absolute errors for the different PDOs; the line within each box depicts the median value. Bright colors show the pre-trained models; lighter colors show the trained-from-scratch models. On top of each model pair, the result of an unpaired, two-sided Mann–Whitney test is displayed (****: ; **: ; *: ; ns: not significant). (B) Absolute differences in predicted and ground truth AUCs (y-axis) for the targeted drug olaparib when PDOs are grouped according to basal and classical subtypes. On top of each model pair, the result of an unpaired, two-sided Mann–Whitney test is displayed (****: ; **: ; *: ; ns: not significant). (C) Z-scores (y-axis) of the 5 best drug predictions for selected PDOs (x-axis). Different drugs are indicated by colored bars.