| Literature DB >> 34070793 |
Abhishek Majumdar1, Yueze Liu2, Yaoqin Lu3, Shaofeng Wu1, Lijun Cheng1.
Abstract
BACKGROUND: Cancer cell lines are frequently used in research as in-vitro tumor models. Genomic data and large-scale drug screening have accelerated the right drug selection for cancer patients. Accuracy in drug response prediction is crucial for success. Due to data-type diversity and big data volume, few methods can integrative and efficiently find the principal low-dimensional manifold of the high-dimensional cancer multi-omics data to predict drug response in precision medicine.Entities:
Keywords: cancer cell-lines; drug response prediction; gene expression
Mesh:
Year: 2021 PMID: 34070793 PMCID: PMC8229729 DOI: 10.3390/genes12060844
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Overview of kESVR model.
Figure 2(a) 2-D plot of reduced dataset (b) 2-D representation of prediction error dataset (c) Clustering of dataset (d) 2-D plot of clustered (e) 2-D plot of data-points in with the newly predicted value (f) 2-D plot of a predicted data point and its neighbors.
Figure 3The steps of kESVR model creation for drug zebularine response AUC prediction on 610 cancer cells from CCLE. (a) Dimension reduction of gene expression profiles and mapping of the latent variable PC1 and drug response of zebularine onto 2D space. (b) Seeking local clusters by k-means algorithm for regression. (c) Construction of local SVR regression models after clustering reduced dataset into 8 clusters (d) Multiple prediction candidates from the trained 8 SVRs on the clusters.
Optimal k-value for kESVR model.
| Drug |
|
|---|---|
| zebularine | 8 |
| azacitidine | 7 |
| myricetin | 8 |
| BRDK64610608 | 8 |
| nelarabine | 12 |
| SB743921 | 1 |
| paclitaxel | 8 |
| daporinad | 8 |
| neopeltolide | 1 |
| docetaxel | 1 |
Model comparison for 5 random drugs.
| Drug | Avg. (Training + Testing) MSE | ||||
|---|---|---|---|---|---|
| LR | BPNN | SVR | QRF | kESVR | |
| zebularine | 36.490 | 1.078 | 1.039 | 1.001 | 0.336 |
| azacitidine | 188.773 | 0.983 | 1.028 | 1.001 | 0.307 |
| myricetin | 117.890 | 0.902 | 0.905 | 0.984 | 0.301 |
| BRDK64610608 | 49.670 | 0.987 | 1.018 | 1.078 | 0.350 |
| nelarabine | 42.137 | 1.010 | 1.093 | 1.090 | 0.450 |
Model setup time for 5 random drugs.
| Drug | Model Setup Time (in sec) | ||||
|---|---|---|---|---|---|
| LR | BPNN | SVR | QRF | kESVR | |
| zebularine | 2.390 | 9417.563 | 30.328 | 3342.786 | 10,934.530 |
| azacitidine | 2.419 | 8830.332 | 29.412 | 3603.740 | 7787.4456 |
| myricetin | 2.487 | 15,179.391 | 30.088 | 3375.857 | 8259.927 |
| BRDK64610608 | 2.493 | 13,580.990 | 29.556 | 3329.557 | 8442.622 |
| nelarabine | 2.335 | 14,683.006 | 27.803 | 3608.870 | 7334.934 |
Model comparison for 5 drugs with maximum variance.
| Drug | Avg. (Training + Testing) MSE | ||||
|---|---|---|---|---|---|
| LR | NN | SVR | QRF | kESVR | |
| SB743921 | 674.143 | 3.496 | 3.095 | 3.238 | 3.095 |
| paclitaxel | 98.3038 | 3.442 | 3.067 | 3.070 | 2.472 |
| daporinad | 137.176 | 3.458 | 3.206 | 3.140 | 2.082 |
| neopeltolide | 118.476 | 3.256 | 3.358 | 3.443 | 3.358 |
| docetaxel | 110.085 | 3.360 | 2.856 | 3.074 | 2.856 |
Model setup time for 5 drugs with maximum variance.
| Drug | Model Setup Time (in sec) | ||||
|---|---|---|---|---|---|
| LR | BPNN | SVR | QRF | kESVR | |
| SB743921 | 2.271 | 10,361.546 | 27.447 | 3563.221 | 8918.989 |
| paclitaxel | 2.088 | 9439.555 | 27.139 | 3497.768 | 8575.110 |
| daporinad | 2.316 | 11,495.089 | 26.572 | 2391.248 | 7637.791 |
| neopeltolide | 1.617 | 2132.497 | 7.937 | 570.632 | 2738.013 |
| docetaxel | 1.650 | 2849.202 | 11.419 | 1139.292 | 4266.580 |
Selection of optimal k value for kESVR model development of drug zebularine.
| DRUG Zebularine | |
|---|---|
|
| Avg. (Training + Testing) MSE |
| 1 | 1.039 |
| 2 | 0.815 |
| 3 | 0.749 |
| 4 | 0.568 |
| 5 | 0.621 |
| 6 | 0.483 |
| 7 | 0.423 |
| 8 | 0.336 |
| 9 | 0.357 |
| 10 | 0.351 |
| 11 | 0.405 |
| 12 | 0.579 |
5-fold cross-validation results for kESVR model (k = 8) of drug zebularine.
| DRUG Zebularine | |||
|---|---|---|---|
| Fold | Training Set MSE | Testing Set MSE | Avg. (Training + Testing) MSE |
| 1 | 0.203 | 0.335 | 0.269 |
| 2 | 0.063 | 0.706 | 0.384 |
| 3 | 0.207 | 0.618 | 0.413 |
| 4 | 0.227 | 0.276 | 0.252 |
| 5 | 0.316 | 0.404 | 0.360 |
Model Fitness Comparison using R-squared values.
| Drug | |||||
|---|---|---|---|---|---|
| LR | BPNN | SVR | QRF | kESVR | |
| zebularine | −0.094 | −0.00047 | 0.284 | 0.693 | 0.778 |
| azacitidine | −0.267 | −4.5 × 10−5 | 0.139 | 0.691 | 0.789 |
| myricetin | −0.25 | −0.000681 | 0.082 | 0.698 | 0.772 |
| BRDK64610608 | 0.072 | −0.000809 | 0.093 | 0.698 | 0.802 |
| nelarabine | −0.152 | −4.2 × 10−5 | 0.119 | 0.627 | 0.700 |
| SB743921 | 0.263 | −0.002453 | 0.66 | 0.795 | 0.66 |
| paclitaxel | 0.093 | −6.0 × 10−6 | 0.415 | 0.764 | 0.853 |
| daporinad | −0.262 | −0.000608 | 0.382 | 0.768 | 0.903 |
| neopeltolide | −11789.886 | −0.00368 | 0.385 | 0.781 | 0.385 |
| docetaxel | −301.018 | −0.001727 | 0.644 | 0.788 | 0.644 |
Figure 4Comparison of kESVR with DualNets, KRR, pairwiseMKL and SRMF models in terms of Root Mean Square Error (RMSE) value over 23 drugs. kESVR is the best performing (lowest RMSE) model in 17 out of 23 drugs. For the remaining 6 drugs, kESVR places in the top 3 position among the 5 models.
Figure 5Selection of optimal value of for drug zebularine. (a) Calinski-Karabasz index sets optimal (b) Average Silhouette value specifies optimal (c) Average (Train+Test) MSE value selects optimal .
Figure 6Variation of performance of kESVR with respect to for drug zebularine. (a) r is varied from 0 to 500. (b) Shows the details from (a) when is varied from 0 to 1.