| Literature DB >> 26860193 |
Harinder Singh1, Rahul Kumar2, Sandeep Singh3, Kumardeep Chaudhary4, Ankur Gautam5, Gajendra P S Raghava6.
Abstract
BACKGROUND: In past, numerous quantitative structure-activity relationship (QSAR) based models have been developed for predicting anticancer activity for a specific class of molecules against different cancer drug targets. In contrast, limited attempt have been made to predict the anticancer activity of a diverse class of chemicals against a wide variety of cancer cell lines. In this study, we described a hybrid method developed on thousands of anticancer and non-anticancer molecules tested against National Cancer Institute (NCI) 60 cancer cell lines.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26860193 PMCID: PMC4748564 DOI: 10.1186/s12885-016-2082-y
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Fig. 1Functional groups present in anticancer and non-anticancer molecules along with their mean frequency
Fig. 2Pharmocophore alignment of most active anticancer molecules generated using PharmaGist
Fig. 3Maximum common substructures found in anticancer molecules along with the number of molecules having that particular substructure
The individual performance of best 10 selected fingerprints using MCC based approach
| Best 10 fingerprints | Sensitivity | Specificity | Accuracy | MCC | FPR | AUC |
|---|---|---|---|---|---|---|
| PubchemFP12 | 79.3 | 65.1 | 71.69 | 0.45 | 0.48 | 0.72 |
| ExtFP1013 | 52.5 | 85.7 | 70.19 | 0.41 | 0.65 | 0.69 |
| ExtFP1012 | 78.4 | 61.9 | 69.61 | 0.41 | 0.47 | 0.7 |
| PubchemFP192 | 58.4 | 79.4 | 69.6 | 0.39 | 0.61 | 0.69 |
| GraphFP382 | 73.3 | 63.8 | 68.27 | 0.37 | 0.50 | 0.69 |
| ExtFP1016 | 42 | 88.7 | 66.91 | 0.35 | 0.71 | 0.65 |
| PubchemFP199 | 28.1 | 95.4 | 64.01 | 0.32 | 0.80 | 0.62 |
| ExtFP1015 | 70.7 | 61.5 | 65.77 | 0.32 | 0.50 | 0.66 |
| MACCSFP105 | 70.1 | 60.6 | 64.98 | 0.31 | 0.50 | 0.65 |
| FP799 | 34.7 | 89.6 | 64.01 | 0.29 | 0.75 | 0.62 |
The performance of potency score based method developed using different sets of fingerprints
| Number of fingerprints | Sensitivity | Specificity | Accuracy | MCC | FPR | ROC |
|---|---|---|---|---|---|---|
| 50 | 79.59 | 93.37 | 86.94 | 0.74 | 0.09 | 0.92 |
| 100 | 82.36 | 95.7 | 89.48 | 0.79 | 0.06 | 0.95 |
| 150 | 83.17 | 96.15 | 90.1 | 0.81 | 0.05 | 0.95 |
| 200 | 83.14 | 96.3 | 90.16 | 0.81 | 0.05 | 0.95 |
| 126 | 84.62 | 96.45 | 90.94 | 0.82 | 0.05 | 0.95 |
Comparative performance of models developed using 126 fingerprints at various thresholds has been shown in this table
| SVM | Random Forest | IBK | Naïve Bayes | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Threshold | Accuracy | MCC | Threshold | Accuracy | MCC | Accuracy | MCC | Accuracy | MCC |
| −1 | 72.40 | 0.54 | 0 | 46.63 | 0.00 | 46.63 | 0.00 | 46.63 | 0.00 |
| −0.8 | 80.58 | 0.65 | 0.1 | 67.74 | 0.47 | 80.22 | 0.63 | 75.06 | 0.50 |
| −0.6 | 84.72 | 0.72 | 0.2 | 77.73 | 0.61 | 82.59 | 0.67 | 74.99 | 0.50 |
| −0.4 | 87.52 | 0.76 | 0.3 | 83.71 | 0.69 | 83.98 | 0.68 | 74.98 | 0.50 |
| −0.2 | 89.60 | 0.79 | 0.4 | 86.33 | 0.73 | 85.71 | 0.72 | 74.92 | 0.50 |
| 0 | 90.40 | 0.81 | 0.5 | 87.47 | 0.75 | 85.31 | 0.71 | 74.86 | 0.49 |
| 0.2 | 90.24 | 0.80 | 0.6 | 86.47 | 0.73 | 85.10 | 0.71 | 74.79 | 0.49 |
| 0.4 | 89.00 | 0.79 | 0.7 | 83.81 | 0.69 | 82.73 | 0.67 | 74.79 | 0.49 |
| 0.6 | 86.35 | 0.74 | 0.8 | 79.03 | 0.62 | 81.22 | 0.65 | 74.81 | 0.49 |
| 0.8 | 82.65 | 0.68 | 0.9 | 71.86 | 0.51 | 80.90 | 0.65 | 74.77 | 0.49 |
| 1.0 | 70.49 | 0.48 | 1.0 | 59.30 | 0.28 | 80.87 | 0.65 | 73.37 | 0.48 |
Fig. 4ROC plot of potency score, SVM and hybrid method developed using 126 fingerprints
Comparative performance of CDRUG (existing method) and our models based on potency score, SVM and hybrid approach
| Method | Sensitivity | Specificity | Accuracy | MCC | FPR | AUC |
|---|---|---|---|---|---|---|
| CDRUG | 65 | - | - | - | 0.05 | 0.88 |
| 74 | 0.10 | |||||
| 81 | - | - | - | 0.20 | ||
| Potency Score | 65.8 | 98.9 | 83.5 | 0.7 | 0.02 | 0.95 |
| 74.26 | 98.42 | 87.15 | 0.76 | 0.02 | ||
| 84.62 | 96.45 | 90.94 | 0.82 | 0.05 | ||
| SVM | 65.47 | 98.66 | 83.34 | 0.69 | 0.02 | 0.95 |
| 74.16 | 97.63 | 86.8 | 0.75 | 0.03 | ||
| 89.02 | 91.52 | 90.42 | 0.81 | 0.09 | ||
| Hybrid | 65.57 | 99.63 | 83.75 | 0.71 | 0.01 | 0.98 |
| 74.41 | 99.11 | 87.59 | 0.77 | 0.01 | ||
| 92.38 | 92.55 | 92.47 | 0.85 | 0.08 |
Fig. 5Various modules of CancerIN showing the input format and output display: a The Marvin draw applet for drawing molecules, b The input form for generation of analogs, c The output page of draw molecule module, and d. The result page of scan library showing the list of query molecules and the most similar anticancer molecules