| Literature DB >> 27752528 |
Yum Eryanti1, Adel Zamri1, Neni Frimayanti1, Unang Supratman2, Tati Herlina2.
Abstract
The dataset of curcumin derivatives consists of 45 compounds (Table 1) with their anti cancer biological activity (IC50) against P388 cell line. 45 curcumin derivatives were used in the model development where 30 of these compounds were in the training set and the remaining 15 compounds were in the test set. The development of the QSAR model involved the use of the multiple linear regression analysis (MLRA) method. Based on the method, r2 value, r2(CV) value of 0.81, 0.67 were obtained. The QSAR model was also employed to predict the biological activity of compounds in the test set. Predictive correlation coefficient r2 values of 0.88 were obtained for the test set.Entities:
Keywords: MLRA; Murine leukemia cell line; QSAR
Year: 2016 PMID: 27752528 PMCID: PMC5061127 DOI: 10.1016/j.dib.2016.09.036
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Molecular structures of 45 curcumin derivatives, they were synthesized using base or acid catalyzed aldol condensation reaction of the appropriate substituted benzaldehyde and corresponding NH-4-piperidones, N-methyl-4-piperidones and N-benzyl-4-piperidones. The IC50 were determined using MTT assay.
Data set divided into:
Training set (compound nos: 1–30).
Test set (compound nos: 31–45).
Statistical output of QSAR model.
| Non | 0.81 |
| Cross validation | 0.67 |
| 12.23 | |
| 6.01 × 10−6 | |
| Standard error of estimate | 0.33 |
| Residual sum of square ( | 2.81 |
| Predictive sum of square ( | 3.61 |
Fig. 1Plot of actual value vs. predicted value of training set. This plot was generated using Microsoft office Excel.
Fig. 2Plot of residual value vs. predicted value. This plot was generated using Microsoft office Excel.
Calculated IC50 value of compounds in the test set.
| 6.04 | 7.52 | |
| 6.3 | 8.24 | |
| 6.33 | 8.23 | |
| 6.49 | 9.68 | |
| 9.39 | 10.42 | |
| 11.54 | 14.56 | |
| 18.03 | 18.01 | |
| 18.28 | 20.3 | |
| 27.75 | 21.8 | |
| 28.78 | 30.83 | |
| 58.82 | 54.8 | |
| 67.03 | 65.05 | |
| 92.62 | 97.79 | |
| 100 | 90.67 | |
| 100 | 119 |
| Subject area | Computational chemistry |
| More specific subject area | Quantitative structure activity relationship (QSAR) modeling |
| Type of Data | Tables |
| How data was acquired | Statistical modeling |
| Data format | Analyzed |
| Experimental factors | The dataset was divided into training set and predicted set. Good QSAR model will have |
| Experimental features | Range scaling was done to select a set of descriptors which were included to develop MLRA model. Descriptors were used as independent variable and PIC50 was used as dependent variable. |
| Data source location | Organic laboratory Department of Chemistry, Faculty of Mathematics and Natural Sciences, Universitas Riau, Pekan Baru Indonesia |
| Data accessibility | The data is with this article |