| Literature DB >> 23351435 |
Loghman Firoozpour1, Khadijeh Sadatnezhad, Sholeh Dehghani, Eslam Pourbasheer, Alireza Foroumadi, Abbas Shafiee, Massoud Amanlou.
Abstract
BACKGROUND AND PURPOSE OF THE STUDY: Multimodal distribution of descriptors makes it more difficult to fit a single global model to model the entire data set in quantitative structure activity relationship (QSAR) studies.Entities:
Year: 2012 PMID: 23351435 PMCID: PMC3556068 DOI: 10.1186/2008-2231-20-31
Source DB: PubMed Journal: Daru ISSN: 1560-8115 Impact factor: 3.117
Selected variables after 4 feature selection steps
| 1 | N-072 | Atom centered fragments | 1D |
| 2 | C-012 | Atom centered fragments | 1D |
| 3 | O-060 | Atom centered fragments | 1D |
| 4 | nCO | functional groups | 1D |
| 5 | nCrR2 | functional groups | 1D |
| 6 | MLOGP | properties | 1D |
| 7 | MATS4v | 2D autocorrelations | 2D |
| 8 | GATS8p | 2D autocorrelations | 2D |
| 9 | GATS6e | 2D autocorrelations | 2D |
| 10 | ATS5e | 2D autocorrelations | 2D |
| 11 | MATS7e | 2D autocorrelations | 2D |
| 12 | SIC0 | Topological descriptors | 2D |
| 13 | piPC10 | Topological descriptors | 2D |
| 14 | PCD | Topological descriptors | 2D |
| 15 | PJI2 | Topological descriptors | 2D |
| 16 | Mor31m | 3D-MoRSE descriptors | 3D |
| 17 | Mor17e | 3D-MoRSE descriptors | 3D |
| 18 | Mor12p | 3D-MoRSE descriptors | 3D |
| 19 | HOMA | aromaticity indices | 3D |
| 20 | HATS1u | Getway descriptors | 3D |
| 21 | R5e+ | Getway descriptors | 3D |
| 22 | R4e | Getway descriptors | 3D |
| 23 | H8e | Getway descriptors | 3D |
| 24 | RDF070m | RDF descriptors | 3D |
1D, 2D, 3D and others (charge descriptors and molecular properties) correspond to the type of molecular descriptor selected [10].
Selected variables after principal component analysis (Factor analysis)
| 1 | C-012 | Atom centered fragments | 1D |
| 2 | MLOGP | properties | 1D |
| 3 | GATS6e | 2D autocorrelations | 2D |
| 4 | PJI2 | Topological descriptors | 2D |
| 5 | Mor12p | 3D-MoRSE descriptors | 3D |
| 6 | R4e | Getway descriptors | 3D |
| 7 | RDF070m | RDF descriptors | 3D |
Mean square errors of linear and non-linear models
| Subset-1 | 0.6163 | 0.16 | 0.3202 |
| Subset-2 | 0.246 | 0.196 | 0.2509 |
| Subset-3 | 0.2678 | 0.207 | 0.2678 |
| Subset-4 | 0.3259 | 0.192 | 0.2428 |
| Subset-5 | 0.3857 | 0.186 | 0.2464 |
| Subset-6 | 0.1724 | 0.187 | 0.1918 |
| Subset-7 | 0.25 | 0.221 | 0.2979 |
| Subset-8 | 0.2135 | 0.192 | 0.2216 |
| Subset-9 | 0.304 | 0.18 | 0.2085 |
| Subset-10 | 0.326 | 0.185 | 0.3063 |
| Mean | 0.3108 | 0.1907 | 0.2554 |
| Standard deviation | 0.1237 | 0.0163 | 0.0427 |
Figure 1Plot of the mean square errors of prediction versus the 10 subsets by XCSF, ANN and MLR methods.
-values for applied models
| XCSF & ANN | 0.002 | 0.05 | Δ |
| XCSF & MLR | 0.02 | 0.05 | Δ |
| ANN & MLR | 0.124 | 0.05 | - |