| Literature DB >> 15067522 |
Alireza Givehchi1, Gisbert Schneider.
Abstract
The influence of preprocessing of molecular descriptor vectors for solving classification tasks was analyzed for drug/nondrug classification by artificial neural networks. Molecular properties were used to form descriptor vectors. Two types of neural networks were used, supervised multilayer neural nets trained with the back-propagation algorithm, and unsupervised self-organizing maps (Kohonen maps). Data were preprocessed by logistic scaling and histogram equalization. For both types of neural networks, the preprocessing step significantly improved classification compared to nonstandardized data. Classification accuracy was measured as prediction mean square error and Matthews correlation coefficient in the case of supervised learning, and quantization error in the case of unsupervised learning. The results demonstrate that appropriate data preprocessing is an essential step in solving classification tasks.Mesh:
Substances:
Year: 2004 PMID: 15067522 DOI: 10.1007/s00894-004-0186-9
Source DB: PubMed Journal: J Mol Model ISSN: 0948-5023 Impact factor: 1.810