Literature DB >> 10850786

Molecular descriptors for effective classification of biologically active compounds based on principal component analysis identified by a genetic algorithm.

L Xue1, J Bajorath.   

Abstract

We have evaluated combinations of 111 descriptors that were calculated from two-dimensional representations of molecules to classify 455 compounds belonging to seven biological activity classes using a method based on principal component analysis. The analysis was facilitated by application of a genetic algorithm. Using scoring functions that related the number of compounds in pure classes (i.e., compounds with the same biological activity), singletons, and mixed classes, effective descriptor sets were identified. A combination of only four molecular descriptors accounting for aromatic character, hydrogen bond acceptors, estimated polar van der Waals surface area, and a single structural key gave overall best results. At this performance level, approximately 91% of the compounds occurred in pure classes and mixed classes were absent. The results indicate that combinations of only a few critical descriptors are preferred to partition compounds according to their biological activity, at least in the test cases studied here.

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 10850786     DOI: 10.1021/ci000322m

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  4 in total

Review 1.  Genetic algorithm optimization in drug design QSAR: Bayesian-regularized genetic neural networks (BRGNN) and genetic algorithm-optimized support vectors machines (GA-SVM).

Authors:  Michael Fernandez; Julio Caballero; Leyden Fernandez; Akinori Sarai
Journal:  Mol Divers       Date:  2010-03-20       Impact factor: 2.943

2.  Drug-drug relationship based on target information: application to drug target identification.

Authors:  Keunwan Park; Dongsup Kim
Journal:  BMC Syst Biol       Date:  2011-12-14

3.  Design of chemical libraries with potentially bioactive molecules applying a maximum common substructure concept.

Authors:  Michael Lisurek; Bernd Rupp; Jörg Wichard; Martin Neuenschwander; Jens Peter von Kries; Ronald Frank; Jörg Rademann; Ronald Kühne
Journal:  Mol Divers       Date:  2009-08-15       Impact factor: 2.943

4.  The Application of CA and PCA to the Evaluation of Lipophilicity and Physicochemical Properties of Tetracyclic Diazaphenothiazine Derivatives.

Authors:  Anna Nycz-Empel; Katarzyna Bober; Mirosław Wyszomirski; Ewa Kisiel; Andrzej Zięba
Journal:  J Anal Methods Chem       Date:  2019-10-20       Impact factor: 2.193

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.