| Literature DB >> 28656625 |
ChangKyoo Yoo1, Mohsen Shahlaei2.
Abstract
Principal component analysis (PCA), as a well-known multivariate data analysis and data reduction technique, is an important and useful algebraic tool in drug design and discovery. PCA, in a typical quantitative structure-activity relationship (QSAR) study, analyzes an original data matrix in which molecules are described by several intercorrelated quantitative dependent variables (molecular descriptors). Although extensively applied, there is disparity in the literature with respect to the applications of PCA in the QSAR studies. This study investigates the different applications of PCA in QSAR studies using a dataset including CCR5 inhibitors. The different types of preprocessing are used to compare the PCA performances. The use of PC plots in the exploratory investigation of matrix of descriptors is described. This work is also proved PCA analysis to be a powerful technique for exploring complex datasets in QSAR studies for identification of outliers. This study shows that PCA is able to easily apply to the pool of calculated structural descriptors and also the extracted information can be used to help decide upon an appropriate harder model for further analysis.Entities:
Keywords: QSAR; data reduction; principal component analysis; rational drug design
Mesh:
Substances:
Year: 2017 PMID: 28656625 DOI: 10.1111/cbdd.13064
Source DB: PubMed Journal: Chem Biol Drug Des ISSN: 1747-0277 Impact factor: 2.817