| Literature DB >> 28936445 |
Zhongheng Zhang1, Adela Castelló2,3.
Abstract
In multivariate analysis, independent variables are usually correlated to each other which can introduce multicollinearity in the regression models. One approach to solve this problem is to apply principal components analysis (PCA) over these variables. This method uses orthogonal transformation to represent sets of potentially correlated variables with principal components (PC) that are linearly uncorrelated. PCs are ordered so that the first PC has the largest possible variance and only some components are selected to represent the correlated variables. As a result, the dimension of the variable space is reduced. This tutorial illustrates how to perform PCA in R environment, the example is a simulated dataset in which two PCs are responsible for the majority of the variance in the data. Furthermore, the visualization of PCA is highlighted.Keywords: Principal component analysis; R; multicollinearity; regression
Year: 2017 PMID: 28936445 PMCID: PMC5599285 DOI: 10.21037/atm.2017.07.12
Source DB: PubMed Journal: Ann Transl Med ISSN: 2305-5839