Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Be careful with your principal components.

Literature DB >> 31433858

Be careful with your principal components.

Abstract

Principal components analysis (PCA) is a common method to summarize a larger set of correlated variables into a smaller and more easily interpretable axes of variation. However, the different components need to be distinct from each other to be interpretable otherwise they only represent random directions. This is a fundamental assumption of PCA and, thus, needs to be tested every time. Sample correlation matrices will always result in a pattern of decreasing eigenvalues even if there is no structure. Tests are, therefore, needed to discern real patterns from illusionary ones. Furthermore, the loadings of the vectors need to be larger than expected by random data to be useful in the calculation of PC-scores. PC-scores calculated from nondistinct PC's have very large standard errors and cannot be used for biological interpretations. I give a number of examples to illustrate the potential problems with PCA. Robustness of the PC's increases with increasing sample size but not with the number of traits. I review a few simple test statistics appropriate for testing PC's and use a real-world example to illustrate how this can be done using randomization tests. PCA can be very useful but great care is needed to avoid spurious results.

Keywords: Correlations; principal components analysis; randomization; standard error

Mesh：

Year: 2019 PMID： 31433858 DOI： 10.1111/evo.13835

Source DB: PubMed Journal: Evolution ISSN： 0014-3820 Impact factor: 3.694

Keyword Cloud
Cited

10 in total

1. How general is cognitive ability in non-human animals? A meta-analytical and multi-level reanalysis approach.

Authors: Marc-Antoine Poirier; Dovid Y Kozlovsky; Julie Morand-Ferron; Vincent Careau
Journal: Proc Biol Sci Date: 2020-12-09 Impact factor: 5.349

2. Principal Component Analyses (PCA)-based findings in population genetic studies are highly biased and must be reevaluated.

Authors: Eran Elhaik
Journal: Sci Rep Date: 2022-08-29 Impact factor: 4.996

3. Recent introgression between Taiga Bean Goose and Tundra Bean Goose results in a largely homogeneous landscape of genetic differentiation.

Authors: Jente Ottenburghs; Johanna Honka; Gerard J D M Müskens; Hans Ellegren
Journal: Heredity (Edinb) Date: 2020-05-26 Impact factor: 3.821

4. Identification of a Quinone Derivative as a YAP/TEAD Activity Modulator from a Repurposing Library.

Authors: Angela Lauriola; Elisa Uliassi; Matteo Santucci; Maria Laura Bolognesi; Marco Mor; Laura Scalvini; Gian Marco Elisi; Gaia Gozzi; Lorenzo Tagliazucchi; Gaetano Marverti; Stefania Ferrari; Lorena Losi; Domenico D'Arca; Maria Paola Costi
Journal: Pharmaceutics Date: 2022-02-10 Impact factor: 6.321

5. Determining the attributes that influence students' online learning satisfaction during COVID-19 pandemic.

Authors: Elizabeth Agyeiwaah; Frank Badu Baiden; Emmanuel Gamor; Fu-Chieh Hsu
Journal: J Hosp Leis Sport Tour Educ Date: 2021-11-24

6. Advanced Operationalization Framework for Climate-Resilient Urban Public Health Care Services: Composite Indicators-Based Scenario Assessment of Khon Kaen City, Thailand.

Authors: Wiriya Puntub; Stefan Greiving
Journal: Int J Environ Res Public Health Date: 2022-01-24 Impact factor: 3.390

Be careful with your principal components.

1. How general is cognitive ability in non-human animals? A meta-analytical and multi-level reanalysis approach.

2. Principal Component Analyses (PCA)-based findings in population genetic studies are highly biased and must be reevaluated.

3. Recent introgression between Taiga Bean Goose and Tundra Bean Goose results in a largely homogeneous landscape of genetic differentiation.

4. Identification of a Quinone Derivative as a YAP/TEAD Activity Modulator from a Repurposing Library.

5. Determining the attributes that influence students' online learning satisfaction during COVID-19 pandemic.

6. Advanced Operationalization Framework for Climate-Resilient Urban Public Health Care Services: Composite Indicators-Based Scenario Assessment of Khon Kaen City, Thailand.

7. PCAtest: testing the statistical significance of Principal Component Analysis in R.

8. Abiotic and biotic correlates of the occurrence, extent and cover of invasive aquatic Elodea nuttallii.

9. Pool choice in a vertical landscape: Tadpole-rearing site flexibility in phytotelm-breeding frogs.

10. Exceptional evolutionary lability of flower-like inflorescences (pseudanthia) in Apiaceae subfamily Apioideae.