Literature DB >> 29428276

A random version of principal component analysis in data clustering.

Luigi Leonardo Palese1.   

Abstract

Principal component analysis (PCA) is a widespread technique for data analysis that relies on the covariance/correlation matrix of the analyzed data. However, to properly work with high-dimensional data sets, PCA poses severe mathematical constraints on the minimum number of different replicates, or samples, that must be included in the analysis. Generally, improper sampling is due to a small number of data respect to the number of the degrees of freedom that characterize the ensemble. In the field of life sciences it is often important to have an algorithm that can accept poorly dimensioned data sets, including degenerated ones. Here a new random projection algorithm is proposed, in which a random symmetric matrix surrogates the covariance/correlation matrix of PCA, while maintaining the data clustering capacity. We demonstrate that what is important for clustering efficiency of PCA is not the exact form of the covariance/correlation matrix, but simply its symmetry.
Copyright © 2018 Elsevier Ltd. All rights reserved.

Keywords:  Data clustering; Dimensionality reduction; Principal component analysis; Protein structure; Random projection; Structural bioinformatics

Year:  2018        PMID: 29428276     DOI: 10.1016/j.compbiolchem.2018.01.009

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  7 in total

1.  Evolutionary sequential genetic search technique-based cancer classification using fuzzy rough nearest neighbour classifier.

Authors:  Loganathan Meenachi; Srinivasan Ramakrishnan
Journal:  Healthc Technol Lett       Date:  2018-08-15

2.  A New Look at the Structures of Old Sepsis Actors by Exploratory Data Analysis Tools.

Authors:  Antonio Gnoni; Emanuele De Nitto; Salvatore Scacco; Luigi Santacroce; Luigi Leonardo Palese
Journal:  Antibiotics (Basel)       Date:  2019-11-14

3.  Systematic Search for SARS-CoV-2 Main Protease Inhibitors for Drug Repurposing: Ethacrynic Acid as a Potential Drug.

Authors:  Camilla Isgrò; Anna Maria Sardanelli; Luigi Leonardo Palese
Journal:  Viruses       Date:  2021-01-13       Impact factor: 5.048

4.  Utilization of Time Series Tools in Life-sciences and Neuroscience.

Authors:  Harshit Gujral; Ajay Kumar Kushwaha; Sukant Khurana
Journal:  Neurosci Insights       Date:  2020-12-08

5.  Predicting cell-penetrating peptides using machine learning algorithms and navigating in their chemical space.

Authors:  Ewerton Cristhian Lima de Oliveira; Kauê Santana; Luiz Josino; Anderson Henrique Lima E Lima; Claudomiro de Souza de Sales Júnior
Journal:  Sci Rep       Date:  2021-04-07       Impact factor: 4.379

6.  Human Ovarian Cancer Tissue Exhibits Increase of Mitochondrial Biogenesis and Cristae Remodeling.

Authors:  Anna Signorile; Domenico De Rasmo; Antonella Cormio; Clara Musicco; Roberta Rossi; Francesco Fortarezza; Luigi Leonardo Palese; Vera Loizzi; Leonardo Resta; Giovanni Scillitani; Ettore Cicinelli; Francesca Simonetti; Anna Ferretta; Silvia Russo; Antonio Tufaro; Gennaro Cormio
Journal:  Cancers (Basel)       Date:  2019-09-12       Impact factor: 6.639

7.  SARS-CoV-2 Main Protease Active Site Ligands in the Human Metabolome.

Authors:  Anna Maria Sardanelli; Camilla Isgrò; Luigi Leonardo Palese
Journal:  Molecules       Date:  2021-03-05       Impact factor: 4.411

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.