Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Classification for high-throughput data with an optimal subset of principal components.

Literature DB >> 19748831

Classification for high-throughput data with an optimal subset of principal components.

Abstract

High-throughput data have been widely used in biological and medical studies to discover gene and protein functions. Due to the high dimensionality, principal component analysis (PCA) is often involved for data dimension reduction. However, when a few principal components (PCs) are selected for dimension reduction or considered for dimension determination, they are typically ranked by their variances, eigenvalues. However, this approach is not always effective in subsequent multivariate analysis, particularly classification. To maximize information from data with a subset of the components, we apply a different ranking criterion, canonical variate criterion, which considers within- and between-group variance rather than total variance in the classical criterion. Four prevalent classification methods are considered and compared using leave-one-out cross-validation. These methods are illustrated with three real high-throughput data sets, two microarray data sets and a nuclear magnetic resonance spectra data set.

Entities: Disease

Mesh：

Year: 2009 PMID： 19748831 DOI： 10.1016/j.compbiolchem.2009.07.017

Source DB: PubMed Journal: Comput Biol Chem ISSN： 1476-9271 Impact factor: 2.877

Keyword Cloud
Cited

1 in total

1. A New Optical Sensor Based on Laser Speckle and Chemometrics for Precision Agriculture: Application to Sunflower Plant-Breeding.

Authors: Maxime Ryckewaert; Daphné Héran; Emma Faur; Pierre George; Bruno Grèzes-Besset; Frédéric Chazallet; Yannick Abautret; Myriam Zerrad; Claude Amra; Ryad Bendoula
Journal: Sensors (Basel) Date: 2020-08-18 Impact factor: 3.576

1 in total