Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Local-learning-based feature selection for high-dimensional data analysis.

Literature DB >> 20634556

Local-learning-based feature selection for high-dimensional data analysis.

Yijun Sun¹, Sinisa Todorovic, Steve Goodison.

Abstract

This paper considers feature selection for data classification in the presence of a huge number of irrelevant features. We propose a new feature-selection algorithm that addresses several major issues with prior work, including problems with algorithm implementation, computational complexity, and solution accuracy. The key idea is to decompose an arbitrarily complex nonlinear problem into a set of locally linear ones through local learning, and then learn feature relevance globally within the large margin framework. The proposed algorithm is based on well-established machine learning and numerical analysis techniques, without making any assumptions about the underlying data distribution. It is capable of processing many thousands of features within minutes on a personal computer while maintaining a very high accuracy that is nearly insensitive to a growing number of irrelevant features. Theoretical analyses of the algorithm's sample complexity suggest that the algorithm has a logarithmical sample complexity with respect to the number of features. Experiments on 11 synthetic and real-world data sets demonstrate the viability of our formulation of the feature-selection problem for supervised learning and the effectiveness of our algorithm.

Entities: Chemical Disease Species

Mesh：

Year: 2010 PMID： 20634556 PMCID： PMC3445441 DOI： 10.1109/TPAMI.2009.190

Source DB: PubMed Journal: IEEE Trans Pattern Anal Mach Intell ISSN： 0098-5589 Impact factor: 6.226

9 in total

1. Nonlinear dimensionality reduction by locally linear embedding.

Authors: S T Roweis; L K Saul
Journal: Science Date: 2000-12-22 Impact factor: 47.728

2. Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning.

Authors: Margaret A Shipp; Ken N Ross; Pablo Tamayo; Andrew P Weng; Jeffery L Kutok; Ricardo C T Aguiar; Michelle Gaasenbeek; Michael Angelo; Michael Reich; Geraldine S Pinkus; Tane S Ray; Margaret A Koval; Kim W Last; Andrew Norton; T Andrew Lister; Jill Mesirov; Donna S Neuberg; Eric S Lander; Jon C Aster; Todd R Golub
Journal: Nat Med Date: 2002-01 Impact factor: 53.440

3. The generalized LASSO.

Authors: Volker Roth
Journal: IEEE Trans Neural Netw Date: 2004-01

4. Integration of gene expression profiling and clinical variables to predict prostate carcinoma recurrence after radical prostatectomy.

Authors: Andrew J Stephenson; Alex Smith; Michael W Kattan; Jaya Satagopan; Victor E Reuter; Peter T Scardino; William L Gerald
Journal: Cancer Date: 2005-07-15 Impact factor: 6.860

5. Optimally sparse representation in general (nonorthogonal) dictionaries via l minimization.

Authors: David L Donoho; Michael Elad
Journal: Proc Natl Acad Sci U S A Date: 2003-02-21 Impact factor: 11.205

6. Improved breast cancer prognosis through the combination of clinical and genetic markers.

Authors: Yijun Sun; Steve Goodison; Jian Li; Li Liu; William Farmerie
Journal: Bioinformatics Date: 2006-11-26 Impact factor: 6.937

Review 7. Approaches to dimensionality reduction in proteomic biomarker studies.

Authors: Melanie Hilario; Alexandros Kalousis
Journal: Brief Bioinform Date: 2008-02-29 Impact factor: 11.622

8. Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer.

Authors: Yixin Wang; Jan G M Klijn; Yi Zhang; Anieta M Sieuwerts; Maxime P Look; Fei Yang; Dmitri Talantov; Mieke Timmermans; Marion E Meijer-van Gelder; Jack Yu; Tim Jatkoe; Els M J J Berns; David Atkins; John A Foekens
Journal: Lancet Date: 2005 Feb 19-25 Impact factor: 79.321

9. Gene expression profiling predicts clinical outcome of breast cancer.

Authors: Laura J van 't Veer; Hongyue Dai; Marc J van de Vijver; Yudong D He; Augustinus A M Hart; Mao Mao; Hans L Peterse; Karin van der Kooy; Matthew J Marton; Anke T Witteveen; George J Schreiber; Ron M Kerkhoven; Chris Roberts; Peter S Linsley; René Bernards; Stephen H Friend
Journal: Nature Date: 2002-01-31 Impact factor: 49.962

9 in total

27 in total

1. Disease prediction in the at-risk mental state for psychosis using neuroanatomical biomarkers: results from the FePsy study.

Authors: Nikolaos Koutsouleris; Stefan Borgwardt; Eva M Meisenzahl; Ronald Bottlender; Hans-Jürgen Möller; Anita Riecher-Rössler
Journal: Schizophr Bull Date: 2011-11-10 Impact factor: 9.306

Local-learning-based feature selection for high-dimensional data analysis.

1. Nonlinear dimensionality reduction by locally linear embedding.

2. Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning.

3. The generalized LASSO.

4. Integration of gene expression profiling and clinical variables to predict prostate carcinoma recurrence after radical prostatectomy.

5. Optimally sparse representation in general (nonorthogonal) dictionaries via l minimization.

6. Improved breast cancer prognosis through the combination of clinical and genetic markers.

Review 7. Approaches to dimensionality reduction in proteomic biomarker studies.

8. Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer.

9. Gene expression profiling predicts clinical outcome of breast cancer.

1. Disease prediction in the at-risk mental state for psychosis using neuroanatomical biomarkers: results from the FePsy study.

2. Cancer progression modeling using static sample data.

3. Distinguishing prodromal from first-episode psychosis using neuroanatomical single-subject pattern recognition.

4. MORPHOLOGICAL SIGNATURES AND GENOMIC CORRELATES IN GLIOBLASTOMA.

5. Computational approach for deriving cancer progression roadmaps from static sample data.

Review 6. Molecular diagnostic trends in urological cancer: biomarkers for non-invasive diagnosis.

7. Genomic prediction based on data from three layer lines using non-linear regression models.

Review 8. Derivation of cancer diagnostic and prognostic signatures from gene expression data.

Review 9. Relief-based feature selection: Introduction and review.

10. A candidate molecular biomarker panel for the detection of bladder cancer.