Literature DB >> 20634556

Local-learning-based feature selection for high-dimensional data analysis.

Yijun Sun1, Sinisa Todorovic, Steve Goodison.   

Abstract

This paper considers feature selection for data classification in the presence of a huge number of irrelevant features. We propose a new feature-selection algorithm that addresses several major issues with prior work, including problems with algorithm implementation, computational complexity, and solution accuracy. The key idea is to decompose an arbitrarily complex nonlinear problem into a set of locally linear ones through local learning, and then learn feature relevance globally within the large margin framework. The proposed algorithm is based on well-established machine learning and numerical analysis techniques, without making any assumptions about the underlying data distribution. It is capable of processing many thousands of features within minutes on a personal computer while maintaining a very high accuracy that is nearly insensitive to a growing number of irrelevant features. Theoretical analyses of the algorithm's sample complexity suggest that the algorithm has a logarithmical sample complexity with respect to the number of features. Experiments on 11 synthetic and real-world data sets demonstrate the viability of our formulation of the feature-selection problem for supervised learning and the effectiveness of our algorithm.

Entities:  

Mesh:

Year:  2010        PMID: 20634556      PMCID: PMC3445441          DOI: 10.1109/TPAMI.2009.190

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  9 in total

1.  Nonlinear dimensionality reduction by locally linear embedding.

Authors:  S T Roweis; L K Saul
Journal:  Science       Date:  2000-12-22       Impact factor: 47.728

2.  Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning.

Authors:  Margaret A Shipp; Ken N Ross; Pablo Tamayo; Andrew P Weng; Jeffery L Kutok; Ricardo C T Aguiar; Michelle Gaasenbeek; Michael Angelo; Michael Reich; Geraldine S Pinkus; Tane S Ray; Margaret A Koval; Kim W Last; Andrew Norton; T Andrew Lister; Jill Mesirov; Donna S Neuberg; Eric S Lander; Jon C Aster; Todd R Golub
Journal:  Nat Med       Date:  2002-01       Impact factor: 53.440

3.  The generalized LASSO.

Authors:  Volker Roth
Journal:  IEEE Trans Neural Netw       Date:  2004-01

4.  Integration of gene expression profiling and clinical variables to predict prostate carcinoma recurrence after radical prostatectomy.

Authors:  Andrew J Stephenson; Alex Smith; Michael W Kattan; Jaya Satagopan; Victor E Reuter; Peter T Scardino; William L Gerald
Journal:  Cancer       Date:  2005-07-15       Impact factor: 6.860

5.  Optimally sparse representation in general (nonorthogonal) dictionaries via l minimization.

Authors:  David L Donoho; Michael Elad
Journal:  Proc Natl Acad Sci U S A       Date:  2003-02-21       Impact factor: 11.205

6.  Improved breast cancer prognosis through the combination of clinical and genetic markers.

Authors:  Yijun Sun; Steve Goodison; Jian Li; Li Liu; William Farmerie
Journal:  Bioinformatics       Date:  2006-11-26       Impact factor: 6.937

Review 7.  Approaches to dimensionality reduction in proteomic biomarker studies.

Authors:  Melanie Hilario; Alexandros Kalousis
Journal:  Brief Bioinform       Date:  2008-02-29       Impact factor: 11.622

8.  Gene-expression profiles to predict distant metastasis of lymph-node-negative primary breast cancer.

Authors:  Yixin Wang; Jan G M Klijn; Yi Zhang; Anieta M Sieuwerts; Maxime P Look; Fei Yang; Dmitri Talantov; Mieke Timmermans; Marion E Meijer-van Gelder; Jack Yu; Tim Jatkoe; Els M J J Berns; David Atkins; John A Foekens
Journal:  Lancet       Date:  2005 Feb 19-25       Impact factor: 79.321

9.  Gene expression profiling predicts clinical outcome of breast cancer.

Authors:  Laura J van 't Veer; Hongyue Dai; Marc J van de Vijver; Yudong D He; Augustinus A M Hart; Mao Mao; Hans L Peterse; Karin van der Kooy; Matthew J Marton; Anke T Witteveen; George J Schreiber; Ron M Kerkhoven; Chris Roberts; Peter S Linsley; René Bernards; Stephen H Friend
Journal:  Nature       Date:  2002-01-31       Impact factor: 49.962

  9 in total
  27 in total

1.  Disease prediction in the at-risk mental state for psychosis using neuroanatomical biomarkers: results from the FePsy study.

Authors:  Nikolaos Koutsouleris; Stefan Borgwardt; Eva M Meisenzahl; Ronald Bottlender; Hans-Jürgen Möller; Anita Riecher-Rössler
Journal:  Schizophr Bull       Date:  2011-11-10       Impact factor: 9.306

2.  Cancer progression modeling using static sample data.

Authors:  Yijun Sun; Jin Yao; Norma J Nowak; Steve Goodison
Journal:  Genome Biol       Date:  2014-08-26       Impact factor: 13.583

3.  Distinguishing prodromal from first-episode psychosis using neuroanatomical single-subject pattern recognition.

Authors:  Stefan Borgwardt; Nikolaos Koutsouleris; Jacqueline Aston; Erich Studerus; Renata Smieskova; Anita Riecher-Rössler; Eva M Meisenzahl
Journal:  Schizophr Bull       Date:  2012-09-11       Impact factor: 9.306

4.  MORPHOLOGICAL SIGNATURES AND GENOMIC CORRELATES IN GLIOBLASTOMA.

Authors:  Lee A D Cooper; Jun Kong; Fusheng Wang; Tahsin Kurc; Carlos S Moreno; Daniel J Brat; Joel H Saltz
Journal:  Proc IEEE Int Symp Biomed Imaging       Date:  2011-03-30

5.  Computational approach for deriving cancer progression roadmaps from static sample data.

Authors:  Yijun Sun; Jin Yao; Le Yang; Runpu Chen; Norma J Nowak; Steve Goodison
Journal:  Nucleic Acids Res       Date:  2017-05-19       Impact factor: 16.971

Review 6.  Molecular diagnostic trends in urological cancer: biomarkers for non-invasive diagnosis.

Authors:  V Urquidi; C J Rosser; S Goodison
Journal:  Curr Med Chem       Date:  2012       Impact factor: 4.530

7.  Genomic prediction based on data from three layer lines using non-linear regression models.

Authors:  Heyun Huang; Jack J Windig; Addie Vereijken; Mario P L Calus
Journal:  Genet Sel Evol       Date:  2014-11-06       Impact factor: 4.297

Review 8.  Derivation of cancer diagnostic and prognostic signatures from gene expression data.

Authors:  Steve Goodison; Yijun Sun; Virginia Urquidi
Journal:  Bioanalysis       Date:  2010-05       Impact factor: 2.681

Review 9.  Relief-based feature selection: Introduction and review.

Authors:  Ryan J Urbanowicz; Melissa Meeker; William La Cava; Randal S Olson; Jason H Moore
Journal:  J Biomed Inform       Date:  2018-07-18       Impact factor: 6.317

10.  A candidate molecular biomarker panel for the detection of bladder cancer.

Authors:  Virginia Urquidi; Steve Goodison; Yunpeng Cai; Yijun Sun; Charles J Rosser
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2012-10-24       Impact factor: 4.254

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.