Literature DB >> 23606924

High-Dimensional Structured Feature Screening Using Binary Markov Random Fields.

Jie Liu1, Peggy Peissig, Chunming Zhang, Elizabeth Burnside, Catherine McCarty, David Page.   

Abstract

Feature screening is a useful feature selection approach for high-dimensional data when the goal is to identify all the features relevant to the response variable. However, common feature screening methods do not take into account the correlation structure of the covariate space. We propose the concept of a feature relevance network, a binary Markov random field to represent the relevance of each individual feature by potentials on the nodes, and represent the correlation structure by potentials on the edges. By performing inference on the feature relevance network, we can accordingly select relevant features. Our algorithm does not yield sparsity, which is different from the particular popular family of feature selection approaches based on penalized least squares or penalized pseudo-likelihood. We give one concrete algorithm under this framework and show its superior performance over common feature selection methods in terms of prediction error and recovery of the truly relevant features on real-world data and synthetic data.

Entities:  

Year:  2012        PMID: 23606924      PMCID: PMC3630518     

Source DB:  PubMed          Journal:  JMLR Workshop Conf Proc        ISSN: 1938-7288


  17 in total

1.  Powerful SNP-set analysis for case-control genome-wide association studies.

Authors:  Michael C Wu; Peter Kraft; Michael P Epstein; Deanne M Taylor; Stephen J Chanock; David J Hunter; Xihong Lin
Journal:  Am J Hum Genet       Date:  2010-06-11       Impact factor: 11.025

2.  Generalized genomic distance-based regression methodology for multilocus association analysis.

Authors:  Jennifer Wessel; Nicholas J Schork
Journal:  Am J Hum Genet       Date:  2006-09-21       Impact factor: 11.025

3.  Genome-wide association analysis by lasso penalized logistic regression.

Authors:  Tong Tong Wu; Yi Fang Chen; Trevor Hastie; Eric Sobel; Kenneth Lange
Journal:  Bioinformatics       Date:  2009-01-28       Impact factor: 6.937

4.  Gene ranking and biomarker discovery under correlation.

Authors:  Verena Zuber; Korbinian Strimmer
Journal:  Bioinformatics       Date:  2009-07-30       Impact factor: 6.937

5.  Polygenes, risk prediction, and targeted prevention of breast cancer.

Authors:  Paul D P Pharoah; Antonis C Antoniou; Douglas F Easton; Bruce A J Ponder
Journal:  N Engl J Med       Date:  2008-06-26       Impact factor: 91.245

6.  Ultrahigh dimensional feature selection: beyond the linear model.

Authors:  Jianqing Fan; Richard Samworth; Yichao Wu
Journal:  J Mach Learn Res       Date:  2009       Impact factor: 3.654

7.  Environmental and heritable factors in the causation of cancer--analyses of cohorts of twins from Sweden, Denmark, and Finland.

Authors:  P Lichtenstein; N V Holm; P K Verkasalo; A Iliadou; J Kaprio; M Koskenvuo; E Pukkala; A Skytthe; K Hemminki
Journal:  N Engl J Med       Date:  2000-07-13       Impact factor: 91.245

8.  The eMERGE Network: a consortium of biorepositories linked to electronic medical records data for conducting genomic studies.

Authors:  Catherine A McCarty; Rex L Chisholm; Christopher G Chute; Iftikhar J Kullo; Gail P Jarvik; Eric B Larson; Rongling Li; Daniel R Masys; Marylyn D Ritchie; Dan M Roden; Jeffery P Struewing; Wendy A Wolf
Journal:  BMC Med Genomics       Date:  2011-01-26       Impact factor: 3.063

9.  HIGH DIMENSIONAL VARIABLE SELECTION.

Authors:  Larry Wasserman; Kathryn Roeder
Journal:  Ann Stat       Date:  2009-01-01       Impact factor: 4.028

10.  From disease association to risk assessment: an optimistic view from genome-wide association studies on type 1 diabetes.

Authors:  Zhi Wei; Kai Wang; Hui-Qi Qu; Haitao Zhang; Jonathan Bradfield; Cecilia Kim; Edward Frackleton; Cuiping Hou; Joseph T Glessner; Rosetta Chiavacci; Charles Stanley; Dimitri Monos; Struan F A Grant; Constantin Polychronakos; Hakon Hakonarson
Journal:  PLoS Genet       Date:  2009-10-09       Impact factor: 5.917

View more
  1 in total

1.  Learning Heterogeneous Hidden Markov Random Fields.

Authors:  Jie Liu; Chunming Zhang; Elizabeth Burnside; David Page
Journal:  JMLR Workshop Conf Proc       Date:  2014
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.