Literature DB >> 25386043

Interaction Screening for Ultra-High Dimensional Data.

Ning Hao1, Hao Helen Zhang2.   

Abstract

In ultra-high dimensional data analysis, it is extremely challenging to identify important interaction effects, and a top concern in practice is computational feasibility. For a data set with n observations and p predictors, the augmented design matrix including all linear and order-2 terms is of size n × (p2 + 3p)/2. When p is large, say more than tens of hundreds, the number of interactions is enormous and beyond the capacity of standard machines and software tools for storage and analysis. In theory, the interaction selection consistency is hard to achieve in high dimensional settings. Interaction effects have heavier tails and more complex covariance structures than main effects in a random design, making theoretical analysis difficult. In this article, we propose to tackle these issues by forward-selection based procedures called iFOR, which identify interaction effects in a greedy forward fashion while maintaining the natural hierarchical model structure. Two algorithms, iFORT and iFORM, are studied. Computationally, the iFOR procedures are designed to be simple and fast to implement. No complex optimization tools are needed, since only OLS-type calculations are involved; the iFOR algorithms avoid storing and manipulating the whole augmented matrix, so the memory and CPU requirement is minimal; the computational complexity is linear in p for sparse models, hence feasible for p ≫ n. Theoretically, we prove that they possess sure screening property for ultra-high dimensional settings. Numerical examples are used to demonstrate their finite sample performance.

Entities:  

Keywords:  Forward selection; GWAS; Heredity condition; Interaction; Sure Screening

Year:  2014        PMID: 25386043      PMCID: PMC4224119          DOI: 10.1080/01621459.2014.881741

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  9 in total

1.  Non-Concave Penalized Likelihood with NP-Dimensionality.

Authors:  Jianqing Fan; Jinchi Lv
Journal:  IEEE Trans Inf Theory       Date:  2011-08       Impact factor: 2.501

2.  Genes, environment, health, and disease: facing up to complexity.

Authors:  Teri A Manolio; Francis S Collins
Journal:  Hum Hered       Date:  2007-02-02       Impact factor: 0.444

3.  Increasing the power of identifying gene x gene interactions in genome-wide association studies.

Authors:  Charles Kooperberg; Michael Leblanc
Journal:  Genet Epidemiol       Date:  2008-04       Impact factor: 2.135

4.  Genome-wide association analysis by lasso penalized logistic regression.

Authors:  Tong Tong Wu; Yi Fang Chen; Trevor Hastie; Eric Sobel; Kenneth Lange
Journal:  Bioinformatics       Date:  2009-01-28       Impact factor: 6.937

5.  Discussion of "Sure Independence Screening for Ultra-High Dimensional Feature Space.

Authors:  Hao Helen Zhang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2008-11       Impact factor: 4.488

6.  Screen and clean: a tool for identifying interactions in genome-wide association studies.

Authors:  Jing Wu; Bernie Devlin; Steven Ringquist; Massimo Trucco; Kathryn Roeder
Journal:  Genet Epidemiol       Date:  2010-04       Impact factor: 2.135

7.  One-step Sparse Estimates in Nonconcave Penalized Likelihood Models.

Authors:  Hui Zou; Runze Li
Journal:  Ann Stat       Date:  2008-08-01       Impact factor: 4.028

Review 8.  Detecting gene-gene interactions that underlie human diseases.

Authors:  Heather J Cordell
Journal:  Nat Rev Genet       Date:  2009-06       Impact factor: 53.242

9.  Two-stage two-locus models in genome-wide association.

Authors:  David M Evans; Jonathan Marchini; Andrew P Morris; Lon R Cardon
Journal:  PLoS Genet       Date:  2006-09-22       Impact factor: 5.917

  9 in total
  12 in total

1.  Sparse and Low-rank Tensor Estimation via Cubic Sketchings.

Authors:  Botao Hao; Anru Zhang; Guang Cheng
Journal:  IEEE Trans Inf Theory       Date:  2020-03-23       Impact factor: 2.501

Review 2.  Gene-Environment Interaction: A Variable Selection Perspective.

Authors:  Fei Zhou; Jie Ren; Xi Lu; Shuangge Ma; Cen Wu
Journal:  Methods Mol Biol       Date:  2021

3.  Convex Modeling of Interactions with Strong Heredity.

Authors:  Asad Haris; Daniela Witten; Noah Simon
Journal:  J Comput Graph Stat       Date:  2015-08-12       Impact factor: 2.302

4.  Selection of nonlinear interactions by a forward stepwise algorithm: Application to identifying environmental chemical mixtures affecting health outcomes.

Authors:  Naveen N Narisetty; Bhramar Mukherjee; Yin-Hsiu Chen; Richard Gonzalez; John D Meeker
Journal:  Stat Med       Date:  2018-12-26       Impact factor: 2.373

5.  Identifying gene-gene interactions using penalized tensor regression.

Authors:  Mengyun Wu; Jian Huang; Shuangge Ma
Journal:  Stat Med       Date:  2017-10-16       Impact factor: 2.373

6.  Gene-gene interaction analysis incorporating network information via a structured Bayesian approach.

Authors:  Xing Qin; Shuangge Ma; Mengyun Wu
Journal:  Stat Med       Date:  2021-09-20       Impact factor: 2.373

7.  A hierarchical integrative group least absolute shrinkage and selection operator for analyzing environmental mixtures.

Authors:  Jonathan Boss; Alexander Rix; Yin-Hsiu Chen; Naveen N Narisetty; Zhenke Wu; Kelly K Ferguson; Thomas F McElrath; John D Meeker; Bhramar Mukherjee
Journal:  Environmetrics       Date:  2021-07-30       Impact factor: 1.527

8.  A selective overview of feature screening for ultrahigh-dimensional data.

Authors:  Liu JingYuan; Zhong Wei; L I RunZe
Journal:  Sci China Math       Date:  2015-08-22       Impact factor: 1.331

9.  An Ultrahigh-Dimensional Mapping Model of High-order Epistatic Networks for Complex Traits.

Authors:  Kirk Gosik; Lidan Sun; Vernon M Chinchilli; Rongling Wu
Journal:  Curr Genomics       Date:  2018-08       Impact factor: 2.236

Review 10.  CHIMGEN: a Chinese imaging genetics cohort to enhance cross-ethnic and cross-geographic brain research.

Authors:  Qiang Xu; Lining Guo; Jingliang Cheng; Meiyun Wang; Zuojun Geng; Wenzhen Zhu; Bing Zhang; Weihua Liao; Shijun Qiu; Hui Zhang; Xiaojun Xu; Yongqiang Yu; Bo Gao; Tong Han; Zhenwei Yao; Guangbin Cui; Feng Liu; Wen Qin; Quan Zhang; Mulin Jun Li; Meng Liang; Feng Chen; Junfang Xian; Jiance Li; Jing Zhang; Xi-Nian Zuo; Dawei Wang; Wen Shen; Yanwei Miao; Fei Yuan; Su Lui; Xiaochu Zhang; Kai Xu; Long Jiang Zhang; Zhaoxiang Ye; Chunshui Yu
Journal:  Mol Psychiatry       Date:  2019-12-11       Impact factor: 15.992

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.