Literature DB >> 35668896

Group Feature Screening via the F Statistic.

Won Chul Song1, Jun Xie2.   

Abstract

Feature screening is crucial in the analysis of ultrahigh dimensional data, where the number of variables (features) is in an exponential order of the number of observations. In various ultrahigh dimensional data, variables are naturally grouped, giving us a good rationale to develop a screening method using joint effect of multiple variables. In this article, we propose a group screening procedure via the F-test statistic. The proposed method is a direct extension of the original sure independence screening procedure, when the group information is known, for example, from prior knowledge. Under certain regularity conditions, we prove that the proposed group screening procedure possesses the sure screening property that selects all effective groups with a probability approaching one at an exponential rate. We use simulations to demonstrate the advantages of the proposed method and show its application in a genome-wide association study. We conclude that the grouping method is very useful in the analysis of ultrahigh dimensional data, as the optimal F-test can detect true signals with desired properties.

Entities:  

Keywords:  Feature screening; Multiple regression; Sure screening property; Ultrahigh dimension

Year:  2019        PMID: 35668896      PMCID: PMC9165430          DOI: 10.1080/03610918.2019.1691223

Source DB:  PubMed          Journal:  Commun Stat Simul Comput        ISSN: 0361-0918            Impact factor:   1.162


  9 in total

1.  Principled sure independence screening for Cox models with ultra-high-dimensional covariates.

Authors:  Sihai Dave Zhao; Yi Li
Journal:  J Multivar Anal       Date:  2012-02-01       Impact factor: 1.473

2.  Nonparametric Independence Screening in Sparse Ultra-High Dimensional Additive Models.

Authors:  Jianqing Fan; Yang Feng; Rui Song
Journal:  J Am Stat Assoc       Date:  2011-06       Impact factor: 5.033

3.  Resampling-based multiple hypothesis testing procedures for genetic case-control association studies.

Authors:  Bingshu E Chen; Lori C Sakoda; Ann W Hsing; Philip S Rosenberg
Journal:  Genet Epidemiol       Date:  2006-09       Impact factor: 2.135

4.  POWERFUL TEST BASED ON CONDITIONAL EFFECTS FOR GENOME-WIDE SCREENING.

Authors:  Yaowu Liu; Jun Xie
Journal:  Ann Appl Stat       Date:  2018-03-09       Impact factor: 2.083

5.  Discussion of "Sure Independence Screening for Ultra-High Dimensional Feature Space.

Authors:  Hao Helen Zhang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2008-11       Impact factor: 4.488

6.  Rare-variant association testing for sequencing data with the sequence kernel association test.

Authors:  Michael C Wu; Seunggeun Lee; Tianxi Cai; Yun Li; Michael Boehnke; Xihong Lin
Journal:  Am J Hum Genet       Date:  2011-07-07       Impact factor: 11.025

7.  Ultrahigh dimensional feature selection: beyond the linear model.

Authors:  Jianqing Fan; Richard Samworth; Yichao Wu
Journal:  J Mach Learn Res       Date:  2009       Impact factor: 3.654

8.  Feature Screening via Distance Correlation Learning.

Authors:  Runze Li; Wei Zhong; Liping Zhu
Journal:  J Am Stat Assoc       Date:  2012-07-01       Impact factor: 5.033

9.  Genome-Wide Analyses Suggest Mechanisms Involving Early B-Cell Development in Canine IgA Deficiency.

Authors:  Mia Olsson; Katarina Tengvall; Marcel Frankowiack; Marcin Kierczak; Kerstin Bergvall; Erik Axelsson; Linda Tintle; Eliane Marti; Petra Roosje; Tosso Leeb; Åke Hedhammar; Lennart Hammarström; Kerstin Lindblad-Toh
Journal:  PLoS One       Date:  2015-07-30       Impact factor: 3.240

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.