Literature DB >> 25124197

Score test variable screening.

Sihai Dave Zhao1, Yi Li.   

Abstract

Variable screening has emerged as a crucial first step in the analysis of high-throughput data, but existing procedures can be computationally cumbersome, difficult to justify theoretically, or inapplicable to certain types of analyses. Motivated by a high-dimensional censored quantile regression problem in multiple myeloma genomics, this article makes three contributions. First, we establish a score test-based screening framework, which is widely applicable, extremely computationally efficient, and relatively simple to justify. Secondly, we propose a resampling-based procedure for selecting the number of variables to retain after screening according to the principle of reproducibility. Finally, we propose a new iterative score test screening method which is closely related to sparse regression. In simulations we apply our methods to four different regression models and show that they can outperform existing procedures. We also apply score test screening to an analysis of gene expression data from multiple myeloma patients using a censored quantile regression model to identify high-risk genes.
© 2014, The International Biometric Society.

Entities:  

Keywords:  Feature selection; High-dimensional data; Projected subgradient method; Score test; Variable screening

Mesh:

Substances:

Year:  2014        PMID: 25124197      PMCID: PMC4427573          DOI: 10.1111/biom.12209

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  17 in total

1.  On corrected score approach for proportional hazards model with covariate measurement error.

Authors:  Xiao Song; Yijian Huang
Journal:  Biometrics       Date:  2005-09       Impact factor: 2.571

2.  Higher criticism thresholding: Optimal feature selection when useful features are rare and weak.

Authors:  David Donoho; Jiashun Jin
Journal:  Proc Natl Acad Sci U S A       Date:  2008-09-24       Impact factor: 11.205

3.  The MicroArray Quality Control (MAQC)-II study of common practices for the development and validation of microarray-based predictive models.

Authors:  Leming Shi; Gregory Campbell; Wendell D Jones; Fabien Campagne; Zhining Wen; Stephen J Walker; Zhenqiang Su; Tzu-Ming Chu; Federico M Goodsaid; Lajos Pusztai; John D Shaughnessy; André Oberthuer; Russell S Thomas; Richard S Paules; Mark Fielden; Bart Barlogie; Weijie Chen; Pan Du; Matthias Fischer; Cesare Furlanello; Brandon D Gallas; Xijin Ge; Dalila B Megherbi; W Fraser Symmans; May D Wang; John Zhang; Hans Bitter; Benedikt Brors; Pierre R Bushel; Max Bylesjo; Minjun Chen; Jie Cheng; Jing Cheng; Jeff Chou; Timothy S Davison; Mauro Delorenzi; Youping Deng; Viswanath Devanarayan; David J Dix; Joaquin Dopazo; Kevin C Dorff; Fathi Elloumi; Jianqing Fan; Shicai Fan; Xiaohui Fan; Hong Fang; Nina Gonzaludo; Kenneth R Hess; Huixiao Hong; Jun Huan; Rafael A Irizarry; Richard Judson; Dilafruz Juraeva; Samir Lababidi; Christophe G Lambert; Li Li; Yanen Li; Zhen Li; Simon M Lin; Guozhen Liu; Edward K Lobenhofer; Jun Luo; Wen Luo; Matthew N McCall; Yuri Nikolsky; Gene A Pennello; Roger G Perkins; Reena Philip; Vlad Popovici; Nathan D Price; Feng Qian; Andreas Scherer; Tieliu Shi; Weiwei Shi; Jaeyun Sung; Danielle Thierry-Mieg; Jean Thierry-Mieg; Venkata Thodima; Johan Trygg; Lakshmi Vishnuvajjala; Sue Jane Wang; Jianping Wu; Yichao Wu; Qian Xie; Waleed A Yousef; Liang Zhang; Xuegong Zhang; Sheng Zhong; Yiming Zhou; Sheng Zhu; Dhivya Arasappan; Wenjun Bao; Anne Bergstrom Lucas; Frank Berthold; Richard J Brennan; Andreas Buness; Jennifer G Catalano; Chang Chang; Rong Chen; Yiyu Cheng; Jian Cui; Wendy Czika; Francesca Demichelis; Xutao Deng; Damir Dosymbekov; Roland Eils; Yang Feng; Jennifer Fostel; Stephanie Fulmer-Smentek; James C Fuscoe; Laurent Gatto; Weigong Ge; Darlene R Goldstein; Li Guo; Donald N Halbert; Jing Han; Stephen C Harris; Christos Hatzis; Damir Herman; Jianping Huang; Roderick V Jensen; Rui Jiang; Charles D Johnson; Giuseppe Jurman; Yvonne Kahlert; Sadik A Khuder; Matthias Kohl; Jianying Li; Li Li; Menglong Li; Quan-Zhen Li; Shao Li; Zhiguang Li; Jie Liu; Ying Liu; Zhichao Liu; Lu Meng; Manuel Madera; Francisco Martinez-Murillo; Ignacio Medina; Joseph Meehan; Kelci Miclaus; Richard A Moffitt; David Montaner; Piali Mukherjee; George J Mulligan; Padraic Neville; Tatiana Nikolskaya; Baitang Ning; Grier P Page; Joel Parker; R Mitchell Parry; Xuejun Peng; Ron L Peterson; John H Phan; Brian Quanz; Yi Ren; Samantha Riccadonna; Alan H Roter; Frank W Samuelson; Martin M Schumacher; Joseph D Shambaugh; Qiang Shi; Richard Shippy; Shengzhu Si; Aaron Smalter; Christos Sotiriou; Mat Soukup; Frank Staedtler; Guido Steiner; Todd H Stokes; Qinglan Sun; Pei-Yi Tan; Rong Tang; Zivana Tezak; Brett Thorn; Marina Tsyganova; Yaron Turpaz; Silvia C Vega; Roberto Visintainer; Juergen von Frese; Charles Wang; Eric Wang; Junwei Wang; Wei Wang; Frank Westermann; James C Willey; Matthew Woods; Shujian Wu; Nianqing Xiao; Joshua Xu; Lei Xu; Lun Yang; Xiao Zeng; Jialu Zhang; Li Zhang; Min Zhang; Chen Zhao; Raj K Puri; Uwe Scherf; Weida Tong; Russell D Wolfinger
Journal:  Nat Biotechnol       Date:  2010-07-30       Impact factor: 54.908

4.  On the C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data.

Authors:  Hajime Uno; Tianxi Cai; Michael J Pencina; Ralph B D'Agostino; L J Wei
Journal:  Stat Med       Date:  2011-01-13       Impact factor: 2.373

5.  Ultrahigh dimensional feature selection: beyond the linear model.

Authors:  Jianqing Fan; Richard Samworth; Yichao Wu
Journal:  J Mach Learn Res       Date:  2009       Impact factor: 3.654

6.  A validated gene expression model of high-risk multiple myeloma is defined by deregulated expression of genes mapping to chromosome 1.

Authors:  John D Shaughnessy; Fenghuang Zhan; Bart E Burington; Yongsheng Huang; Simona Colla; Ichiro Hanamura; James P Stewart; Bob Kordsmeier; Christopher Randolph; David R Williams; Yan Xiao; Hongwei Xu; Joshua Epstein; Elias Anaissie; Somashekar G Krishna; Michele Cottler-Fox; Klaus Hollmig; Abid Mohiuddin; Mauricio Pineda-Roman; Guido Tricot; Frits van Rhee; Jeffrey Sawyer; Yazan Alsayed; Ronald Walker; Maurizio Zangari; John Crowley; Bart Barlogie
Journal:  Blood       Date:  2006-11-14       Impact factor: 22.113

7.  Regularized estimation for the accelerated failure time model.

Authors:  T Cai; J Huang; L Tian
Journal:  Biometrics       Date:  2009-06       Impact factor: 2.571

8.  Prediction of survival in multiple myeloma based on gene expression profiles reveals cell cycle and chromosomal instability signatures in high-risk patients and hyperdiploid signatures in low-risk patients: a study of the Intergroupe Francophone du Myélome.

Authors:  Olivier Decaux; Laurence Lodé; Florence Magrangeas; Catherine Charbonnel; Wilfried Gouraud; Pascal Jézéquel; Michel Attal; Jean-Luc Harousseau; Philippe Moreau; Régis Bataille; Loïc Campion; Hervé Avet-Loiseau; Stéphane Minvielle
Journal:  J Clin Oncol       Date:  2008-06-30       Impact factor: 44.544

9.  Prognostic significance of copy-number alterations in multiple myeloma.

Authors:  Hervé Avet-Loiseau; Cheng Li; Florence Magrangeas; Wilfried Gouraud; Catherine Charbonnel; Jean-Luc Harousseau; Michel Attal; Gerald Marit; Claire Mathiot; Thierry Facon; Philippe Moreau; Kenneth C Anderson; Loïc Campion; Nikhil C Munshi; Stéphane Minvielle
Journal:  J Clin Oncol       Date:  2009-08-17       Impact factor: 44.544

10.  Feature Screening via Distance Correlation Learning.

Authors:  Runze Li; Wei Zhong; Liping Zhu
Journal:  J Am Stat Assoc       Date:  2012-07-01       Impact factor: 5.033

View more
  5 in total

1.  Penalized full likelihood approach to variable selection for Cox's regression model under nested case-control sampling.

Authors:  Jie-Huei Wang; Chun-Hao Pan; I-Shou Chang; Chao Agnes Hsiung
Journal:  Lifetime Data Anal       Date:  2019-05-07       Impact factor: 1.588

2.  An improved variable selection procedure for adaptive Lasso in high-dimensional survival analysis.

Authors:  Kevin He; Yue Wang; Xiang Zhou; Han Xu; Can Huang
Journal:  Lifetime Data Anal       Date:  2018-11-26       Impact factor: 1.588

3.  Feature selection of ultrahigh-dimensional covariates with survival outcomes: a selective review.

Authors:  Hong Hyokyoung Grace; Yi Li
Journal:  Appl Math       Date:  2017-12-29

4.  A selective overview of feature screening methods with applications to neuroimaging data.

Authors:  Kevin He; Han Xu; Jian Kang
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2018-09-21

5.  Analyzing biomarker discovery: Estimating the reproducibility of biomarker sets.

Authors:  Amir Forouzandeh; Alex Rutar; Sunil V Kalmady; Russell Greiner
Journal:  PLoS One       Date:  2022-07-28       Impact factor: 3.752

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.