Literature DB >> 25479793

A selective review of robust variable selection with applications in bioinformatics.

Cen Wu, Shuangge Ma.   

Abstract

A drastic amount of data have been and are being generated in bioinformatics studies. In the analysis of such data, the standard modeling approaches can be challenged by the heavy-tailed errors and outliers in response variables, the contamination in predictors (which may be caused by, for instance, technical problems in microarray gene expression studies), model mis-specification and others. Robust methods are needed to tackle these challenges. When there are a large number of predictors, variable selection can be as important as estimation. As a generic variable selection and regularization tool, penalization has been extensively adopted. In this article, we provide a selective review of robust penalized variable selection approaches especially designed for high-dimensional data from bioinformatics and biomedical studies. We discuss the robust loss functions, penalty functions and computational algorithms. The theoretical properties and implementation are also briefly examined. Application examples of the robust penalization approaches in representative bioinformatics and biomedical studies are also illustrated.
© The Author 2014. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  bioinformatics study; penalization; robust methods; variable selection

Mesh:

Year:  2014        PMID: 25479793      PMCID: PMC4570200          DOI: 10.1093/bib/bbu046

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   13.994


  21 in total

1.  Regression approaches for microarray data analysis.

Authors:  Mark R Segal; Kam D Dahlquist; Bruce R Conklin
Journal:  J Comput Biol       Date:  2003       Impact factor: 1.479

2.  A haplotype map of the human genome.

Authors: 
Journal:  Nature       Date:  2005-10-27       Impact factor: 49.962

3.  Spatial smoothing and hot spot detection for CGH data using the fused lasso.

Authors:  Robert Tibshirani; Pei Wang
Journal:  Biostatistics       Date:  2007-05-18       Impact factor: 5.899

4.  Rank-based estimation in the {ell}1-regularized partly linear model for censored outcomes with application to integrated analyses of clinical predictors and gene expression data.

Authors:  Brent A Johnson
Journal:  Biostatistics       Date:  2009-06-24       Impact factor: 5.899

5.  On the robustness of the adaptive lasso to model misspecification.

Authors:  W Lu; Y Goldberg; J P Fine
Journal:  Biometrika       Date:  2012-07-11       Impact factor: 2.445

6.  Regulation of gene expression in the mammalian eye and its relevance to eye disease.

Authors:  Todd E Scheetz; Kwang-Youn A Kim; Ruth E Swiderski; Alisdair R Philp; Terry A Braun; Kevin L Knudtson; Anne M Dorrance; Gerald F DiBona; Jian Huang; Thomas L Casavant; Val C Sheffield; Edwin M Stone
Journal:  Proc Natl Acad Sci U S A       Date:  2006-09-18       Impact factor: 11.205

7.  Conditional expression of a Gi-coupled receptor causes ventricular conduction delay and a lethal cardiomyopathy.

Authors:  C H Redfern; M Y Degtyarev; A T Kwa; N Salomonis; N Cotte; T Nanevicz; N Fidelman; K Desai; K Vranizan; E K Lee; P Coward; N Shah; J A Warrington; G I Fishman; D Bernstein; A J Baker; B R Conklin
Journal:  Proc Natl Acad Sci U S A       Date:  2000-04-25       Impact factor: 11.205

8.  Weighted Wilcoxon-type smoothly clipped absolute deviation method.

Authors:  Lan Wang; Runze Li
Journal:  Biometrics       Date:  2008-07-18       Impact factor: 2.571

9.  A robust penalized method for the analysis of noisy DNA copy number data.

Authors:  Xiaoli Gao; Jian Huang
Journal:  BMC Genomics       Date:  2010-09-25       Impact factor: 3.969

10.  A penalized robust method for identifying gene-environment interactions.

Authors:  Xingjie Shi; Jin Liu; Jian Huang; Yong Zhou; Yang Xie; Shuangge Ma
Journal:  Genet Epidemiol       Date:  2014-02-24       Impact factor: 2.344

View more
  33 in total

1.  Multikernel linear mixed model with adaptive lasso for complex phenotype prediction.

Authors:  Yalu Wen; Qing Lu
Journal:  Stat Med       Date:  2020-01-27       Impact factor: 2.373

2.  Robust network-based regularization and variable selection for high-dimensional genomic data in cancer prognosis.

Authors:  Jie Ren; Yinhao Du; Shaoyu Li; Shuangge Ma; Yu Jiang; Cen Wu
Journal:  Genet Epidemiol       Date:  2019-02-11       Impact factor: 2.135

Review 3.  Gene-Environment Interaction: A Variable Selection Perspective.

Authors:  Fei Zhou; Jie Ren; Xi Lu; Shuangge Ma; Cen Wu
Journal:  Methods Mol Biol       Date:  2021

4.  Analysis of cancer gene expression data with an assisted robust marker identification approach.

Authors:  Hao Chai; Xingjie Shi; Qingzhao Zhang; Qing Zhao; Yuan Huang; Shuangge Ma
Journal:  Genet Epidemiol       Date:  2017-09-14       Impact factor: 2.135

5.  Multi-kernel linear mixed model with adaptive lasso for prediction analysis on high-dimensional multi-omics data.

Authors:  Jun Li; Qing Lu; Yalu Wen
Journal:  Bioinformatics       Date:  2020-03-01       Impact factor: 6.937

6.  Robust semiparametric gene-environment interaction analysis using sparse boosting.

Authors:  Mengyun Wu; Shuangge Ma
Journal:  Stat Med       Date:  2019-07-29       Impact factor: 2.373

7.  Dissecting gene-environment interactions: A penalized robust approach accounting for hierarchical structures.

Authors:  Cen Wu; Yu Jiang; Jie Ren; Yuehua Cui; Shuangge Ma
Journal:  Stat Med       Date:  2017-10-16       Impact factor: 2.373

8.  Nonlinear variable selection with continuous outcome: a fully nonparametric incremental forward stagewise approach.

Authors:  Tianwei Yu
Journal:  Stat Anal Data Min       Date:  2018-06-19       Impact factor: 1.051

Review 9.  Sensitivity and specificity of information criteria.

Authors:  John J Dziak; Donna L Coffman; Stephanie T Lanza; Runze Li; Lars S Jermiin
Journal:  Brief Bioinform       Date:  2020-03-23       Impact factor: 11.622

10.  A U-statistics for integrative analysis of multilayer omics data.

Authors:  Xiaqiong Wang; Yalu Wen
Journal:  Bioinformatics       Date:  2020-04-15       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.