Literature DB >> 31359454

Robust semiparametric gene-environment interaction analysis using sparse boosting.

Mengyun Wu1,2, Shuangge Ma2.   

Abstract

For the pathogenesis of complex diseases, gene-environment (G-E) interactions have been shown to have important implications. G-E interaction analysis can be challenging with the need to jointly analyze a large number of main effects and interactions and to respect the "main effects, interactions" hierarchical constraint. Extensive methodological developments on G-E interaction analysis have been conducted in recent literature. Despite considerable successes, most of the existing studies are still limited as they cannot accommodate long-tailed distributions/data contamination, make the restricted assumption of linear effects, and cannot effectively accommodate missingness in E variables. To directly tackle these problems, a semiparametric model is assumed to accommodate nonlinear effects, and the Huber loss function and Qn estimator are adopted to accommodate long-tailed distributions/data contamination. A regression-based multiple imputation approach is developed to accommodate missingness in E variables. For model estimation and selection of relevant variables, we adopt an effective sparse boosting approach. The proposed approach is practically well motivated, has intuitive formulations, and can be effectively realized. In extensive simulations, it significantly outperforms multiple direct competitors. The analysis of The Cancer Genome Atlas data on stomach adenocarcinoma and cutaneous melanoma shows that the proposed approach makes sensible discoveries with satisfactory prediction and stability.
© 2019 John Wiley & Sons, Ltd.

Entities:  

Keywords:  gene-environment interaction; missingness; robustness; semiparametric modeling; sparse boosting

Year:  2019        PMID: 31359454      PMCID: PMC6736719          DOI: 10.1002/sim.8322

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  38 in total

1.  Matrix correlations for high-dimensional data: the modified RV-coefficient.

Authors:  A K Smilde; H A L Kiers; S Bijlsma; C M Rubingh; M J van Erk
Journal:  Bioinformatics       Date:  2008-12-10       Impact factor: 6.937

2.  Promotion of glioma cell survival by acyl-CoA synthetase 5 under extracellular acidosis conditions.

Authors:  T Mashima; S Sato; Y Sugimoto; T Tsuruo; H Seimiya
Journal:  Oncogene       Date:  2008-09-22       Impact factor: 9.867

Review 3.  Review of inverse probability weighting for dealing with missing data.

Authors:  Shaun R Seaman; Ian R White
Journal:  Stat Methods Med Res       Date:  2011-01-10       Impact factor: 3.021

4.  On the C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data.

Authors:  Hajime Uno; Tianxi Cai; Michael J Pencina; Ralph B D'Agostino; L J Wei
Journal:  Stat Med       Date:  2011-01-13       Impact factor: 2.373

5.  Body mass index at the time of diagnosis and the risk of advanced stages and poorly differentiated cancers of the breast: findings from a case-series study.

Authors:  A Stark; M S Stahl; H L Kirchner; S Krum; J Prichard; J Evans
Journal:  Int J Obes (Lond)       Date:  2010-03-30       Impact factor: 5.095

Review 6.  Gene--environment-wide association studies: emerging approaches.

Authors:  Duncan Thomas
Journal:  Nat Rev Genet       Date:  2010-04       Impact factor: 53.242

7.  A combined proteomics and metabolomics profiling of gastric cardia cancer reveals characteristic dysregulations in glucose metabolism.

Authors:  Zhen Cai; Jiang-Sha Zhao; Jing-Jing Li; Dan-Ni Peng; Xiao-Yan Wang; Tian-Lu Chen; Yun-Ping Qiu; Ping-Ping Chen; Wen-Jie Li; Li-Yan Xu; En-Ming Li; Jason P M Tam; Robert Z Qi; Wei Jia; Dong Xie
Journal:  Mol Cell Proteomics       Date:  2010-08-10       Impact factor: 5.911

8.  Multiple imputation using chained equations: Issues and guidance for practice.

Authors:  Ian R White; Patrick Royston; Angela M Wood
Journal:  Stat Med       Date:  2010-11-30       Impact factor: 2.373

9.  Variable selection in the accelerated failure time model via the bridge method.

Authors:  Jian Huang; Shuangge Ma
Journal:  Lifetime Data Anal       Date:  2009-12-16       Impact factor: 1.588

Review 10.  Targeting protein kinase C (PKC) in physiology and cancer of the gastric cell system.

Authors:  Michael Fährmann
Journal:  Curr Med Chem       Date:  2008       Impact factor: 4.530

View more
  5 in total

Review 1.  Gene-Environment Interaction: A Variable Selection Perspective.

Authors:  Fei Zhou; Jie Ren; Xi Lu; Shuangge Ma; Cen Wu
Journal:  Methods Mol Biol       Date:  2021

2.  Integrative functional linear model for genome-wide association studies with multiple traits.

Authors:  Yang Li; Fan Wang; Mengyun Wu; Shuangge Ma
Journal:  Biostatistics       Date:  2022-04-13       Impact factor: 5.899

3.  Overlapping group screening for detection of gene-environment interactions with application to TCGA high-dimensional survival genomic data.

Authors:  Jie-Huei Wang; Kang-Hsin Wang; Yi-Hau Chen
Journal:  BMC Bioinformatics       Date:  2022-05-30       Impact factor: 3.307

4.  GEInter: an R package for robust gene-environment interaction analysis.

Authors:  Mengyun Wu; Xing Qin; Shuangge Ma
Journal:  Bioinformatics       Date:  2021-05-07       Impact factor: 6.937

5.  Penalized Variable Selection for Lipid-Environment Interactions in a Longitudinal Lipidomics Study.

Authors:  Fei Zhou; Jie Ren; Gengxin Li; Yu Jiang; Xiaoxi Li; Weiqun Wang; Cen Wu
Journal:  Genes (Basel)       Date:  2019-12-03       Impact factor: 4.096

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.