Literature DB >> 25629319

Sparse overlapping group lasso for integrative multi-omics analysis.

Heewon Park1, Atushi Niida, Satoru Miyano, Seiya Imoto.   

Abstract

Gene networks and graphs are crucial tools for understanding a heterogeneous system of cancer, since cancer is a disease that does not involve individual genes but combinations of genes associated with oncogenic process. A goal of genomic data analysis via gene networks is to identify both gene networks and individual genes within the selected networks. Existing methods, however, perform only network selection, and thus all genes in selected networks are included in models. This leads to overfitting when uncovering driver genes, and the results are not biologically interpretable. To accomplish both "groupwise sparsity" and "within group sparsity" for identifying driver genes based on biological knowledge (i.e., predefined overlapping groups of features), we propose a sparse overlapping group lasso via duplicated predictors in extended space. The proposed method effectively identifies driver genes and their interactions using known biological pathway information. Monte Carlo simulations and The Cancer Genome Atlas (TCGA) project data analysis indicate that the proposed method is effective for fitting a regression model (i.e., feature selection and prediction accuracy) constructed with duplicated predictors in overlapping groups. In the TCGA data analysis, we uncover potential cancer driver genes via expression modules and gene networks constructed by multi-omics data and identify that the uncovered genes have strong evidences as a cancer driver gene. The proposed method is a useful tool for identifying cancer driver genes and for integrative multi-omics analysis.

Entities:  

Keywords:  gene networks; graph; group sparse regularization; multi-omics analysis; uncovering driver genes

Mesh:

Substances:

Year:  2015        PMID: 25629319     DOI: 10.1089/cmb.2014.0197

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  4 in total

1.  Role of Graph Architecture in Controlling Dynamical Networks with Applications to Neural Systems.

Authors:  Jason Z Kim; Jonathan M Soffer; Ari E Kahn; Jean M Vettel; Fabio Pasqualetti; Danielle S Bassett
Journal:  Nat Phys       Date:  2017-09-25       Impact factor: 20.034

2.  Multiple Group Testing Procedures for Analysis of High-Dimensional Genomic Data.

Authors:  Hyoseok Ko; Kipoong Kim; Hokeun Sun
Journal:  Genomics Inform       Date:  2016-12-30

Review 3.  Computational systems biology approaches for Parkinson's disease.

Authors:  Enrico Glaab
Journal:  Cell Tissue Res       Date:  2017-11-29       Impact factor: 5.249

4.  Proteomics analysis to reveal biological pathways and predictive proteins in the survival of high-grade serous ovarian cancer.

Authors:  Hongyu Xie; Wenjie Wang; Fengyu Sun; Kui Deng; Xin Lu; Huijuan Liu; Weiwei Zhao; Yuanyuan Zhang; Xiaohua Zhou; Kang Li; Yan Hou
Journal:  Sci Rep       Date:  2017-08-29       Impact factor: 4.379

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.