Literature DB >> 28472344

Gene set selection via LASSO penalized regression (SLPR).

H Robert Frost1, Christopher I Amos1.   

Abstract

Gene set testing is an important bioinformatics technique that addresses the challenges of power, interpretation and replication. To better support the analysis of large and highly overlapping gene set collections, researchers have recently developed a number of multiset methods that jointly evaluate all gene sets in a collection to identify a parsimonious group of functionally independent sets. Unfortunately, current multiset methods all use binary indicators for gene and gene set activity and assume that a gene is active if any containing gene set is active. This simplistic model limits performance on many types of genomic data. To address this limitation, we developed gene set Selection via LASSO Penalized Regression (SLPR), a novel mapping of multiset gene set testing to penalized multiple linear regression. The SLPR method assumes a linear relationship between continuous measures of gene activity and the activity of all gene sets in the collection. As we demonstrate via simulation studies and the analysis of TCGA data using MSigDB gene sets, the SLPR method outperforms existing multiset methods when the true biological process is well approximated by continuous activity measures and a linear association between genes and gene sets.
© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28472344      PMCID: PMC5499546          DOI: 10.1093/nar/gkx291

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  36 in total

Review 1.  Microarray data analysis: from disarray to consolidation and consensus.

Authors:  David B Allison; Xiangqin Cui; Grier P Page; Mahyar Sabripour
Journal:  Nat Rev Genet       Date:  2006-01       Impact factor: 53.242

2.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

3.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

4.  A model-based analysis to infer the functional content of a gene list.

Authors:  Michael A Newton; Qiuling He; Christina Kendziorski
Journal:  Stat Appl Genet Mol Biol       Date:  2012-01-06

5.  Multiset Statistics for Gene Set Analysis.

Authors:  Michael A Newton; Zhishi Wang
Journal:  Annu Rev Stat Appl       Date:  2015-04       Impact factor: 5.810

6.  Reactome: a database of reactions, pathways and biological processes.

Authors:  David Croft; Gavin O'Kelly; Guanming Wu; Robin Haw; Marc Gillespie; Lisa Matthews; Michael Caudy; Phani Garapati; Gopal Gopinath; Bijay Jassal; Steven Jupe; Irina Kalatskaya; Shahana Mahajan; Bruce May; Nelson Ndegwa; Esther Schmidt; Veronica Shamovsky; Christina Yung; Ewan Birney; Henning Hermjakob; Peter D'Eustachio; Lincoln Stein
Journal:  Nucleic Acids Res       Date:  2010-11-09       Impact factor: 16.971

7.  IL-7 promotes CXCR3 ligand-dependent T cell antitumor reactivity in lung cancer.

Authors:  Asa Andersson; Seok-Chul Yang; Min Huang; Li Zhu; Upendra K Kar; Raj K Batra; David Elashoff; Robert M Strieter; Steven M Dubinett; Sherven Sharma
Journal:  J Immunol       Date:  2009-06-01       Impact factor: 5.422

8.  Camera: a competitive gene set test accounting for inter-gene correlation.

Authors:  Di Wu; Gordon K Smyth
Journal:  Nucleic Acids Res       Date:  2012-05-25       Impact factor: 16.971

9.  Model-based gene set analysis for Bioconductor.

Authors:  Sebastian Bauer; Peter N Robinson; Julien Gagneur
Journal:  Bioinformatics       Date:  2011-05-10       Impact factor: 6.937

10.  A probabilistic generative model for GO enrichment analysis.

Authors:  Yong Lu; Roni Rosenfeld; Itamar Simon; Gerard J Nau; Ziv Bar-Joseph
Journal:  Nucleic Acids Res       Date:  2008-08-01       Impact factor: 16.971

View more
  16 in total

1.  Identification of novel candidate biomarkers and immune infiltration in polycystic ovary syndrome.

Authors:  Zhijing Na; Wen Guo; Jiahui Song; Di Feng; Yuanyuan Fang; Da Li
Journal:  J Ovarian Res       Date:  2022-07-06       Impact factor: 5.506

2.  Prediction of Schizophrenia Diagnosis by Integration of Genetically Correlated Conditions and Traits.

Authors:  Jingchun Chen; Jian-Shing Wu; Travis Mize; Dandan Shui; Xiangning Chen
Journal:  J Neuroimmune Pharmacol       Date:  2018-10-01       Impact factor: 4.147

3.  CEA: Combination-based gene set functional enrichment analysis.

Authors:  Duanchen Sun; Yinliang Liu; Xiang-Sun Zhang; Ling-Yun Wu
Journal:  Sci Rep       Date:  2018-08-30       Impact factor: 4.379

4.  Computational Modeling of Gene-Specific Transcriptional Repression, Activation and Chromatin Interactions in Leukemogenesis by LASSO-Regularized Logistic Regression.

Authors:  Nickolas Steinauer; Kevin Zhang; Chun Guo; Jinsong Zhang
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2021-12-08       Impact factor: 3.710

5.  A novel comprehensive immune-related gene signature as a promising survival predictor for the patients with head and neck squamous cell carcinoma.

Authors:  Ruihua Fang; Muhammad Iqbal; Lin Chen; Jing Liao; Jierong Luo; Fanqin Wei; Weiping Wen; Wei Sun
Journal:  Aging (Albany NY)       Date:  2021-04-17       Impact factor: 5.682

6.  A seven-gene signature model predicts overall survival in kidney renal clear cell carcinoma.

Authors:  Ling Chen; Zijin Xiang; Xueru Chen; Xiuting Zhu; Xiangdong Peng
Journal:  Hereditas       Date:  2020-09-03       Impact factor: 3.271

7.  Identification of an 11-lncRNA signature with high performance for predicting the prognosis of hepatocellular carcinoma using bioinformatics analysis.

Authors:  Anmei Wang; Junhua Lei
Journal:  Medicine (Baltimore)       Date:  2021-02-05       Impact factor: 1.817

8.  The value of a metabolic reprogramming-related gene signature for pancreatic adenocarcinoma prognosis prediction.

Authors:  Zhen Tan; Yubin Lei; Jin Xu; Si Shi; Jie Hua; Bo Zhang; Qingcai Meng; Jiang Liu; Yiyin Zhang; Miaoyan Wei; Xianjun Yu; Chen Liang
Journal:  Aging (Albany NY)       Date:  2020-11-20       Impact factor: 5.682

9.  The Pathway Coexpression Network: Revealing pathway relationships.

Authors:  Yered Pita-Juárez; Gabriel Altschuler; Sokratis Kariotis; Wenbin Wei; Katjuša Koler; Claire Green; Rudolph E Tanzi; Winston Hide
Journal:  PLoS Comput Biol       Date:  2018-03-19       Impact factor: 4.475

10.  Parallel Tempering with Lasso for model reduction in systems biology.

Authors:  Sanjana Gupta; Robin E C Lee; James R Faeder
Journal:  PLoS Comput Biol       Date:  2020-03-09       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.