Literature DB >> 23089821

Weighted lasso with data integration.

Linn Cecilie Bergersen1, Ingrid K Glad, Heidi Lyng.   

Abstract

The lasso is one of the most commonly used methods for high-dimensional regression, but can be unstable and lacks satisfactory asymptotic properties for variable selection. We propose to use weighted lasso with integrated relevant external information on the covariates to guide the selection towards more stable results. Weighting the penalties with external information gives each regression coefficient a covariate specific amount of penalization and can improve upon standard methods that do not use such information by borrowing knowledge from the external material. The method is applied to two cancer data sets, with gene expressions as covariates. We find interesting gene signatures, which we are able to validate. We discuss various ideas on how the weights should be defined and illustrate how different types of investigations can utilize our method exploiting different sources of external data. Through simulations, we show that our method outperforms the lasso and the adaptive lasso when the external information is from relevant to partly relevant, in terms of both variable selection and prediction.

Entities:  

Mesh:

Year:  2011        PMID: 23089821     DOI: 10.2202/1544-6115.1703

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  17 in total

1.  Structured variable selection with q-values.

Authors:  Tanya P Garcia; Samuel Müller; Raymond J Carroll; Tamara N Dunn; Anthony P Thomas; Sean H Adams; Suresh D Pillai; Rosemary L Walzem
Journal:  Biostatistics       Date:  2013-04-10       Impact factor: 5.899

2.  Statistical Methods in Integrative Genomics.

Authors:  Sylvia Richardson; George C Tseng; Wei Sun
Journal:  Annu Rev Stat Appl       Date:  2016-04-18       Impact factor: 5.810

3.  Integration of mechanistic immunological knowledge into a machine learning pipeline improves predictions.

Authors:  Anthony Culos; Amy S Tsai; Natalie Stanley; Martin Becker; Mohammad S Ghaemi; David R McIlwain; Ramin Fallahzadeh; Athena Tanada; Huda Nassar; Camilo Espinosa; Maria Xenochristou; Edward Ganio; Laura Peterson; Xiaoyuan Han; Ina A Stelzer; Kazuo Ando; Dyani Gaudilliere; Thanaphong Phongpreecha; Ivana Marić; Alan L Chang; Gary M Shaw; David K Stevenson; Sean Bendall; Kara L Davis; Wendy Fantl; Garry P Nolan; Trevor Hastie; Robert Tibshirani; Martin S Angst; Brice Gaudilliere; Nima Aghaeepour
Journal:  Nat Mach Intell       Date:  2020-10-12

4.  COX REGRESSION WITH EXCLUSION FREQUENCY-BASED WEIGHTS TO IDENTIFY NEUROIMAGING MARKERS RELEVANT TO HUNTINGTON'S DISEASE ONSET.

Authors:  Tanya P Garcia; Samuel Müller
Journal:  Ann Appl Stat       Date:  2017-01-05       Impact factor: 2.083

5.  New adaptive lasso approaches for variable selection in automated pharmacovigilance signal detection.

Authors:  Pascale Tubert-Bitter; Ismaïl Ahmed; Émeline Courtois
Journal:  BMC Med Res Methodol       Date:  2021-12-01       Impact factor: 4.615

6.  Quantitative profiling of carbonyl metabolites directly in crude biological extracts using chemoselective tagging and nanoESI-FTMS.

Authors:  Pan Deng; Richard M Higashi; Andrew N Lane; Ronald C Bruntz; Ramon C Sun; Mandapati V Ramakrishnam Raju; Michael H Nantz; Zhen Qi; Teresa W-M Fan
Journal:  Analyst       Date:  2017-12-18       Impact factor: 4.616

7.  Adaptive penalization in high-dimensional regression and classification with external covariates using variational Bayes.

Authors:  Britta Velten; Wolfgang Huber
Journal:  Biostatistics       Date:  2021-04-10       Impact factor: 5.899

8.  A Novel Metric for Developing Easy-to-Use and Accurate Clinical Prediction Models: The Time-cost Information Criterion.

Authors:  Sei J Lee; Alexander K Smith; L Grisell Diaz-Ramirez; Kenneth E Covinsky; Siqi Gan; Catherine L Chen; William J Boscardin
Journal:  Med Care       Date:  2021-05-01       Impact factor: 3.178

9.  A Bayesian approach for structure learning in oscillating regulatory networks.

Authors:  Daniel Trejo Banos; Andrew J Millar; Guido Sanguinetti
Journal:  Bioinformatics       Date:  2015-07-14       Impact factor: 6.937

Review 10.  The common ground of genomics and systems biology.

Authors:  Ana Conesa; Ali Mortazavi
Journal:  BMC Syst Biol       Date:  2014-03-13
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.