Literature DB >> 27248122

The use of vector bootstrapping to improve variable selection precision in Lasso models.

Charles Laurin, Dorret Boomsma, Gitta Lubke.   

Abstract

The Lasso is a shrinkage regression method that is widely used for variable selection in statistical genetics. Commonly, K-fold cross-validation is used to fit a Lasso model. This is sometimes followed by using bootstrap confidence intervals to improve precision in the resulting variable selections. Nesting cross-validation within bootstrapping could provide further improvements in precision, but this has not been investigated systematically. We performed simulation studies of Lasso variable selection precision (VSP) with and without nesting cross-validation within bootstrapping. Data were simulated to represent genomic data under a polygenic model as well as under a model with effect sizes representative of typical GWAS results. We compared these approaches to each other as well as to software defaults for the Lasso. Nested cross-validation had the most precise variable selection at small effect sizes. At larger effect sizes, there was no advantage to nesting. We illustrated the nested approach with empirical data comprising SNPs and SNP-SNP interactions from the most significant SNPs in a GWAS of borderline personality symptoms. In the empirical example, we found that the default Lasso selected low-reliability SNPs and interactions which were excluded by bootstrapping.

Entities:  

Mesh:

Year:  2016        PMID: 27248122      PMCID: PMC5131926          DOI: 10.1515/sagmb-2015-0043

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  25 in total

1.  Distribution of allele frequencies and effect sizes and their interrelationships for common genetic susceptibility variants.

Authors:  Ju-Hyun Park; Mitchell H Gail; Clarice R Weinberg; Raymond J Carroll; Charles C Chung; Zhaoming Wang; Stephen J Chanock; Joseph F Fraumeni; Nilanjan Chatterjee
Journal:  Proc Natl Acad Sci U S A       Date:  2011-10-14       Impact factor: 11.205

2.  Netherlands Twin Register: from twins to twin families.

Authors:  Dorret I Boomsma; Eco J C de Geus; Jacqueline M Vink; Janine H Stubbe; Marijn A Distel; Jouke-Jan Hottenga; Danielle Posthuma; Toos C E M van Beijsterveldt; James J Hudziak; Meike Bartels; Gonneke Willemsen
Journal:  Twin Res Hum Genet       Date:  2006-12       Impact factor: 1.587

3.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

Review 4.  Detecting gene-gene interactions that underlie human diseases.

Authors:  Heather J Cordell
Journal:  Nat Rev Genet       Date:  2009-06       Impact factor: 53.242

Review 5.  Abundant pleiotropy in human complex diseases and traits.

Authors:  Shanya Sivakumaran; Felix Agakov; Evropi Theodoratou; James G Prendergast; Lina Zgaga; Teri Manolio; Igor Rudan; Paul McKeigue; James F Wilson; Harry Campbell
Journal:  Am J Hum Genet       Date:  2011-11-11       Impact factor: 11.025

6.  Reprioritizing genetic associations in hit regions using LASSO-based resample model averaging.

Authors:  William Valdar; Jeremy Sabourin; Andrew Nobel; Christopher C Holmes
Journal:  Genet Epidemiol       Date:  2012-04-30       Impact factor: 2.135

7.  Optimized application of penalized regression methods to diverse genomic data.

Authors:  Levi Waldron; Melania Pintilie; Ming-Sound Tsao; Frances A Shepherd; Curtis Huttenhower; Igor Jurisica
Journal:  Bioinformatics       Date:  2011-12-15       Impact factor: 6.937

8.  LASSO model selection with post-processing for a genome-wide association study data set.

Authors:  Allan J Motyer; Chris McKendry; Sally Galbraith; Susan R Wilson
Journal:  BMC Proc       Date:  2011-11-29

9.  Combining least absolute shrinkage and selection operator (LASSO) and principal-components analysis for detection of gene-gene interactions in genome-wide association studies.

Authors:  Gina M D'Angelo; Dc Rao; C Charles Gu
Journal:  BMC Proc       Date:  2009-12-15

10.  Genome-wide analyses of borderline personality features.

Authors:  G H Lubke; C Laurin; N Amin; J J Hottenga; G Willemsen; G van Grootheest; A Abdellaoui; L C Karssen; B A Oostra; C M van Duijn; B W J H Penninx; D I Boomsma
Journal:  Mol Psychiatry       Date:  2013-08-27       Impact factor: 15.992

View more
  6 in total

1.  Genes, exposures, and interactions on preterm birth risk: an exploratory study in an Argentine population.

Authors:  Dario E Elias; Maria R Santos; Hebe Campaña; Fernando A Poletta; Silvina L Heisecke; Juan A Gili; Julia Ratowiecki; Viviana Cosentino; Rocio Uranga; Diana Rojas Málaga; Alice Brinckmann Oliveira Netto; Ana Carolina Brusius-Facchin; César Saleme; Mónica Rittler; Hugo B Krupitzki; Jorge S Lopez Camelo; Lucas G Gimenez
Journal:  J Community Genet       Date:  2022-08-17

2.  Development and validation of a lifestyle-based model for colorectal cancer risk prediction: the LiFeCRC score.

Authors:  Krasimira Aleksandrova; Robin Reichmann; Rudolf Kaaks; Mazda Jenab; H Bas Bueno-de-Mesquita; Christina C Dahm; Anne Kirstine Eriksen; Anne Tjønneland; Fanny Artaud; Marie-Christine Boutron-Ruault; Gianluca Severi; Anika Hüsing; Antonia Trichopoulou; Anna Karakatsani; Eleni Peppa; Salvatore Panico; Giovanna Masala; Sara Grioni; Carlotta Sacerdote; Rosario Tumino; Sjoerd G Elias; Anne M May; Kristin B Borch; Torkjel M Sandanger; Guri Skeie; Maria-Jose Sánchez; José María Huerta; Núria Sala; Aurelio Barricarte Gurrea; José Ramón Quirós; Pilar Amiano; Jonna Berntsson; Isabel Drake; Bethany van Guelpen; Sophia Harlid; Tim Key; Elisabete Weiderpass; Elom K Aglago; Amanda J Cross; Konstantinos K Tsilidis; Elio Riboli; Marc J Gunter
Journal:  BMC Med       Date:  2021-01-04       Impact factor: 8.775

3.  Leveraging pleiotropic association using sparse group variable selection in genomics data.

Authors:  Matthew Sutton; Pierre-Emmanuel Sugier; Therese Truong; Benoit Liquet
Journal:  BMC Med Res Methodol       Date:  2022-01-07       Impact factor: 4.615

4.  Scoring System Based on RNA Modification Writer-Related Genes to Predict Overall Survival and Therapeutic Response in Bladder Cancer.

Authors:  Pu Zhang; Zijian Liu; Decai Wang; Yunxue Li; Yifei Xing; Yajun Xiao
Journal:  Front Immunol       Date:  2021-08-26       Impact factor: 7.561

5.  The effect of nonpharmaceutical interventions on COVID-19 infections for lower and middle-income countries: A debiased LASSO approach.

Authors:  Akbar Zamanzadeh; Tony Cavoli
Journal:  PLoS One       Date:  2022-07-22       Impact factor: 3.752

6.  Factors Associated With Return to Work After Acute Myocardial Infarction in China.

Authors:  Zihan Jiang; Rachel P Dreyer; John A Spertus; Frederick A Masoudi; Jing Li; Xin Zheng; Xi Li; Chaoqun Wu; Xueke Bai; Shuang Hu; Yun Wang; Harlan M Krumholz; Hong Chen
Journal:  JAMA Netw Open       Date:  2018-11-02
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.