Literature DB >> 32989444

Fast Lasso method for large-scale and ultrahigh-dimensional Cox model with applications to UK Biobank.

Ruilin Li1, Christopher Chang2, Johanne M Justesen3, Yosuke Tanigawa3, Junyang Qian3, Trevor Hastie3, Manuel A Rivas3, Robert Tibshirani3.   

Abstract

We develop a scalable and highly efficient algorithm to fit a Cox proportional hazard model by maximizing the $L^1$-regularized (Lasso) partial likelihood function, based on the Batch Screening Iterative Lasso (BASIL) method developed in Qian and others (2019). Our algorithm is particularly suitable for large-scale and high-dimensional data that do not fit in the memory. The output of our algorithm is the full Lasso path, the parameter estimates at all predefined regularization parameters, as well as their validation accuracy measured using the concordance index (C-index) or the validation deviance. To demonstrate the effectiveness of our algorithm, we analyze a large genotype-survival time dataset across 306 disease outcomes from the UK Biobank (Sudlow and others, 2015). We provide a publicly available implementation of the proposed approach for genetics data on top of the PLINK2 package and name it snpnet-Cox.
© The Author 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Keywords:  Concordance index; Cox proportional hazard model; LASSO; Time-to-event data; UK Biobank

Mesh:

Year:  2022        PMID: 32989444      PMCID: PMC9007437          DOI: 10.1093/biostatistics/kxaa038

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  19 in total

Review 1.  Inherited disorders of bilirubin metabolism.

Authors:  Piter Jabik Bosma
Journal:  J Hepatol       Date:  2003-01       Impact factor: 25.083

Review 2.  Human UDP-glucuronosyltransferases: metabolism, expression, and disease.

Authors:  R H Tukey; C P Strassburg
Journal:  Annu Rev Pharmacol Toxicol       Date:  2000       Impact factor: 13.820

Review 3.  Clinical practice. Gout.

Authors:  Robert A Terkeltaub
Journal:  N Engl J Med       Date:  2003-10-23       Impact factor: 91.245

4.  Sequence variants affecting eosinophil numbers associate with asthma and myocardial infarction.

Authors:  Daniel F Gudbjartsson; Unnur S Bjornsdottir; Eva Halapi; Anna Helgadottir; Patrick Sulem; Gudrun M Jonsdottir; Gudmar Thorleifsson; Hafdis Helgadottir; Valgerdur Steinthorsdottir; Hreinn Stefansson; Carolyn Williams; Jennie Hui; John Beilby; Nicole M Warrington; Alan James; Lyle J Palmer; Gerard H Koppelman; Andrea Heinzmann; Marcus Krueger; H Marike Boezen; Amanda Wheatley; Janine Altmuller; Hyoung Doo Shin; Soo-Taek Uh; Hyun Sub Cheong; Brynja Jonsdottir; David Gislason; Choon-Sik Park; Linda M Rasmussen; Celeste Porsbjerg; Jakob W Hansen; Vibeke Backer; Thomas Werge; Christer Janson; Ulla-Britt Jönsson; Maggie C Y Ng; Juliana Chan; Wing Yee So; Ronald Ma; Svati H Shah; Christopher B Granger; Arshed A Quyyumi; Allan I Levey; Viola Vaccarino; Muredach P Reilly; Daniel J Rader; Michael J A Williams; Andre M van Rij; Gregory T Jones; Elisabetta Trabetti; Giovanni Malerba; Pier Franco Pignatti; Attilio Boner; Lydia Pescollderungg; Domenico Girelli; Oliviero Olivieri; Nicola Martinelli; Bjorn R Ludviksson; Dora Ludviksdottir; Gudmundur I Eyjolfsson; David Arnar; Gudmundur Thorgeirsson; Klaus Deichmann; Philip J Thompson; Matthias Wjst; Ian P Hall; Dirkje S Postma; Thorarinn Gislason; Jeffrey Gulcher; Augustine Kong; Ingileif Jonsdottir; Unnur Thorsteinsdottir; Kari Stefansson
Journal:  Nat Genet       Date:  2009-02-08       Impact factor: 38.330

5.  Statistical learning and selective inference.

Authors:  Jonathan Taylor; Robert J Tibshirani
Journal:  Proc Natl Acad Sci U S A       Date:  2015-06-23       Impact factor: 11.205

6.  Covariance analysis of censored survival data.

Authors:  N Breslow
Journal:  Biometrics       Date:  1974-03       Impact factor: 2.571

Review 7.  The role of monogenic disease in children with very early onset inflammatory bowel disease.

Authors:  Judith R Kelsen; Robert N Baldassano
Journal:  Curr Opin Pediatr       Date:  2017-10       Impact factor: 2.856

8.  Evaluating the yield of medical tests.

Authors:  F E Harrell; R M Califf; D B Pryor; K L Lee; R A Rosati
Journal:  JAMA       Date:  1982-05-14       Impact factor: 56.272

9.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

10.  The UK Biobank resource with deep phenotyping and genomic data.

Authors:  Clare Bycroft; Colin Freeman; Desislava Petkova; Gavin Band; Lloyd T Elliott; Kevin Sharp; Allan Motyer; Damjan Vukcevic; Olivier Delaneau; Jared O'Connell; Adrian Cortes; Samantha Welsh; Alan Young; Mark Effingham; Gil McVean; Stephen Leslie; Naomi Allen; Peter Donnelly; Jonathan Marchini
Journal:  Nature       Date:  2018-10-10       Impact factor: 49.962

View more
  8 in total

1.  LARGE-SCALE MULTIVARIATE SPARSE REGRESSION WITH APPLICATIONS TO UK BIOBANK.

Authors:  Junyang Qian; Yosuke Tanigawa; Ruilin Li; Robert Tibshirani; Manuel A Rivas; Trevor Hastie
Journal:  Ann Appl Stat       Date:  2022-07-19       Impact factor: 1.959

2.  A fast and scalable framework for large-scale and ultrahigh-dimensional sparse regression with application to the UK Biobank.

Authors:  Junyang Qian; Yosuke Tanigawa; Wenfei Du; Matthew Aguirre; Chris Chang; Robert Tibshirani; Manuel A Rivas; Trevor Hastie
Journal:  PLoS Genet       Date:  2020-10-23       Impact factor: 5.917

3.  Computationally scalable regression modeling for ultrahigh-dimensional omics data with ParProx.

Authors:  Seyoon Ko; Ginny X Li; Hyungwon Choi; Joong-Ho Won
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

4.  Genomic architecture and prediction of censored time-to-event phenotypes with a Bayesian genome-wide analysis.

Authors:  Sven E Ojavee; Athanasios Kousathanas; Daniel Trejo Banos; Etienne J Orliac; Marion Patxot; Kristi Läll; Reedik Mägi; Krista Fischer; Zoltan Kutalik; Matthew R Robinson
Journal:  Nat Commun       Date:  2021-04-20       Impact factor: 14.919

5.  A novel 14-gene signature for overall survival in lung adenocarcinoma based on the Bayesian hierarchical Cox proportional hazards model.

Authors:  Na Sun; Jiadong Chu; Wei Hu; Xuanli Chen; Nengjun Yi; Yueping Shen
Journal:  Sci Rep       Date:  2022-01-07       Impact factor: 4.379

6.  Accounting for age of onset and family history improves power in genome-wide association studies.

Authors:  Emil M Pedersen; Esben Agerbo; Oleguer Plana-Ripoll; Jakob Grove; Julie W Dreier; Katherine L Musliner; Marie Bækvad-Hansen; Georgios Athanasiadis; Andrew Schork; Jonas Bybjerg-Grauholm; David M Hougaard; Thomas Werge; Merete Nordentoft; Ole Mors; Søren Dalsgaard; Jakob Christensen; Anders D Børglum; Preben B Mortensen; John J McGrath; Florian Privé; Bjarni J Vilhjálmsson
Journal:  Am J Hum Genet       Date:  2022-02-08       Impact factor: 11.025

7.  Survival Analysis on Rare Events Using Group-Regularized Multi-Response Cox Regression.

Authors:  Ruilin Li; Yosuke Tanigawa; Johanne M Justesen; Jonathan Taylor; Trevor Hastie; Robert Tibshirani; Manuel A Rivas
Journal:  Bioinformatics       Date:  2021-02-09       Impact factor: 6.937

8.  Significant sparse polygenic risk scores across 813 traits in UK Biobank.

Authors:  Yosuke Tanigawa; Junyang Qian; Guhan Venkataraman; Johanne Marie Justesen; Ruilin Li; Robert Tibshirani; Trevor Hastie; Manuel A Rivas
Journal:  PLoS Genet       Date:  2022-03-24       Impact factor: 6.020

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.