Literature DB >> 30387139

Log-ratio lasso: Scalable, sparse estimation for log-ratio models.

Stephen Bates1, Robert Tibshirani2.   

Abstract

Positive-valued signal data is common in the biological and medical sciences, due to the prevalence of mass spectrometry other imaging techniques. With such data, only the relative intensities of the raw measurements are meaningful. It is desirable to consider models consisting of the log-ratios of all pairs of the raw features, since log-ratios are the simplest meaningful derived features. In this case, however, the dimensionality of the predictor space becomes large, and computationally efficient estimation procedures are required. In this work, we introduce an embedding of the log-ratio parameter space into a space of much lower dimension and use this representation to develop an efficient penalized fitting procedure. This procedure serves as the foundation for a two-step fitting procedure that combines a convex filtering step with a second non-convex pruning step to yield highly sparse solutions. On a cancer proteomics data set, the proposed method fits a highly sparse model consisting of features of known biological relevance while greatly improving upon the predictive accuracy of less interpretable methods.
© 2018 International Biometric Society.

Entities:  

Keywords:  compositional data; lasso; log-ratio; mass spectrometry; variable selection

Mesh:

Year:  2019        PMID: 30387139      PMCID: PMC9470385          DOI: 10.1111/biom.12995

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   1.701


  3 in total

1.  The lasso method for variable selection in the Cox model.

Authors:  R Tibshirani
Journal:  Stat Med       Date:  1997-02-28       Impact factor: 2.373

2.  Diagnosis of prostate cancer by desorption electrospray ionization mass spectrometric imaging of small metabolites and lipids.

Authors:  Shibdas Banerjee; Richard N Zare; Robert J Tibshirani; Christian A Kunder; Rosalie Nolley; Richard Fan; James D Brooks; Geoffrey A Sonn
Journal:  Proc Natl Acad Sci U S A       Date:  2017-03-14       Impact factor: 11.205

3.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

  3 in total
  4 in total

1.  Integration of Proteomics and Other Omics Data.

Authors:  Mengyun Wu; Yu Jiang; Shuangge Ma
Journal:  Methods Mol Biol       Date:  2021

2.  Learning Sparse Log-Ratios for High-Throughput Sequencing Data.

Authors:  Elliott Gordon-Rodriguez; Thomas P Quinn; John P Cunningham
Journal:  Bioinformatics       Date:  2021-09-08       Impact factor: 6.937

Review 3.  Network analysis methods for studying microbial communities: A mini review.

Authors:  Monica Steffi Matchado; Michael Lauber; Sandra Reitmeier; Tim Kacprowski; Jan Baumbach; Dirk Haller; Markus List
Journal:  Comput Struct Biotechnol J       Date:  2021-05-04       Impact factor: 7.271

4.  Tumor Purity Coexpressed Genes Related to Immune Microenvironment and Clinical Outcomes of Lung Adenocarcinoma.

Authors:  Ming Bai; Qi Pan; Chen Sun
Journal:  J Oncol       Date:  2021-06-14       Impact factor: 4.375

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.