Literature DB >> 20812908

Lasso logistic regression, GSoft and the cyclic coordinate descent algorithm: application to gene expression data.

Manuel Garcia-Magariños1, Anestis Antoniadis, Ricardo Cao, Wenceslao Gonzãlez-Manteiga.   

Abstract

Statistical methods generating sparse models are of great value in the gene expression field, where the number of covariates (genes) under study moves about the thousands while the sample sizes seldom reach a hundred of individuals. For phenotype classification, we propose different lasso logistic regression approaches with specific penalizations for each gene. These methods are based on a generalized soft-threshold (GSoft) estimator. We also show that a recent algorithm for convex optimization, namely, the cyclic coordinate descent (CCD) algorithm, provides with a way to solve the optimization problem significantly faster than with other competing methods. Viewing GSoft as an iterative thresholding procedure allows us to get the asymptotic properties of the resulting estimates in a straightforward manner. Results are obtained for simulated and real data. The leukemia and colon datasets are commonly used to evaluate new statistical approaches, so they come in useful to establish comparisons with similar methods. Furthermore, biological meaning is extracted from the leukemia results, and compared with previous studies. In summary, the approaches presented here give rise to sparse, interpretable models that are competitive with similar methods developed in the field.

Entities:  

Mesh:

Year:  2010        PMID: 20812908     DOI: 10.2202/1544-6115.1536

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  2 in total

1.  High-throughput DNA methylation datasets for evaluating false discovery rate methodologies.

Authors:  N Asomaning; K J Archer
Journal:  Comput Stat Data Anal       Date:  2011-10-29       Impact factor: 1.681

2.  Identification of diagnostic gene biomarkers and immune infiltration in patients with diabetic kidney disease using machine learning strategies and bioinformatic analysis.

Authors:  Shaojie Fu; Yanli Cheng; Xueyao Wang; Jingda Huang; Sensen Su; Hao Wu; Jinyu Yu; Zhonggao Xu
Journal:  Front Med (Lausanne)       Date:  2022-09-29
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.