| Literature DB >> 32281490 |
Yang Li1,2,3, Fan Wang2,3, Rong Li2,3, Yifan Sun1,2.
Abstract
In genomic analysis, it is significant though challenging to identify markers associated with cancer outcomes or phenotypes. Based on the biological mechanisms of cancers and the characteristics of datasets, we propose a novel integrative interaction approach under a semiparametric model, in which genetic and environmental factors are included as the parametric and nonparametric components, respectively. The goal of this approach is to identify the genetic factors and gene-gene interactions associated with cancer outcomes, while estimating the nonlinear effects of environmental factors. The proposed approach is based on the threshold gradient-directed regularisation technique. Simulation studies indicate that the proposed approach outperforms alternative methods at identifying the main effects and interactions, and has favourable estimation and prediction accuracy. We analysed non-small-cell lung carcinoma datasets from the Cancer Genome Atlas, and the results demonstrate that the proposed approach can identify markers with important implications and that it performs favourably in terms of prediction accuracy, identification stability, and computation cost.Entities:
Keywords: Heterogeneity; nonlinear model; strong hierarchy; the Cancer Genome Atlas; threshold gradient directed regularisation
Mesh:
Year: 2020 PMID: 32281490 DOI: 10.1177/0962280220909969
Source DB: PubMed Journal: Stat Methods Med Res ISSN: 0962-2802 Impact factor: 3.021