| Literature DB >> 32677168 |
Qiang Hu1, Liang Zhu2, Yanyan Liu3, Jianguo Sun4, Deo Kumar Srivastava5, Leslie L Robison6.
Abstract
For the analysis of ultrahigh-dimensional data, the first step is often to perform screening and feature selection to effectively reduce the dimensionality while retaining all the active or relevant variables with high probability. For this, many methods have been developed under various frameworks but most of them only apply to complete data. In this paper, we consider an incomplete data situation, case II interval-censored failure time data, for which there seems to be no screening procedure. Basing on the idea of cumulative residual, a model-free or nonparametric method is developed and shown to have the sure independent screening property. In particular, the approach is shown to tend to rank the active variables above the inactive ones in terms of their association with the failure time of interest. A simulation study is conducted to demonstrate the usefulness of the proposed method and, in particular, indicates that it works well with general survival models and is capable of capturing the nonlinear covariates with interactions. Also the approach is applied to a childhood cancer survivor study that motivated this investigation.Entities:
Keywords: cumulative residual; generalized Turnbull estimator; interval-censored; model-free screening; sure screening property
Year: 2020 PMID: 32677168 PMCID: PMC7988961 DOI: 10.1002/bimj.201900154
Source DB: PubMed Journal: Biom J ISSN: 0323-3847 Impact factor: 2.207