| Literature DB >> 35400988 |
Jingxiang Chen1, Quoc Tran-Dinh2, Michael R Kosorok3, Yufeng Liu4.
Abstract
Precision medicine is an important area of research with the goal of identifying the optimal treatment for each individual patient. In the literature, various methods are proposed to divide the population into subgroups according to the heterogeneous effects of individuals. In this paper, a new exploratory machine learning tool, named latent supervised clustering, is proposed to identify the heterogeneous subpopulations. In particular, we formulate the problem as a regression problem with subject specific coefficients, and use adaptive fusion to cluster the coefficients into subpopulations. This method has two main advantages. First, it relies on little prior knowledge and weak parametric assumptions on the underlying subpopulation structure. Second, it makes use of the outcome-predictor relationship, and hence can have competitive estimation and prediction accuracy. To estimate the parameters, we design a highly efficient accelerated proximal gradient algorithm which guarantees convergence at a competitive rate. Numerical studies show that the proposed method has competitive estimation and prediction accuracy, and can also produce interpretable clustering results for the underlying heterogeneous effect.Entities:
Keywords: Accelerated Proximal Gradient Algorithm; Clustering Analysis; Convex Clustering; Machine Learning; Precision Medicine; Subpopulation Identification
Year: 2020 PMID: 35400988 PMCID: PMC8993151 DOI: 10.1080/10618600.2020.1763808
Source DB: PubMed Journal: J Comput Graph Stat ISSN: 1061-8600 Impact factor: 1.884