| Literature DB >> 35707024 |
Haruka Yamashita1, Yoshinobu Kawahara2.
Abstract
Analysis with principal points is a useful statistical tool for summarizing large data. In this paper, we propose a subgradient-based algorithm to calculate a set of principal points for multivariate binary data by the formulating it as a p-median problem. This enables us to find a globally optimal set of principal points or an ε-optimal solution in the middle of the calculation by combining an upper bound found using the greedy method. This algorithm is an iterative procedure where each iteration can be calculated in an efficient manner. We investigate the applicability of the proposed framework with questionnaire data and arXiv co-authors data.Entities:
Keywords: Lagrangian relaxation; Statistical data analysis; principal points; supermodular minimization
Year: 2019 PMID: 35707024 PMCID: PMC9041942 DOI: 10.1080/02664763.2019.1675605
Source DB: PubMed Journal: J Appl Stat ISSN: 0266-4763 Impact factor: 1.416