Principal points analysis via p-median problem for binary data

© 2019 Informa UK Limited, trading as Taylor & Francis Group.

Détails bibliographiques
Publié dans:Journal of applied statistics. - 1991. - 47(2020), 7 vom: 17., Seite 1282-1297
Auteur principal: Yamashita, Haruka (Auteur)
Autres auteurs: Kawahara, Yoshinobu
Format: Article en ligne
Langue:English
Publié: 2020
Accès à la collection:Journal of applied statistics
Sujets:Journal Article Lagrangian relaxation Statistical data analysis principal points supermodular minimization
Description
Résumé:© 2019 Informa UK Limited, trading as Taylor & Francis Group.
Analysis with principal points is a useful statistical tool for summarizing large data. In this paper, we propose a subgradient-based algorithm to calculate a set of principal points for multivariate binary data by the formulating it as a p-median problem. This enables us to find a globally optimal set of principal points or an ε-optimal solution in the middle of the calculation by combining an upper bound found using the greedy method. This algorithm is an iterative procedure where each iteration can be calculated in an efficient manner. We investigate the applicability of the proposed framework with questionnaire data and arXiv co-authors data
Description:Date Revised 16.07.2022
published: Electronic-eCollection
Citation Status PubMed-not-MEDLINE
ISSN:0266-4763
DOI:10.1080/02664763.2019.1675605