Identification of outlying observations for large-dimensional data

© 2021 Informa UK Limited, trading as Taylor & Francis Group.

Bibliographische Detailangaben
Veröffentlicht in:Journal of applied statistics. - 1991. - 50(2023), 2 vom: 30., Seite 370-386
1. Verfasser: Wang, Tao (VerfasserIn)
Weitere Verfasser: Yang, Xiaona, Guo, Yunfei, Li, Zhonghua
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2023
Zugriff auf das übergeordnete Werk:Journal of applied statistics
Schlagworte:Journal Article Outlier identification asymptotic distribution large-dimension statistics multiple hypothesis testing robust statistics
Beschreibung
Zusammenfassung:© 2021 Informa UK Limited, trading as Taylor & Francis Group.
This work proposes a two-stage procedure for identifying outlying observations in a large-dimensional data set. In the first stage, an outlier identification measure is defined by using a max-normal statistic and a clean subset that contains non-outliers is obtained. The identification of outliers can be deemed as a multiple hypothesis testing problem, then, in the second stage, we explore the asymptotic distribution of the proposed measure, and obtain the threshold of the outlying observations. Furthermore, in order to improve the identification power and better control the misjudgment rate, a one-step refined algorithm is proposed. Simulation results and two real data analysis examples show that, compared with other methods, the proposed procedure has great advantages in identifying outliers in various data situations
Beschreibung:Date Revised 02.02.2023
published: Electronic-eCollection
Citation Status PubMed-not-MEDLINE
ISSN:0266-4763
DOI:10.1080/02664763.2021.1993799