Time to retire F1-binary score for action unit detection

Detecting action units is an important task in face analysis, especially in facial expression recognition. This is due, in part, to the idea that expressions can be decomposed into multiple action units. To evaluate systems that detect action units, F1-binary score is often used as the evaluation me...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:Pattern recognition letters. - 1998. - 182(2024) vom: 01. Juni, Seite 111-117
1. Verfasser: Hinduja, Saurabh (VerfasserIn)
Weitere Verfasser: Nourivandi, Tara, Cohn, Jeffrey F, Canavan, Shaun
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2024
Zugriff auf das übergeordnete Werk:Pattern recognition letters
Schlagworte:Journal Article Action units Data imbalance F1 score Machine learning
Beschreibung
Zusammenfassung:Detecting action units is an important task in face analysis, especially in facial expression recognition. This is due, in part, to the idea that expressions can be decomposed into multiple action units. To evaluate systems that detect action units, F1-binary score is often used as the evaluation metric. In this paper, we argue that F1-binary score does not reliably evaluate these models due largely to class imbalance. Because of this, F1-binary score should be retired and a suitable replacement should be used. We justify this argument through a detailed evaluation of the negative influence of class imbalance on action unit detection. This includes an investigation into the influence of class imbalance in train and test sets and in new data (i.e., generalizability). We empirically show that F1-micro should be used as the replacement for F1-binary
Beschreibung:Date Revised 03.08.2024
published: Print-Electronic
Citation Status PubMed-not-MEDLINE
ISSN:0167-8655
DOI:10.1016/j.patrec.2024.04.016