Learning Probabilistic Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception

With only video-level event labels, this paper targets at the task of weakly-supervised audio-visual event perception (WS-AVEP), which aims to temporally localize and categorize events that belong to each modality. Despite the recent progress, most existing approaches either ignore the unsynchronize...

Description complète

Détails bibliographiques
Publié dans:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 47(2025), 6 vom: 03. Mai, Seite 4787-4802
Auteur principal: Gao, Junyu (Auteur)
Autres auteurs: Chen, Mengyuan, Xu, Changsheng
Format: Article en ligne
Langue:English
Publié: 2025
Accès à la collection:IEEE transactions on pattern analysis and machine intelligence
Sujets:Journal Article