End-to-End Temporal Action Detection With Transformer

Temporal action detection (TAD) aims to determine the semantic label and the temporal interval of every action instance in an untrimmed video. It is a fundamental and challenging task in video understanding. Previous methods tackle this task with complicated pipelines. They often need to train multi...

Description complète

Détails bibliographiques
Publié dans:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 31(2022) vom: 10., Seite 5427-5441
Auteur principal: Liu, Xiaolong (Auteur)
Autres auteurs: Wang, Qimeng, Hu, Yao, Tang, Xu, Zhang, Shiwei, Bai, Song, Bai, Xiang
Format: Article en ligne
Langue:English
Publié: 2022
Accès à la collection:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Sujets:Journal Article