End-to-End Temporal Action Detection With Transformer
Temporal action detection (TAD) aims to determine the semantic label and the temporal interval of every action instance in an untrimmed video. It is a fundamental and challenging task in video understanding. Previous methods tackle this task with complicated pipelines. They often need to train multi...
Description complète
Détails bibliographiques
Publié dans: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 31(2022) vom: 10., Seite 5427-5441
|
Auteur principal: |
Liu, Xiaolong
(Auteur) |
Autres auteurs: |
Wang, Qimeng,
Hu, Yao,
Tang, Xu,
Zhang, Shiwei,
Bai, Song,
Bai, Xiang |
Format: | Article en ligne
|
Langue: | English |
Publié: |
2022
|
Accès à la collection: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
|
Sujets: | Journal Article |