TransVOD : End-to-End Video Object Detection With Spatial-Temporal Transformers

Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance as previous complex hand-crafted detectors. However, their performance on Video Object Detection (VOD) has not been wel...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 45(2023), 6 vom: 23. Juni, Seite 7853-7869
1. Verfasser: Zhou, Qianyu (VerfasserIn)
Weitere Verfasser: Li, Xiangtai, He, Lu, Yang, Yibo, Cheng, Guangliang, Tong, Yunhai, Ma, Lizhuang, Tao, Dacheng
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2023
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article