I2Transformer : Intra- and Inter-Relation Embedding Transformer for TV Show Captioning

TV show captioning aims to generate a linguistic sentence based on the video and its associated subtitle. Compared to purely video-based captioning, the subtitle can provide the captioning model with useful semantic clues such as actors' sentiments and intentions. However, the effective use of...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 31(2022) vom: 25., Seite 3565-3577
1. Verfasser: Tu, Yunbin (VerfasserIn)
Weitere Verfasser: Li, Liang, Su, Li, Gao, Shengxiang, Yan, Chenggang, Zha, Zheng-Jun, Yu, Zhengtao, Huang, Qingming
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2022
Zugriff auf das übergeordnete Werk:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Schlagworte:Journal Article