Dense Relational Image Captioning via Multi-Task Triple-Stream Networks

We introduce dense relational captioning, a novel image captioning task which aims to generate multiple captions with respect to relational information between objects in a visual scene. Relational captioning provides explicit descriptions for each relationship between object combinations. This fram...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - 44(2022), 11 vom: 14. Nov., Seite 7348-7362
1. Verfasser: Kim, Dong-Jin (VerfasserIn)
Weitere Verfasser: Oh, Tae-Hyun, Choi, Jinsoo, Kweon, In So
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2022
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article