Joint Answering and Explanation for Visual Commonsense Reasoning
Visual Commonsense Reasoning (VCR), deemed as one challenging extension of Visual Question Answering (VQA), endeavors to pursue a higher-level visual comprehension. VCR includes two complementary processes: question answering over a given image and rationale inference for answering explanation. Over...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 32(2023) vom: 06., Seite 3836-3846
|
1. Verfasser: |
Li, Zhenyang
(VerfasserIn) |
Weitere Verfasser: |
Guo, Yangyang,
Wang, Kejie,
Wei, Yinwei,
Nie, Liqiang,
Kankanhalli, Mohan |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2023
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
|
Schlagworte: | Journal Article |