Knowledge-Augmented Visual Question Answering With Natural Language Explanation
Visual question answering with natural language explanation (VQA-NLE) is a challenging task that requires models to not only generate accurate answers but also to provide explanations that justify the relevant decision-making processes. This task is accomplished by generating natural language senten...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 33(2024) vom: 28., Seite 2652-2664
|
1. Verfasser: |
Xie, Jiayuan
(VerfasserIn) |
Weitere Verfasser: |
Cai, Yi,
Chen, Jiali,
Xu, Ruohang,
Wang, Jiexin,
Li, Qing |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2024
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
|
Schlagworte: | Journal Article |