Knowledge-Augmented Visual Question Answering With Natural Language Explanation

Visual question answering with natural language explanation (VQA-NLE) is a challenging task that requires models to not only generate accurate answers but also to provide explanations that justify the relevant decision-making processes. This task is accomplished by generating natural language senten...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 33(2024) vom: 28., Seite 2652-2664
1. Verfasser: Xie, Jiayuan (VerfasserIn)
Weitere Verfasser: Cai, Yi, Chen, Jiali, Xu, Ruohang, Wang, Jiexin, Li, Qing
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2024
Zugriff auf das übergeordnete Werk:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Schlagworte:Journal Article