VisQA : X-raying Vision and Language Reasoning in Transformers
Visual Question Answering systems target answering open-ended textual questions given input images. They are a testbed for learning high-level reasoning with a primary use in HCI, for instance assistance for the visually impaired. Recent research has shown that state-of-the-art models tend to produc...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on visualization and computer graphics. - 1996. - 28(2022), 1 vom: 01. Jan., Seite 976-986
|
1. Verfasser: |
Jaunet, Theo
(VerfasserIn) |
Weitere Verfasser: |
Kervadec, Corentin,
Vuillemot, Romain,
Antipov, Grigory,
Baccouche, Moez,
Wolf, Christian |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2022
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on visualization and computer graphics
|
Schlagworte: | Journal Article |