UNK-VQA : A Dataset and a Probe into the Abstention Ability of Multi-modal Large Models

Teaching Visual Question Answering (VQA) models to refrain from answering unanswerable questions is necessary for building a trustworthy AI system. Existing studies, though have explored various aspects of VQA but somewhat ignored this particular attribute. This paper aims to bridge the research gap...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2024) vom: 02. Aug.
1. Verfasser: Guo, Yangyang (VerfasserIn)
Weitere Verfasser: Jiao, Fangkai, Shen, Zhiqi, Nie, Liqiang, Kankanhalli, Mohan
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2024
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article