Loss Re-Scaling VQA : Revisiting the Language Prior Problem From a Class-Imbalance View

Recent studies have pointed out that many well-developed Visual Question Answering (VQA) models are heavily affected by the language prior problem. It refers to making predictions based on the co-occurrence pattern between textual questions and answers instead of reasoning upon visual contents. To t...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 31(2022) vom: 03., Seite 227-238
1. Verfasser: Guo, Yangyang (VerfasserIn)
Weitere Verfasser: Nie, Liqiang, Cheng, Zhiyong, Tian, Qi, Zhang, Min
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2022
Zugriff auf das übergeordnete Werk:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Schlagworte:Journal Article