Advancing Multimodal Large Language Models in Chart Question Answering with Visualization-Referenced Instruction Tuning
Emerging multimodal large language models (MLLMs) exhibit great potential for chart question answering (CQA). Recent efforts primarily focus on scaling up training datasets (i.e., charts, data tables, and question-answer (QA) pairs) through data collection and synthesis. However, our empirical study...
Veröffentlicht in: | IEEE transactions on visualization and computer graphics. - 1996. - PP(2024) vom: 10. Sept. |
---|---|
1. Verfasser: | |
Weitere Verfasser: | , , |
Format: | Online-Aufsatz |
Sprache: | English |
Veröffentlicht: |
2024
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on visualization and computer graphics |
Schlagworte: | Journal Article |
Online verfügbar |
Volltext |