An Empirical Evaluation of the GPT-4 Multimodal Language Model on Visualization Literacy Tasks

Large Language Models (LLMs) like GPT-4 which support multimodal input (i.e., prompts containing images in addition to text) have immense potential to advance visualization research. However, many questions exist about the visual capabilities of such models, including how well they can read and inte...

Description complète

Détails bibliographiques
Publié dans:IEEE transactions on visualization and computer graphics. - 1996. - 31(2025), 1 vom: 28. Jan., Seite 1105-1115
Auteur principal: Bendeck, Alexander (Auteur)
Autres auteurs: Stasko, John
Format: Article en ligne
Langue:English
Publié: 2025
Accès à la collection:IEEE transactions on visualization and computer graphics
Sujets:Journal Article