LVLM-EHub : A Comprehensive Evaluation Benchmark for Large Vision-Language Models
Large Vision-Language Models (LVLMs) have recently played a dominant role in multimodal vision-language learning. Despite the great success, it lacks a holistic evaluation of their efficacy. This paper presents a comprehensive evaluation of publicly available large multimodal models by building an L...
Ausführliche Beschreibung
Bibliographische Detailangaben
| Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2024) vom: 27. Nov.
|
| 1. Verfasser: |
Xu, Peng
(VerfasserIn) |
| Weitere Verfasser: |
Shao, Wenqi,
Zhang, Kaipeng,
Gao, Peng,
Liu, Shuo,
Lei, Meng,
Meng, Fanqing,
Huang, Siyuan,
Qiao, Yu,
Luo, Ping |
| Format: | Online-Aufsatz
|
| Sprache: | English |
| Veröffentlicht: |
2024
|
| Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
| Schlagworte: | Journal Article |