LVLM-EHub : A Comprehensive Evaluation Benchmark for Large Vision-Language Models
Large Vision-Language Models (LVLMs) have recently played a dominant role in multimodal vision-language learning. Despite the great success, it lacks a holistic evaluation of their efficacy. This paper presents a comprehensive evaluation of publicly available large multimodal models by building an L...
Description complète
Détails bibliographiques
| Publié dans: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2024) vom: 27. Nov.
|
| Auteur principal: |
Xu, Peng
(Auteur) |
| Autres auteurs: |
Shao, Wenqi,
Zhang, Kaipeng,
Gao, Peng,
Liu, Shuo,
Lei, Meng,
Meng, Fanqing,
Huang, Siyuan,
Qiao, Yu,
Luo, Ping |
| Format: | Article en ligne
|
| Langue: | English |
| Publié: |
2024
|
| Accès à la collection: | IEEE transactions on pattern analysis and machine intelligence
|
| Sujets: | Journal Article |