Tuning Vision-Language Models With Multiple Prototypes Clustering
Benefiting from advances in large-scale pre-training, foundation models, have demonstrated remarkable capability in the fields of natural language processing, computer vision, among others. However, to achieve expert-level performance in specific applications, such models often need to be fine-tuned...
Ausführliche Beschreibung
Bibliographische Detailangaben
| Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - 46(2024), 12 vom: 13. Dez., Seite 11186-11199
|
| 1. Verfasser: |
Guo, Meng-Hao
(VerfasserIn) |
| Weitere Verfasser: |
Zhang, Yi,
Mu, Tai-Jiang,
Huang, Sharon X,
Hu, Shi-Min |
| Format: | Online-Aufsatz
|
| Sprache: | English |
| Veröffentlicht: |
2024
|
| Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
| Schlagworte: | Journal Article |