Universal Multimodal Representation for Language Understanding
Representation learning is the foundation of natural language processing (NLP). This work presents new methods to employ visual information as assistant signals to general NLP tasks. For each sentence, we first retrieve a flexible number of images either from a light topic-image lookup table extract...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - 45(2023), 7 vom: 03. Juli, Seite 9169-9185
|
1. Verfasser: |
Zhang, Zhuosheng
(VerfasserIn) |
Weitere Verfasser: |
Chen, Kehai,
Wang, Rui,
Utiyama, Masao,
Sumita, Eiichiro,
Li, Zuchao,
Zhao, Hai |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2023
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
Schlagworte: | Journal Article |