MemBridge : Video-Language Pre-Training With Memory-Augmented Inter-Modality Bridge
Video-language pre-training has attracted considerable attention recently for its promising performance on various downstream tasks. Most existing methods utilize the modality-specific or modality-joint representation architectures for the cross-modality pre-training. Different from previous methods...
Ausführliche Beschreibung
Bibliographische Detailangaben
Veröffentlicht in: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 32(2023) vom: 20., Seite 4073-4087
|
1. Verfasser: |
Yang, Jiahao
(VerfasserIn) |
Weitere Verfasser: |
Li, Xiangyang,
Zheng, Mao,
Wang, Zihan,
Zhu, Yongqing,
Guo, Xiaoqian,
Yuan, Yuchen,
Chai, Zifeng,
Jiang, Shuqiang |
Format: | Online-Aufsatz
|
Sprache: | English |
Veröffentlicht: |
2023
|
Zugriff auf das übergeordnete Werk: | IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
|
Schlagworte: | Journal Article |