Video Moment Retrieval With Cross-Modal Neural Architecture Search

The task of video moment retrieval (VMR) is to retrieve the specific video moment from an untrimmed video, according to a textual query. It is a challenging task that requires effective modeling of complex cross-modal matching relationship. Recent efforts primarily model the cross-modal interactions...

Description complète

Détails bibliographiques
Publié dans:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 31(2022) vom: 11., Seite 1204-1216
Auteur principal: Yang, Xun (Auteur)
Autres auteurs: Wang, Shanshan, Dong, Jian, Dong, Jianfeng, Wang, Meng, Chua, Tat-Seng
Format: Article en ligne
Langue:English
Publié: 2022
Accès à la collection:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Sujets:Journal Article