Rethinking and Improving Feature Pyramids for One-Stage Referring Expression Comprehension

Referring Expression Comprehension (REC) is an important task in the vision-and-language community, since it is an essential step for many cross-modal tasks such as VQA, image retrieval and image caption. To obtain a better trade-off between speed and accuracy, existing researches usually follow a o...

Description complète

Détails bibliographiques
Publié dans:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. - 1992. - 32(2023) vom: 06., Seite 854-864
Auteur principal: Suo, Wei (Auteur)
Autres auteurs: Sun, Mengyang, Wang, Peng, Zhang, Yanning, Wu, Qi
Format: Article en ligne
Langue:English
Publié: 2023
Accès à la collection:IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Sujets:Journal Article