Unified Static and Dynamic Network : Efficient Temporal Filtering for Video Grounding

Inspired by the activity-silent and persistent activity mechanisms in human visual perception biology, we design a Unified Static and Dynamic Network (UniSDNet), to learn the semantic association between the video and text/audio queries in a cross-modal environment for efficient video grounding. For...

Description complète

Détails bibliographiques
Publié dans:IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2025) vom: 08. Apr.
Auteur principal: Hu, Jingjing (Auteur)
Autres auteurs: Guo, Dan, Li, Kun, Si, Zhan, Yang, Xun, Chang, Xiaojun, Wang, Meng
Format: Article en ligne
Langue:English
Publié: 2025
Accès à la collection:IEEE transactions on pattern analysis and machine intelligence
Sujets:Journal Article