Fei, H., Wu, S., Zhang, M., Zhang, M., Chua, T., & Yan, S. (2024). Enhancing Video-Language Representations With Structural Spatio-Temporal Alignment. IEEE transactions on pattern analysis and machine intelligence, 46(12), 7701. https://doi.org/10.1109/TPAMI.2024.3393452
Chicago ZitierstilFei, Hao, Shengqiong Wu, Meishan Zhang, Min Zhang, Tat-Seng Chua, und Shuicheng Yan. "Enhancing Video-Language Representations With Structural Spatio-Temporal Alignment." IEEE Transactions on Pattern Analysis and Machine Intelligence 46, no. 12 (2024): 7701. https://dx.doi.org/10.1109/TPAMI.2024.3393452.
MLA ZitierstilFei, Hao, et al. "Enhancing Video-Language Representations With Structural Spatio-Temporal Alignment." IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 46, no. 12, 2024, p. 7701.