Multilingual Text-to-Image Person Retrieval via Bidirectional Relation Reasoning and Aligning

Text-to-image person retrieval (TIPR) aims to identify the target person using textual descriptions, facing challenge in modality heterogeneity. Prior works have attempted to address it by developing cross-modal global or local alignment strategies. However, global methods typically overlook fine-gr...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2025) vom: 10. Okt.
1. Verfasser: Cao, Min (VerfasserIn)
Weitere Verfasser: Zhou, Xinyu, Jiang, Ding, Du, Bo, Ye, Mang, Zhang, Min
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2025
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article