M3D : a Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction

Multimodal information extraction (IE) tasks have attracted increasing attention because many studies have shown that multimodal information benefits text information extraction. However, existing multimodal IE datasets mainly focus on sentence-level image-facilitated IE in English text, and pay lit...

Ausführliche Beschreibung

Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2025) vom: 11. Sept.
1. Verfasser: Liu, Jiang (VerfasserIn)
Weitere Verfasser: Li, Bobo, Yang, Xinran, Yang, Na, Fei, Hao, Zhang, Mingyao, Li, Fei, Ji, Donghong
Format: Online-Aufsatz
Sprache:English
Veröffentlicht: 2025
Zugriff auf das übergeordnete Werk:IEEE transactions on pattern analysis and machine intelligence
Schlagworte:Journal Article