M3D : a Multimodal, Multilingual and Multitask Dataset for Grounded Document-level Information Extraction
Multimodal information extraction (IE) tasks have attracted increasing attention because many studies have shown that multimodal information benefits text information extraction. However, existing multimodal IE datasets mainly focus on sentence-level image-facilitated IE in English text, and pay lit...
Ausführliche Beschreibung
Bibliographische Detailangaben
| Veröffentlicht in: | IEEE transactions on pattern analysis and machine intelligence. - 1979. - PP(2025) vom: 11. Sept.
|
| 1. Verfasser: |
Liu, Jiang
(VerfasserIn) |
| Weitere Verfasser: |
Li, Bobo,
Yang, Xinran,
Yang, Na,
Fei, Hao,
Zhang, Mingyao,
Li, Fei,
Ji, Donghong |
| Format: | Online-Aufsatz
|
| Sprache: | English |
| Veröffentlicht: |
2025
|
| Zugriff auf das übergeordnete Werk: | IEEE transactions on pattern analysis and machine intelligence
|
| Schlagworte: | Journal Article |