|
|
|
|
LEADER |
01000caa a22002652 4500 |
001 |
NLM28636171X |
003 |
DE-627 |
005 |
20240229161827.0 |
007 |
cr uuu---uuuuu |
008 |
231225s2018 xx |||||o 00| ||eng c |
024 |
7 |
|
|a 10.1109/TPAMI.2018.2828437
|2 doi
|
028 |
5 |
2 |
|a pubmed24n1308.xml
|
035 |
|
|
|a (DE-627)NLM28636171X
|
035 |
|
|
|a (NLM)29993628
|
040 |
|
|
|a DE-627
|b ger
|c DE-627
|e rakwb
|
041 |
|
|
|a eng
|
100 |
1 |
|
|a Das, Abhishek
|e verfasserin
|4 aut
|
245 |
1 |
0 |
|a Visual Dialog
|
264 |
|
1 |
|c 2018
|
336 |
|
|
|a Text
|b txt
|2 rdacontent
|
337 |
|
|
|a ƒaComputermedien
|b c
|2 rdamedia
|
338 |
|
|
|a ƒa Online-Ressource
|b cr
|2 rdacarrier
|
500 |
|
|
|a Date Revised 27.02.2024
|
500 |
|
|
|a published: Print-Electronic
|
500 |
|
|
|a Citation Status Publisher
|
520 |
|
|
|a We introduce the task of Visual Dialog, which requires an AI agent to hold a meaningful dialog with humans in natural, conversational language about visual content. Specifically, given an image, a dialog history, and a question about the image, the agent has to ground the question in image, infer context from history, and answer the question accurately. Visual Dialog is disentangled enough from a specific downstream task so as to serve as a general test of machine intelligence, while being sufficiently grounded in vision to allow objective evaluation of individual responses and benchmark progress. We develop a novel two-person real-time chat data-collection protocol to curate a large-scale Visual Dialog dataset (VisDial). VisDial v0.9 has been released and consists of dialog question-answer pairs from 10-round, human-human dialogs grounded in images from the COCO dataset
|
650 |
|
4 |
|a Journal Article
|
700 |
1 |
|
|a Kottur, Satwik
|e verfasserin
|4 aut
|
700 |
1 |
|
|a Gupta, Khushi
|e verfasserin
|4 aut
|
700 |
1 |
|
|a Singh, Avi
|e verfasserin
|4 aut
|
700 |
1 |
|
|a Yadav, Deshraj
|e verfasserin
|4 aut
|
700 |
1 |
|
|a Lee, Stefan
|e verfasserin
|4 aut
|
700 |
1 |
|
|a Moura, Jose
|e verfasserin
|4 aut
|
700 |
1 |
|
|a Parikh, Devi
|e verfasserin
|4 aut
|
700 |
1 |
|
|a Batra, Dhruv
|e verfasserin
|4 aut
|
773 |
0 |
8 |
|i Enthalten in
|t IEEE transactions on pattern analysis and machine intelligence
|d 1979
|g (2018) vom: 19. Apr.
|w (DE-627)NLM098212257
|x 1939-3539
|7 nnns
|
773 |
1 |
8 |
|g year:2018
|g day:19
|g month:04
|
856 |
4 |
0 |
|u http://dx.doi.org/10.1109/TPAMI.2018.2828437
|3 Volltext
|
912 |
|
|
|a GBV_USEFLAG_A
|
912 |
|
|
|a SYSFLAG_A
|
912 |
|
|
|a GBV_NLM
|
912 |
|
|
|a GBV_ILN_350
|
951 |
|
|
|a AR
|
952 |
|
|
|j 2018
|b 19
|c 04
|