Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding

Akira Fukui, Dong Huk Park, Daylen Yang, Anna Rohrbach, Trevor Darrell, Marcus Rohrbach


Anthology ID:
D16-1044
Volume:
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2016
Address:
Austin, Texas
Venue:
EMNLP
SIG:
SIGDAT
Publisher:
Association for Computational Linguistics
Note:
Pages:
457–468
Language:
URL:
https://www.aclweb.org/anthology/D16-1044
DOI:
10.18653/v1/D16-1044
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
http://aclanthology.lst.uni-saarland.de/D16-1044.pdf