Hadamard product for low-rank bilinear pooling

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 188
  • Download : 0
Bilinear models provide rich representations compared with linear models. They have been applied in various visual tasks, such as object recognition, segmentation, and visual question-answering, to get state-of-the-art performances taking advantage of the expanded representations. However, bilinear representations tend to be high-dimensional, limiting the applicability to computationally complex tasks. We propose low-rank bilinear pooling using Hadamard product for an efficient attention mechanism of multimodal learning. We show that our model outperforms compact bilinear pooling in visual question-answering tasks with the state-of-the-art results on the VQA dataset, having a better parsimonious property.
Publisher
International Conference on Learning Representations, ICLR
Issue Date
2017-04
Language
English
Citation

5th International Conference on Learning Representations, ICLR 2017

URI
http://hdl.handle.net/10203/310286
Appears in Collection
RIMS Conference Papers
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0