论文阅读:Learning Visual Question Answering by Bootstrapping Hard Attention

Learning Visual Question Answering by Bootstrapping Hard Attention

Google DeepMind  ECCV-2018

  2018-08-05 19:24:44

 

Paperhttps://arxiv.org/abs/1808.00300 

 

论文阅读:Learning Visual Question Answering by Bootstrapping Hard Attention

 

Introduction

本文尝试仅仅用 hard attention 的方法来抠出最有用的 feature,进行 VQA 任务的学习。

Soft Attention:   

  Existing attention models [7,8,9,10] are predominantly based on soft attention, in which all information is adaptively re-weighted before being aggregated. This can improve accuracy by isolating important information and avoiding interference from unimportant information. 

Hard Attention

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

--

论文阅读:Learning Visual Question Answering by Bootstrapping Hard Attention

上一篇:半夜钱款莫名被转走!睡觉手机到底该不该关机?安全专家解读新型网络盗窃!


下一篇:注册页面手机验证码无跳转接收[html+js+ajax+php]