Meituan Technology Team
Oct 15, 2020 · Artificial Intelligence
Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue
The paper introduces the Answer‑Driven Visual State Estimator (ADVSE), which uses answer‑driven focusing attention and conditional visual information fusion to dynamically incorporate answers into visual dialogue, overcoming static encoding limitations and achieving state‑of‑the‑art performance on the GuessWhat?! question‑generation and guessing tasks.
Attention Mechanismgoal-orientedmultimodal AI
0 likes · 10 min read
