Tagged articles
1 articles
Page 1 of 1
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 19, 2018 · Artificial Intelligence

Can Generative Models Boost Visual‑Text Retrieval? Introducing GXN

This paper presents GXN, a generative cross‑modal feature learning framework that enhances image‑text retrieval by incorporating both high‑level semantic similarity and fine‑grained local matching through a three‑step Look‑Imagine‑Match process, achieving state‑of‑the‑art results on MSCOCO and Flickr30K.

Deep LearningGenerative Modelsartificial intelligence
0 likes · 6 min read
Can Generative Models Boost Visual‑Text Retrieval? Introducing GXN