DaTaobao Tech
May 27, 2022 · Artificial Intelligence
Multimodal Pretraining for Search Recall in E-commerce
The paper proposes a multimodal pre‑training framework that jointly encodes query text and item titles with images via shared and single‑stream towers, using MLM, MPM, QIC, and matching tasks, and demonstrates substantial Recall@K gains on a billion‑item e‑commerce catalog by leveraging visual cues to bridge the semantic gap.
PretrainingVector Retrievaldeep learning
0 likes · 17 min read