DaTaobao Tech
May 24, 2022 · Artificial Intelligence
GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection
GEN‑VLKT introduces a Guided‑Embedding Network with position‑ and instance‑guided embeddings to remove costly post‑processing and leverages CLIP‑based visual‑linguistic knowledge transfer for interaction understanding, achieving state‑of‑the‑art HOI detection performance and zero‑shot capability, now deployed in Alibaba’s Taobao services.
ClipComputer VisionHOI detection
0 likes · 7 min read