Tag

visual grounding

0 views collected around this technical thread.

DaTaobao Tech
DaTaobao Tech
Nov 25, 2024 · Artificial Intelligence

Open‑Set Object Detection and Visual Grounding: Analysis of YOLO‑World, Grounding DINO, and YOLO11

The article surveys state‑of‑the‑art open‑set object detection and visual‑grounding models—Grounding DINO, YOLO‑World, and the latest YOLO 11—detailing their architectures, training strategies, and experimental results on home‑decoration datasets, showing that open‑set detectors recognize unseen objects while YOLO 11 excels on known categories, and that integrating both approaches yields superior performance, highlighting the expanded potential of detectors for real‑world applications.

Grounding DINOYOLO-WorldYOLO11
0 likes · 15 min read
Open‑Set Object Detection and Visual Grounding: Analysis of YOLO‑World, Grounding DINO, and YOLO11
Youku Technology
Youku Technology
Aug 6, 2020 · Artificial Intelligence

Recent ACM MM Papers Accepted by Alibaba Entertainment Group

Alibaba Entertainment Group secured four ACM MM paper acceptances, presenting a probabilistic graphical model for crowdsourced visual quality assessment, an attention‑driven Siamese network with reinforcement learning for robust object tracking, a scene‑aware context‑graph method for unsupervised video anomaly detection, and a cross‑modal graph‑matching approach for visual grounding.

Graph Neural NetworksObject Trackingcomputer vision
0 likes · 6 min read
Recent ACM MM Papers Accepted by Alibaba Entertainment Group