Tagged articles

GroundingCap

1 articles · Page 1 of 1
AIWalker
AIWalker
Feb 11, 2025 · Artificial Intelligence

LLMDet: LLM‑Powered Open‑Vocabulary Detector Beats Grounding DINO

LLMDet introduces a novel training pipeline that leverages large language models to generate detailed image‑level captions and region‑level phrases, fine‑tunes an open‑vocabulary detector with the GroundingCap‑1M dataset, and achieves state‑of‑the‑art zero‑shot performance surpassing Grounding DINO across multiple benchmarks.

GroundingCapLLMDetOpen-Vocabulary Detection
0 likes · 20 min read
LLMDet: LLM‑Powered Open‑Vocabulary Detector Beats Grounding DINO