Tagged articles
3 articles
Page 1 of 1
AIWalker
AIWalker
Sep 24, 2025 · Artificial Intelligence

Top 2025 Object Detection Research Paths: From Grounding DINO 1.5 to Open‑Set Breakthroughs

The article outlines four key innovation avenues—architecture redesign, task expansion, information fusion, and paradigm shift—highlighting recent works such as Mr. DETR, Grounding DINO 1.5, SM3Det, and RoboFusion, and offers a curated list of 176 cutting‑edge object‑detection papers with code and datasets for free.

Deep LearningModel architectureobject detection
0 likes · 8 min read
Top 2025 Object Detection Research Paths: From Grounding DINO 1.5 to Open‑Set Breakthroughs
DaTaobao Tech
DaTaobao Tech
Nov 25, 2024 · Artificial Intelligence

Open‑Set Object Detection and Visual Grounding: Analysis of YOLO‑World, Grounding DINO, and YOLO11

The article surveys state‑of‑the‑art open‑set object detection and visual‑grounding models—Grounding DINO, YOLO‑World, and the latest YOLO 11—detailing their architectures, training strategies, and experimental results on home‑decoration datasets, showing that open‑set detectors recognize unseen objects while YOLO 11 excels on known categories, and that integrating both approaches yields superior performance, highlighting the expanded potential of detectors for real‑world applications.

Computer VisionDeep LearningGrounding DINO
0 likes · 15 min read
Open‑Set Object Detection and Visual Grounding: Analysis of YOLO‑World, Grounding DINO, and YOLO11
Huolala Tech
Huolala Tech
Jul 21, 2023 · Artificial Intelligence

Visual Language Models Power Open-Set Detection and Surgical Tool Segmentation

Recent advances in visual language models enable zero-shot multimodal tasks, and this article explores their application to open-set object detection, prompt learning, and promptable surgical instrument segmentation, highlighting methods like CLIP, CoOp, and the DetPro framework with experimental results across multiple benchmarks.

Computer VisionMultimodalVisual-Language Models
0 likes · 12 min read
Visual Language Models Power Open-Set Detection and Surgical Tool Segmentation