Data Party THU
Data Party THU
Sep 28, 2025 · Artificial Intelligence

How YOLO-Count Enables Precise Object Counting in Text-to-Image Generation

This article reviews the YOLO-Count model, a fully differentiable, open‑vocabulary object counting system that guides text‑to‑image generators to produce the exact number of objects specified in prompts, achieving state‑of‑the‑art results on both generic counting and controlled image synthesis tasks.

Generative AIVision-LanguageYOLO-Count
0 likes · 8 min read
How YOLO-Count Enables Precise Object Counting in Text-to-Image Generation
AI Frontier Lectures
AI Frontier Lectures
Sep 7, 2025 · Artificial Intelligence

How YOLO-Count Enables Precise Object Counting in Text-to-Image Generation

YOLO-Count introduces a fully differentiable, open‑vocabulary object counting model that guides text‑to‑image generators to produce the exact number of objects specified in prompts, achieving state‑of‑the‑art performance on both generic counting and controlled image synthesis tasks.

Generative AIYOLO-Countdifferentiable models
0 likes · 8 min read
How YOLO-Count Enables Precise Object Counting in Text-to-Image Generation
AIWalker
AIWalker
May 22, 2025 · Artificial Intelligence

VisionReasoner: RL‑Unified System Beats YOLO‑World on Detection, Segmentation, Counting

VisionReasoner introduces a reinforcement‑learning‑driven unified framework that simultaneously handles detection, segmentation, and counting tasks within a single model, achieving 29.1% higher COCO detection AP, 22.1% better ReasonSeg segmentation, and 15.3% improvement on CountBench, while requiring only 7,000 training samples and offering efficient multi‑target matching via batch computation and the Hungarian algorithm.

LVLMVisionReasonerimage segmentation
0 likes · 19 min read
VisionReasoner: RL‑Unified System Beats YOLO‑World on Detection, Segmentation, Counting