Tag

test-time computation

0 views collected around this technical thread.

Architect
Architect
Feb 27, 2025 · Artificial Intelligence

Understanding Inference Large Language Models: DeepSeek‑R1 and the Rise of Test‑Time Computation

This article explains how inference‑oriented large language models such as DeepSeek‑R1 and OpenAI o1‑mini shift AI research from training‑time scaling to test‑time computation, detailing the underlying principles, new scaling laws, verification techniques, reinforcement‑learning pipelines, and practical methods for distilling reasoning capabilities into smaller models.

DeepSeek-R1InferencePrompt Engineering
0 likes · 18 min read
Understanding Inference Large Language Models: DeepSeek‑R1 and the Rise of Test‑Time Computation