Tag

DeepSeek-R1

0 views collected around this technical thread.

ByteDance Cloud Native
ByteDance Cloud Native
Mar 20, 2025 · Artificial Intelligence

How to Deploy DeepSeek‑R1 671B on AIBrix: Multi‑Node GPU Inference in Hours

This guide explains how to use the AIBrix distributed inference platform to deploy the massive DeepSeek‑R1 671B model across multiple GPU nodes, covering cluster setup, custom vLLM images, storage options, RDMA networking, autoscaling, request handling, and observability, turning a weeks‑long deployment into an hour‑scale process.

AIBrixDeepSeek-R1Distributed Inference
0 likes · 14 min read
How to Deploy DeepSeek‑R1 671B on AIBrix: Multi‑Node GPU Inference in Hours
Architect
Architect
Feb 27, 2025 · Artificial Intelligence

Understanding Inference Large Language Models: DeepSeek‑R1 and the Rise of Test‑Time Computation

This article explains how inference‑oriented large language models such as DeepSeek‑R1 and OpenAI o1‑mini shift AI research from training‑time scaling to test‑time computation, detailing the underlying principles, new scaling laws, verification techniques, reinforcement‑learning pipelines, and practical methods for distilling reasoning capabilities into smaller models.

DeepSeek-R1InferenceLarge Language Models
0 likes · 18 min read
Understanding Inference Large Language Models: DeepSeek‑R1 and the Rise of Test‑Time Computation
Tencent Technical Engineering
Tencent Technical Engineering
Feb 21, 2025 · Artificial Intelligence

DeepSeek-R1: Enhancing Reasoning Capabilities in LLMs via Reinforcement Learning

DeepSeek‑R1 demonstrates that large‑scale reinforcement learning, especially with the novel Group Relative Policy Optimization and a rule‑based reward scheme, can markedly boost reasoning in LLMs without heavy supervised fine‑tuning, while a brief cold‑start SFT phase, two‑stage alignment, and knowledge distillation further improve performance and efficiency, despite remaining challenges such as language mixing.

Cold StartDeepSeek-R1GRPO
0 likes · 21 min read
DeepSeek-R1: Enhancing Reasoning Capabilities in LLMs via Reinforcement Learning
Tencent Technical Engineering
Tencent Technical Engineering
Feb 19, 2025 · Artificial Intelligence

Reproduction and Analysis of DeepSeek R1/R1‑zero Reinforcement Learning Experiments

This note surveys four open‑source reproductions of DeepSeek R1/R1‑zero reinforcement‑learning pipelines, re‑implements their training on math and logic datasets using Qwen‑based models, shows that format‑plus‑accuracy rewards improve long‑chain reasoning though stability and scaling remain challenges, and outlines future directions for large‑scale RL and business deployment.

DeepSeek-R1large language modellong chain of thought
0 likes · 39 min read
Reproduction and Analysis of DeepSeek R1/R1‑zero Reinforcement Learning Experiments
Java Tech Enthusiast
Java Tech Enthusiast
Feb 15, 2025 · Artificial Intelligence

DeepSeek-R1: High-Performance AI Inference Model

DeepSeek‑R1 is a high‑performance AI inference model that leverages reinforcement‑learning techniques to boost reasoning on complex tasks, has become a Chinese‑New‑Year sensation, and requires substantial hardware resources for local deployment, especially the full‑scale 671‑billion‑parameter version.

AI deploymentAI inferenceAI model
0 likes · 4 min read
DeepSeek-R1: High-Performance AI Inference Model
Architect
Architect
Feb 6, 2025 · Artificial Intelligence

DeepSeek‑R1: Reinforcement‑Learning‑Driven Long‑Chain Reasoning for Large Language Models

The article reviews DeepSeek‑R1, detailing its reinforcement‑learning‑based training pipeline that uses minimal supervised data, cold‑start fine‑tuning, multi‑stage RL, rejection‑sampling SFT, and distillation to achieve reasoning performance comparable to OpenAI‑o1‑1217, while also discussing successful contributions and failed experiments.

AI researchDeepSeek-R1LLM reasoning
0 likes · 11 min read
DeepSeek‑R1: Reinforcement‑Learning‑Driven Long‑Chain Reasoning for Large Language Models
Code Mala Tang
Code Mala Tang
Feb 5, 2025 · Artificial Intelligence

How O3-mini Stacks Up Against DeepSeek‑R1: Speed, Coding Power, and STEM Reasoning

OpenAI's newly released O3-mini and O3-mini‑high models outperform DeepSeek‑R1 in coding benchmarks, offer faster STEM reasoning, and are accessible to free users, while DeepSeek‑R1 remains a cost‑effective open‑source alternative with strong reasoning capabilities.

AI model comparisonDeepSeek-R1O3-mini
0 likes · 8 min read
How O3-mini Stacks Up Against DeepSeek‑R1: Speed, Coding Power, and STEM Reasoning
Architect
Architect
Feb 2, 2025 · Artificial Intelligence

Deploying DeepSeek‑R1 Locally with Ollama and Accessing It via Spring Boot and Spring AI

This guide explains how to install Ollama, download and run the open‑source DeepSeek‑R1 language model locally, configure GPU acceleration, and integrate the model into a Spring Boot application using Spring AI to provide an API service for AI inference.

AI model deploymentDeepSeek-R1GPU Acceleration
0 likes · 12 min read
Deploying DeepSeek‑R1 Locally with Ollama and Accessing It via Spring Boot and Spring AI
Tencent Cloud Developer
Tencent Cloud Developer
Feb 2, 2025 · Artificial Intelligence

Deploying DeepSeek-R1 Models on Tencent Cloud HAI Platform

Deploy DeepSeek‑R1 models on Tencent Cloud HAI in just three minutes by logging in, creating an application, and accessing the model via ChatbotUI or JupyterLab, without purchasing GPUs or configuring environments, while also leveraging integrated services like Cloud Studio and Object Storage for enterprise AI solutions.

AI model deploymentChatbotUIDeepSeek-R1
0 likes · 3 min read
Deploying DeepSeek-R1 Models on Tencent Cloud HAI Platform
ZhongAn Tech Team
ZhongAn Tech Team
Jan 26, 2025 · Artificial Intelligence

Weekly AI Digest – Issue 12

This issue covers the Stargate AI initiative announced by Trump and Altman, Musk’s criticism, Tencent’s Hunyuan 3D AI creation engine launch, and DeepSeek’s release of the open‑source reasoning model R1 with its technical innovations.

3D AIAIDeepSeek-R1
0 likes · 10 min read
Weekly AI Digest – Issue 12