Tagged articles

DeepSeek-R1

21 articles · Page 1 of 1

Jun 13, 2026 · Artificial Intelligence

What Is an Inference Large Language Model? A Visual Guide

The article explains inference‑type large language models, how they differ from traditional models by breaking questions into reasoning steps, the shift from training‑time to test‑time compute, scaling‑law insights, validation techniques, proposal‑distribution tricks, and the detailed training pipeline of DeepSeek‑R1, while also discussing failed experiments and future directions.

DeepSeek-R1inference modelslarge language models

0 likes · 20 min read

What Is an Inference Large Language Model? A Visual Guide

AI Explorer

Mar 27, 2026 · Artificial Intelligence

Why Tsinghua’s Multi‑Intelligence DeepSeek‑R1 Shifts AI from Depth to Width

Tsinghua University and WuWen XinQiong unveil DeepSeek‑R1, a multi‑model AI architecture that prioritizes width over depth, enabling parallel expert models to tackle complex, multi‑format data, addressing single‑model limitations while attracting significant industry investment and posing new engineering challenges.

AI ArchitectureDeepSeek-R1Multi-Model

0 likes · 7 min read

Why Tsinghua’s Multi‑Intelligence DeepSeek‑R1 Shifts AI from Depth to Width

Ubuntu

Jan 26, 2026 · Artificial Intelligence

Build a Fully Private Ubuntu AI Assistant with DeepSeek‑R1 and AnythingLLM (No Internet Needed)

This guide walks you through installing Ollama on Ubuntu, loading the open‑source DeepSeek‑R1 model, configuring AnythingLLM as a local RAG system, and testing it offline so the AI can answer questions from your private documents without any data ever leaving your machine.

AnythingLLMDeepSeek-R1Ollama

0 likes · 6 min read

Build a Fully Private Ubuntu AI Assistant with DeepSeek‑R1 and AnythingLLM (No Internet Needed)

HyperAI Super Neural

Sep 18, 2025 · Artificial Intelligence

DeepSeek‑R1 Costs $294K to Train, Hits Nature Cover as First Peer‑Reviewed Large Model

DeepSeek‑R1, the first mainstream large language model to pass peer review in Nature, was trained for $294,000 using 648 H800 GPUs, and its RL‑enhanced version, DeepSeek‑R1‑Zero, achieved up to 86.7% pass@1 on AIME 2024, outperforming human averages across math, coding, and science tasks.

AI researchDeepSeek-R1Large Language Model

0 likes · 10 min read

DeepSeek‑R1 Costs $294K to Train, Hits Nature Cover as First Peer‑Reviewed Large Model

Liangxu Linux

Apr 28, 2025 · Artificial Intelligence

Deploy DeepSeek‑R1 on Your Server in 15 Minutes with Zero Code

This guide shows how to use the lightweight OpenStation platform to install, configure, and launch the DeepSeek‑R1 large‑model on a personal server in under 15 minutes, covering zero‑code deployment, resource management, inference engine selection, and integration with CherryStudio.

AI model deploymentCherryStudioDeepSeek-R1

0 likes · 7 min read

Deploy DeepSeek‑R1 on Your Server in 15 Minutes with Zero Code

ByteDance Cloud Native

Mar 20, 2025 · Artificial Intelligence

How to Deploy DeepSeek‑R1 671B on AIBrix: Multi‑Node GPU Inference in Hours

This guide explains how to use the AIBrix distributed inference platform to deploy the massive DeepSeek‑R1 671B model across multiple GPU nodes, covering cluster setup, custom vLLM images, storage options, RDMA networking, autoscaling, request handling, and observability, turning a weeks‑long deployment into an hour‑scale process.

AIBrixDeepSeek-R1Distributed Inference

0 likes · 14 min read

How to Deploy DeepSeek‑R1 671B on AIBrix: Multi‑Node GPU Inference in Hours

Meituan Technology Team

Mar 6, 2025 · Artificial Intelligence

INT8 Quantization and Inference Optimization of DeepSeek R1 Model

Meituan’s search and recommendation team converted the FP8‑only DeepSeek‑R1 model to INT8 by first casting weights to BF16 and then applying block‑wise or channel‑wise quantization, which preserves GSM8K and MMLU accuracy while delivering 33% to 50% higher throughput on A100‑80G GPUs, and they released the SGLang‑based inference scripts and quantized weights publicly, enabling deployment on older NVIDIA hardware without accuracy loss.

DeepSeek-R1GPU deploymentINT8 Quantization

0 likes · 11 min read

INT8 Quantization and Inference Optimization of DeepSeek R1 Model

Architect

Feb 27, 2025 · Artificial Intelligence

Understanding Inference Large Language Models: DeepSeek‑R1 and the Rise of Test‑Time Computation

This article explains how inference‑oriented large language models such as DeepSeek‑R1 and OpenAI o1‑mini shift AI research from training‑time scaling to test‑time computation, detailing the underlying principles, new scaling laws, verification techniques, reinforcement‑learning pipelines, and practical methods for distilling reasoning capabilities into smaller models.

DeepSeek-R1inferencelarge language models

0 likes · 18 min read

Understanding Inference Large Language Models: DeepSeek‑R1 and the Rise of Test‑Time Computation

AIWalker

Feb 27, 2025 · Artificial Intelligence

Step-by-Step Guide to Deploying, Testing, and Optimizing DeepSeek‑R1: A Complete Tutorial

This article provides a comprehensive, hands‑on guide for installing and configuring DeepSeek‑R1 with Ollama and vLLM, setting up multi‑node multi‑GPU environments, running performance benchmarks, optimizing runtime parameters, and even generating a full PyTorch distributed‑training script.

DeepSeek-R1GPU OptimizationLLM deployment

0 likes · 39 min read

Step-by-Step Guide to Deploying, Testing, and Optimizing DeepSeek‑R1: A Complete Tutorial

Architect

Feb 22, 2025 · Artificial Intelligence

How Open‑Source Projects Reproduced DeepSeek‑R1 and Pushed LLM Limits

This article reviews the most notable open‑source reproductions of DeepSeek‑R1—including Open R1, OpenThoughts, LIMO and DeepScaleR—detailing their data pipelines, training steps, reinforcement‑learning strategies, dataset constructions, and benchmark results that demonstrate how small, high‑quality data can rival massive‑scale models.

AI researchDataset ConstructionDeepSeek-R1

0 likes · 26 min read

How Open‑Source Projects Reproduced DeepSeek‑R1 and Pushed LLM Limits

Tencent Technical Engineering

Feb 19, 2025 · Artificial Intelligence

Reproduction and Analysis of DeepSeek R1/R1‑zero Reinforcement Learning Experiments

This note surveys four open‑source reproductions of DeepSeek R1/R1‑zero reinforcement‑learning pipelines, re‑implements their training on math and logic datasets using Qwen‑based models, shows that format‑plus‑accuracy rewards improve long‑chain reasoning though stability and scaling remain challenges, and outlines future directions for large‑scale RL and business deployment.

DeepSeek-R1Large Language Modellong chain of thought

0 likes · 39 min read

Reproduction and Analysis of DeepSeek R1/R1‑zero Reinforcement Learning Experiments

AI Algorithm Path

Feb 18, 2025 · Artificial Intelligence

Build DeepSeek‑R1 from Scratch: Complete Training Process with Code Walkthrough

This article provides a step‑by‑step, code‑first guide to reproducing DeepSeek‑R1 from the ground up, covering model selection, dataset preparation, custom reward functions, GRPO reinforcement‑learning training, supervised fine‑tuning, reasoning‑oriented RL, rejection sampling, and model distillation.

DeepSeek-R1LLM trainingPython

0 likes · 48 min read

Build DeepSeek‑R1 from Scratch: Complete Training Process with Code Walkthrough

Java Tech Enthusiast

Feb 15, 2025 · Artificial Intelligence

DeepSeek-R1: High-Performance AI Inference Model

DeepSeek‑R1 is a high‑performance AI inference model that leverages reinforcement‑learning techniques to boost reasoning on complex tasks, has become a Chinese‑New‑Year sensation, and requires substantial hardware resources for local deployment, especially the full‑scale 671‑billion‑parameter version.

AI DeploymentAI inferenceAI model

0 likes · 4 min read

DeepSeek-R1: High-Performance AI Inference Model

AI Algorithm Path

Feb 13, 2025 · Artificial Intelligence

How to Build a Local RAG Knowledge Base with DeepSeek‑R1 and Ollama

This article walks through setting up a local Retrieval‑Augmented Generation (RAG) system using the open‑source DeepSeek‑R1 model run via Ollama, covering installation, model selection, PDF ingestion with LangChain, semantic chunking, FAISS vector store creation, RetrievalQA chain construction, and a Streamlit UI for querying.

DeepSeek-R1FAISSLangChain

0 likes · 8 min read

How to Build a Local RAG Knowledge Base with DeepSeek‑R1 and Ollama

Fun with Large Models

Feb 12, 2025 · Artificial Intelligence

Build a Local DeepSeek‑R1 Large Model Service with Ollama – Intro to AI LLMs

This guide walks through installing Ollama on Windows, configuring the OLLAMA_MODELS path, downloading the 7‑b DeepSeek‑R1 model, running it locally, and accessing it via a browser using the Page Assist extension, providing step‑by‑step commands, screenshots, and tips for offline setups.

AI model deploymentDeepSeek-R1Ollama

0 likes · 9 min read

Build a Local DeepSeek‑R1 Large Model Service with Ollama – Intro to AI LLMs

21CTO

Feb 6, 2025 · Artificial Intelligence

Run DeepSeek R1 Locally for Free – Integrate AI into VSCode with LM Studio, Ollama, Jan

This guide shows how to set up the free, open‑source DeepSeek R1 large language model locally using LM Studio, Ollama, or Jan, choose the appropriate model size for your hardware, and integrate it into Visual Studio Code as a code‑assistant without any cost.

DeepSeek-R1JanLM Studio

0 likes · 8 min read

Run DeepSeek R1 Locally for Free – Integrate AI into VSCode with LM Studio, Ollama, Jan

Architect

Feb 6, 2025 · Artificial Intelligence

DeepSeek‑R1: Reinforcement‑Learning‑Driven Long‑Chain Reasoning for Large Language Models

The article reviews DeepSeek‑R1, detailing its reinforcement‑learning‑based training pipeline that uses minimal supervised data, cold‑start fine‑tuning, multi‑stage RL, rejection‑sampling SFT, and distillation to achieve reasoning performance comparable to OpenAI‑o1‑1217, while also discussing successful contributions and failed experiments.

AI researchDeepSeek-R1LLM reasoning

0 likes · 11 min read

DeepSeek‑R1: Reinforcement‑Learning‑Driven Long‑Chain Reasoning for Large Language Models

Code Mala Tang

Feb 5, 2025 · Artificial Intelligence

How O3-mini Stacks Up Against DeepSeek‑R1: Speed, Coding Power, and STEM Reasoning

OpenAI's newly released O3-mini and O3-mini‑high models outperform DeepSeek‑R1 in coding benchmarks, offer faster STEM reasoning, and are accessible to free users, while DeepSeek‑R1 remains a cost‑effective open‑source alternative with strong reasoning capabilities.

AI model comparisonDeepSeek-R1O3-mini

0 likes · 8 min read

How O3-mini Stacks Up Against DeepSeek‑R1: Speed, Coding Power, and STEM Reasoning

Architect

Feb 2, 2025 · Artificial Intelligence

Deploying DeepSeek‑R1 Locally with Ollama and Accessing It via Spring Boot and Spring AI

This guide explains how to install Ollama, download and run the open‑source DeepSeek‑R1 language model locally, configure GPU acceleration, and integrate the model into a Spring Boot application using Spring AI to provide an API service for AI inference.

AI model deploymentDeepSeek-R1GPU Acceleration

0 likes · 12 min read

Deploying DeepSeek‑R1 Locally with Ollama and Accessing It via Spring Boot and Spring AI

Tencent Cloud Developer

Feb 2, 2025 · Artificial Intelligence

Deploying DeepSeek-R1 Models on Tencent Cloud HAI Platform

Deploy DeepSeek‑R1 models on Tencent Cloud HAI in just three minutes by logging in, creating an application, and accessing the model via ChatbotUI or JupyterLab, without purchasing GPUs or configuring environments, while also leveraging integrated services like Cloud Studio and Object Storage for enterprise AI solutions.

AI model deploymentChatbotUIDeepSeek-R1

0 likes · 3 min read

Deploying DeepSeek-R1 Models on Tencent Cloud HAI Platform

ZhongAn Tech Team

Jan 26, 2025 · Artificial Intelligence

Weekly AI Digest – Issue 12

This issue covers the Stargate AI initiative announced by Trump and Altman, Musk’s criticism, Tencent’s Hunyuan 3D AI creation engine launch, and DeepSeek’s release of the open‑source reasoning model R1 with its technical innovations.

3D AIAIDeepSeek-R1

0 likes · 10 min read