Tagged articles

R1

8 articles · Page 1 of 1

Jun 4, 2025 · Artificial Intelligence

Beyond a Minor Update: DeepSeek's Coding Ability Leaps Forward

The DeepSeek‑R1 model upgrade dramatically improves reasoning depth and code‑generation performance, matching top‑tier models on benchmarks like LiveCodeBench, while industry experts warn that such advances could reshape software engineering roles and devalue pure coding skills.

AI impact on jobsAI programmingDeepSeek

0 likes · 5 min read

Beyond a Minor Update: DeepSeek's Coding Ability Leaps Forward

DataFunTalk

Mar 7, 2025 · Artificial Intelligence

DeepSeek R1 Technical Report: Insights into Reasoning Models and Their Impact

This presentation reviews the development, technical details, and societal impact of DeepSeek's R1 model, explaining its reasoning capabilities, training pipeline, comparisons with other models, and future directions for AI research and product applications.

AI researchDeepSeekR1

0 likes · 53 min read

DeepSeek R1 Technical Report: Insights into Reasoning Models and Their Impact

AI Algorithm Path

Mar 3, 2025 · Artificial Intelligence

DeepSeek‑R1 Model Performance: Comparing 32B, 70B, and R1

This article evaluates DeepSeek‑R1’s 32B and 70B distilled models alongside the original R1 on a range of reasoning and coding tasks, detailing hardware setup, test methodology, per‑task results, and a comparative analysis of their strengths and weaknesses.

32B70BDeepSeek

0 likes · 6 min read

DeepSeek‑R1 Model Performance: Comparing 32B, 70B, and R1

Architect

Feb 17, 2025 · Artificial Intelligence

Deploying DeepSeek R1 on Huawei Ascend 910B: Weight Conversion and Troubleshooting

This article details a step‑by‑step deployment of the DeepSeek R1 model on Huawei Ascend 910B NPUs, covering FP8‑to‑BF16 weight conversion, custom container image preparation, configuration of MindIE services, common pitfalls, and practical troubleshooting tips for large‑scale inference.

DeepSeekHuawei AscendMindIE

0 likes · 8 min read

Deploying DeepSeek R1 on Huawei Ascend 910B: Weight Conversion and Troubleshooting

JD Tech Talk

Feb 13, 2025 · Artificial Intelligence

DeepSeek R1: Concept Overview, Training Principles, and Practical Implementations

This article introduces the DeepSeek family of models, explains the concepts of online search and deep reasoning, details the two‑phase training pipeline with data augmentation and reinforcement learning, and showcases practical experiments and deployment examples for the R1 and distilled variants.

DeepSeekKnowledge DistillationLLM

0 likes · 10 min read

DeepSeek R1: Concept Overview, Training Principles, and Practical Implementations

AI2ML AI to Machine Learning

Feb 8, 2025 · Artificial Intelligence

Analyzing DeepSeek R1 Inference Projects: Source Code, Cold‑Start, and Scaling Techniques

This article examines DeepSeek R1’s three breakthroughs, its low‑cost optimizations that bypass CUDA, and the resulting impact on the AI ecosystem, then provides a detailed technical review of seven open‑source reproductions—Open‑R1, Tiny‑Zero, SimpleScaling‑S1, and simpleRL‑reason—covering their architectures, reinforcement‑learning pipelines, and code implementations.

DeepSeekInference ScalingLarge Language Models

0 likes · 10 min read

Analyzing DeepSeek R1 Inference Projects: Source Code, Cold‑Start, and Scaling Techniques

Java Architecture Diary

Feb 5, 2025 · Artificial Intelligence

Unlocking DeepSeek R1’s Chain‑of‑Thought: A Spring WebFlux Integration Guide

This article examines why mainstream AI frameworks like Spring AI and LangChain4j cannot fully support DeepSeek’s R1 model, explains its unique chain‑of‑thought response format and parameter constraints, and provides a complete Spring WebFlux‑based solution—including API calls, streaming handling, and response parsing—to preserve reasoning content.

Chain-of-ThoughtDeepSeekR1

0 likes · 8 min read

Unlocking DeepSeek R1’s Chain‑of‑Thought: A Spring WebFlux Integration Guide

Java Architecture Diary

Jan 21, 2025 · Artificial Intelligence

Unlocking DeepSeek R1: How to Leverage the New Reasoning Model with Spring AI

This article introduces DeepSeek R1, a breakthrough reasoning‑focused large model that visualizes its chain‑of‑thought process, matches OpenAI O1 performance, offers open‑source advantages, and provides step‑by‑step Spring AI integration guidance, including dependency setup, configuration, and code examples.

AI integrationDeepSeekR1

0 likes · 9 min read

Unlocking DeepSeek R1: How to Leverage the New Reasoning Model with Spring AI