Tagged articles

DeepSeek

623 articles · Page 2 of 7

Apr 15, 2026 · Industry Insights

How DeepSeek V4 Uses Huawei Ascend 950PR to Outperform Nvidia H20 by 2.9×

The article analyzes DeepSeek V4's migration to Huawei's Ascend 950PR chip and CANN framework, detailing three hardware‑level innovations, the CUDA‑to‑CANN transition, and the resulting 35× inference speed boost, 2.87× performance over Nvidia H20, and dramatic cost reductions for trillion‑parameter models.

AI hardwareCANN frameworkDeepSeek

0 likes · 10 min read

How DeepSeek V4 Uses Huawei Ascend 950PR to Outperform Nvidia H20 by 2.9×

Machine Heart

Apr 12, 2026 · Artificial Intelligence

LRT: Implicit Reasoning Chains Boost Speed and Accuracy by Removing Redundant Steps

Researchers introduce Latent Reasoning Tuning (LRT), a lightweight inference network that encodes explicit reasoning chains into fixed‑length latent vectors, eliminating thousands of decoding steps; experiments reveal substantial redundancy in traditional chains and demonstrate that LRT achieves faster, more accurate inference and outperforms existing efficient reasoning methods.

DeepSeekEfficient InferenceHybrid Reasoning

0 likes · 10 min read

LRT: Implicit Reasoning Chains Boost Speed and Accuracy by Removing Redundant Steps

ArcThink

Apr 11, 2026 · Artificial Intelligence

DeepSeek V4 Preview: A Sovereign Shift Beyond Benchmarks

Developers can sift through official silence and industry leaks—internal statements, Ascend 950PR supply‑chain hints, and sparse‑attention innovations—to assess DeepSeek V4’s likely technical leaps, from million‑token context to native Ascend training, and its strategic impact on the open‑source AI landscape and CUDA independence.

AI model analysisDeepSeekHuawei Ascend

0 likes · 27 min read

DeepSeek V4 Preview: A Sovereign Shift Beyond Benchmarks

Wukong Talks Architecture

Apr 8, 2026 · Artificial Intelligence

How to Switch Claude Code to DeepSeek for Faster, Cheaper AI Coding

This step‑by‑step guide shows how to configure Claude Code to use DeepSeek’s Anthropic‑compatible API, replace the default model, optimize costs with mixed model strategies, secure your API key, and troubleshoot common connection issues, enabling a seamless, cost‑effective AI development workflow.

AI model integrationAPI ConfigurationClaude Code

0 likes · 7 min read

How to Switch Claude Code to DeepSeek for Faster, Cheaper AI Coding

Old Meng AI Explorer

Apr 3, 2026 · Artificial Intelligence

Unlock Faster, Cheaper Claude Code with Domestic LLMs: 3 Practical Solutions

Discover three practical ways to replace costly, slow Claude Code API calls with domestic large‑language models—DeepSeek, Alibaba Cloud Bailei, and third‑party relay services—offering lower latency, dramatically reduced fees, step‑by‑step configuration, performance benchmarks, and troubleshooting tips for developers.

AI codingClaude CodeDeepSeek

0 likes · 8 min read

Unlock Faster, Cheaper Claude Code with Domestic LLMs: 3 Practical Solutions

Smart Workplace Lab

Apr 1, 2026 · Artificial Intelligence

Build a Zero‑Leak Local AI Workstation for Non‑Tech Professionals

This guide explains how to set up a privacy‑preserving local AI workstation by selecting modest hardware, using open‑source inference frameworks, deploying models with a one‑click graphical interface, and isolating sensitive data through offline routing, all without requiring programming skills.

DeepSeekGGUFHardware Selection

0 likes · 3 min read

Build a Zero‑Leak Local AI Workstation for Non‑Tech Professionals

Lao Guo's Learning Space

Mar 31, 2026 · Artificial Intelligence

2026 Guide to Choosing a Personal Supercomputer for Local DeepSeek (15k‑100k)

With cloud API costs soaring and privacy concerns rising, this 2026 guide compares three personal‑supercomputer options—Apple Mac Studio, NVIDIA DGX Spark, and Mingfan MS‑S1 MAX—using unified memory, memory bandwidth, and AI compute to help developers pick the right hardware for their budget and workload.

AI hardwareDeepSeekMac Studio

0 likes · 12 min read

2026 Guide to Choosing a Personal Supercomputer for Local DeepSeek (15k‑100k)

Black & White Path

Mar 31, 2026 · Information Security

DeepSeek’s Early‑Year Security Fallout: A Post‑Mortem

The article dissects DeepSeek’s series of security breaches in early 2025—including an open ClickHouse database, multiple XSS flaws, model‑level attacks, and regulatory fallout—highlighting how rapid AI product rollout can outpace essential security safeguards.

AI securityClickHouse exposureDeepSeek

0 likes · 14 min read

DeepSeek’s Early‑Year Security Fallout: A Post‑Mortem

Black & White Path

Mar 21, 2026 · Artificial Intelligence

Japan’s ‘Self‑Developed’ 700B AI Model: A DeepSeek Re‑skin Flop

Rakuten AI 3.0 was billed as Japan’s largest, self‑developed 700‑billion‑parameter model backed by government funds, but a quick look at its Hugging Face config reveals it merely re‑uses DeepSeek V3, prompting a broader critique of the hype, funding motives, and strategic trade‑offs behind the launch.

AI Industry AnalysisDeepSeekGovernment funding

0 likes · 5 min read

Japan’s ‘Self‑Developed’ 700B AI Model: A DeepSeek Re‑skin Flop

AI Explorer

Mar 12, 2026 · Industry Insights

Nvidia’s $26 B Bet on Open‑Source AI Models: Redefining the Industry’s Foundations

Nvidia is committing $26 billion to open‑source AI models, shifting from a pure hardware supplier to shaping the entire AI stack—from chips and system software to frameworks and applications—while raising questions about ecosystem lock‑in, competition with newcomers like DeepSeek, and the future of AI infrastructure.

AI EcosystemAI InfrastructureAI Strategy

0 likes · 7 min read

Nvidia’s $26 B Bet on Open‑Source AI Models: Redefining the Industry’s Foundations

Frontend AI Walk

Mar 12, 2026 · Artificial Intelligence

Configure OpenClaw Multi‑Agent: GLM‑5, Kimi K2.5, DeepSeek & GLM‑Flash Team

This step‑by‑step tutorial shows how to integrate domestic LLM providers (GLM‑5, GLM‑4.7, GLM‑Flash, Kimi K2.5, DeepSeek, Qwen3‑Coder‑Next, BGE‑M3) into OpenClaw, define model routing, create dedicated controller, writer and coder agents, and run a complete multi‑agent workflow.

AI configurationDeepSeekGLM-5

0 likes · 16 min read

Configure OpenClaw Multi‑Agent: GLM‑5, Kimi K2.5, DeepSeek & GLM‑Flash Team

Frontend AI Walk

Mar 11, 2026 · Artificial Intelligence

OpenClaw Full‑Domestic Model Stack: 6 Role‑Based Selections and Strategies

This guide outlines a role‑based selection strategy for building a fully domestic OpenClaw model stack, explains common pitfalls when replacing foreign models, details why specific Chinese models fit each role, presents three balanced configurations, and offers a step‑by‑step migration plan.

BGE‑M3DeepSeekGLM-5

0 likes · 15 min read

OpenClaw Full‑Domestic Model Stack: 6 Role‑Based Selections and Strategies

Mingyi World Elasticsearch

Mar 5, 2026 · Artificial Intelligence

Build a Natural‑Language Easysearch Assistant with LLM‑Powered Tool Use (No DSL Required)

This article shows how to create an Easysearch intelligent assistant that lets users manage indexes, write data, search and aggregate documents using Chinese natural language, by combining the DeepSeek large‑language model with OpenAI‑compatible function calling (Tool Use) and a lightweight Node.js executor.

DeepSeekEasysearchLLM

0 likes · 12 min read

Build a Natural‑Language Easysearch Assistant with LLM‑Powered Tool Use (No DSL Required)

Mingyi World Elasticsearch

Mar 5, 2026 · Backend Development

Turning the Easysearch CLI Assistant into a Web App: A Practical Upgrade Guide

This article walks through converting the Easysearch command‑line assistant into a web‑based tool by adding an Express API layer, reusing shared logic, and building a lightweight HTML/CSS/JS front‑end, while preserving the original CLI capabilities.

APICLIDeepSeek

0 likes · 11 min read

Turning the Easysearch CLI Assistant into a Web App: A Practical Upgrade Guide

AI Algorithm Path

Mar 4, 2026 · Artificial Intelligence

Beginner’s Guide: Building a Pedestrian Detection Skill with NanoBot

This step‑by‑step tutorial shows how to install NanoBot, configure it with a DeepSeek API key, create a YOLO‑based pedestrian detection skill via natural‑language commands, test the generated code, and extend the output to JSON, demonstrating AI agents in Python.

AI AgentDeepSeekNanobot

0 likes · 6 min read

Beginner’s Guide: Building a Pedestrian Detection Skill with NanoBot

Machine Learning Algorithms & Natural Language Processing

Mar 3, 2026 · Artificial Intelligence

Identity Constraint Beats DeepSeek mHC After 150B Tokens: A Surprising Reversal

Extensive experiments on DeepSeek's 1.7B and 8B models reveal that replacing the manifold hyper‑connection (mHC) constraint with a simple identity matrix consistently outperforms the original mHC, improves signal flow stability, and avoids the collapse caused by repeated Sinkhorn‑Knopp projections.

DeepSeekHyper-ConnectionSinkhorn

0 likes · 12 min read

Identity Constraint Beats DeepSeek mHC After 150B Tokens: A Surprising Reversal

Machine Learning Algorithms & Natural Language Processing

Mar 1, 2026 · Industry Insights

DeepSeek V4 Launch Next Week Promises 50× Cheaper AI and a Shock to US Stocks

DeepSeek V4, a native multimodal model with image, video and text generation, massive token windows and deep optimization for Chinese AI chips, is set to launch next week, claiming API costs over fifty times lower than rivals and potentially rattling US tech stocks by bypassing Nvidia.

AI industryDeepSeekMultimodal AI

0 likes · 15 min read

DeepSeek V4 Launch Next Week Promises 50× Cheaper AI and a Shock to US Stocks

Architecture & Thinking

Mar 1, 2026 · Artificial Intelligence

Why DeepSeek V4 Prioritizes Chinese Chips Over Nvidia – A Game‑Changer for AI Compute

DeepSeek’s upcoming V4 model breaks industry norms by prioritizing Huawei’s Ascend chips over Nvidia GPUs, offering over 30% performance gains, ultra‑long context windows, native multimodal abilities, and dramatically lower inference costs, signaling a shift toward autonomous AI compute in China.

AI computeAI modelsChinese chips

0 likes · 6 min read

Why DeepSeek V4 Prioritizes Chinese Chips Over Nvidia – A Game‑Changer for AI Compute

Machine Learning Algorithms & Natural Language Processing

Feb 28, 2026 · Artificial Intelligence

How DualPath Revives Idle Network Cards to Break Long‑Context I/O Bottlenecks in DeepSeek V4

The article analyzes the KV‑Cache storage I/O bottleneck that limits agentic LLM inference, introduces the DualPath architecture with a storage‑to‑decode data path and RDMA‑based scheduling, and shows up to 1.87× offline and 1.96× online throughput gains on large‑scale GPU clusters.

DeepSeekDualPathKV cache

0 likes · 13 min read

How DualPath Revives Idle Network Cards to Break Long‑Context I/O Bottlenecks in DeepSeek V4

Machine Learning Algorithms & Natural Language Processing

Feb 27, 2026 · Artificial Intelligence

Can DeepSeek’s DualPath Break GPU Bottlenecks and Ignite an Agentic AI Surge?

DeepSeek’s new DualPath inference framework, co‑developed with leading Chinese universities, decouples compute from KV‑Cache memory access to eliminate I/O stalls in multi‑round agentic workloads, delivering up to nearly 2× higher throughput and dramatically reducing job‑completion time across several large‑scale LLMs.

AI InfrastructureAgentic InferenceDeepSeek

0 likes · 13 min read

Can DeepSeek’s DualPath Break GPU Bottlenecks and Ignite an Agentic AI Surge?

Woodpecker Software Testing

Feb 27, 2026 · Artificial Intelligence

Automating WeChat Public Account Publishing with AI (DeepSeek & Qwen)

This article walks through building a Python pipeline that uses DeepSeek and Alibaba Qwen to generate AI‑written articles, creates cover images, and automatically saves them as drafts in a WeChat public account, with detailed environment setup, client implementations, fallback strategies, and deployment tips.

AIAutomationContent Generation

0 likes · 26 min read

Automating WeChat Public Account Publishing with AI (DeepSeek & Qwen)

PaperAgent

Feb 26, 2026 · Industry Insights

What the DeepSeek V4 Lite Leak Reveals About Its Specs and Multimodal Power

Recent reports indicate that DeepSeek's unreleased V4 Lite model, featuring a 1‑million‑token context window and native multimodal reasoning, has been leaked online, with Huawei gaining early access while Nvidia is excluded, and the model demonstrates impressive spatial reasoning in generated SVG examples.

DeepSeekIndustry insightLarge Language Model

0 likes · 3 min read

What the DeepSeek V4 Lite Leak Reveals About Its Specs and Multimodal Power

Model Perspective

Feb 26, 2026 · Artificial Intelligence

Why DeepSeek Skipped the Chinese New Year Red‑Packet Rush: A Cost‑Benefit Analysis

The article examines DeepSeek’s decision to avoid the Chinese New Year red‑packet promotion by modeling user acquisition costs, revenue, and compute constraints, showing that limited capital and the trade‑off between inference and training resources make mass user growth financially unattractive compared to larger rivals.

AI StrategyCost-Benefit AnalysisDeepSeek

0 likes · 8 min read

Why DeepSeek Skipped the Chinese New Year Red‑Packet Rush: A Cost‑Benefit Analysis

ShiZhen AI

Feb 25, 2026 · Artificial Intelligence

Anthropic Accuses Chinese AI Labs of Large-Scale Distillation Attack; Community Notes and Musk React

Anthropic's report alleges that DeepSeek, Moonshot AI, and MiniMax used 24,000 fake accounts to harvest 16 million Claude interactions for illicit model distillation, prompting Community Notes to expose Anthropic's own past data‑piracy settlements and sparking a rebuttal from Elon Musk.

AI securityAnthropicClaude

0 likes · 10 min read

Anthropic Accuses Chinese AI Labs of Large-Scale Distillation Attack; Community Notes and Musk React

AI2ML AI to Machine Learning

Feb 24, 2026 · Artificial Intelligence

Optimizing Structured Processes in the Large‑Model Era: From Reasoning to Agentic RL

The article analyzes how large‑model development has moved from reasoning to the agentic stage, compares open‑source and closed‑source capabilities, details Reasoning RL versus Agentic RL designs, and proposes skill‑centric data and verification mechanisms to close the performance gap.

DeepSeekGLM-5RL+SFT

0 likes · 10 min read

Optimizing Structured Processes in the Large‑Model Era: From Reasoning to Agentic RL

AI Insight Log

Feb 16, 2026 · Artificial Intelligence

DeepSeek V4 Benchmark Leak Fuels Talk of a New Coding King

A leaked SWE‑Bench score of 83.7% for DeepSeek V4 sparked claims it outperforms Claude Opus 4.5 and GPT‑5.2, but the data was later debunked as fabricated while official hints confirm a 1‑million‑token context model and a mid‑February 2026 release.

AI benchmarkingAI industryDeepSeek

0 likes · 7 min read

DeepSeek V4 Benchmark Leak Fuels Talk of a New Coding King

AI Engineering

Feb 14, 2026 · Artificial Intelligence

DeepSeek‑V4‑Lite‑285B Hits 100% Recall in 256K Token Tests – A Needle‑in‑a‑Haystack Benchmark

Community testing of DeepSeek's rumored V4‑Lite‑285B model using the OpenAI MRCR 8‑pin standard shows perfect 1.0000 scores on several 128K‑token samples and a 256K‑token sample, achieving 100% recall in native 256K context while longer contexts drop to about 60%, with a note that the "needle‑in‑a‑haystack" method may be exploitable by DSA mechanisms.

DeepSeekLLMLong Context

0 likes · 3 min read

DeepSeek‑V4‑Lite‑285B Hits 100% Recall in 256K Token Tests – A Needle‑in‑a‑Haystack Benchmark

DataFunTalk

Feb 12, 2026 · Artificial Intelligence

DeepSeek’s New Model V4? Exploring 1M‑Token Context and Updated Knowledge

DeepSeek quietly launched its latest model, reportedly supporting up to 1 million tokens, extending its knowledge cutoff to May 2025, adopting a more enthusiastic response style, and still operating as a pure‑text system, while early tests showcase impressive coding and reasoning capabilities.

AI evaluationDeepSeekLarge Language Model

0 likes · 5 min read

DeepSeek’s New Model V4? Exploring 1M‑Token Context and Updated Knowledge

PaperAgent

Feb 11, 2026 · Industry Insights

Is DeepSeek’s New V4 Model Redefining the AI Landscape?

DeepSeek has quietly released a new large‑language model—likely V4—featuring a May 2025 knowledge cutoff, a 1 million‑token context window, and pure‑text capabilities, while industry trends in 2026 shift focus toward agentic AI systems that coordinate multiple specialized models.

AI modelsAgentic AIDeepSeek

0 likes · 3 min read

Is DeepSeek’s New V4 Model Redefining the AI Landscape?

Machine Learning Algorithms & Natural Language Processing

Feb 10, 2026 · Artificial Intelligence

Inside GLM-5: 745B Parameters, DeepSeek‑style Sparse Attention, and a 60% Stock Surge

The GLM-5 architecture, uncovered from a GitHub PR, doubles the previous model to 745 B parameters, adopts DeepSeek‑V3 sparse attention and multi‑token prediction, features a 78‑layer MoE with 256 experts, supports a 202K‑token context window, and its rumored test model "Pony Alpha" sparked a 60% rise in Zhipu AI's stock amid a crowded AI release season.

AI Stock ImpactDeepSeekGLM-5

0 likes · 6 min read

Inside GLM-5: 745B Parameters, DeepSeek‑style Sparse Attention, and a 60% Stock Surge

Old Zhang's AI Learning

Feb 9, 2026 · Artificial Intelligence

GLM-5 Emerges First, Built on DeepSeek Tech, Triggering a 40% Stock Surge

An anonymous OpenRouter model dubbed "Pony Alpha" was verified as the new 745B‑parameter GLM-5, which reuses DeepSeek‑V3 architecture, supports sparse attention and multi‑token prediction, and has already caused a near‑40% jump in Zhipu AI’s stock while hinting at upcoming integration into the Transformers library.

DeepSeekGLM-5Large Language Model

0 likes · 3 min read

GLM-5 Emerges First, Built on DeepSeek Tech, Triggering a 40% Stock Surge

Old Zhang's AI Learning

Feb 9, 2026 · Artificial Intelligence

Qwen 3.5 Emerges; ByteDance and DeepSeek Set to Release Flagship LLMs for Spring Festival

The LMSYS Chatbot Arena now shows Qwen 3.5 (codenamed Karp-001/002) alongside ByteDance's Pisces‑llm models and DeepSeek‑V4, with new Transformers configs and hints of an Active‑3B MoE architecture, suggesting a fresh wave of flagship large language models arriving for the Spring Festival.

ByteDanceDeepSeekMoE

0 likes · 4 min read

Qwen 3.5 Emerges; ByteDance and DeepSeek Set to Release Flagship LLMs for Spring Festival

Amazon Cloud Developers

Feb 5, 2026 · Cloud Computing

How to Build a Fast, Accurate AI‑Powered Knowledge Base with Amazon OpenSearch and DeepSeek

This article walks through using Amazon OpenSearch Service’s vector search and ML connector together with the DeepSeek large language model to create a low‑cost, high‑efficiency enterprise knowledge base, covering architecture, step‑by‑step deployment, RAG pipeline configuration, and conversational search extensions.

Amazon OpenSearchDeepSeekKnowledge Base

0 likes · 17 min read

How to Build a Fast, Accurate AI‑Powered Knowledge Base with Amazon OpenSearch and DeepSeek

Mingyi World Elasticsearch

Jan 30, 2026 · Backend Development

Text2DSL: Convert Natural Language to Precise Elasticsearch/Easysearch DSL

Text2DSL lets users describe search requirements in plain language, uses DeepSeek to generate Elasticsearch DSL, validates the DSL locally with Elasticsearch/Easysearch, iteratively refines it up to five times, and achieves over 95% first‑try accuracy while cutting query‑building time by at least threefold.

DSL generationDeepSeekEasysearch

0 likes · 12 min read

Text2DSL: Convert Natural Language to Precise Elasticsearch/Easysearch DSL

HyperAI Super Neural

Jan 30, 2026 · Artificial Intelligence

Frontier OCR Advances: DeepSeek, Tencent, and Baidu Push From Text Recognition to Structured Document Understanding

This weekly AI paper roundup reviews five cutting‑edge OCR studies—DeepSeek‑OCR 2, LightOnOCR‑2‑1B, HunyuanOCR, PaddleOCR‑VL, and GOT—detailing their novel visual‑language architectures, training data, benchmark evaluations, and performance gains over previous models.

DeepSeekGoTLightOnOCR

0 likes · 9 min read

Frontier OCR Advances: DeepSeek, Tencent, and Baidu Push From Text Recognition to Structured Document Understanding

PaperAgent

Jan 27, 2026 · Artificial Intelligence

How DeepSeek-OCR 2’s Dual-Flow Attention Redefines Document Understanding

DeepSeek-OCR 2 introduces a novel dual‑stream (bidirectional + causal) attention architecture that replaces fixed raster scanning, leverages a Qwen2‑0.5B encoder, and achieves state‑of‑the‑art accuracy on OmniDocBench while reducing token budget and improving reading‑order consistency.

DeepEncoderDeepSeekDual-Stream Attention

0 likes · 8 min read

How DeepSeek-OCR 2’s Dual-Flow Attention Redefines Document Understanding

Ubuntu

Jan 24, 2026 · Artificial Intelligence

Unlock Full‑Stack AI Coding on Ubuntu with Ollama and CC Switch

This step‑by‑step guide shows how to replace cloud‑based AI coding tools with a private, zero‑cost workflow on Ubuntu by installing Ollama, configuring systemd, adding DeepSeek or Qwen2.5 models, installing Claude, Codex and Gemini CLIs, and routing them through CC Switch.

AI codingCC SwitchClaude Code

0 likes · 7 min read

Ubuntu

Jan 23, 2026 · Artificial Intelligence

Deploy DeepSeek Locally on Ubuntu: Build Your Private AI Assistant

This guide walks through why you might run a large language model locally—privacy, zero latency, and no token costs—then details hardware requirements, installs Ollama, pulls the appropriate DeepSeek‑R1 model, tests it with a coding prompt, and optionally adds a web UI via Docker.

AI assistantDeepSeekOllama

0 likes · 6 min read

Deploy DeepSeek Locally on Ubuntu: Build Your Private AI Assistant

Data Party THU

Jan 21, 2026 · Artificial Intelligence

What DeepSeek’s Secret “Model1” Reveals About the Upcoming V4 LLM

Analyzing recent DeepSeek flashmla repository commits, the article uncovers that the mysterious Model1 likely corresponds to DeepSeek‑V4, detailing architectural shifts to a 512‑dimensional head, full support for NVIDIA Blackwell GPUs, token‑level sparse MLA, and new mechanisms such as Value Vector Position Awareness and Engram.

DeepSeekDeepSeek-V4GPU Optimization

0 likes · 6 min read

What DeepSeek’s Secret “Model1” Reveals About the Upcoming V4 LLM

PaperAgent

Jan 21, 2026 · Artificial Intelligence

Inside DeepSeek’s FlashMLA Update: What’s New in the MODEL1 Architecture

DeepSeek’s recent FlashMLA update introduces the new MODEL1, featuring a tighter KV-Cache layout, an extra two-stage cache, and a fixed 512×512 head dimension, with four code changes detailed in a public GitHub commit and illustrated by comparative diagrams.

AI ArchitectureDeepSeekFlashMLA

0 likes · 3 min read

Inside DeepSeek’s FlashMLA Update: What’s New in the MODEL1 Architecture

Woodpecker Software Testing

Jan 15, 2026 · Artificial Intelligence

Step-by-Step Guide to Building Your First AI Agent: Connecting Alibaba Cloud, OpenAI, Dashscope, DeepSeek, and Ollama

This article provides a detailed, hands‑on tutorial for creating an AI agent, covering registration and API key setup for Alibaba Cloud, OpenAI, Dashscope and DeepSeek, installing and using Ollama for local model deployment, configuring CherryStudio, and implementing function‑calling and MCP techniques with full code examples.

AI AgentAlibaba CloudDashScope

0 likes · 26 min read

Step-by-Step Guide to Building Your First AI Agent: Connecting Alibaba Cloud, OpenAI, Dashscope, DeepSeek, and Ollama

AI Insight Log

Jan 13, 2026 · Artificial Intelligence

Why Bigger LLMs Still Forget Facts – DeepSeek’s Engram Memory Module Explained

This article analyzes DeepSeek’s new Engram module, showing how conditional memory reduces the compute‑only approach of large language models, improves knowledge retrieval, reasoning, long‑context handling, and system efficiency while maintaining strict parameter and FLOP budgets.

AI ArchitectureDeepSeekEngram

0 likes · 15 min read

Why Bigger LLMs Still Forget Facts – DeepSeek’s Engram Memory Module Explained

PaperAgent

Jan 10, 2026 · Artificial Intelligence

DeepSeek V4 Unveiled: Why Its Coding Power Beats Claude and GPT

DeepSeek's newly announced V4 model, the successor to its December 2024 V3 release, demonstrates superior coding abilities over Claude and GPT series, details its data composition, infrastructure, training costs, failed experimental attempts, expanded benchmark comparisons, and includes a comprehensive safety report.

AI model analysisCoding performanceDeepSeek

0 likes · 4 min read

DeepSeek V4 Unveiled: Why Its Coding Power Beats Claude and GPT

AI Insight Log

Jan 9, 2026 · Industry Insights

Anthropic Blocks xAI from Claude; DeepSeek V4 Targets Code Supremacy at New Year

Anthropic abruptly cut off xAI employees’ access to its Claude model, labeling them a competitor, prompting xAI co‑founder Tony Wu to view the loss as both a short‑term productivity hit and a catalyst for accelerating its own coding AI, while Chinese startup DeepSeek is rumored to launch V4 during the upcoming Chinese New Year, claiming code‑generation capabilities that surpass current Anthropic and OpenAI models.

AI competitionAnthropicClaude

0 likes · 5 min read

Anthropic Blocks xAI from Claude; DeepSeek V4 Targets Code Supremacy at New Year

Mingyi World Elasticsearch

Jan 3, 2026 · Artificial Intelligence

Build Your Own AI Coding Assistant in 5 Minutes: A Hands‑On Guide

The article analyzes common pain points of traditional AI coding chats—repetitive context input, lengthy prompts, and generic answers—and demonstrates how to create a persistent, expert‑level AI coding assistant using Coco AI, with step‑by‑step configuration, example prompts, and future RAG enhancements.

AI AgentCoco AIDeepSeek

0 likes · 9 min read

Build Your Own AI Coding Assistant in 5 Minutes: A Hands‑On Guide

Design Hub

Jan 2, 2026 · Artificial Intelligence

DeepSeek’s “Mathematical Tight‑Fit” Tames AI: Constraints Drive Performance Gains

DeepSeek’s new mHC architecture replaces unconstrained hyper‑connections with manifold‑constrained doubly‑stochastic matrices, stabilizing large‑scale training, reducing signal explosion from 3000× to 1.6×, and delivering consistent accuracy improvements across BBH, DROP, GSM8K, and MMLU benchmarks while adding only 6.7% training overhead.

AI training stabilityDeepSeekhyper-connections

0 likes · 10 min read

DeepSeek’s “Mathematical Tight‑Fit” Tames AI: Constraints Drive Performance Gains

AI Insight Log

Jan 1, 2026 · Artificial Intelligence

Can DeepSeek’s mHC Architecture Break ResNet’s Decade-Long Dominance?

DeepSeek’s new paper “mHC: Manifold‑Constrained Hyper‑Connections” proposes a novel architecture that replaces traditional residual connections with mathematically constrained hyper‑connections, showing on a 27B model a modest 6.7 % training‑time increase but significant stability gains and superior performance on BBH, DROP and GSM8K benchmarks.

DeepSeekLLM trainingResNet

0 likes · 8 min read

Can DeepSeek’s mHC Architecture Break ResNet’s Decade-Long Dominance?

Baidu Geek Talk

Dec 24, 2025 · Artificial Intelligence

Context Parallelism Slashes TTFT by 80% for 128K-Token LLMs

The article explains how Baidu’s Baige team integrated a Context Parallelism strategy into DeepSeek V3.2, detailing the DSA architecture, the limitations of traditional tensor and sequence parallelism, and how CP distributes computation and memory across GPUs to achieve up to an 80 % reduction in token‑to‑first‑token latency for ultra‑long 128K‑token contexts.

Context ParallelismDeepSeekLLM

0 likes · 9 min read

Context Parallelism Slashes TTFT by 80% for 128K-Token LLMs

Baidu Intelligent Cloud Tech Hub

Dec 17, 2025 · Artificial Intelligence

How AFD Splits Attention and FFN to Boost DeepSeek‑V3 Inference by Up to 19%

The article details the Attention‑FFN Disaggregation (AFD) technique used by Baidu Baige to separate self‑attention and feed‑forward network stages in DeepSeek‑V3 models, describing multi‑stage scheduling, three‑batch overlap, communication optimizations, and performance results that achieve up to 19% throughput improvement under a 100 ms SLO.

3BOAFDAttention-FFN Disaggregation

0 likes · 17 min read

How AFD Splits Attention and FFN to Boost DeepSeek‑V3 Inference by Up to 19%

21CTO

Dec 11, 2025 · Artificial Intelligence

Why DeepSeek’s Founder Made Nature’s 2025 Top‑10 Scientists List

Nature’s 2025 “Nature’s 10” list highlighted DeepSeek founder Liang Wenfeng for his breakthrough in AI transparency, noting his open‑weight model’s impact on researchers, while also detailing the model’s low‑cost performance and the other distinguished scientists honored that year.

DeepSeekLiang WenfengNature's 10

0 likes · 3 min read

Why DeepSeek’s Founder Made Nature’s 2025 Top‑10 Scientists List

Data Party THU

Dec 10, 2025 · Artificial Intelligence

How DeepSeek‑V3.2 Cuts Inference Cost and Boosts Agent Skills with Sparse Attention

DeepSeek's V3.2 release introduces a dual‑model lineup, a Sparse Attention architecture that halves long‑context inference cost, a post‑training reinforcement‑learning pipeline that exceeds 10% of pre‑training compute, and a revamped agent framework that dramatically improves tool‑use and reasoning performance across benchmarks.

Agentic AIDeepSeekLarge Language Model

0 likes · 11 min read

How DeepSeek‑V3.2 Cuts Inference Cost and Boosts Agent Skills with Sparse Attention

Old Meng AI Explorer

Dec 7, 2025 · Artificial Intelligence

Why DeepSeek-Math-V2 Is the New Benchmark for Rigorous AI Math Reasoning

DeepSeek-Math-V2, an open‑source math reasoning model from DeepSeek, introduces a self‑verification mechanism that ensures step‑by‑step logical correctness, achieving gold‑medal scores in IMO 2025, CMO 2024 and near‑perfect results in the Putnam 2024 competition, while offering free, extensible deployment for research, training, and scientific computation.

AI MathDeepSeekSelf‑Verification

0 likes · 13 min read

Why DeepSeek-Math-V2 Is the New Benchmark for Rigorous AI Math Reasoning

Instant Consumer Technology Team

Dec 5, 2025 · Artificial Intelligence

Transform Complex Prompts into Reusable AI Skills and Hook DeepSeek into Claude Code

This article explains how to replace cumbersome, city‑specific prompt strings with modular AI Skills, demonstrates the food‑diorama‑skill that generates 3D gourmet dioramas, and provides a step‑by‑step guide for connecting the DeepSeek V3.2 model to Claude Code using environment variables or the CC Switch GUI.

AIClaudeDeepSeek

0 likes · 8 min read

Transform Complex Prompts into Reusable AI Skills and Hook DeepSeek into Claude Code

Frontend AI Walk

Dec 5, 2025 · Artificial Intelligence

Master Prompt Engineering: From Random Chat to Precise Control with Zero-shot, Few-shot, and Chain‑of‑Thought

This article explains how to converse effectively with large language models by mastering three core prompting techniques—Zero‑shot, Few‑shot, and Chain‑of‑Thought—illustrated with front‑end analogies, code snippets, and a step‑by‑step DeepSeek JSON‑generation exercise that shows common pitfalls and best practices.

Chain-of-ThoughtDeepSeekFew-shot

0 likes · 12 min read

Master Prompt Engineering: From Random Chat to Precise Control with Zero-shot, Few-shot, and Chain‑of‑Thought

Fun with Large Models

Dec 5, 2025 · Artificial Intelligence

DeepSeek Math V2 & V3.2: A Plain‑Language Deep Dive into Core Innovations

This article provides a detailed, easy‑to‑understand analysis of DeepSeek‑Math‑V2’s self‑verification training method and DeepSeek‑V3.2’s GRPO framework, sparse‑attention DSA mechanism, massive agent data pipeline, and benchmark results that place both models among the world’s top open‑source large language models.

DeepSeekGRPOLLM

0 likes · 19 min read

DeepSeek Math V2 & V3.2: A Plain‑Language Deep Dive into Core Innovations

PMTalk Product Manager Community

Dec 4, 2025 · Industry Insights

Three Chinese AI Giants, Three Strategies: Doubao, DeepSeek, and Qwen in 2025

In 2025 China's AI large‑model arena is sharply fragmenting, with ByteDance's Doubao leading user activity, DeepSeek dominating technical and international influence, and Alibaba's Qwen carving a unique full‑stack strategic edge, each pursuing distinct paths in technology, product and ecosystem competition.

AIChinaDeepSeek

0 likes · 11 min read

Three Chinese AI Giants, Three Strategies: Doubao, DeepSeek, and Qwen in 2025

Aikesheng Open Source Community

Dec 4, 2025 · Artificial Intelligence

Gemini 3 Pro vs DeepSeek‑V3.2‑Exp: Which LLM Dominates SQL Understanding, Optimization, and Dialect Conversion?

This report evaluates the professional‑grade LLMs Gemini 3 Pro and DeepSeek‑V3.2‑Exp on three SQL‑related dimensions—understanding, optimization, and dialect conversion—using the SCALE benchmark, presenting detailed scores, strengths, weaknesses, and practical recommendations for database engineers and decision makers.

DeepSeekGeminiLLM

0 likes · 16 min read

Gemini 3 Pro vs DeepSeek‑V3.2‑Exp: Which LLM Dominates SQL Understanding, Optimization, and Dialect Conversion?

PaperAgent

Dec 2, 2025 · Artificial Intelligence

How DeepSeek‑V3.2’s New Agent Architecture Bridges the Gap to Closed‑Source LLMs

DeepSeek‑V3.2 introduces a reinforced‑agent framework that combines a synthetic task factory, scaling reinforcement learning, and advanced context management, achieving the highest open‑source agent scores and narrowing the performance gap with leading closed‑source models such as Claude‑4.5‑Sonnet, GPT‑5‑High, and Gemini‑3.0‑Pro.

AI agentsDeepSeekScaling RL

0 likes · 7 min read

How DeepSeek‑V3.2’s New Agent Architecture Bridges the Gap to Closed‑Source LLMs

Baidu Intelligent Cloud Tech Hub

Nov 25, 2025 · Artificial Intelligence

Why DeepSeek‑V3.2‑Exp Lost Performance and How a Simple RoPE Fix Restored It

The Baidu Baige team discovered that DeepSeek‑V3.2‑Exp’s long‑context performance lagged behind the official report, traced the issue to a subtle RoPE layout mismatch in the open‑source inference demo, collaborated with DeepSeek to fix it, and verified that the model’s speed and accuracy fully recovered across multiple benchmarks.

AI InfrastructureDeepSeekLLM Inference

0 likes · 9 min read

Why DeepSeek‑V3.2‑Exp Lost Performance and How a Simple RoPE Fix Restored It

BirdNest Tech Talk

Nov 17, 2025 · Artificial Intelligence

How to Parse and Use Claude Skills with Go: A Deep Dive into LLM Tool Integration

This article explains the concept of Claude Skills, walks through a Go library that parses skill packages, demonstrates a CLI inspector, shows how to run skills with Deepseek‑v3 via an OpenAI‑compatible API, and outlines future security enhancements.

ClaudeDeepSeekGo

0 likes · 13 min read

How to Parse and Use Claude Skills with Go: A Deep Dive into LLM Tool Integration

Selected Java Interview Questions

Nov 15, 2025 · Backend Development

Why Your Spring Boot DeepSeek API Key Stays Old and How to Fix It

In Spring Boot projects, configuration values can appear unchanged due to environment variable precedence, misplaced files, missing setters, profile activation issues, or IDE caching, and this guide walks through five common pitfalls with concrete commands and code snippets to resolve them.

API keyConfigurationDeepSeek

0 likes · 9 min read

Why Your Spring Boot DeepSeek API Key Stays Old and How to Fix It

Huawei Cloud Developer Alliance

Nov 14, 2025 · Artificial Intelligence

How to Build a DeepSeek AI Chat Assistant Using Huawei Developer Space

This guide walks you through creating an AI chat assistant powered by the DeepSeek‑V3 large language model on Huawei Developer Space, covering cloud container setup, free token acquisition, MaaS model activation, environment configuration, code deployment with Gradio, and end‑to‑end testing.

AIChatbotDeepSeek

0 likes · 13 min read

How to Build a DeepSeek AI Chat Assistant Using Huawei Developer Space

Tech Stroll Journey

Nov 9, 2025 · Backend Development

How to Deploy AnythingLLM Locally with Docker for Enterprise Document RAG

This guide walks through setting up a Ubuntu VM, installing Docker, pulling the AnythingLLM image, configuring storage, launching the container, and using it to ingest and query local documents with a DeepSeek‑R1 model.

AI DeploymentAnythingLLMDeepSeek

0 likes · 6 min read

How to Deploy AnythingLLM Locally with Docker for Enterprise Document RAG

Baidu Intelligent Cloud Tech Hub

Oct 28, 2025 · Artificial Intelligence

How Baidu’s New MTP Inference Code Doubles DeepSeek‑V3.2 Throughput

Baidu Baige and the SGLang community have open‑sourced a production‑tested MTP inference engine that boosts DeepSeek‑V3.2 decoding speed by over two times while delivering exceptional stability, thanks to a DSA‑optimized architecture that predicts multiple tokens in a single forward pass.

AIDSADeepSeek

0 likes · 4 min read

How Baidu’s New MTP Inference Code Doubles DeepSeek‑V3.2 Throughput

ShiZhen AI

Oct 24, 2025 · Artificial Intelligence

Why GPT‑5 Lost 72% While Chinese AI Models Gained 32% in the NOF1.AI Alpha Arena

The NOF1.AI Alpha Arena benchmark shows Chinese models like Qwen3 Max and DeepSeek out‑performing GPT‑5, delivering +32.42% and +22.46% returns respectively, while GPT‑5 suffers a -72.49% loss, highlighting the impact of trade frequency, risk control, and profit‑to‑loss ratios in AI‑driven crypto trading.

AI tradingAlpha ArenaDeepSeek

0 likes · 14 min read

Why GPT‑5 Lost 72% While Chinese AI Models Gained 32% in the NOF1.AI Alpha Arena

Python Programming Learning Circle

Oct 23, 2025 · Artificial Intelligence

How to Integrate DeepSeek‑V3 AI into PyCharm Using the Continue Plugin

This guide walks you through obtaining a DeepSeek API key, installing the Continue plugin in PyCharm, configuring the plugin with DeepSeek‑V3 settings, and using the AI to explain or modify selected code snippets, complete with screenshots and a ready‑to‑paste JSON configuration.

AIAPI integrationDeepSeek

0 likes · 4 min read

How to Integrate DeepSeek‑V3 AI into PyCharm Using the Continue Plugin

Baobao Algorithm Notes

Oct 20, 2025 · Artificial Intelligence

Can Visual Tokens Compress Text? Inside DeepSeek-OCR’s Optical Compression

DeepSeek‑OCR introduces a novel visual encoder that transforms text into images, achieving up to 10‑20× token compression while maintaining OCR accuracy, and demonstrates strong performance on OmniDocBench with a 3B‑parameter model across multilingual and multimodal tasks.

AIDeepSeekOCR

0 likes · 10 min read

Can Visual Tokens Compress Text? Inside DeepSeek-OCR’s Optical Compression

DataFunTalk

Oct 20, 2025 · Artificial Intelligence

How DeepSeek-OCR Achieves 10× Context Compression with Vision Tokens

DeepSeek-OCR, a newly open‑sourced 3B‑parameter OCR model, uses a novel DeepEncoder and a 3B MoE decoder to compress long‑text contexts into visual tokens, achieving up to 10× compression with 97% accuracy and demonstrating strong practical performance on benchmarks and multilingual documents.

DeepSeekMultimodal AIOCR

0 likes · 11 min read

How DeepSeek-OCR Achieves 10× Context Compression with Vision Tokens

BirdNest Tech Talk

Oct 14, 2025 · Artificial Intelligence

How DeepSeek’s Lightning Indexer Enables Efficient Sparse Attention for Long Texts

The article explains how DeepSeek’s Lightning Indexer acts as a memory‑filtering expert that computes index scores, selects the top‑k relevant tokens, and maps a compact formula to FP8 kernel code, reducing attention complexity from 128K to 2048 tokens for massive sequences.

DeepSeekFP8Lightning Indexer

0 likes · 7 min read

How DeepSeek’s Lightning Indexer Enables Efficient Sparse Attention for Long Texts

Python Programming Learning Circle

Oct 11, 2025 · Artificial Intelligence

How AI-Powered DeepSeek Can Auto‑Heal Your Playwright Tests

This article demonstrates how to use DeepSeek's coding model together with Playwright to automatically detect, analyze, and fix fragile UI automation scripts, providing AI‑driven suggestions and patch generation for more resilient test suites.

AIDeepSeekPython

0 likes · 5 min read

How AI-Powered DeepSeek Can Auto‑Heal Your Playwright Tests

DataFunTalk

Sep 30, 2025 · Artificial Intelligence

DeepSeek‑V3.2‑Exp Unveiled: Million‑Token Context, Sparse Attention, and Cost‑Effective Inference

DeepSeek‑V3.2‑Exp, the latest experimental large‑language model, is open‑sourced with a paper, featuring a million‑token context window, a new sparse attention mechanism, GRPO‑enhanced reasoning, and detailed cost‑analysis showing up to ten‑fold inference savings.

DeepSeekGRPOInference Optimization

0 likes · 5 min read

DeepSeek‑V3.2‑Exp Unveiled: Million‑Token Context, Sparse Attention, and Cost‑Effective Inference

Zhihu Tech Column

Sep 23, 2025 · Backend Development

Build a High‑Performance AI Chatbot with FUST Microservices and DeepSeek

This tutorial walks through using Zhihu's open‑source FUST microservice framework together with DeepSeek's language model API to design, implement, and deploy a scalable, high‑performance intelligent Q&A system, covering architecture, data models, service layers, and deployment scripts.

AI ChatbotDeepSeekFUST

0 likes · 16 min read

Build a High‑Performance AI Chatbot with FUST Microservices and DeepSeek

DataFunTalk

Sep 23, 2025 · Artificial Intelligence

DeepSeek‑V3.1‑Terminus Fixes the ‘Extreme’ Bug and Outperforms Gemini 2.5 Pro

DeepSeek released the V3.1‑Terminus model, fixing the notorious “extreme” character bug, improving language consistency and Agent capabilities, and achieving notable benchmark gains that surpass Gemini 2.5 Pro, while providing download links and hinting at upcoming V4/R2 releases.

AgentDeepSeekLarge Language Model

0 likes · 6 min read

DeepSeek‑V3.1‑Terminus Fixes the ‘Extreme’ Bug and Outperforms Gemini 2.5 Pro

Code Wrench

Sep 22, 2025 · Artificial Intelligence

Build a Private ChatGPT on Your Laptop with Ollama, DeepSeek‑R1 and Go MCP

This guide walks you through installing Ollama, pulling the open‑source DeepSeek‑R1:1.5B model, wrapping it with a Go‑based Model Context Protocol (MCP) server, creating a client example, and enhancing the experience with Open‑WebUI while offering performance‑tuning tips.

DeepSeekGoMCP

0 likes · 9 min read

Build a Private ChatGPT on Your Laptop with Ollama, DeepSeek‑R1 and Go MCP

Data Party THU

Sep 21, 2025 · Artificial Intelligence

Building a Mini‑DeepSeek‑V3: Transformer Block and MTP Implementation on Limited Compute

This article walks through the design and implementation of a Mini‑DeepSeek‑V3 language model, detailing how to assemble the core Transformer block, integrate Multi‑Token Prediction (MTP) modules, construct the overall architecture, and compute the combined loss—all using modest GPU resources and a single‑card or DDP training setup.

AIDeepSeekMTP

0 likes · 12 min read

Building a Mini‑DeepSeek‑V3: Transformer Block and MTP Implementation on Limited Compute

Data Party THU

Sep 20, 2025 · Artificial Intelligence

How DeepSeek Trained a $30M LLM for Just $29.4K – Inside the R1 Model

The article reports that DeepSeek’s R1 large language model, detailed in a peer‑reviewed Nature paper, was built with roughly $300 k in total cost—about $29.4 k for training—using Nvidia H800 chips and novel pure reinforcement‑learning techniques, achieving competitive performance while remaining open‑source.

DeepSeekLarge Language ModelNvidia H800

0 likes · 9 min read

How DeepSeek Trained a $30M LLM for Just $29.4K – Inside the R1 Model

Data Party THU

Sep 19, 2025 · Artificial Intelligence

How DeepSeek R1 Redefines AI Reasoning with Pure Reinforcement Learning

DeepSeek R1 replaces traditional supervised fine‑tuning with a pure reinforcement‑learning pipeline, introducing the GRPO algorithm and a four‑stage training regime that dramatically lowers cost, boosts reasoning and code‑generation performance, and raises important ethical, privacy, and societal considerations for large language models.

AI reasoningDeepSeekGRPO

0 likes · 14 min read

How DeepSeek R1 Redefines AI Reasoning with Pure Reinforcement Learning

Data Party THU

Sep 18, 2025 · Artificial Intelligence

How DeepSeek‑R1’s Reinforcement Learning Redefined LLM Reasoning (Nature Cover Story)

DeepSeek‑R1, the first peer‑reviewed large language model, landed on Nature’s cover after a novel reinforcement‑learning‑only training pipeline that dramatically boosted reasoning performance while keeping training costs surprisingly low.

DeepSeekGRPOModel Training

0 likes · 14 min read

How DeepSeek‑R1’s Reinforcement Learning Redefined LLM Reasoning (Nature Cover Story)

DataFunTalk

Sep 18, 2025 · Artificial Intelligence

How DeepSeek‑R1’s Reinforcement Learning Earned a Nature Cover

DeepSeek‑R1, the first peer‑reviewed large language model, leveraged a pure reinforcement‑learning framework and the novel GRPO algorithm to achieve breakthrough reasoning performance, low training cost, and widespread acclaim, culminating in a Nature magazine cover story.

AI reasoningDeepSeekGRPO

0 likes · 14 min read

How DeepSeek‑R1’s Reinforcement Learning Earned a Nature Cover

Raymond Ops

Sep 14, 2025 · Artificial Intelligence

Create AI Videos with DeepSeek + Tongyi Wanxiang: Step-by-Step Guide

This article explains how to leverage the Chinese AI multimodal platform Tongyi Wanxiang together with DeepSeek to generate high-quality AI videos, covering AI video fundamentals, core features, application scenarios, detailed workflow, script creation, video synthesis, and Java API integration with code examples.

AI video generationDeepSeekJava SDK

0 likes · 25 min read

Create AI Videos with DeepSeek + Tongyi Wanxiang: Step-by-Step Guide

xkx's Tech General Store

Sep 14, 2025 · Artificial Intelligence

Exploring Agents 006: Tencent Youtu-Agent – A Simple, Highly Extensible General Agent

This article introduces Tencent's open‑source Youtu‑Agent, detailing its modular, configuration‑driven design, installation steps, benchmark performance on the WebWalkerQA dataset, and hands‑on test cases for SimpleAgent and Orchestra modes, while highlighting its extensibility and web UI capabilities.

AI agentsDeepSeekLLM

0 likes · 12 min read

Exploring Agents 006: Tencent Youtu-Agent – A Simple, Highly Extensible General Agent

Aikesheng Open Source Community

Sep 4, 2025 · Artificial Intelligence

How GPT‑5, DeepSeek‑V3.1 and SQLShift Stack Up in the August 2025 SQL LLM Benchmark

The August 2025 SCALE benchmark evaluates new AI models—including the GPT‑5 family, DeepSeek‑V3.1, and the SQLShift tool—across SQL understanding, optimization, and dialect conversion, revealing distinct strengths, weaknesses, and the growing advantage of specialized tools over generic large language models.

AIDeepSeekGPT-5

0 likes · 15 min read

How GPT‑5, DeepSeek‑V3.1 and SQLShift Stack Up in the August 2025 SQL LLM Benchmark

Dunmao Tech Hub

Sep 1, 2025 · Artificial Intelligence

Deploy DeepSeek‑r1 Locally with a One‑Click Ollama Script

This guide walks you through a Bash script that automatically checks for Ollama, installs it if missing, lets you choose a DeepSeek‑r1 model size, starts the Ollama service, and runs the selected model locally, complete with usage examples and a token‑cost note.

AIDeepSeekModel Deployment

0 likes · 7 min read

Deploy DeepSeek‑r1 Locally with a One‑Click Ollama Script

IT Services Circle

Aug 28, 2025 · Artificial Intelligence

Why DeepSeek V3.1 Keeps Spitting the ‘Extreme’ Token and How to Fix It

Developers using DeepSeek V3.1's API have reported that the model intermittently inserts the Chinese character “极” (or its variants) into generated code, a bug that spreads across multiple platforms and threatens high‑precision code generation, prompting community workarounds and speculation about its root causes.

AI model bugDeepSeekLLM

0 likes · 6 min read

Why DeepSeek V3.1 Keeps Spitting the ‘Extreme’ Token and How to Fix It

Aikesheng Open Source Community

Aug 28, 2025 · Artificial Intelligence

How Does DeepSeek‑V3.1 Perform on Professional SQL Tasks? A Detailed Benchmark

This report objectively evaluates DeepSeek‑V3.1 on professional‑grade SQL tasks, presenting its balanced strengths in understanding, optimization, and dialect conversion, highlighting its top scores in syntax error detection and Chinese database conversion while exposing weaknesses in execution‑plan analysis and large‑SQL transformations.

DeepSeekLLMartificial-intelligence

0 likes · 8 min read

How Does DeepSeek‑V3.1 Perform on Professional SQL Tasks? A Detailed Benchmark

Efficient Ops

Aug 27, 2025 · Artificial Intelligence

Why DeepSeek V3.1 Randomly Inserts the Chinese Character “极” – Token Bug Explained

DeepSeek’s latest V3.1 model unexpectedly injects the Chinese character “极” into generated text, a token‑ID mix‑up that breaks code compilation, JSON parsing, and academic writing, with users tracing the issue to adjacent token IDs and two main hypotheses of dataset contamination or model shortcut.

AI safetyDeepSeekLanguage Model

0 likes · 4 min read

Why DeepSeek V3.1 Randomly Inserts the Chinese Character “极” – Token Bug Explained

Architects' Tech Alliance

Aug 26, 2025 · Artificial Intelligence

How DeepSeek‑V3.1’s New FP8 Precision Supercharges Domestic Chip Performance

DeepSeek‑V3.1 introduces the UE8M0 FP8 Scale precision, cutting memory usage by up to 75% and enabling next‑generation Chinese chips such as Ascend 910B to run 128K context models efficiently, while the ecosystem rapidly adopts FP8, yet challenges in IP autonomy and software maturity remain before global competitiveness is achieved.

AI hardwareDeepSeekFP8

0 likes · 10 min read

How DeepSeek‑V3.1’s New FP8 Precision Supercharges Domestic Chip Performance

Raymond Ops

Aug 26, 2025 · Artificial Intelligence

How to Deploy DeepSeek R1 Locally: Versions, Hardware, and UI Tools

This guide explains DeepSeek R1’s model variants, hardware requirements, local installation steps using Ollama, LM Studio or Docker, and how to add visual interfaces like Open‑WebUI and Dify for a complete on‑premise AI solution.

DeepSeekDifyHardware Requirements

0 likes · 14 min read

How to Deploy DeepSeek R1 Locally: Versions, Hardware, and UI Tools

IT Services Circle

Aug 24, 2025 · Artificial Intelligence

What Is UE8M0 FP8 and Why It’s Boosting China’s Next‑Gen AI Chips

The article explains the UE8M0 FP8 precision format, its MXFP8 origins, how it reduces bandwidth and power consumption, and why Chinese AI chip makers like Cambricon, HaiGuang and Moore Threads are rapidly adopting it, signaling a shift toward domestic AI hardware independence.

AI hardwareChinese chipsDeepSeek

0 likes · 10 min read

What Is UE8M0 FP8 and Why It’s Boosting China’s Next‑Gen AI Chips

Fun with Large Models

Aug 22, 2025 · Artificial Intelligence

Step‑by‑Step Guide: Building a PDF‑Based RAG Knowledge Base with LangChain, Streamlit, DashScope & DeepSeek

This tutorial shows how to create a lightweight Retrieval‑Augmented Generation (RAG) system that indexes multiple PDF files, stores their embeddings in a FAISS vector database, and answers user queries through a LangChain agent powered by DashScope embeddings and the DeepSeek‑Chat model, all wrapped in a Streamlit UI.

DashScopeDeepSeekFAISS

0 likes · 13 min read

Step‑by‑Step Guide: Building a PDF‑Based RAG Knowledge Base with LangChain, Streamlit, DashScope & DeepSeek

Open Source Tech Hub

Aug 21, 2025 · Artificial Intelligence

How to Connect Claude Code to DeepSeek‑V3.1 for AI‑Powered Coding

This guide explains how to install Claude Code and the ClaudeCodeRouter, configure them to use DeepSeek‑V3.1 via the Anthropic API format, run the tool, and troubleshoot common connection and Windows environment issues.

AI codingClaude CodeDeepSeek

0 likes · 5 min read

How to Connect Claude Code to DeepSeek‑V3.1 for AI‑Powered Coding

AI Algorithm Path

Aug 20, 2025 · Artificial Intelligence

DeepSeek V3.1 Open‑Source: Unlocking a New Era of Long‑Context AI

DeepSeek V3.1, a 685‑billion‑parameter open‑source model, supports up to 128,000 tokens, delivers mixed‑architecture capabilities, matches top‑tier closed systems in benchmarks, and its rapid community adoption signals a shift toward democratized AI development and new industry dynamics.

AI performanceDeepSeekLarge Language Model

0 likes · 6 min read

DeepSeek V3.1 Open‑Source: Unlocking a New Era of Long‑Context AI

Fun with Large Models

Aug 20, 2025 · Artificial Intelligence

DeepSeek V3.1 Review: 128K Context, Knowledge, Programming & Agent Skills Near Claude 4

DeepSeek V3.1, released on August 19, expands context length to 128 K tokens and updates its knowledge base to July 2024, and the author’s benchmarks show its programming and agent capabilities now rival Claude 4, with detailed prompt examples, code generation demos, and performance comparisons.

Agent evaluationClaude 4DeepSeek

0 likes · 9 min read

DeepSeek V3.1 Review: 128K Context, Knowledge, Programming & Agent Skills Near Claude 4

Architects' Tech Alliance

Aug 13, 2025 · Artificial Intelligence

Can DeepSeek Survive the AI Arms Race? A Deep Dive into Its Challenges

DeepSeek, a fast‑rising large‑model contender, boasts impressive NLP and code‑generation capabilities, yet faces steep hurdles—including security concerns, industry‑specific customization gaps, slowing innovation, fierce competition from OpenAI, Google, and Alibaba’s Qwen3, and fragmented open‑source ecosystems—that cast doubt on its long‑term prospects.

AI competitionDeepSeekmodel evaluation

0 likes · 12 min read

Can DeepSeek Survive the AI Arms Race? A Deep Dive into Its Challenges

JD Tech Talk

Aug 6, 2025 · Artificial Intelligence

How to Deploy JoyAgent AI Agent on JD Cloud in Four Simple Steps

This guide walks you through deploying JD Cloud’s open‑source JoyAgent AI agent using the JoyAgent‑Genie image, covering host creation, firewall configuration, model and search engine setup, and service startup, enabling you to access the Genie interface via a public IP.

DeepSeekJD CloudJoyAgent

0 likes · 4 min read

How to Deploy JoyAgent AI Agent on JD Cloud in Four Simple Steps

IT Services Circle

Jul 21, 2025 · Artificial Intelligence

Why Is DeepSeek’s R1 Losing Users? Inside the Market Shift and Strategy

DeepSeek’s R1, once hailed as a breakthrough AI model with explosive growth, now faces a sharp decline in user traffic and market share, prompting analysis of user migration to third‑party platforms, performance bottlenecks, and contrasting strategies with rivals like Anthropic.

AI modelAnthropicDeepSeek

0 likes · 8 min read

Why Is DeepSeek’s R1 Losing Users? Inside the Market Shift and Strategy

Tech Freedom Circle

Jul 17, 2025 · Artificial Intelligence

DeepSeek V3 Architecture Deep Dive: MoE, MLA, DualPipe, FP8 Mixed Precision & Multi‑Token Prediction

This article provides a detailed technical analysis of DeepSeek‑V3, covering its MOE architecture, the novel Multi‑head Latent Attention (MLA) mechanism, the DualPipe pipeline‑parallel algorithm, mixed‑precision FP8 training, and the Multi‑Token Prediction (MTP) inference improvements that together boost performance and efficiency.

DeepSeekDualPipeFP8

0 likes · 44 min read

DeepSeek V3 Architecture Deep Dive: MoE, MLA, DualPipe, FP8 Mixed Precision & Multi‑Token Prediction

Fun with Large Models

Jul 17, 2025 · Artificial Intelligence

How to Integrate Large Models with LangChain: A Step‑by‑Step Tutorial

This tutorial explains LangChain's core modules and three‑layer architecture, shows how to set up a Python environment, and provides concrete code examples for connecting SiliconFlow Qwen3‑8B and DeepSeek models via the init_chat_model API, including result inspection and references to official documentation.

DeepSeekLangChainModel integration

0 likes · 9 min read

How to Integrate Large Models with LangChain: A Step‑by‑Step Tutorial

Tencent Technical Engineering

Jul 11, 2025 · Artificial Intelligence

How DeepSeek Achieved 15,800+ Tokens/s: Full‑Stack Inference Optimizations

This article details the Angel‑HCF team's end‑to‑end DeepSeek inference optimizations—including PD separation, multi‑layer MTP, EP and DP parallelism, hardware‑aware kernels, and load‑balancing strategies—that boost throughput to over 15,800 tokens per second while keeping per‑token latency under 50 ms.

AI performanceDeepSeekGPU Utilization

0 likes · 13 min read

How DeepSeek Achieved 15,800+ Tokens/s: Full‑Stack Inference Optimizations

ITPUB

Jul 7, 2025 · Operations

How to Build a DeepSeek AI Ops Platform: Architecture & Implementation

This article presents a comprehensive blueprint for constructing a DeepSeek-powered AI Ops platform, detailing the six‑module architecture, data collection stack, AI engine deployment options, application and interaction layers, implementation road‑map, model training, security measures, cost estimates, and risk mitigation strategies.

AI OpsDeepSeekOperations Automation

0 likes · 8 min read

How to Build a DeepSeek AI Ops Platform: Architecture & Implementation