Tagged articles
566 articles
Page 4 of 6
Architects' Tech Alliance
Architects' Tech Alliance
Mar 1, 2025 · Artificial Intelligence

Decoding DeepSeek: A Four‑Tier Capability Framework for Multimodal AI

The article outlines DeepSeek's four‑level capability hierarchy—basic multimodal data fusion and dynamic governance, intermediate domain modeling with causal reasoning and multi‑objective optimization, advanced complex system modeling with digital twins and multi‑agent coordination, and ultimate autonomous evolution features such as concept‑space exploration and self‑programming.

DeepSeekDigital TwinModel Capability
0 likes · 5 min read
Decoding DeepSeek: A Four‑Tier Capability Framework for Multimodal AI
ITPUB
ITPUB
Mar 1, 2025 · Artificial Intelligence

Can DeepSeek AI Replace Your DBA? Real-World Database Scenarios Tested

This article examines DeepSeek, a Chinese AGI‑focused AI model, explains prompt‑engineering techniques, and evaluates its performance across database architecture, development, and operations tasks through concrete Q&A examples, SQL plan analysis, and shell‑script generation, while also discussing its broader impact on professionals, vendors and enterprises.

AIDeepSeekPrompt engineering
0 likes · 10 min read
Can DeepSeek AI Replace Your DBA? Real-World Database Scenarios Tested
Architects' Tech Alliance
Architects' Tech Alliance
Feb 28, 2025 · Artificial Intelligence

DeepSeek V3 & R1: How Their Training Costs Compare to Llama 3.1

The article analyzes DeepSeek’s latest V3 conversational model and R1 inference model, detailing their MoE architecture, training on H800 GPUs costing about $558 k, comparing compute expenses to Meta’s Llama 3.1, and showing that their API pricing is roughly one‑tenth of GPT‑4o for dialogue and one‑twentieth of OpenAI o1 for inference.

AI model analysisDeepSeekinference pricing
0 likes · 4 min read
DeepSeek V3 & R1: How Their Training Costs Compare to Llama 3.1
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 28, 2025 · Artificial Intelligence

How DeepSeek’s RL‑Powered Time Scaling Is Redefining AI Model Training

DeepSeek’s rapid rise is examined through its RL‑based Time Scaling paradigm, cost‑effective architecture, innovative training pipeline, open‑source strategy, and security challenges, highlighting how these breakthroughs disrupt traditional AI model development, lower resource demands, and influence industry dynamics.

AI model trainingDeepSeekModel architecture
0 likes · 13 min read
How DeepSeek’s RL‑Powered Time Scaling Is Redefining AI Model Training
Alibaba Cloud Native
Alibaba Cloud Native
Feb 27, 2025 · Cloud Native

Build AI-Powered Code Review with Alibaba Cloud Flow and DeepSeek

This guide walks you through creating a custom Cloud Native Flow step that calls DeepSeek to automatically review code in Alibaba Cloud Codeup, covering token creation, API key setup, step publishing, pipeline configuration, and viewing AI‑generated review comments.

Alibaba CloudDeepSeekDevOps
0 likes · 7 min read
Build AI-Powered Code Review with Alibaba Cloud Flow and DeepSeek
IT Services Circle
IT Services Circle
Feb 27, 2025 · Artificial Intelligence

DeepSeek Announces FlashMLA: An Efficient Multi‑Layer Attention Decoding Kernel for Hopper GPUs

DeepSeek’s OpenSourceWeek introduced FlashMLA, a GPU‑optimized MLA decoding kernel for Hopper GPUs that leverages FlashAttention and CUTLASS to dramatically improve large‑model inference performance, with early adoption showing up to 30% higher compute utilization and doubled speed in some scenarios.

DeepSeekFlashMLAGPU
0 likes · 3 min read
DeepSeek Announces FlashMLA: An Efficient Multi‑Layer Attention Decoding Kernel for Hopper GPUs
JavaEdge
JavaEdge
Feb 27, 2025 · Artificial Intelligence

How to Quickly Build a DeepSeek‑Powered Knowledge Base on Tencent Cloud

This guide walks through deploying the full‑feature DeepSeek V3+R1 model on Tencent Cloud, configuring a smart knowledge‑base application, importing documentation, enabling internet search, tuning retrieval parameters, and publishing the app for public use, all without writing code.

AIDeepSeekKnowledge Base
0 likes · 6 min read
How to Quickly Build a DeepSeek‑Powered Knowledge Base on Tencent Cloud
Model Perspective
Model Perspective
Feb 27, 2025 · Artificial Intelligence

Why AI Model Cost Cuts Trigger a New Wave of Nvidia Demand

The article explains how DeepSeek’s low‑cost large‑language‑model training reduces GPU price pressure, yet paradoxically fuels greater demand for Nvidia hardware by lowering entry barriers, illustrating the modern Jevons paradox and its broader economic and societal implications.

AI hardwareDeepSeekGPU demand
0 likes · 8 min read
Why AI Model Cost Cuts Trigger a New Wave of Nvidia Demand
NewBeeNLP
NewBeeNLP
Feb 27, 2025 · Industry Insights

How DeepSeek’s Open‑Source Tools Exploit China‑Specific H800 GPUs to Boost AI Performance

The article analyzes DeepSeek’s three open‑source projects—FlashMLA, DeepEP, and DeepGEMM—showing how they optimize for the China‑only NVIDIA H800 GPU, contrast this with the abundant hardware resources of Western AI firms, and highlight the growing demand for talent that masters both AI models and GPU hardware.

AI hardwareDeepEPDeepGEMM
0 likes · 7 min read
How DeepSeek’s Open‑Source Tools Exploit China‑Specific H800 GPUs to Boost AI Performance
Tencent Cloud Developer
Tencent Cloud Developer
Feb 27, 2025 · Artificial Intelligence

DeepSeek LLM Series (V1‑V3, R1) Technical Overview and Analysis

The DeepSeek technical overview details the evolution from the dense 67 B V1 model through the 236 B MoE‑based V2 and 671 B V3 with FP8 training, to the RL‑only R1 series that learns reasoning without supervision, highlighting innovations such as Grouped‑Query Attention, Multi‑Head Latent Attention, load‑balancing‑free MoE, Multi‑Token Prediction, and knowledge distillation, and reporting state‑of‑the‑art benchmark results and open‑source reproduction projects.

AI researchDeepSeekMixture of Experts
0 likes · 37 min read
DeepSeek LLM Series (V1‑V3, R1) Technical Overview and Analysis
Architects' Tech Alliance
Architects' Tech Alliance
Feb 27, 2025 · Artificial Intelligence

How Inspur Metabrain R1 Server Enables 1000+ Concurrent Users for DeepSeek 671B via SGLang Optimization

The Inspur Metabrain R1 inference server, equipped with FP8 acceleration and a 1128 GB HBM3e memory pool, has been tightly integrated with SGLang 0.4.3 to run the 671‑billion‑parameter DeepSeek R1 model, delivering over 1,000 concurrent user sessions and up to 3,976 tokens/s throughput.

AI serverDeepSeekInference Optimization
0 likes · 5 min read
How Inspur Metabrain R1 Server Enables 1000+ Concurrent Users for DeepSeek 671B via SGLang Optimization
IT Architects Alliance
IT Architects Alliance
Feb 26, 2025 · Artificial Intelligence

DeepSeek Large Model: Core Architecture, Key Technologies, and Training Strategies

The article provides an in‑depth overview of DeepSeek’s large language model, detailing its mixture‑of‑experts and Transformer foundations, novel attention mechanisms, load‑balancing, multi‑token prediction, FP8 mixed‑precision training, and various training regimes such as knowledge distillation and reinforcement learning.

DeepSeekFP8MLA
0 likes · 18 min read
DeepSeek Large Model: Core Architecture, Key Technologies, and Training Strategies
Tencent Technical Engineering
Tencent Technical Engineering
Feb 26, 2025 · Artificial Intelligence

Engineers' Perspectives on DeepSeek: Technical Innovations and Implications

Thirteen engineers praise DeepSeek’s open‑source, reinforcement‑learning‑driven architecture—using FP8 storage and SFT‑free training—to deliver GPT‑4‑level reasoning at one‑twentieth the cost, enabling single‑GPU deployment, lowering barriers for academia and startups, and prompting notable market reactions that could democratize advanced AI.

AI cost reductionDeepSeekFP8
0 likes · 9 min read
Engineers' Perspectives on DeepSeek: Technical Innovations and Implications
58UXD
58UXD
Feb 26, 2025 · Artificial Intelligence

How AI Tools Like Deepseek Transform Design Workflow

This article shows designers how to combine AI services such as Deepseek, JiMeng, Tripo, Tongyi and Jianying to accelerate 3D modeling, PPT creation and short‑video production, turning lengthy manual tasks into fast, creative processes.

3D modelingAIDeepSeek
0 likes · 5 min read
How AI Tools Like Deepseek Transform Design Workflow
Architecture Digest
Architecture Digest
Feb 26, 2025 · Artificial Intelligence

DeepSeek4j 1.4: A Java Integration Framework for DeepSeek AI Models

DeepSeek4j 1.4 introduces a Java‑native framework that fully preserves DeepSeek's chain‑of‑thought and billing features, adds reactive streaming support, and provides a Spring Boot starter for effortless integration, accompanied by quick‑start code, configuration examples, and a built‑in debugging UI.

AIAPIDeepSeek
0 likes · 5 min read
DeepSeek4j 1.4: A Java Integration Framework for DeepSeek AI Models
Architect
Architect
Feb 25, 2025 · Artificial Intelligence

DeepSeek R1: Multi‑Stage Reinforcement Learning, Reward Modeling, and Distillation for a High‑Performance LLM

DeepSeek R1 builds on the DeepSeek V3 base model using a multi‑stage reinforcement learning pipeline—including GRPO optimization, rule‑based reward modeling, supervised fine‑tuning, language‑consistency rewards, rejection sampling, and distillation—to produce a high‑performing, aligned LLM capable of accurate reasoning.

DeepSeekLLM trainingReward Modeling
0 likes · 24 min read
DeepSeek R1: Multi‑Stage Reinforcement Learning, Reward Modeling, and Distillation for a High‑Performance LLM
Architects' Tech Alliance
Architects' Tech Alliance
Feb 25, 2025 · Artificial Intelligence

What Makes DeepSeek‑R1 a Game‑Changer in AIGC? Insights from Peking University

This article summarizes a Peking University lecture on DeepSeek‑R1, detailing its core concepts, advantages, and historical significance, then explains the underlying mechanisms of large‑model AI and AIGC tools, and finally offers practical guidance for selecting and efficiently applying AI solutions.

AI model analysisAIGCDeepSeek
0 likes · 5 min read
What Makes DeepSeek‑R1 a Game‑Changer in AIGC? Insights from Peking University
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Feb 25, 2025 · Artificial Intelligence

Accelerate DeepSeek‑V2‑Lite Deployment with FlashMLA: A Step‑by‑Step Guide

This tutorial walks users through installing FlashMLA, integrating it with the vLLM framework, downloading the DeepSeek‑V2‑Lite‑Chat model, benchmarking various MLA implementations, and running a local inference demo that shows FlashMLA’s speed advantage on long‑sequence generation.

DeepSeekFlashMLAInferenceOptimization
0 likes · 16 min read
Accelerate DeepSeek‑V2‑Lite Deployment with FlashMLA: A Step‑by‑Step Guide
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Feb 25, 2025 · Artificial Intelligence

Build a RAG‑Powered Smart Q&A Assistant with Milvus, DeepSeek, and PAI LangStudio

This step‑by‑step guide shows how to assemble a Retrieval‑Augmented Generation (RAG) system using Alibaba Cloud Milvus vector search, the DeepSeek large language model, and PAI LangStudio, covering instance creation, data upload, model deployment, connection setup, flow design, and service invocation.

AI TutorialDeepSeekLLM
0 likes · 9 min read
Build a RAG‑Powered Smart Q&A Assistant with Milvus, DeepSeek, and PAI LangStudio
Architecture Digest
Architecture Digest
Feb 25, 2025 · Artificial Intelligence

DeepSeek Distillation Technology: Overview, Innovations, Architecture, Training, Performance, and Challenges

DeepSeek’s distillation technology combines data and model distillation to transfer knowledge from large teacher models to compact student models, detailing its definitions, principles, key innovations, architecture, training methods, performance gains, and challenges, especially in multimodal contexts.

AI researchDeepSeekknowledge distillation
0 likes · 16 min read
DeepSeek Distillation Technology: Overview, Innovations, Architecture, Training, Performance, and Challenges
Java Web Project
Java Web Project
Feb 25, 2025 · Artificial Intelligence

How DeepSeek4j 1.4 Solves Spring AI’s Chain‑of‑Thought and Streaming Gaps

The article explains why existing Java AI frameworks struggle with DeepSeek R1’s chain‑of‑thought and streaming features, introduces DeepSeek4j 1.4 as a targeted solution, details its core capabilities, and provides a step‑by‑step guide to integrate it with Spring Boot and Project Reactor.

AI integrationDeepSeekJava
0 likes · 5 min read
How DeepSeek4j 1.4 Solves Spring AI’s Chain‑of‑Thought and Streaming Gaps
Efficient Ops
Efficient Ops
Feb 25, 2025 · Artificial Intelligence

How to Deploy DeepSeek R1 Locally: A Step‑by‑Step Guide for AI Enthusiasts

This guide explains what DeepSeek R1 is, compares its full and distilled versions, details hardware requirements for Linux, Windows, and macOS, and provides step‑by‑step instructions for local deployment using Ollama, LM Studio, Docker, and visual interfaces like Open‑WebUI and Dify.

AI modelDeepSeekDify
0 likes · 9 min read
How to Deploy DeepSeek R1 Locally: A Step‑by‑Step Guide for AI Enthusiasts
Tencent Cloud Developer
Tencent Cloud Developer
Feb 25, 2025 · Artificial Intelligence

Deploy DeepSeek AI: Cloud, Local, API – Full Step‑by‑Step Guide

This guide walks developers through the full lifecycle of using DeepSeek—choosing the right deployment method (API, local machine, or private cloud), selecting model sizes based on hardware, configuring Tencent Cloud services, building AI applications, and integrating the model into development tools and mini‑programs.

AI Model DeploymentAI application developmentCloud Native
0 likes · 12 min read
Deploy DeepSeek AI: Cloud, Local, API – Full Step‑by‑Step Guide
CSS Magic
CSS Magic
Feb 25, 2025 · Artificial Intelligence

Two Simple Ways to Access DeepSeek API for Free

This guide shows how to obtain free DeepSeek API access through GitHub Models and SiliconFlow, detailing the required API base URL, key, and model name, how to register, create keys, verify usage with a web chat tool, and compare model choices and platform limits.

APIDeepSeekFree access
0 likes · 7 min read
Two Simple Ways to Access DeepSeek API for Free
DevOps
DevOps
Feb 24, 2025 · Artificial Intelligence

AI‑Powered Full‑Stack Development with DeepSeek and ClinePRO: A 12× Efficiency Case Study

During the Chinese New Year break the author used DeepSeek and AISE ClinePRO to build a complete full‑stack product in only 20 hours, demonstrating a twelve‑fold productivity boost over traditional development while showcasing AI‑driven code generation, multilingual support, automated documentation, and DevOps integration.

AI CodingClinePRODeepSeek
0 likes · 17 min read
AI‑Powered Full‑Stack Development with DeepSeek and ClinePRO: A 12× Efficiency Case Study
AI Algorithm Path
AI Algorithm Path
Feb 24, 2025 · Artificial Intelligence

Flash-MLA: Boosting LLM Inference Speed on Nvidia Hopper GPUs

Flash-MLA is an open‑source GPU kernel optimized for Nvidia Hopper GPUs that compresses the KV cache of multi‑head attention, cutting memory usage by up to 93.3% and delivering 580 TFLOPS compute, thereby dramatically accelerating large‑language‑model inference while lowering cost.

DeepSeekFlash-MLAGPU Optimization
0 likes · 8 min read
Flash-MLA: Boosting LLM Inference Speed on Nvidia Hopper GPUs
21CTO
21CTO
Feb 24, 2025 · Artificial Intelligence

From Transformers to DeepSeek-R1: Evolution of Large Language Models

Since the 2017 introduction of the Transformer architecture, this article chronicles the rapid development of large language models—including BERT, GPT series, multimodal systems, and the cost‑effective DeepSeek‑R1—highlighting key innovations, scaling trends, alignment techniques, and their transformative impact across AI research and industry.

AI evolutionDeepSeekLLM History
0 likes · 23 min read
From Transformers to DeepSeek-R1: Evolution of Large Language Models
Architects' Tech Alliance
Architects' Tech Alliance
Feb 24, 2025 · Artificial Intelligence

NSA: Hardware‑Optimized Sparse Attention Mechanism from DeepSeek, Peking University and University of Washington

The NSA mechanism introduces a three‑branch hardware‑optimized sparse attention architecture—token compression, token selection, and sliding window—combined with learnable gating to balance global and local context, dramatically improving inference speed and efficiency for long‑context large language models.

AI ArchitectureDeepSeekHardware acceleration
0 likes · 5 min read
NSA: Hardware‑Optimized Sparse Attention Mechanism from DeepSeek, Peking University and University of Washington
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Feb 24, 2025 · Artificial Intelligence

Generate Game Code Instantly with DeepSeek V3 on Huawei Cloud

This tutorial walks you through configuring a Huawei Cloud host, installing the AutoGen framework, setting up DeepSeek V3 model API keys, and using the model to automatically generate Python code for a graphical two‑player battle game, complete with step‑by‑step instructions and sample commands.

AI code generationAutoGenDeepSeek
0 likes · 9 min read
Generate Game Code Instantly with DeepSeek V3 on Huawei Cloud
AI Large Model Application Practice
AI Large Model Application Practice
Feb 24, 2025 · Artificial Intelligence

How Web Agents Combine LLMs and Browser Automation to Perform Real‑World Tasks

This article explains what Web Agents are, their ReAct‑style reasoning loop, key implementation technologies such as observation parsing, multimodal models, and browser control tools like Selenium and Playwright, and demonstrates building a DeepSeek‑powered Web Agent with the Browser‑use framework, including code samples and performance insights.

Browser AutomationDeepSeekLLM
0 likes · 11 min read
How Web Agents Combine LLMs and Browser Automation to Perform Real‑World Tasks
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Feb 24, 2025 · Artificial Intelligence

How to Distill and Fine‑Tune DeepSeek R1 with Qwen on Alibaba Cloud PAI

This guide walks you through the complete workflow of preparing instruction data, deploying the DeepSeek‑R1 teacher model, using Alibaba Cloud PAI to generate teacher responses, distilling a smaller Qwen2.5‑7B‑Instruct student model, fine‑tuning it, and deploying the final service, with performance comparisons on several math‑reasoning benchmarks.

Alibaba Cloud PAIDeepSeek
0 likes · 17 min read
How to Distill and Fine‑Tune DeepSeek R1 with Qwen on Alibaba Cloud PAI
Java Web Project
Java Web Project
Feb 23, 2025 · Artificial Intelligence

Build Your First AI Chatbot with Spring Boot and DeepSeek LLM

This guide walks you through creating a Spring Boot project, configuring DeepSeek's large language model via SiliconFlow, setting up OpenAI‑compatible parameters, and implementing a REST controller that returns weather forecasts using the model, complete with step‑by‑step code snippets, configuration files, and deployment instructions.

AIChatbotDeepSeek
0 likes · 7 min read
Build Your First AI Chatbot with Spring Boot and DeepSeek LLM
Open Source Linux
Open Source Linux
Feb 23, 2025 · Artificial Intelligence

How Chinese Universities Are Rapidly Deploying DeepSeek AI Models on Campus

After a winter break surge, DeepSeek AI models have been swiftly adopted across Chinese universities, enabling local deployments for teaching, research, and campus services, while facing bans and security concerns abroad, highlighting both rapid domestic integration and international challenges.

AI modelsChinaDeepSeek
0 likes · 13 min read
How Chinese Universities Are Rapidly Deploying DeepSeek AI Models on Campus
Su San Talks Tech
Su San Talks Tech
Feb 23, 2025 · Artificial Intelligence

How DeepSeek’s Distillation Breaks AI Model Limits: Core Principles & Performance

This article explores DeepSeek’s cutting‑edge distillation technology, detailing its definition, underlying principles, innovative data‑model fusion, architecture choices, training strategies, performance gains over large language models, and the remaining challenges in knowledge transfer and multimodal data processing.

AI OptimizationDeepSeekMultimodal Learning
0 likes · 16 min read
How DeepSeek’s Distillation Breaks AI Model Limits: Core Principles & Performance
macrozheng
macrozheng
Feb 22, 2025 · Artificial Intelligence

Choosing the Right DeepSeek‑R1 Model: Hardware Needs & Use Cases Explained

This guide compares DeepSeek‑R1’s 1.5B/7B/8B, 14B/32B, and 70B/671B versions, detailing their characteristics, typical applications, and the specific CPU, memory, and GPU specifications required for local deployment, helping you select the optimal model for your resources.

AI Model DeploymentDeepSeekHardware Requirements
0 likes · 7 min read
Choosing the Right DeepSeek‑R1 Model: Hardware Needs & Use Cases Explained
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Feb 22, 2025 · Artificial Intelligence

Deploying DeepSeek Locally with Ollama, Building Personal and Organizational Knowledge Bases, and Integrating with Spring AI

This guide explains how to locally deploy the DeepSeek large‑language model using Ollama on Windows, macOS, and Linux, configure model storage and CORS, build personal and enterprise RAG knowledge bases with AnythingLLM and Open WebUI, and integrate the model into a Spring AI application via Docker and Docker‑Compose.

DeepSeekDockerKnowledge Base
0 likes · 16 min read
Deploying DeepSeek Locally with Ollama, Building Personal and Organizational Knowledge Bases, and Integrating with Spring AI
Architect
Architect
Feb 21, 2025 · Artificial Intelligence

DeepSeek Model Innovations: Architecture, Training Methods, and Performance Evaluation

This article reviews DeepSeek's recent breakthroughs, including the MLA attention redesign, GRPO alignment algorithm, MoE enhancements, multi‑stage training pipelines (SFT, RL, preference tuning, distillation), and comparative performance against GPT‑4o‑Mini and Llama 3.1, highlighting both strengths and remaining challenges.

DeepSeekMixture of ExpertsModel Evaluation
0 likes · 16 min read
DeepSeek Model Innovations: Architecture, Training Methods, and Performance Evaluation
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 21, 2025 · Artificial Intelligence

Build a Plain‑Explanation AI Agent with DeepSeek‑R1: Prompt Templates & SVG Tips

This article introduces the “Plain Explanation Expert” AI agent built on DeepSeek‑R1, explains its prompt framework—including role, skills, and output format—demonstrates usage through direct prompt copying and smart‑agent configuration in tools like Cherry Studio, and provides concrete examples, memory tricks, and SVG visualizations to help users quickly master complex concepts.

AI promptingDeepSeekSVG visualization
0 likes · 15 min read
Build a Plain‑Explanation AI Agent with DeepSeek‑R1: Prompt Templates & SVG Tips
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Feb 21, 2025 · Artificial Intelligence

Deploying DeepSeek R1 Model Inference on ACK Edge with Virtual Nodes and Serverless GPU

This article explains how to use Alibaba Cloud ACK Edge to manage on‑premise GPU resources and seamlessly fall back to cloud‑based ACS Serverless GPU via virtual nodes for deploying DeepSeek R1 inference, covering environment preparation, model download, storage setup, custom scheduling, and scaling strategies.

ACK@EdgeDeepSeekGPU
0 likes · 16 min read
Deploying DeepSeek R1 Model Inference on ACK Edge with Virtual Nodes and Serverless GPU
Top Architect
Top Architect
Feb 21, 2025 · Artificial Intelligence

DeepSeek4j 1.4: Java Integration Framework for DeepSeek with Full Chain‑of‑Thought and Streaming Support

The article introduces DeepSeek4j 1.4, a Java‑based framework that overcomes Spring AI’s limitations by fully preserving DeepSeek’s chain‑of‑thought and billing features, adding reactive streaming, providing Spring Boot starter integration, and offering quick‑start code samples and configuration guidance.

AIDeepSeekJava
0 likes · 8 min read
DeepSeek4j 1.4: Java Integration Framework for DeepSeek with Full Chain‑of‑Thought and Streaming Support
Data Thinking Notes
Data Thinking Notes
Feb 20, 2025 · Artificial Intelligence

How to Deploy DeepSeek R1 671B Model Locally with Ollama: A Step‑by‑Step Guide

This article provides a comprehensive tutorial on locally deploying the 671‑billion‑parameter DeepSeek R1 model using Ollama, covering model selection, hardware requirements, dynamic quantization, detailed installation steps, performance observations, and practical recommendations for consumer‑grade hardware.

AI model optimizationDeepSeekDynamic Quantization
0 likes · 14 min read
How to Deploy DeepSeek R1 671B Model Locally with Ollama: A Step‑by‑Step Guide
dbaplus Community
dbaplus Community
Feb 20, 2025 · Artificial Intelligence

Can DeepSeek AI Replace DBA Tasks? Real-World Database Scenarios Tested

This article examines DeepSeek, a Chinese AGI‑focused AI model, and demonstrates how prompt engineering can enable it to assist database architects, development DBAs, and operations DBAs across various real‑world scenarios, while also discussing its broader impact on individuals, vendors, and enterprises.

AI for DBAsDatabase ArchitectureDeepSeek
0 likes · 10 min read
Can DeepSeek AI Replace DBA Tasks? Real-World Database Scenarios Tested
Top Architect
Top Architect
Feb 20, 2025 · Artificial Intelligence

Deploying DeepSeek R1 671B Model Locally with Ollama and Dynamic Quantization

This guide explains how to download, quantize, and run the full‑size 671‑billion‑parameter DeepSeek R1 model on local hardware using Ollama, covering model selection, hardware requirements, step‑by‑step deployment commands, optional web UI setup, performance observations, and practical recommendations.

AIDeepSeekDynamic Quantization
0 likes · 16 min read
Deploying DeepSeek R1 671B Model Locally with Ollama and Dynamic Quantization
Su San Talks Tech
Su San Talks Tech
Feb 20, 2025 · Artificial Intelligence

Generate AI‑Powered PPTs in WPS with DeepSeek – Step‑by‑Step Guide

This guide shows how to use the built‑in DeepSeek integration in the latest WPS version to create AI‑generated PowerPoint presentations, covering installation, activation, prompt design, template selection, custom template upload, and final editing, all without extra software or API keys.

AI PPTDeepSeekWPS
0 likes · 5 min read
Generate AI‑Powered PPTs in WPS with DeepSeek – Step‑by‑Step Guide
Java Architect Essentials
Java Architect Essentials
Feb 19, 2025 · Artificial Intelligence

Integrating DeepSeek AI Assistant into IntelliJ IDEA for Java Development

This guide walks Java developers through preparing the environment, installing the CodeGPT plugin, configuring DeepSeek API keys, and using the AI-powered code assistant within IntelliJ IDEA, including code completion, explanation, and question‑answer features, with usage statistics and sample code.

AI code assistantCodeGPTDeepSeek
0 likes · 9 min read
Integrating DeepSeek AI Assistant into IntelliJ IDEA for Java Development
Data Thinking Notes
Data Thinking Notes
Feb 19, 2025 · Artificial Intelligence

DeepSeek Evolution: Key Technical Highlights from V1 to R1

This article examines DeepSeek’s various versions, detailing their core modules, underlying principles, architecture diagrams, and performance metrics, while illustrating the internal logic and advantages of each model to guide enthusiasts, professionals, and practitioners toward deeper AI innovation insights.

AIDeepSeekModel architecture
0 likes · 4 min read
DeepSeek Evolution: Key Technical Highlights from V1 to R1
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Feb 19, 2025 · Artificial Intelligence

How DeepSeek Beats GPT-4 with 10× Less Compute: Inside the AI Efficiency Revolution

This article examines DeepSeek's breakthrough AI techniques—including a revamped MoE architecture, aggressive data distillation, ultra‑low‑energy training, novel multi‑stage training strategies, and custom AI chips—that enable a 7B model to rival GPT‑4 while consuming a fraction of the resources.

AI efficiencyData distillationDeepSeek
0 likes · 9 min read
How DeepSeek Beats GPT-4 with 10× Less Compute: Inside the AI Efficiency Revolution
AIWalker
AIWalker
Feb 19, 2025 · Artificial Intelligence

DeepSeek’s NSA Attention Cuts Inference Time 11× – CEO Liang Co‑author

DeepSeek introduces the NSA sparse attention mechanism, combining dynamic hierarchical sparsity, coarse token compression and fine token selection to achieve up to 11.6× faster inference, lower pre‑training cost, and superior benchmark performance across general, long‑context, and chain‑of‑thought tasks.

BenchmarkDeepSeekLLM optimization
0 likes · 9 min read
DeepSeek’s NSA Attention Cuts Inference Time 11× – CEO Liang Co‑author
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Feb 19, 2025 · Artificial Intelligence

DeepSeek’s Self‑Correction: Transforming AI Reliability and Safety

The article explores DeepSeek’s innovative self‑correction system—combining a Mixture‑of‑Experts architecture with reinforcement‑learning feedback—to achieve real‑time error detection, dynamic knowledge‑graph updates, and enhanced safety in high‑risk fields like autonomous driving and medical diagnostics.

AI SafetyDeepSeekMixture of Experts
0 likes · 9 min read
DeepSeek’s Self‑Correction: Transforming AI Reliability and Safety
Alibaba Cloud Native
Alibaba Cloud Native
Feb 19, 2025 · Cloud Native

Engineering Traffic Management for DeepSeek: Cloud‑Native Deployment Strategies

This article outlines practical cloud‑native deployment options for DeepSeek models, explains common engineering challenges such as traffic spikes, latency, security, quota control, and provides detailed AI‑gateway solutions—including fallback, content safety, API key management, gray‑release routing, caching, and observability—to ensure reliable large‑model applications.

DeepSeekModel Deploymenttraffic management
0 likes · 9 min read
Engineering Traffic Management for DeepSeek: Cloud‑Native Deployment Strategies
Java Tech Enthusiast
Java Tech Enthusiast
Feb 19, 2025 · Artificial Intelligence

AI Agent Development Guide: Building Intelligent Agents with Coze Platform

The guide explains how to build AI agents—digital labor forces that follow instructions, plan tasks, and use tools—using ByteDance’s no‑code Coze platform, outlining a 3‑phase, 10‑step framework, emphasizing business‑first design, tool integration, and concise, scenario‑driven development with real‑world case studies.

AI AgentAgent Development FrameworkCoze Platform
0 likes · 7 min read
AI Agent Development Guide: Building Intelligent Agents with Coze Platform
Architects' Tech Alliance
Architects' Tech Alliance
Feb 19, 2025 · Industry Insights

Why DeepSeek One‑Stop AI Machines Are Redefining Private Model Deployment

The surge in demand for private AI deployment has prompted multiple vendors to launch DeepSeek one‑stop machines—integrated hardware solutions that support the full DeepSeek model family, offering higher stability, easier setup, customization, cost savings, and data security across diverse industry scenarios.

AI InfrastructureAI hardwareDeepSeek
0 likes · 7 min read
Why DeepSeek One‑Stop AI Machines Are Redefining Private Model Deployment
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Feb 19, 2025 · Artificial Intelligence

Build a DeepSeek AI Assistant with PAI‑RAG: Internet Search & Enterprise Knowledge Base

This guide walks you through using Alibaba Cloud's PAI‑RAG platform to deploy a DeepSeek large‑language‑model assistant that combines real‑time web search with an enterprise knowledge‑base, covering deployment, network‑search configuration, testing, and advanced enterprise features.

AI AssistantDeepSeekEnterprise Knowledge Base
0 likes · 10 min read
Build a DeepSeek AI Assistant with PAI‑RAG: Internet Search & Enterprise Knowledge Base
Tencent Cloud Developer
Tencent Cloud Developer
Feb 19, 2025 · Industry Insights

Why Every Enterprise Needs a Knowledge‑Management System in the LLM Era

The article analyzes how the shift from data‑driven to knowledge‑driven operations, powered by large language models like DeepSeek, forces companies to build dynamic knowledge‑management platforms that integrate personal and corporate knowledge, improve efficiency, and create sustainable competitive advantage.

DeepSeekDigital TransformationEnterprise AI
0 likes · 14 min read
Why Every Enterprise Needs a Knowledge‑Management System in the LLM Era
Architects' Tech Alliance
Architects' Tech Alliance
Feb 18, 2025 · Artificial Intelligence

How DeepSeek’s Latest Models Redefine AI Performance and Industry Adoption

The DeepSeek report details rapid model releases from 2024 onward, highlighting innovations such as model distillation, a 671 B MoE architecture, FP8 mixed‑precision, and the Janus‑Pro multimodal framework, while also documenting major cloud and chip providers' integration of these models into their services.

AI industry adoptionDeepSeekMoE architecture
0 likes · 10 min read
How DeepSeek’s Latest Models Redefine AI Performance and Industry Adoption
Architect
Architect
Feb 18, 2025 · Artificial Intelligence

DeepSeek‑R1: Training Innovations and Architecture for High‑Performance Reasoning LLMs

The article explains how DeepSeek‑R1 advances large language model reasoning by releasing a lightweight distilled version, sharing a complete training pipeline—including pre‑training, supervised fine‑tuning, and reinforcement learning—introducing long‑chain reasoning data, a transitional inference model, and a comprehensive RL optimization that together yield strong mathematical and logical capabilities.

AIDeepSeekModel Training
0 likes · 10 min read
DeepSeek‑R1: Training Innovations and Architecture for High‑Performance Reasoning LLMs
Alibaba Cloud Native
Alibaba Cloud Native
Feb 18, 2025 · Cloud Native

Deploy DeepSeek‑R1 on Alibaba Cloud ACK One Using ACS GPU in Minutes

This guide shows how to overcome on‑premise compute limits by registering a local Kubernetes cluster to Alibaba Cloud ACK One, provisioning ACS GPU resources, and deploying the DeepSeek‑R1 inference model with the vLLM framework through a series of concrete commands and YAML configurations.

ACK OneACS GPUDeepSeek
0 likes · 15 min read
Deploy DeepSeek‑R1 on Alibaba Cloud ACK One Using ACS GPU in Minutes
Architects' Tech Alliance
Architects' Tech Alliance
Feb 18, 2025 · Artificial Intelligence

How to Distill DeepSeek LLMs into Lightweight Models for Local Deployment

This article explains DeepSeek's knowledge‑distillation approach for compressing large language models into small, efficient student models, details step‑by‑step local deployment requirements, performance optimizations, and highlights the cost, privacy, and application benefits of running the distilled model on‑premise.

AI inferenceDeepSeekLLM
0 likes · 10 min read
How to Distill DeepSeek LLMs into Lightweight Models for Local Deployment
Architects' Tech Alliance
Architects' Tech Alliance
Feb 18, 2025 · Industry Insights

How DeepSeek V3 Is Driving a New Wave of Communication‑Hardware Demand

DeepSeek V3 cuts training to 2.788 M H800 GPU‑hours with FP8 mixed‑precision and a fully optimized framework, slashes token costs by 96% versus ChatGPT O1, and its efficient inference and model‑compression techniques are reshaping AI‑agent development, spurring demand for low‑latency, high‑bandwidth optical modules and edge‑computing infrastructure.

AICommunication IndustryDeepSeek
0 likes · 5 min read
How DeepSeek V3 Is Driving a New Wave of Communication‑Hardware Demand
Top Architecture Tech Stack
Top Architecture Tech Stack
Feb 18, 2025 · Backend Development

Step-by-Step Guide: Integrating DeepSeek API with a WeChat Public Account on a Cloud Server

This tutorial walks beginners through obtaining a DeepSeek API key, configuring a WeChat public account, setting up an Alibaba Cloud ECS instance, installing required dependencies, editing configuration files, and running the chatgpt‑on‑wechat project to enable AI‑powered replies via the WeChat platform.

DeepSeekPythonTutorial
0 likes · 13 min read
Step-by-Step Guide: Integrating DeepSeek API with a WeChat Public Account on a Cloud Server
Big Data Tech Team
Big Data Tech Team
Feb 18, 2025 · Artificial Intelligence

How DeepSeek Trains and Optimizes Its LLMs: From Pre‑training to Reasoning Models

This article breaks down DeepSeek's LLM training pipeline, explaining the massive pre‑training phase, instruction fine‑tuning, reinforcement‑learning‑from‑human‑feedback, and the distinct roles of its V3 instruction model and R1 reasoning model, while also highlighting performance metrics and current limitations.

DeepSeekLLMModel Training
0 likes · 8 min read
How DeepSeek Trains and Optimizes Its LLMs: From Pre‑training to Reasoning Models
Architecture & Thinking
Architecture & Thinking
Feb 18, 2025 · Artificial Intelligence

Why Is DeepSeek Server Overloaded? Causes and Practical Workarounds

The article investigates why DeepSeek frequently returns a “server busy” message, analyzing factors such as sudden traffic spikes, compute and bandwidth limitations, security attacks, and maintenance policies, and then offers actionable solutions including query optimization, off‑peak usage, third‑party cloud platforms, and local deployment.

AIDeepSeekModel Deployment
0 likes · 10 min read
Why Is DeepSeek Server Overloaded? Causes and Practical Workarounds
DevOps
DevOps
Feb 17, 2025 · Artificial Intelligence

Microsoft OmniParser V2.0: A Visual Agent Parsing Framework for Enhanced UI Understanding

Microsoft's OmniParser V2.0 transforms large language models such as DeepSeek‑R1, GPT‑4o, and Qwen‑2.5VL into visual AI agents by accurately detecting interactive UI elements, providing semantic descriptions, and generating structured representations that boost inference speed, reduce latency by 60%, and dramatically improve benchmark accuracy.

AI AgentComputer VisionDeepSeek
0 likes · 7 min read
Microsoft OmniParser V2.0: A Visual Agent Parsing Framework for Enhanced UI Understanding
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Feb 17, 2025 · Artificial Intelligence

24 Proven Prompt Formulas to Unlock DeepSeek’s Full Potential

Discover a comprehensive collection of 24 structured prompting techniques—from basic role‑play formulas to advanced cross‑disciplinary and managerial frameworks—designed to help users of DeepSeek and other large language models craft precise, high‑impact queries that dramatically improve response quality and efficiency.

AI promptingDeepSeekPrompt engineering
0 likes · 12 min read
24 Proven Prompt Formulas to Unlock DeepSeek’s Full Potential
Architect
Architect
Feb 17, 2025 · Artificial Intelligence

Deploying DeepSeek R1 on Huawei Ascend 910B: Weight Conversion and Troubleshooting

This article details a step‑by‑step deployment of the DeepSeek R1 model on Huawei Ascend 910B NPUs, covering FP8‑to‑BF16 weight conversion, custom container image preparation, configuration of MindIE services, common pitfalls, and practical troubleshooting tips for large‑scale inference.

DeepSeekHuawei AscendMindIE
0 likes · 8 min read
Deploying DeepSeek R1 on Huawei Ascend 910B: Weight Conversion and Troubleshooting
Java One
Java One
Feb 17, 2025 · Artificial Intelligence

How to Get Free Access to DeepSeek R1 Across Major Cloud Platforms

This guide walks you through using DeepSeek R1 via the official website or popular third‑party cloud services, compares free token quotas, explains token accounting, and provides step‑by‑step instructions for configuring API access and AI clients such as Chatbox, Cherry Studio, and Dify.

AI clientAPIDeepSeek
0 likes · 11 min read
How to Get Free Access to DeepSeek R1 Across Major Cloud Platforms
AI Large Model Application Practice
AI Large Model Application Practice
Feb 17, 2025 · Artificial Intelligence

Mastering Structured Output for DeepSeek‑R1 with LangChain, LangGraph, and ReAct Agents

DeepSeek‑R1 excels at deep reasoning but lacks native structured output; this guide explains why structured output matters, outlines common API‑level techniques, and provides three practical solutions—using an auxiliary model with a LangChain chain, a LangGraph workflow, and a ReAct agent—complete with code snippets and JSON‑mode tips.

DeepSeekLLMLangChain
0 likes · 12 min read
Mastering Structured Output for DeepSeek‑R1 with LangChain, LangGraph, and ReAct Agents