Tagged articles

DeepSeek

623 articles · Page 3 of 7
DataFunTalk
DataFunTalk
Jul 6, 2025 · Artificial Intelligence

Why DeepSeek’s Low‑Cost Tokenomics Are Losing Market Share to Anthropic and OpenAI

The article analyses DeepSeek’s unconventional low‑price, high‑latency strategy, its token‑pricing and KPI trade‑offs, and compares its performance, hardware choices, and market share with Anthropic, OpenAI, Google and other AI providers, while also discussing the rise of inference‑as‑a‑service and rumors about DeepSeek R2.

AI modelsDeepSeekTokenomics
0 likes · 14 min read
Why DeepSeek’s Low‑Cost Tokenomics Are Losing Market Share to Anthropic and OpenAI
DataFunTalk
DataFunTalk
Jul 5, 2025 · Artificial Intelligence

DeepSeek R1T2 Chimera: Faster, High‑Performance LLM with Assembly of Experts

The DeepSeek R1T2 Chimera model, an open‑source LLM built with Assembly of Experts technology, delivers up to 200% faster inference than R1‑0528, surpasses R1 on GPQA‑Diamond and AIME‑24 benchmarks, and offers a 671‑billion‑parameter MoE architecture, though it lacks function‑calling support and trails the highest‑end R1‑0528 on the toughest tests.

AIAssembly of ExpertsDeepSeek
0 likes · 5 min read
DeepSeek R1T2 Chimera: Faster, High‑Performance LLM with Assembly of Experts
Java Architecture Diary
Java Architecture Diary
Jun 25, 2025 · Artificial Intelligence

Build a Text‑to‑SQL Chatbot with Spring AI and DeepSeek LLM

This tutorial walks through creating a natural‑language‑to‑SQL chatbot using Spring AI, configuring a MySQL school database with Flyway, defining system prompts for a DeepSeek LLM, implementing service beans and a REST API, and interacting with the bot via curl commands.

ChatbotDeepSeekJava
0 likes · 15 min read
Build a Text‑to‑SQL Chatbot with Spring AI and DeepSeek LLM
Sohu Tech Products
Sohu Tech Products
Jun 11, 2025 · Artificial Intelligence

How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era

This article explores DeepSeek's open‑source large‑model breakthroughs, PingCAP's AI‑enhanced database roadmap, TiDB.AI's retrieval‑augmented generation framework, the unified TiDB data engine, and practical Q&A insights on knowledge‑graph construction, vector search, and AI‑driven SQL generation.

AIDeepSeekKnowledge Graph
0 likes · 15 min read
How DeepSeek and TiDB AI Are Redefining Data Engines for the Large‑Model Era
DataFunTalk
DataFunTalk
Jun 11, 2025 · Backend Development

Master Modern Web Scraping: From Classic Tools to DeepSeek AI Integration

This article provides a comprehensive overview of web‑scraping technologies, compares popular tools such as requests, BeautifulSoup and Selenium, introduces AI‑assisted crawling with DeepSeek, and walks through practical steps for using BrightData’s platform to collect industry data, complete with ready‑to‑run Python code.

BrightDataDeepSeekPython
0 likes · 13 min read
Master Modern Web Scraping: From Classic Tools to DeepSeek AI Integration
ITFLY8 Architecture Home
ITFLY8 Architecture Home
Jun 10, 2025 · Artificial Intelligence

DeepSeek Evolution: Technical Highlights, Architecture, and Performance Explained

This article examines DeepSeek’s various versions, detailing their core modules, underlying principles, architectural diagrams, and performance metrics, offering practical guidance for enthusiasts, professionals, and practitioners while inspiring further exploration of artificial intelligence innovations.

DeepSeekTech Overviewartificial-intelligence
0 likes · 2 min read
DeepSeek Evolution: Technical Highlights, Architecture, and Performance Explained
Java Architecture Diary
Java Architecture Diary
Jun 5, 2025 · Artificial Intelligence

Unlock AI Reasoning: How Ollama’s New ‘Thinking’ Feature Works

Version 0.9.0 of Ollama introduces a ‘thinking’ control that lets users view and manage the AI model’s reasoning process, with detailed CLI commands, REST API usage, model support list, scripting options, and advanced Modelfile configurations for models like DeepSeek R1 and Qwen 3.

AI reasoningCLIDeepSeek
0 likes · 6 min read
Unlock AI Reasoning: How Ollama’s New ‘Thinking’ Feature Works
Data Thinking Notes
Data Thinking Notes
Jun 4, 2025 · Artificial Intelligence

How DeepSeek AI Model is Revolutionizing China’s State Enterprises – Over 100 Deployment Cases

The DeepSeek large language model has been extensively deployed across more than 100 central and local Chinese state‑owned enterprises, spanning sectors such as energy, manufacturing, transportation, finance, telecommunications, construction, and public services, driving intelligent transformation through applications like smart scheduling, risk assessment, intelligent customer service, and AI‑enhanced office automation.

AI DeploymentDeepSeekState‑owned enterprises
0 likes · 38 min read
How DeepSeek AI Model is Revolutionizing China’s State Enterprises – Over 100 Deployment Cases
Java Web Project
Java Web Project
Jun 4, 2025 · Artificial Intelligence

Why DeepSeek V3 Stands Out: Architecture, Performance, and Open‑Source Edge

The article analyzes DeepSeek's rapid adoption, detailing its seven core models, the third‑generation MoE architecture, FP8 mixed‑precision training, 128K context window, benchmark superiority on MMLU/HumanEval/CMMLU, low training cost, and fully open‑source release, while also introducing a companion guide for developers.

AI ArchitectureDeepSeekFP8 training
0 likes · 9 min read
Why DeepSeek V3 Stands Out: Architecture, Performance, and Open‑Source Edge
Smart Era Software Development
Smart Era Software Development
Jun 4, 2025 · Artificial Intelligence

Beyond a Minor Update: DeepSeek's Coding Ability Leaps Forward

The DeepSeek‑R1 model upgrade dramatically improves reasoning depth and code‑generation performance, matching top‑tier models on benchmarks like LiveCodeBench, while industry experts warn that such advances could reshape software engineering roles and devalue pure coding skills.

AI impact on jobsAI programmingDeepSeek
0 likes · 5 min read
Beyond a Minor Update: DeepSeek's Coding Ability Leaps Forward
ITFLY8 Architecture Home
ITFLY8 Architecture Home
May 30, 2025 · Artificial Intelligence

How to Use AI (DeepSeek) for Efficient Official Document Drafting

This guide explains how DeepSeek can assist in quickly generating, structuring, and polishing official documents by defining clear requirements, building logical frameworks, filling content modules, adjusting style, and performing thorough proofreading, while also providing numerous prompt templates for common government and corporate paperwork.

AI writingDeepSeekGovernment
0 likes · 13 min read
How to Use AI (DeepSeek) for Efficient Official Document Drafting
Smart Era Software Development
Smart Era Software Development
May 30, 2025 · Artificial Intelligence

How Tencent’s TRMT Boosted DeepSeek’s Communication: A Chinese Open‑Source Success

Tencent’s Star‑Network team partnered with DeepSeek to open‑source the DeepEP communication library, then used its self‑developed TRMT stack to overcome RoCE limitations, achieving up to 100% speedup on RoCEv2 and 30% on InfiniBand, cutting training costs and inference latency for large MoE models.

AI trainingDeepEPDeepSeek
0 likes · 8 min read
How Tencent’s TRMT Boosted DeepSeek’s Communication: A Chinese Open‑Source Success
Efficient Ops
Efficient Ops
May 29, 2025 · Artificial Intelligence

DeepSeek R1 0528 Update: New Features, Performance Gains Over OpenAI o3

DeepSeek quietly launched the R1 0528 model, which early testers report matches OpenAI’s o3 in benchmarks and style, while adding deeper chain‑of‑thought reasoning, better writing output, and extended thinking windows, and the announcement is followed by a promotion for the GOPS Global Ops Conference.

AI performanceChain-of-ThoughtDeepSeek
0 likes · 3 min read
DeepSeek R1 0528 Update: New Features, Performance Gains Over OpenAI o3
Alibaba Cloud Developer
Alibaba Cloud Developer
May 27, 2025 · Artificial Intelligence

How to Build AI-Powered Java Apps with Spring AI and DeepSeek

This guide walks Java developers through integrating Spring AI with large‑model services such as DeepSeek, covering setup, API key configuration, code examples for synchronous and streaming calls, reactive implementation, monitoring with Actuator, and compatibility with OpenAI‑style APIs.

AI integrationDeepSeekJava
0 likes · 9 min read
How to Build AI-Powered Java Apps with Spring AI and DeepSeek
Java One
Java One
May 26, 2025 · Artificial Intelligence

Integrate ProxyAI with JetBrains IDEA for Seamless AI‑Powered Coding

This guide walks you through installing the ProxyAI plugin in JetBrains IDEA, configuring default and DeepSeek‑V3 models, obtaining API keys, and using features like chat, FIM code completion, bug detection, and code explanation to boost development efficiency.

AI code assistantDeepSeekIDEA
0 likes · 8 min read
Integrate ProxyAI with JetBrains IDEA for Seamless AI‑Powered Coding
IT Services Circle
IT Services Circle
May 25, 2025 · Artificial Intelligence

DeepSeek Core Technologies and Model Innovations: DeepSeek‑V3 and DeepSeek‑R1 Technical Overview

The article provides a detailed technical overview of DeepSeek's flagship large language models, DeepSeek‑V3 and DeepSeek‑R1, describing their MoE architecture, training frameworks, reinforcement‑learning based fine‑tuning, inference optimizations, and the broader impact of these innovations on the AI landscape while also promoting related books and resources.

AIDeepSeekLarge Language Model
0 likes · 10 min read
DeepSeek Core Technologies and Model Innovations: DeepSeek‑V3 and DeepSeek‑R1 Technical Overview
Java Architecture Diary
Java Architecture Diary
May 21, 2025 · Artificial Intelligence

Spring AI 1.0 Launch: Production‑Ready Java AI Framework Unveiled

Spring AI 1.0, the first production‑grade Java AI framework, introduces ready‑to‑use APIs, seamless model integration, enterprise‑level RAG engine, smart tool calling, and three development modes, empowering developers to rapidly build, customize, and fully control AI applications with major model providers like OpenAI, Anthropic, DeepSeek.

AI FrameworkDeepSeekJava AI
0 likes · 13 min read
Spring AI 1.0 Launch: Production‑Ready Java AI Framework Unveiled
Architect's Guide
Architect's Guide
May 18, 2025 · Backend Development

Integrating DeepSeek AI with a WeChat Public Account: A Step‑by‑Step Backend Tutorial

This tutorial walks beginners through obtaining a DeepSeek API key, setting up an Alibaba Cloud ECS instance, configuring the server and WeChat public platform, installing required Python dependencies, editing configuration files, and finally running the chatbot so the public account can interact with the DeepSeek large‑language model.

APICloudDeepSeek
0 likes · 12 min read
Integrating DeepSeek AI with a WeChat Public Account: A Step‑by‑Step Backend Tutorial
DataFunSummit
DataFunSummit
May 17, 2025 · Artificial Intelligence

Integrating Knowledge Graphs with DeepSeek AI for Enterprise Knowledge Management

This presentation explores how combining knowledge graphs with DeepSeek large‑model agents can revolutionize enterprise knowledge management, detailing DeepSeek’s technical strengths, the graph‑model complementarity paradigm, various knowledge types, practical frameworks, case studies, and future outlooks for AI‑enhanced intelligent systems.

DeepSeekEnterprise Knowledge ManagementLarge Language Model
0 likes · 23 min read
Integrating Knowledge Graphs with DeepSeek AI for Enterprise Knowledge Management
Architects' Tech Alliance
Architects' Tech Alliance
May 16, 2025 · Industry Insights

Can DeepSeek Survive the AI Arms Race? A Deep Dive into Its Challenges and Competition

The article provides a comprehensive analysis of DeepSeek’s rise in the large‑model market, examining its technical merits, security and customization hurdles, slowing innovation, fierce competition from OpenAI, Google and Alibaba’s Qwen3, as well as the fragility of its open‑source ecosystem and data preparation, ultimately questioning its long‑term viability.

AI modelsDeepSeekIndustry Analysis
0 likes · 13 min read
Can DeepSeek Survive the AI Arms Race? A Deep Dive into Its Challenges and Competition
Data Thinking Notes
Data Thinking Notes
May 13, 2025 · Information Security

DeepSeek Security: Top 5 Model Threats and How to Defend

This report examines DeepSeek’s security and reliability by detailing five core model threats—DDoS attacks, unlimited inference, vulnerability exploitation, data poisoning, and jailbreak—alongside two private‑deployment risks and three external threats such as counterfeit apps, offering targeted mitigation strategies to help users safely adopt the platform.

AI securityDeepSeekModel safety
0 likes · 8 min read
DeepSeek Security: Top 5 Model Threats and How to Defend
Architect's Guide
Architect's Guide
May 13, 2025 · Artificial Intelligence

DeepSeek Model Distillation Technology: Overview, Innovations, Architecture, Training, Performance, and Challenges

This article provides a comprehensive overview of DeepSeek's model distillation technology, detailing its definition, key innovations, architecture, training methods, performance gains, and the remaining challenges such as the implicit performance ceiling and multimodal data distillation.

DeepSeekKnowledge Transferai-optimization
0 likes · 14 min read
DeepSeek Model Distillation Technology: Overview, Innovations, Architecture, Training, Performance, and Challenges
Linux Kernel Journey
Linux Kernel Journey
May 8, 2025 · Artificial Intelligence

How Tencent’s TRMT Tech Delivered a Huge Speedup to DeepSeek’s Large‑Model Network

DeepSeek engineers highlighted Tencent’s open‑source TRMT and DeepEP contributions that boost GPU‑to‑GPU communication by up to 300%, double RoCE performance and add a further 30% gain on InfiniBand, while addressing lane‑utilization and CPU‑control bottlenecks through three targeted optimizations.

DeepEPDeepSeekGPU communication
0 likes · 6 min read
How Tencent’s TRMT Tech Delivered a Huge Speedup to DeepSeek’s Large‑Model Network
JD Tech
JD Tech
May 8, 2025 · Artificial Intelligence

The Emerging Boom of Large Model Applications and Why 2025 Will Be the Turning Point

Amid the AI wave, large language models like DeepSeek R1 are poised to explode by 2025, driven by open-source, low-cost access and superior reasoning, with successful deployment requiring four key factors—domain expertise, knowledge bases, robust search, and engineered agent architectures—to unlock value beyond simple chat.

2025AI ApplicationsDeepSeek
0 likes · 10 min read
The Emerging Boom of Large Model Applications and Why 2025 Will Be the Turning Point
DevOps
DevOps
May 5, 2025 · Artificial Intelligence

DeepSeek Releases Math‑Specialized Large Model V2 and ProverBench Evaluation Suite

DeepSeek has quietly open‑sourced a new mathematics‑focused large language model, DeepSeek‑Prover‑V2 (available in 671B and 7B variants), achieving 88.9% on MiniF2F and strong results on PutnamBench, alongside the high‑quality ProverBench dataset and a novel recursive theorem‑proving pipeline.

AIDeepSeekLarge Language Model
0 likes · 4 min read
DeepSeek Releases Math‑Specialized Large Model V2 and ProverBench Evaluation Suite
ITPUB
ITPUB
May 5, 2025 · Operations

Turn Zabbix Alerts into an AI‑Powered Personal Assistant

This guide shows how to integrate Zabbix with a locally deployed DeepSeek large language model via Webhook, enabling automatic analysis of alert causes and solutions, feeding results back to operators through dashboards or enterprise WeChat, and dramatically reducing MTTR and manual effort.

AIAlert AutomationDeepSeek
0 likes · 5 min read
Turn Zabbix Alerts into an AI‑Powered Personal Assistant
Architects' Tech Alliance
Architects' Tech Alliance
May 2, 2025 · Artificial Intelligence

DeepSeek‑Prover‑V2‑671B: A Massive AI Model for Formal Mathematical Theorem Proving

DeepSeek‑Prover‑V2‑671B, a 671 billion‑parameter AI model released on Hugging Face, dramatically advances formal mathematical theorem proving with MoE architecture, FP8 quantization, 163 k token context, superior performance over GPT‑4 Turbo and other models, and broad implications for research and industry.

AIDeepSeekFP8 quantization
0 likes · 11 min read
DeepSeek‑Prover‑V2‑671B: A Massive AI Model for Formal Mathematical Theorem Proving
Data Thinking Notes
Data Thinking Notes
Apr 27, 2025 · Artificial Intelligence

Step‑by‑Step MCP Demo: Build Server and Claude/DeepSeek Clients

This guide walks developers through creating a complete MCP application, covering the workflow, server setup with Python, debugging tools, and client implementation using both Claude and DeepSeek models, complete with code snippets, environment configuration, and testing procedures to demonstrate end‑to‑end LLM tool integration.

ClaudeDeepSeekLLM
0 likes · 10 min read
Step‑by‑Step MCP Demo: Build Server and Claude/DeepSeek Clients
Baobao Algorithm Notes
Baobao Algorithm Notes
Apr 27, 2025 · Artificial Intelligence

How DeepSeek R1T‑Chimera Cuts Tokens by 40% Without Fine‑Tuning

The DeepSeek‑R1T‑Chimera model merges DeepSeek‑R1 reasoning with V3‑0324 architecture, reusing most V3 weights and swapping only the blue‑highlighted R1 routing experts, achieving the same intelligence as R1 while reducing output tokens by about 40% and running faster, all without any fine‑tuning or distillation.

DeepSeekLLMParameter Reuse
0 likes · 5 min read
How DeepSeek R1T‑Chimera Cuts Tokens by 40% Without Fine‑Tuning
Big Data Tech Team
Big Data Tech Team
Apr 23, 2025 · Industry Insights

10 Powerful Ways DeepSeek Transforms Data Warehousing

DeepSeek leverages AI to automate multi‑source integration, data cleaning, warehouse modeling, real‑time processing, governance, metadata management, reporting, cloud scaling, and decision support, offering twelve distinct use cases that boost efficiency, intelligence, and scalability of modern data warehouses.

AIData WarehouseDeepSeek
0 likes · 9 min read
10 Powerful Ways DeepSeek Transforms Data Warehousing
Data Thinking Notes
Data Thinking Notes
Apr 22, 2025 · Artificial Intelligence

How DeepSeek AI is Transforming Agriculture, Manufacturing, Finance, Healthcare, and Education

The Zhejiang University IT Center report highlights DeepSeek's AI technology across more than 40 real‑world cases in agriculture, manufacturing, finance, healthcare and education, demonstrating data‑driven, intelligent solutions that accelerate industry transformation and modernization.

AI ApplicationsData-driven transformationDeepSeek
0 likes · 3 min read
How DeepSeek AI is Transforming Agriculture, Manufacturing, Finance, Healthcare, and Education
Big Data Technology & Architecture
Big Data Technology & Architecture
Apr 22, 2025 · Artificial Intelligence

Introduction to Retrieval‑Augmented Generation (RAG) and Vector Indexing with StarRocks and DeepSeek

This article explains the fundamentals of Retrieval‑Augmented Generation, demonstrates how to create and query vector indexes using StarRocks, shows how DeepSeek provides embeddings and answer generation, and walks through a complete end‑to‑end RAG pipeline with code examples and a web UI.

AIDeepSeekEmbedding
0 likes · 20 min read
Introduction to Retrieval‑Augmented Generation (RAG) and Vector Indexing with StarRocks and DeepSeek
Big Data Tech Team
Big Data Tech Team
Apr 21, 2025 · Industry Insights

8 Practical Ways DeepSeek Boosts Data Quality for Better Governance

This guide outlines eight concrete methods DeepSeek uses to improve data quality—including automated cleaning, validation, classification, monitoring, governance standards, anomaly detection, integration, and intelligent analysis—providing actionable steps for organizations to enhance data accuracy, completeness, consistency, and usability.

Data IntegrationData QualityDeepSeek
0 likes · 5 min read
8 Practical Ways DeepSeek Boosts Data Quality for Better Governance
dbaplus Community
dbaplus Community
Apr 21, 2025 · Operations

Turn Zabbix Alerts into AI‑Powered Insights with DeepSeek

This guide shows how to integrate Zabbix with a locally deployed DeepSeek large language model via Webhook, enabling automatic analysis of alerts, generation of root‑cause explanations and remediation suggestions, and delivering results through WeChat bots, dashboards, or email to reduce MTTR and manual effort.

AI OpsAlert AutomationDeepSeek
0 likes · 4 min read
Turn Zabbix Alerts into AI‑Powered Insights with DeepSeek
AIWalker
AIWalker
Apr 17, 2025 · Artificial Intelligence

Unveiling DeepSeek’s Janus Series: Decoupled Visual Encoding for Unified Multimodal Understanding and Generation

This article provides an in‑depth analysis of DeepSeek’s Janus and Janus‑Pro models, explaining how decoupling visual encoding resolves the conflict between multimodal understanding and generation, detailing training stages, data scaling, architectural choices, and presenting extensive benchmark results that demonstrate significant performance gains.

DeepSeekJanusModel Scaling
0 likes · 23 min read
Unveiling DeepSeek’s Janus Series: Decoupled Visual Encoding for Unified Multimodal Understanding and Generation
Big Data Tech Team
Big Data Tech Team
Apr 14, 2025 · Industry Insights

How DeepSeek AI is Transforming Data Warehouses: From Automation to Real‑Time Insights

DeepSeek leverages large‑model AI to automate requirement analysis, intelligent modeling, performance tuning, and value extraction in data warehouses, addressing low development efficiency, high O&M cost, latency, and lack of intelligence while showcasing concrete use‑case results across finance, e‑commerce, and manufacturing.

AIAutomationData Warehouse
0 likes · 9 min read
How DeepSeek AI is Transforming Data Warehouses: From Automation to Real‑Time Insights
Open Source Linux
Open Source Linux
Apr 14, 2025 · Artificial Intelligence

How to Deploy DeepSeek Locally: Step‑by‑Step Guide for Offline AI

This guide compares DeepSeek’s local and online versions, outlines hardware and privacy advantages of offline deployment, and provides a detailed step‑by‑step tutorial—including Ollama installation, model selection, command execution, and UI plugin setup—to help users run DeepSeek on their own machines.

AI modelDeepSeekOllama
0 likes · 6 min read
How to Deploy DeepSeek Locally: Step‑by‑Step Guide for Offline AI
Architects' Tech Alliance
Architects' Tech Alliance
Apr 13, 2025 · Artificial Intelligence

Deploying DeepSeek LLMs On-Premises: Step‑by‑Step Guide and Hardware Sizing

This article provides a comprehensive technical guide for privately deploying DeepSeek large language models, covering model and runtime parameter selection, hardware sizing calculations, software stack preparation, inference service setup, performance tuning, and security monitoring considerations.

AI hardware sizingDeepSeekInference Optimization
0 likes · 14 min read
Deploying DeepSeek LLMs On-Premises: Step‑by‑Step Guide and Hardware Sizing
AI Algorithm Path
AI Algorithm Path
Apr 13, 2025 · Artificial Intelligence

Understanding GRPO: Group Relative Policy Optimization for LLM Training

The article explains GRPO, a reinforcement‑learning algorithm that extends PPO with group sampling, no value network, dual penalties and KL regularisation, showing how it improves efficiency and stability when fine‑tuning large language models such as DeepSeek‑Math and DeepSeek‑R1.

DeepSeekGRPOPPO
0 likes · 6 min read
Understanding GRPO: Group Relative Policy Optimization for LLM Training
Fun with Large Models
Fun with Large Models
Apr 12, 2025 · Artificial Intelligence

Build a No‑Code Travel‑Planning AI Assistant with VS Code, Cline, and Gaode MCP Server

This guide walks through setting up VS Code, installing the Cline plugin, configuring a Gaode Map MCP Server API key, and using the DeepSeek model to generate a personalized park‑recommendation agent and a visual HTML page, while also explaining the stdio‑based communication between Cline and the MCP Server.

AI AgentClineDeepSeek
0 likes · 15 min read
Build a No‑Code Travel‑Planning AI Assistant with VS Code, Cline, and Gaode MCP Server
Big Data Tech Team
Big Data Tech Team
Apr 9, 2025 · Artificial Intelligence

12 Powerful Ways DeepSeek Transforms Data Governance

This article outlines twelve practical DeepSeek AI applications for data governance, covering automated classification, dynamic privacy masking, compliance checks, quality monitoring, intelligent integration, lineage analysis, metadata management, smart retrieval, strategy formulation, security risk handling, lifecycle control, and performance evaluation.

AIDeepSeekuse cases
0 likes · 7 min read
12 Powerful Ways DeepSeek Transforms Data Governance
Volcano Engine Developer Services
Volcano Engine Developer Services
Apr 8, 2025 · Artificial Intelligence

Which Cloud Platform Delivers the Fastest DeepSeek‑R1 API? A Comprehensive Benchmark

This article aggregates multiple independent evaluations of DeepSeek‑R1 across major cloud providers, comparing accuracy on AIME math problems, token‑per‑second throughput, first‑token latency, stability under high concurrency, and overall service reliability, ultimately highlighting Volcano Engine as the top performer.

AI inferenceAPI performanceDeepSeek
0 likes · 12 min read
Which Cloud Platform Delivers the Fastest DeepSeek‑R1 API? A Comprehensive Benchmark
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 7, 2025 · Artificial Intelligence

Why Does GPU Memory Keep Growing in DeepSeek‑R1 Inference? Uncovering PyTorch’s Cache

After deploying the full‑precision DeepSeek‑R1 model on a 2×8‑GPU ACS cluster, repeated stress tests showed GPU memory usage continuously rising without release; this article details the investigation, reproduces the behavior, examines vLLM logs, Prometheus metrics, and reveals PyTorch’s caching allocator as the root cause, offering mitigation tips.

DeepSeekGPU memoryMemory Cache
0 likes · 21 min read
Why Does GPU Memory Keep Growing in DeepSeek‑R1 Inference? Uncovering PyTorch’s Cache
Code Mala Tang
Code Mala Tang
Apr 5, 2025 · Artificial Intelligence

Open-Source AI Video Models Are Redefining the Industry – China Leads the Charge

While most eyes remain on familiar AI giants, China’s Alibaba and DeepSeek are unveiling open‑source video and inference models that run on consumer GPUs, sparking a regulatory scramble and threatening the dominance of closed‑source AI, heralding a rapid, disruptive shift across the industry.

AI localizationAI regulationAI video
0 likes · 10 min read
Open-Source AI Video Models Are Redefining the Industry – China Leads the Charge
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 3, 2025 · Artificial Intelligence

Create High‑Quality SVG Illustrations with DeepSeek‑V3 and Claude AI

This guide shows how to use DeepSeek‑V3‑0324 and Claude 3.5/3.7 to generate professional SVG graphics for articles and presentations, explains the impact of model capability and prompt quality, provides ready‑to‑use prompt templates, and demonstrates basic and advanced usage scenarios such as prototype drawing, image re‑drawing, and colorful newspaper‑style visuals.

AI image generationClaudeDeepSeek
0 likes · 15 min read
Create High‑Quality SVG Illustrations with DeepSeek‑V3 and Claude AI
AI Algorithm Path
AI Algorithm Path
Apr 2, 2025 · Artificial Intelligence

Vision‑Reasoning Model: Enabling LLMs to See and Think

The article analyzes the limitations of current visual language models and large reasoning models, proposes a combined Vision‑Reasoning Model (VRM), details its architecture using LLaVA, describes end‑to‑end fine‑tuning and reinforcement‑learning reward design, and argues that such models will become the next breakthrough in AI.

DeepSeekLLaVALarge Language Model
0 likes · 9 min read
Vision‑Reasoning Model: Enabling LLMs to See and Think
Java Architect Essentials
Java Architect Essentials
Apr 2, 2025 · Backend Development

Integrating DeepSeek Large Language Model with Spring Boot to Build an AI Chat Application

This guide demonstrates how to create a Spring Boot backend that integrates DeepSeek's large language model via the Spring AI OpenAI starter, covering project setup, dependency configuration, API key management, and a sample controller that provides AI-powered chat responses such as weather forecasts.

AI integrationChatbotDeepSeek
0 likes · 8 min read
Integrating DeepSeek Large Language Model with Spring Boot to Build an AI Chat Application
AI Algorithm Path
AI Algorithm Path
Apr 2, 2025 · Artificial Intelligence

Master the Three Essential LLM Training Stages for 2025

The article breaks down the three core stages of large‑language‑model training—pre‑training, supervised fine‑tuning, and RLHF—explaining their purpose, methods, and concrete examples while noting DeepSeek‑R1’s recent breakthrough and its implications for AI development.

AI trainingDeepSeekLLM
0 likes · 5 min read
Master the Three Essential LLM Training Stages for 2025
Architects' Tech Alliance
Architects' Tech Alliance
Apr 1, 2025 · Artificial Intelligence

What’s New in Large Language Models? DeepSeek V3, Qwen2.5‑Omni, Gemini 2.5 Pro, and GPT‑4o Unpacked

This article reviews the latest updates from major LLM providers—DeepSeek V3’s parameter boost and longer context, Qwen2.5‑Omni’s open‑source multimodal 7B model, Google Gemini 2.5 Pro’s 1 M‑token window and multimodal prowess, and OpenAI GPT‑4o’s image generation and reduced pricing—highlighting technical specs, capabilities, and availability.

DeepSeekGPT-4oGemini
0 likes · 9 min read
What’s New in Large Language Models? DeepSeek V3, Qwen2.5‑Omni, Gemini 2.5 Pro, and GPT‑4o Unpacked
Data Thinking Notes
Data Thinking Notes
Mar 30, 2025 · Artificial Intelligence

How DeepSeek‑R1 and Kimi‑K1.5 Push the Boundaries of Strong Reasoning Models

This comprehensive analysis by the Peking University AI Alignment team dissects the technical innovations behind DeepSeek‑R1, DeepSeek‑R1 Zero, and Kimi‑K1.5, covering reinforcement‑learning‑based post‑training, rule‑based rewards, GRPO optimization, scaling laws, multimodal extensions, safety challenges, and future research directions.

AI alignmentDeepSeekKimi
0 likes · 57 min read
How DeepSeek‑R1 and Kimi‑K1.5 Push the Boundaries of Strong Reasoning Models
dbaplus Community
dbaplus Community
Mar 30, 2025 · Databases

Supercharge Your SQL Workflows with DeepSeek Prompt Templates

This guide presents a comprehensive collection of DeepSeek prompt templates for MySQL, covering SQL generation, optimization, data analysis, database administration, debugging, and advanced features, enabling beginners and seasoned developers alike to craft accurate queries, improve performance, and resolve errors efficiently.

DeepSeekdatabase optimizationmysql
0 likes · 11 min read
Supercharge Your SQL Workflows with DeepSeek Prompt Templates
Fun with Large Models
Fun with Large Models
Mar 30, 2025 · Artificial Intelligence

DeepSeek‑V3‑0324 Review: Why This New Chinese LLM Beats the Competition for Agent Development

The article provides a comprehensive evaluation of DeepSeek‑V3‑0324, highlighting its superior inference, coding, and long‑text abilities, benchmark rankings that place it near GPT‑4.5, extensive code‑generation tests, and advanced Function Calling features that make it the preferred model for building AI agents.

Agent developmentDeepSeekcode generation
0 likes · 8 min read
DeepSeek‑V3‑0324 Review: Why This New Chinese LLM Beats the Competition for Agent Development
Java Tech Enthusiast
Java Tech Enthusiast
Mar 29, 2025 · Frontend Development

Building a Twitter Image Card Browser Extension with Claude Sonnet and DeepSeek

In a side‑by‑side test on the Trae platform, the author used Claude Sonnet 3.5 to create a functional Twitter‑to‑image‑card browser extension in roughly twenty minutes, while DeepSeek‑R1 required multiple prompt iterations, manual bug fixes, and still produced visual glitches, demonstrating Claude’s superior reliability for frontend plugin generation.

AI code generationBrowser ExtensionClaude Sonnet
0 likes · 5 min read
Building a Twitter Image Card Browser Extension with Claude Sonnet and DeepSeek
MaGe Linux Operations
MaGe Linux Operations
Mar 28, 2025 · Artificial Intelligence

How to Create AI-Generated Videos with Tongyi Wanxiang and DeepSeek: A Step‑by‑Step Guide

This article explains the fundamentals of AI video technology, details the features of Alibaba Cloud's Tongyi Wanxiang platform, demonstrates how to use DeepSeek for script generation, and provides a complete workflow—including code examples—for producing high‑quality AI‑generated videos.

AI video generationDeepSeekJava SDK
0 likes · 24 min read
How to Create AI-Generated Videos with Tongyi Wanxiang and DeepSeek: A Step‑by‑Step Guide
Architects' Tech Alliance
Architects' Tech Alliance
Mar 28, 2025 · Artificial Intelligence

How DeepSeek Leverages Huawei Ascend to Boost AI Inference Efficiency

The report analyzes DeepSeek's latest V3 and R1 models, highlights their scaling‑law‑driven cost reductions, explains how Huawei Ascend optimizes inference by cutting KV‑Cache storage and improving compute efficiency, and surveys the model’s deployments across finance, government, manufacturing, and healthcare sectors.

AI efficiencyAI inferenceDeepSeek
0 likes · 4 min read
How DeepSeek Leverages Huawei Ascend to Boost AI Inference Efficiency
Qborfy AI
Qborfy AI
Mar 27, 2025 · Artificial Intelligence

How to Deploy DeepSeek‑R1 Locally with Ollama and Dify: A Step‑by‑Step Guide

This article walks through the entire process of deploying the DeepSeek‑R1 large language model on a personal machine, covering hardware requirements, Ollama installation, model download, service startup, remote access configuration, and visual UI integration with Dify, complete with concrete commands and screenshots.

AIDeepSeekDocker
0 likes · 9 min read
How to Deploy DeepSeek‑R1 Locally with Ollama and Dify: A Step‑by‑Step Guide
AI Algorithm Path
AI Algorithm Path
Mar 26, 2025 · Artificial Intelligence

DeepSeek V3-0324 Upgrade Delivers Smarter Coding and Higher Code Quality

The DeepSeek V3-0324 model, released on March 24, 2025 with 6.85 trillion parameters and a Mixture‑of‑Experts architecture, is fully open‑source on Hugging Face and brings notable upgrades in coding ability, structured responses, stability, generation length, and speed, while offering performance comparable to leading closed‑source models such as Claude 3.7.

AI code generationDeepSeekMixture of Experts
0 likes · 10 min read
DeepSeek V3-0324 Upgrade Delivers Smarter Coding and Higher Code Quality
Java Architecture Diary
Java Architecture Diary
Mar 26, 2025 · Artificial Intelligence

How DeepSeek V3-0324 Boosts Java AI Apps with Function Calling

The article introduces DeepSeek's new V3-0324 model, highlights its performance gains and new features like function calling and standardized JSON output, demonstrates Chinese and frontend coding tests, provides Java code examples for AI integration, and concludes with a summary of its business impact.

AIChat2BIDeepSeek
0 likes · 6 min read
How DeepSeek V3-0324 Boosts Java AI Apps with Function Calling
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 26, 2025 · Artificial Intelligence

Why DeepSeek Is Shaking Up the LLM Landscape: Architecture, Performance, and Cost

DeepSeek, a Chinese AI startup, offers open‑source large language models—DeepSeek‑V3 for general tasks and DeepSeek‑R1 for intensive reasoning—featuring MoE, MLA, low‑cost training, and competitive performance against OpenAI’s GPT‑4o, while providing detailed usage guidance and cost analysis.

AI inferenceDeepSeekcost efficiency
0 likes · 21 min read
Why DeepSeek Is Shaking Up the LLM Landscape: Architecture, Performance, and Cost
Aikesheng Open Source Community
Aikesheng Open Source Community
Mar 25, 2025 · Databases

ChatDBA vs DeepSeek: AI‑Driven Diagnosis of OceanBase Backup Cluster Tenant Sync Issue (Case Study)

This case study demonstrates how the AI assistant ChatDBA identifies and resolves a tenant data‑synchronization failure in an OceanBase primary‑backup cluster, detailing four interactive troubleshooting rounds, the final SQL fix, and a comparative analysis with the DeepSeek‑R1 model.

AI assistantChatDBADatabase Troubleshooting
0 likes · 5 min read
ChatDBA vs DeepSeek: AI‑Driven Diagnosis of OceanBase Backup Cluster Tenant Sync Issue (Case Study)
Java Architect Essentials
Java Architect Essentials
Mar 25, 2025 · Artificial Intelligence

DeepSeek4j 1.4: A Java Framework for Seamless DeepSeek AI Integration with Full Chain‑of‑Thought and Streaming Support

The article introduces DeepSeek4j 1.4, a Java‑centric framework that overcomes Spring AI’s limitations by preserving DeepSeek’s chain‑of‑thought, supporting streaming output, and offering a simple Spring Boot starter with reactive, configurable, and ready‑to‑use APIs for AI developers.

AIDeepSeekJava
0 likes · 5 min read
DeepSeek4j 1.4: A Java Framework for Seamless DeepSeek AI Integration with Full Chain‑of‑Thought and Streaming Support
DataFunTalk
DataFunTalk
Mar 24, 2025 · Artificial Intelligence

DeepSeek R1: Open‑Source Reasoning Model and Multi‑Stage Training Insights

The interview explores DeepSeek R1's open‑source weights, its multi‑stage training pipeline—including pre‑training, supervised fine‑tuning, and RLHF—alongside innovations such as self‑consistency, chain‑of‑thought prompting, distillation, MoE architectures, and cost considerations, highlighting its impact on the future of large language models.

AI trainingChain-of-ThoughtDeepSeek
0 likes · 20 min read
DeepSeek R1: Open‑Source Reasoning Model and Multi‑Stage Training Insights
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Mar 21, 2025 · Industry Insights

How DeepSeek’s Large Model is Revolutionizing Digital Twin Simulations

This article analyzes how DeepSeek’s multimodal large model overcomes traditional digital‑twin simulation bottlenecks through dynamic modeling, generative data augmentation, and low‑cost open‑source architecture, enabling smarter city traffic, industrial design, and water‑resource management while reshaping the industry’s AI‑driven simulation landscape.

AIDeepSeekDigital Twin
0 likes · 22 min read
How DeepSeek’s Large Model is Revolutionizing Digital Twin Simulations
Fun with Large Models
Fun with Large Models
Mar 20, 2025 · Artificial Intelligence

Fine‑Tune DeepSeek‑R1 with Just a Few Lines of Code Using Unsloth

This guide walks through setting up an Anaconda environment, installing Unsloth, downloading the DeepSeek‑R1‑Distill‑Llama‑8B model, preparing a medical CoT dataset, configuring LoRA parameters, running a short fine‑tuning job, and evaluating the customized model with structured prompts.

DeepSeekLoRAPython
0 likes · 18 min read
Fine‑Tune DeepSeek‑R1 with Just a Few Lines of Code Using Unsloth
Practical DevOps Architecture
Practical DevOps Architecture
Mar 20, 2025 · Artificial Intelligence

DeepSeek Model Integration Tutorial Series

This collection provides a step‑by‑step tutorial series of sixteen short videos demonstrating how to access, configure, and use the DeepSeek large language model across various office applications such as Word, Excel, PowerPoint, as well as platforms like WPS and online APIs.

AIAPIDeepSeek
0 likes · 5 min read
DeepSeek Model Integration Tutorial Series
dbaplus Community
dbaplus Community
Mar 17, 2025 · Operations

Designing an AI‑Powered Ops Platform with DeepSeek: Architecture, Modules, and Implementation

This article outlines a comprehensive AI‑Ops solution built on DeepSeek, covering its technical architecture, data collection stack, AI engine deployment, key functional modules, implementation roadmap, model training, security design, cost estimates, and risk mitigation strategies for modern operations teams.

AI OpsDeepSeekInfrastructure Automation
0 likes · 7 min read
Designing an AI‑Powered Ops Platform with DeepSeek: Architecture, Modules, and Implementation
Infra Learning Club
Infra Learning Club
Mar 17, 2025 · Artificial Intelligence

Testing OpenManus with DeepSeek: A Hands‑On Evaluation

The author walks through installing OpenManus, configuring it to use DeepSeek (and an Ollama‑based vision model), runs a sample financial data query, and reports that the system is slow, sometimes inaccurate, and still requires further optimization.

AI agentsCondaDeepSeek
0 likes · 5 min read
Testing OpenManus with DeepSeek: A Hands‑On Evaluation
Selected Java Interview Questions
Selected Java Interview Questions
Mar 15, 2025 · Artificial Intelligence

DeepSeek4j 1.4: Java Spring Boot Integration for DeepSeek with Full Chain‑of‑Thought and Streaming Support

DeepSeek4j 1.4 introduces a Java‑centric, Spring Boot‑compatible framework that fully preserves DeepSeek's chain‑of‑thought capabilities, adds reactive streaming, and provides simple one‑line API integration, addressing previous limitations in mainstream frameworks and offering ready‑to‑use configuration and code examples.

AI integrationChain-of-ThoughtDeepSeek
0 likes · 5 min read
DeepSeek4j 1.4: Java Spring Boot Integration for DeepSeek with Full Chain‑of‑Thought and Streaming Support
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Mar 15, 2025 · Artificial Intelligence

Integrating DeepSeek Large Model with SpringAI in Java Applications

This article provides a concise guide on using SpringAI to connect Java applications with the domestic large‑language model DeepSeek, covering design philosophy, configuration, code examples for chat, streaming, structured output, security hardening, performance tuning, and production best practices.

AI integrationBackend DevelopmentChatClient
0 likes · 9 min read
Integrating DeepSeek Large Model with SpringAI in Java Applications
Open Source Tech Hub
Open Source Tech Hub
Mar 13, 2025 · Artificial Intelligence

Build a Private AI Knowledge Base with Webman AI, Redis‑Stack, and Ollama

This guide walks you through setting up a private AI knowledge base using Webman AI 5.4.0, deploying Redis‑Stack, installing the illuminate/redis component, adding Ollama with DeepSeek and other embedding models, configuring Redis, importing training data, running the training process, and configuring role prompts for accurate AI responses.

AIDeepSeekOllama
0 likes · 6 min read
Build a Private AI Knowledge Base with Webman AI, Redis‑Stack, and Ollama
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 13, 2025 · Artificial Intelligence

How Alibaba’s Tongyi Lingma AI Programmer Supercharges Java Development with QwQ‑32B

This article reviews Alibaba Cloud's Tongyi Lingma AI programmer, highlighting its new model selection feature—including DeepSeek V3, R1, Qwen2.5‑Max and the open‑source QwQ‑32B—its impressive benchmark performance, step‑by‑step code generation for a CMS notice module, cross‑language integration with DeepSeek‑R1, and practical developer experiences comparing version 1.0 and 2.0.

AI code generationCross-language programmingDeepSeek
0 likes · 23 min read
How Alibaba’s Tongyi Lingma AI Programmer Supercharges Java Development with QwQ‑32B
Java Tech Enthusiast
Java Tech Enthusiast
Mar 12, 2025 · Artificial Intelligence

Open-Source Enterprise Knowledge Base and Intelligent Dialogue Platform Based on Spring Boot and DeepSeek

The open‑source DeepSeek‑Flow‑AI platform combines Spring Boot 3.4 back‑end APIs with a Vue 3 front‑end to deliver an enterprise‑grade knowledge base and intelligent multi‑turn dialogue system, supporting private deployment, role‑based access, analytics, CRM/ERP integration, and easy installation via Maven and Yarn.

AIDeepSeekKnowledge Base
0 likes · 5 min read
Open-Source Enterprise Knowledge Base and Intelligent Dialogue Platform Based on Spring Boot and DeepSeek
Architects' Tech Alliance
Architects' Tech Alliance
Mar 12, 2025 · Artificial Intelligence

How DeepSeek Can Transform Family Education: A Practical Guide

This guide from Tsinghua University’s New Media Research Center systematically explores DeepSeek’s entry methods, interaction strategies, subject‑specific tutoring, emotional support, ethical risks, age‑specific parenting solutions, tool integration, and future educational outlook, offering parents actionable AI‑powered techniques for digital home learning.

AI in EducationDeepSeekEducational Technology
0 likes · 5 min read
How DeepSeek Can Transform Family Education: A Practical Guide
Architects' Tech Alliance
Architects' Tech Alliance
Mar 11, 2025 · Artificial Intelligence

How DeepSeek’s Breakthrough AI Models Thrive on Huawei Ascend: A Deep Dive

An in‑depth analysis reveals how DeepSeek’s V3 and R1 large‑language models achieve superior inference performance and cost efficiency on Huawei’s Ascend AI platform, detailing architectural optimizations, KV‑Cache reductions, multimodal support, real‑world deployments across finance, government, manufacturing, and the projected impact on the AI industry.

DeepSeekHuawei Ascendai-optimization
0 likes · 4 min read
How DeepSeek’s Breakthrough AI Models Thrive on Huawei Ascend: A Deep Dive
NewBeeNLP
NewBeeNLP
Mar 11, 2025 · Artificial Intelligence

How DeepSeek’s New Architecture Redefines LLM Efficiency and Performance

This article analyzes DeepSeek’s recent breakthroughs—including the Multi‑Head Latent Attention (MLA), Group Relative Policy Optimization (GRPO), and a refined Mixture‑of‑Experts design—along with its three‑stage training pipeline, RL‑only R1‑Zero variant, and benchmark comparisons against GPT‑4o‑Mini and Llama 3.1, highlighting both gains and remaining challenges.

DeepSeekLLMMixture of Experts
0 likes · 18 min read
How DeepSeek’s New Architecture Redefines LLM Efficiency and Performance
Architect
Architect
Mar 10, 2025 · Artificial Intelligence

What Makes DeepSeek’s New Architecture a Game‑Changer? Inside MLA, GRPO, and MoE Innovations

This article analyzes DeepSeek’s latest large‑model breakthroughs, covering the MLA attention compression, GRPO alignment algorithm, MoE load‑balancing redesign, multi‑stage training pipelines, reinforcement‑learning tricks, and performance comparisons with GPT‑4o‑Mini and Llama 3.1, highlighting both strengths and remaining challenges.

AI trainingDeepSeekGRPO
0 likes · 19 min read
What Makes DeepSeek’s New Architecture a Game‑Changer? Inside MLA, GRPO, and MoE Innovations
Baidu Geek Talk
Baidu Geek Talk
Mar 10, 2025 · Artificial Intelligence

How Baidu Cloud’s AI+ Strategy Powers State‑Owned Enterprises with DeepSeek One‑Box Solutions

The article examines Baidu Cloud’s integration of DeepSeek large‑model hardware, detailing the Baige and Qianfan one‑box systems, their technical specs, deployment speed, and how they enable state‑owned enterprises across energy, manufacturing, and logistics to accelerate AI‑driven digital transformation.

AIBaidu CloudCloud Computing
0 likes · 6 min read
How Baidu Cloud’s AI+ Strategy Powers State‑Owned Enterprises with DeepSeek One‑Box Solutions