Tagged articles
566 articles
Page 3 of 6
Architects' Tech Alliance
Architects' Tech Alliance
Apr 13, 2025 · Artificial Intelligence

Deploying DeepSeek LLMs On-Premises: Step‑by‑Step Guide and Hardware Sizing

This article provides a comprehensive technical guide for privately deploying DeepSeek large language models, covering model and runtime parameter selection, hardware sizing calculations, software stack preparation, inference service setup, performance tuning, and security monitoring considerations.

AI hardware sizingDeepSeekInference Optimization
0 likes · 14 min read
Deploying DeepSeek LLMs On-Premises: Step‑by‑Step Guide and Hardware Sizing
AI Algorithm Path
AI Algorithm Path
Apr 13, 2025 · Artificial Intelligence

Understanding GRPO: Group Relative Policy Optimization for LLM Training

The article explains GRPO, a reinforcement‑learning algorithm that extends PPO with group sampling, no value network, dual penalties and KL regularisation, showing how it improves efficiency and stability when fine‑tuning large language models such as DeepSeek‑Math and DeepSeek‑R1.

DeepSeekGRPOPPO
0 likes · 6 min read
Understanding GRPO: Group Relative Policy Optimization for LLM Training
Fun with Large Models
Fun with Large Models
Apr 12, 2025 · Artificial Intelligence

Build a No‑Code Travel‑Planning AI Assistant with VS Code, Cline, and Gaode MCP Server

This guide walks through setting up VS Code, installing the Cline plugin, configuring a Gaode Map MCP Server API key, and using the DeepSeek model to generate a personalized park‑recommendation agent and a visual HTML page, while also explaining the stdio‑based communication between Cline and the MCP Server.

AI AgentClineDeepSeek
0 likes · 15 min read
Build a No‑Code Travel‑Planning AI Assistant with VS Code, Cline, and Gaode MCP Server
Big Data Tech Team
Big Data Tech Team
Apr 9, 2025 · Artificial Intelligence

12 Powerful Ways DeepSeek Transforms Data Governance

This article outlines twelve practical DeepSeek AI applications for data governance, covering automated classification, dynamic privacy masking, compliance checks, quality monitoring, intelligent integration, lineage analysis, metadata management, smart retrieval, strategy formulation, security risk handling, lifecycle control, and performance evaluation.

AIDeepSeekUse Cases
0 likes · 7 min read
12 Powerful Ways DeepSeek Transforms Data Governance
Volcano Engine Developer Services
Volcano Engine Developer Services
Apr 8, 2025 · Artificial Intelligence

Which Cloud Platform Delivers the Fastest DeepSeek‑R1 API? A Comprehensive Benchmark

This article aggregates multiple independent evaluations of DeepSeek‑R1 across major cloud providers, comparing accuracy on AIME math problems, token‑per‑second throughput, first‑token latency, stability under high concurrency, and overall service reliability, ultimately highlighting Volcano Engine as the top performer.

AI inferenceAPI performanceBenchmark
0 likes · 12 min read
Which Cloud Platform Delivers the Fastest DeepSeek‑R1 API? A Comprehensive Benchmark
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 7, 2025 · Artificial Intelligence

Why Does GPU Memory Keep Growing in DeepSeek‑R1 Inference? Uncovering PyTorch’s Cache

After deploying the full‑precision DeepSeek‑R1 model on a 2×8‑GPU ACS cluster, repeated stress tests showed GPU memory usage continuously rising without release; this article details the investigation, reproduces the behavior, examines vLLM logs, Prometheus metrics, and reveals PyTorch’s caching allocator as the root cause, offering mitigation tips.

DeepSeekGPU MemoryMemory Cache
0 likes · 21 min read
Why Does GPU Memory Keep Growing in DeepSeek‑R1 Inference? Uncovering PyTorch’s Cache
Code Mala Tang
Code Mala Tang
Apr 5, 2025 · Artificial Intelligence

Open-Source AI Video Models Are Redefining the Industry – China Leads the Charge

While most eyes remain on familiar AI giants, China’s Alibaba and DeepSeek are unveiling open‑source video and inference models that run on consumer GPUs, sparking a regulatory scramble and threatening the dominance of closed‑source AI, heralding a rapid, disruptive shift across the industry.

AI localizationAI regulationAI video
0 likes · 10 min read
Open-Source AI Video Models Are Redefining the Industry – China Leads the Charge
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 3, 2025 · Artificial Intelligence

Create High‑Quality SVG Illustrations with DeepSeek‑V3 and Claude AI

This guide shows how to use DeepSeek‑V3‑0324 and Claude 3.5/3.7 to generate professional SVG graphics for articles and presentations, explains the impact of model capability and prompt quality, provides ready‑to‑use prompt templates, and demonstrates basic and advanced usage scenarios such as prototype drawing, image re‑drawing, and colorful newspaper‑style visuals.

AI image generationClaudeDeepSeek
0 likes · 15 min read
Create High‑Quality SVG Illustrations with DeepSeek‑V3 and Claude AI
AI Algorithm Path
AI Algorithm Path
Apr 2, 2025 · Artificial Intelligence

Vision‑Reasoning Model: Enabling LLMs to See and Think

The article analyzes the limitations of current visual language models and large reasoning models, proposes a combined Vision‑Reasoning Model (VRM), details its architecture using LLaVA, describes end‑to‑end fine‑tuning and reinforcement‑learning reward design, and argues that such models will become the next breakthrough in AI.

DeepSeekLLaVAVision-Language Model
0 likes · 9 min read
Vision‑Reasoning Model: Enabling LLMs to See and Think
Java Architect Essentials
Java Architect Essentials
Apr 2, 2025 · Backend Development

Integrating DeepSeek Large Language Model with Spring Boot to Build an AI Chat Application

This guide demonstrates how to create a Spring Boot backend that integrates DeepSeek's large language model via the Spring AI OpenAI starter, covering project setup, dependency configuration, API key management, and a sample controller that provides AI-powered chat responses such as weather forecasts.

AI integrationChatbotDeepSeek
0 likes · 8 min read
Integrating DeepSeek Large Language Model with Spring Boot to Build an AI Chat Application
AI Algorithm Path
AI Algorithm Path
Apr 2, 2025 · Artificial Intelligence

Master the Three Essential LLM Training Stages for 2025

The article breaks down the three core stages of large‑language‑model training—pre‑training, supervised fine‑tuning, and RLHF—explaining their purpose, methods, and concrete examples while noting DeepSeek‑R1’s recent breakthrough and its implications for AI development.

AI trainingDeepSeekLLM
0 likes · 5 min read
Master the Three Essential LLM Training Stages for 2025
Architects' Tech Alliance
Architects' Tech Alliance
Apr 1, 2025 · Artificial Intelligence

What’s New in Large Language Models? DeepSeek V3, Qwen2.5‑Omni, Gemini 2.5 Pro, and GPT‑4o Unpacked

This article reviews the latest updates from major LLM providers—DeepSeek V3’s parameter boost and longer context, Qwen2.5‑Omni’s open‑source multimodal 7B model, Google Gemini 2.5 Pro’s 1 M‑token window and multimodal prowess, and OpenAI GPT‑4o’s image generation and reduced pricing—highlighting technical specs, capabilities, and availability.

DeepSeekGPT-4oGemini
0 likes · 9 min read
What’s New in Large Language Models? DeepSeek V3, Qwen2.5‑Omni, Gemini 2.5 Pro, and GPT‑4o Unpacked
Data Thinking Notes
Data Thinking Notes
Mar 30, 2025 · Artificial Intelligence

How DeepSeek‑R1 and Kimi‑K1.5 Push the Boundaries of Strong Reasoning Models

This comprehensive analysis by the Peking University AI Alignment team dissects the technical innovations behind DeepSeek‑R1, DeepSeek‑R1 Zero, and Kimi‑K1.5, covering reinforcement‑learning‑based post‑training, rule‑based rewards, GRPO optimization, scaling laws, multimodal extensions, safety challenges, and future research directions.

AI AlignmentDeepSeekKimi
0 likes · 57 min read
How DeepSeek‑R1 and Kimi‑K1.5 Push the Boundaries of Strong Reasoning Models
dbaplus Community
dbaplus Community
Mar 30, 2025 · Databases

Supercharge Your SQL Workflows with DeepSeek Prompt Templates

This guide presents a comprehensive collection of DeepSeek prompt templates for MySQL, covering SQL generation, optimization, data analysis, database administration, debugging, and advanced features, enabling beginners and seasoned developers alike to craft accurate queries, improve performance, and resolve errors efficiently.

Database OptimizationDeepSeekmysql
0 likes · 11 min read
Supercharge Your SQL Workflows with DeepSeek Prompt Templates
Fun with Large Models
Fun with Large Models
Mar 30, 2025 · Artificial Intelligence

DeepSeek‑V3‑0324 Review: Why This New Chinese LLM Beats the Competition for Agent Development

The article provides a comprehensive evaluation of DeepSeek‑V3‑0324, highlighting its superior inference, coding, and long‑text abilities, benchmark rankings that place it near GPT‑4.5, extensive code‑generation tests, and advanced Function Calling features that make it the preferred model for building AI agents.

Code GenerationDeepSeekagent development
0 likes · 8 min read
DeepSeek‑V3‑0324 Review: Why This New Chinese LLM Beats the Competition for Agent Development
Java Tech Enthusiast
Java Tech Enthusiast
Mar 29, 2025 · Frontend Development

Building a Twitter Image Card Browser Extension with Claude Sonnet and DeepSeek

In a side‑by‑side test on the Trae platform, the author used Claude Sonnet 3.5 to create a functional Twitter‑to‑image‑card browser extension in roughly twenty minutes, while DeepSeek‑R1 required multiple prompt iterations, manual bug fixes, and still produced visual glitches, demonstrating Claude’s superior reliability for frontend plugin generation.

AI code generationClaude SonnetDeepSeek
0 likes · 5 min read
Building a Twitter Image Card Browser Extension with Claude Sonnet and DeepSeek
MaGe Linux Operations
MaGe Linux Operations
Mar 28, 2025 · Artificial Intelligence

How to Create AI-Generated Videos with Tongyi Wanxiang and DeepSeek: A Step‑by‑Step Guide

This article explains the fundamentals of AI video technology, details the features of Alibaba Cloud's Tongyi Wanxiang platform, demonstrates how to use DeepSeek for script generation, and provides a complete workflow—including code examples—for producing high‑quality AI‑generated videos.

AI video generationDeepSeekJava SDK
0 likes · 24 min read
How to Create AI-Generated Videos with Tongyi Wanxiang and DeepSeek: A Step‑by‑Step Guide
Architects' Tech Alliance
Architects' Tech Alliance
Mar 28, 2025 · Artificial Intelligence

How DeepSeek Leverages Huawei Ascend to Boost AI Inference Efficiency

The report analyzes DeepSeek's latest V3 and R1 models, highlights their scaling‑law‑driven cost reductions, explains how Huawei Ascend optimizes inference by cutting KV‑Cache storage and improving compute efficiency, and surveys the model’s deployments across finance, government, manufacturing, and healthcare sectors.

AI efficiencyAI inferenceDeepSeek
0 likes · 4 min read
How DeepSeek Leverages Huawei Ascend to Boost AI Inference Efficiency
Qborfy AI
Qborfy AI
Mar 27, 2025 · Artificial Intelligence

How to Deploy DeepSeek‑R1 Locally with Ollama and Dify: A Step‑by‑Step Guide

This article walks through the entire process of deploying the DeepSeek‑R1 large language model on a personal machine, covering hardware requirements, Ollama installation, model download, service startup, remote access configuration, and visual UI integration with Dify, complete with concrete commands and screenshots.

AIDeepSeekDocker
0 likes · 9 min read
How to Deploy DeepSeek‑R1 Locally with Ollama and Dify: A Step‑by‑Step Guide
AI Algorithm Path
AI Algorithm Path
Mar 26, 2025 · Artificial Intelligence

DeepSeek V3-0324 Upgrade Delivers Smarter Coding and Higher Code Quality

The DeepSeek V3-0324 model, released on March 24, 2025 with 6.85 trillion parameters and a Mixture‑of‑Experts architecture, is fully open‑source on Hugging Face and brings notable upgrades in coding ability, structured responses, stability, generation length, and speed, while offering performance comparable to leading closed‑source models such as Claude 3.7.

AI code generationCoding AIDeepSeek
0 likes · 10 min read
DeepSeek V3-0324 Upgrade Delivers Smarter Coding and Higher Code Quality
Java Architecture Diary
Java Architecture Diary
Mar 26, 2025 · Artificial Intelligence

How DeepSeek V3-0324 Boosts Java AI Apps with Function Calling

The article introduces DeepSeek's new V3-0324 model, highlights its performance gains and new features like function calling and standardized JSON output, demonstrates Chinese and frontend coding tests, provides Java code examples for AI integration, and concludes with a summary of its business impact.

AIChat2BIDeepSeek
0 likes · 6 min read
How DeepSeek V3-0324 Boosts Java AI Apps with Function Calling
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 26, 2025 · Artificial Intelligence

Why DeepSeek Is Shaking Up the LLM Landscape: Architecture, Performance, and Cost

DeepSeek, a Chinese AI startup, offers open‑source large language models—DeepSeek‑V3 for general tasks and DeepSeek‑R1 for intensive reasoning—featuring MoE, MLA, low‑cost training, and competitive performance against OpenAI’s GPT‑4o, while providing detailed usage guidance and cost analysis.

AI inferenceDeepSeekModel architecture
0 likes · 21 min read
Why DeepSeek Is Shaking Up the LLM Landscape: Architecture, Performance, and Cost
Java Architect Essentials
Java Architect Essentials
Mar 25, 2025 · Artificial Intelligence

DeepSeek4j 1.4: A Java Framework for Seamless DeepSeek AI Integration with Full Chain‑of‑Thought and Streaming Support

The article introduces DeepSeek4j 1.4, a Java‑centric framework that overcomes Spring AI’s limitations by preserving DeepSeek’s chain‑of‑thought, supporting streaming output, and offering a simple Spring Boot starter with reactive, configurable, and ready‑to‑use APIs for AI developers.

AIDeepSeekJava
0 likes · 5 min read
DeepSeek4j 1.4: A Java Framework for Seamless DeepSeek AI Integration with Full Chain‑of‑Thought and Streaming Support
DataFunTalk
DataFunTalk
Mar 24, 2025 · Artificial Intelligence

DeepSeek R1: Open‑Source Reasoning Model and Multi‑Stage Training Insights

The interview explores DeepSeek R1's open‑source weights, its multi‑stage training pipeline—including pre‑training, supervised fine‑tuning, and RLHF—alongside innovations such as self‑consistency, chain‑of‑thought prompting, distillation, MoE architectures, and cost considerations, highlighting its impact on the future of large language models.

AI trainingDeepSeekRLHF
0 likes · 20 min read
DeepSeek R1: Open‑Source Reasoning Model and Multi‑Stage Training Insights
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Mar 21, 2025 · Industry Insights

How DeepSeek’s Large Model is Revolutionizing Digital Twin Simulations

This article analyzes how DeepSeek’s multimodal large model overcomes traditional digital‑twin simulation bottlenecks through dynamic modeling, generative data augmentation, and low‑cost open‑source architecture, enabling smarter city traffic, industrial design, and water‑resource management while reshaping the industry’s AI‑driven simulation landscape.

AIDeepSeekDigital Twin
0 likes · 22 min read
How DeepSeek’s Large Model is Revolutionizing Digital Twin Simulations
Fun with Large Models
Fun with Large Models
Mar 20, 2025 · Artificial Intelligence

Fine‑Tune DeepSeek‑R1 with Just a Few Lines of Code Using Unsloth

This guide walks through setting up an Anaconda environment, installing Unsloth, downloading the DeepSeek‑R1‑Distill‑Llama‑8B model, preparing a medical CoT dataset, configuring LoRA parameters, running a short fine‑tuning job, and evaluating the customized model with structured prompts.

DeepSeekFine-tuningLoRA
0 likes · 18 min read
Fine‑Tune DeepSeek‑R1 with Just a Few Lines of Code Using Unsloth
Practical DevOps Architecture
Practical DevOps Architecture
Mar 20, 2025 · Artificial Intelligence

DeepSeek Model Integration Tutorial Series

This collection provides a step‑by‑step tutorial series of sixteen short videos demonstrating how to access, configure, and use the DeepSeek large language model across various office applications such as Word, Excel, PowerPoint, as well as platforms like WPS and online APIs.

AIAPIDeepSeek
0 likes · 5 min read
DeepSeek Model Integration Tutorial Series
dbaplus Community
dbaplus Community
Mar 17, 2025 · Operations

Designing an AI‑Powered Ops Platform with DeepSeek: Architecture, Modules, and Implementation

This article outlines a comprehensive AI‑Ops solution built on DeepSeek, covering its technical architecture, data collection stack, AI engine deployment, key functional modules, implementation roadmap, model training, security design, cost estimates, and risk mitigation strategies for modern operations teams.

AI OpsDeepSeekInfrastructure Automation
0 likes · 7 min read
Designing an AI‑Powered Ops Platform with DeepSeek: Architecture, Modules, and Implementation
Infra Learning Club
Infra Learning Club
Mar 17, 2025 · Artificial Intelligence

Testing OpenManus with DeepSeek: A Hands‑On Evaluation

The author walks through installing OpenManus, configuring it to use DeepSeek (and an Ollama‑based vision model), runs a sample financial data query, and reports that the system is slow, sometimes inaccurate, and still requires further optimization.

AI agentsCondaDeepSeek
0 likes · 5 min read
Testing OpenManus with DeepSeek: A Hands‑On Evaluation
Selected Java Interview Questions
Selected Java Interview Questions
Mar 15, 2025 · Artificial Intelligence

DeepSeek4j 1.4: Java Spring Boot Integration for DeepSeek with Full Chain‑of‑Thought and Streaming Support

DeepSeek4j 1.4 introduces a Java‑centric, Spring Boot‑compatible framework that fully preserves DeepSeek's chain‑of‑thought capabilities, adds reactive streaming, and provides simple one‑line API integration, addressing previous limitations in mainstream frameworks and offering ready‑to‑use configuration and code examples.

AI integrationDeepSeekSpring Boot
0 likes · 5 min read
DeepSeek4j 1.4: Java Spring Boot Integration for DeepSeek with Full Chain‑of‑Thought and Streaming Support
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Mar 15, 2025 · Artificial Intelligence

Integrating DeepSeek Large Model with SpringAI in Java Applications

This article provides a concise guide on using SpringAI to connect Java applications with the domestic large‑language model DeepSeek, covering design philosophy, configuration, code examples for chat, streaming, structured output, security hardening, performance tuning, and production best practices.

AI integrationBackend DevelopmentChatClient
0 likes · 9 min read
Integrating DeepSeek Large Model with SpringAI in Java Applications
Open Source Tech Hub
Open Source Tech Hub
Mar 13, 2025 · Artificial Intelligence

Build a Private AI Knowledge Base with Webman AI, Redis‑Stack, and Ollama

This guide walks you through setting up a private AI knowledge base using Webman AI 5.4.0, deploying Redis‑Stack, installing the illuminate/redis component, adding Ollama with DeepSeek and other embedding models, configuring Redis, importing training data, running the training process, and configuring role prompts for accurate AI responses.

AIDeepSeekOllama
0 likes · 6 min read
Build a Private AI Knowledge Base with Webman AI, Redis‑Stack, and Ollama
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 13, 2025 · Artificial Intelligence

How Alibaba’s Tongyi Lingma AI Programmer Supercharges Java Development with QwQ‑32B

This article reviews Alibaba Cloud's Tongyi Lingma AI programmer, highlighting its new model selection feature—including DeepSeek V3, R1, Qwen2.5‑Max and the open‑source QwQ‑32B—its impressive benchmark performance, step‑by‑step code generation for a CMS notice module, cross‑language integration with DeepSeek‑R1, and practical developer experiences comparing version 1.0 and 2.0.

AI code generationCross-language programmingDeepSeek
0 likes · 23 min read
How Alibaba’s Tongyi Lingma AI Programmer Supercharges Java Development with QwQ‑32B
Java Tech Enthusiast
Java Tech Enthusiast
Mar 12, 2025 · Artificial Intelligence

Open-Source Enterprise Knowledge Base and Intelligent Dialogue Platform Based on Spring Boot and DeepSeek

The open‑source DeepSeek‑Flow‑AI platform combines Spring Boot 3.4 back‑end APIs with a Vue 3 front‑end to deliver an enterprise‑grade knowledge base and intelligent multi‑turn dialogue system, supporting private deployment, role‑based access, analytics, CRM/ERP integration, and easy installation via Maven and Yarn.

AIDeepSeekKnowledge Base
0 likes · 5 min read
Open-Source Enterprise Knowledge Base and Intelligent Dialogue Platform Based on Spring Boot and DeepSeek
Architects' Tech Alliance
Architects' Tech Alliance
Mar 12, 2025 · Artificial Intelligence

How DeepSeek Can Transform Family Education: A Practical Guide

This guide from Tsinghua University’s New Media Research Center systematically explores DeepSeek’s entry methods, interaction strategies, subject‑specific tutoring, emotional support, ethical risks, age‑specific parenting solutions, tool integration, and future educational outlook, offering parents actionable AI‑powered techniques for digital home learning.

AI in EducationDeepSeekEducational Technology
0 likes · 5 min read
How DeepSeek Can Transform Family Education: A Practical Guide
Architects' Tech Alliance
Architects' Tech Alliance
Mar 11, 2025 · Artificial Intelligence

How DeepSeek’s Breakthrough AI Models Thrive on Huawei Ascend: A Deep Dive

An in‑depth analysis reveals how DeepSeek’s V3 and R1 large‑language models achieve superior inference performance and cost efficiency on Huawei’s Ascend AI platform, detailing architectural optimizations, KV‑Cache reductions, multimodal support, real‑world deployments across finance, government, manufacturing, and the projected impact on the AI industry.

AI OptimizationDeepSeekHuawei Ascend
0 likes · 4 min read
How DeepSeek’s Breakthrough AI Models Thrive on Huawei Ascend: A Deep Dive
NewBeeNLP
NewBeeNLP
Mar 11, 2025 · Artificial Intelligence

How DeepSeek’s New Architecture Redefines LLM Efficiency and Performance

This article analyzes DeepSeek’s recent breakthroughs—including the Multi‑Head Latent Attention (MLA), Group Relative Policy Optimization (GRPO), and a refined Mixture‑of‑Experts design—along with its three‑stage training pipeline, RL‑only R1‑Zero variant, and benchmark comparisons against GPT‑4o‑Mini and Llama 3.1, highlighting both gains and remaining challenges.

DeepSeekLLMMixture of Experts
0 likes · 18 min read
How DeepSeek’s New Architecture Redefines LLM Efficiency and Performance
Architect
Architect
Mar 10, 2025 · Artificial Intelligence

What Makes DeepSeek’s New Architecture a Game‑Changer? Inside MLA, GRPO, and MoE Innovations

This article analyzes DeepSeek’s latest large‑model breakthroughs, covering the MLA attention compression, GRPO alignment algorithm, MoE load‑balancing redesign, multi‑stage training pipelines, reinforcement‑learning tricks, and performance comparisons with GPT‑4o‑Mini and Llama 3.1, highlighting both strengths and remaining challenges.

AI trainingDeepSeekGRPO
0 likes · 19 min read
What Makes DeepSeek’s New Architecture a Game‑Changer? Inside MLA, GRPO, and MoE Innovations
Baidu Geek Talk
Baidu Geek Talk
Mar 10, 2025 · Artificial Intelligence

How Baidu Cloud’s AI+ Strategy Powers State‑Owned Enterprises with DeepSeek One‑Box Solutions

The article examines Baidu Cloud’s integration of DeepSeek large‑model hardware, detailing the Baige and Qianfan one‑box systems, their technical specs, deployment speed, and how they enable state‑owned enterprises across energy, manufacturing, and logistics to accelerate AI‑driven digital transformation.

AIBaidu CloudDeepSeek
0 likes · 6 min read
How Baidu Cloud’s AI+ Strategy Powers State‑Owned Enterprises with DeepSeek One‑Box Solutions
AI Frontier Lectures
AI Frontier Lectures
Mar 10, 2025 · Industry Insights

Why DeepSeek’s Rise Is Shaking China’s AGI Landscape

The article analyzes how DeepSeek’s unexpected success has triggered a strategic rethink across Chinese AI firms, prompting shifts from product‑centric growth to foundational model research, reshaping talent structures at Tencent and ByteDance, and questioning where the true barriers to AGI lie.

AGIChina AIDeepSeek
0 likes · 13 min read
Why DeepSeek’s Rise Is Shaking China’s AGI Landscape
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 10, 2025 · Artificial Intelligence

Seamlessly Switch Between DeepSeek‑R1 and QwQ‑32B with Higress AI Gateway

Learn how to deploy the new QwQ‑32B inference model alongside DeepSeek‑R1 using the Higress AI gateway, covering environment setup, model configuration, routing, token‑level rate limiting, content safety, semantic caching, and advanced features like automatic fallback and internet‑search integration.

DeepSeekHigressLLM integration
0 likes · 16 min read
Seamlessly Switch Between DeepSeek‑R1 and QwQ‑32B with Higress AI Gateway
Baobao Algorithm Notes
Baobao Algorithm Notes
Mar 10, 2025 · Artificial Intelligence

Why DeepSeek V3’s FP8 Training Beats Traditional Schemes: A Deep Dive

This article provides a detailed technical analysis of FP8 training, comparing Nvidia’s TransformerEngine approach with DeepSeek V3’s novel scheme, and examines how block‑wise scaling, high‑precision accumulation, and vector length and correlation affect quantization error and signal‑to‑noise ratio in large‑language‑model training.

DeepSeekFP8LLM
0 likes · 20 min read
Why DeepSeek V3’s FP8 Training Beats Traditional Schemes: A Deep Dive
CSS Magic
CSS Magic
Mar 10, 2025 · Artificial Intelligence

Three Advanced Ways to Harness DeepSeek for Everyone

The article outlines three practical approaches to get the most out of DeepSeek—using it as a conversational assistant, integrating its API to power AI tools such as the Chrome immersive‑translation plugin, and leveraging it for AI‑assisted programming—while comparing the V3 and R1 models and offering concrete configuration steps.

AI programmingAI translationChrome Extension
0 likes · 8 min read
Three Advanced Ways to Harness DeepSeek for Everyone
Java Architect Essentials
Java Architect Essentials
Mar 9, 2025 · Backend Development

Building an AI-Powered Chatbot with Spring Boot and DeepSeek

This tutorial demonstrates how to create an AI-driven Spring Boot application by integrating DeepSeek's large language model, covering project setup, dependency configuration, API key management, and implementing a REST controller that provides weather forecasts via a conversational interface.

AIChatbotDeepSeek
0 likes · 8 min read
Building an AI-Powered Chatbot with Spring Boot and DeepSeek
Data Thinking Notes
Data Thinking Notes
Mar 9, 2025 · Artificial Intelligence

How DeepSeek R1 Uses Large‑Scale Reinforcement Learning to Rival OpenAI o1

DeepSeek R1, an open‑source large language model, leverages rule‑based, large‑scale reinforcement learning and mixed supervised‑fine‑tuning data to achieve deep reasoning comparable to OpenAI o1, illustrating China’s rapid AI progress, the importance of efficiency, and the democratizing impact of open AI research.

DeepSeekmodel efficiencyopen-source AI
0 likes · 11 min read
How DeepSeek R1 Uses Large‑Scale Reinforcement Learning to Rival OpenAI o1
Architects' Tech Alliance
Architects' Tech Alliance
Mar 9, 2025 · Industry Insights

DeepSeek’s AI Ecosystem: From Core Tech to Market Impact

This article provides a comprehensive analysis of DeepSeek, covering its foundational AI research, technology stack, product offerings, and the broader upstream, midstream, and downstream AI industry landscape, including hardware, server, cloud, and market trends.

AI InfrastructureDeepSeekMarket Trends
0 likes · 13 min read
DeepSeek’s AI Ecosystem: From Core Tech to Market Impact
DataFunTalk
DataFunTalk
Mar 8, 2025 · Artificial Intelligence

DeepSeek Reflection Wave and the Shifting Landscape of AGI Development in China

The article analyzes how DeepSeek's rapid rise has triggered a strategic rethink across Chinese AI startups and tech giants, prompting a shift from product‑centric growth to deep‑model research, while examining the real barriers to AGI and the importance of time‑advantage in the large‑model race.

AGIAIChinese tech
0 likes · 12 min read
DeepSeek Reflection Wave and the Shifting Landscape of AGI Development in China
Fun with Large Models
Fun with Large Models
Mar 8, 2025 · Artificial Intelligence

Make AI Obey: A Detailed Prompt Engineering Guide to Boost Large‑Model Logic

This tutorial explains how to enhance large language models' logical reasoning by using DeepSeek‑R1's deep‑thinking mode, few‑shot prompting, chain‑of‑thought, and zero‑shot chain‑of‑thought techniques, providing concrete examples, comparisons, and a step‑by‑step template for effective prompt design.

AI reasoningDeepSeekchain-of-thought
0 likes · 10 min read
Make AI Obey: A Detailed Prompt Engineering Guide to Boost Large‑Model Logic
Java Architect Essentials
Java Architect Essentials
Mar 7, 2025 · Artificial Intelligence

Introducing DeepSeek4j 1.4: A Java Spring Boot Integration for DeepSeek AI with Chain‑of‑Thought and Streaming Support

The article introduces DeepSeek4j 1.4, a Java Spring Boot library that overcomes existing framework limitations by preserving DeepSeek's chain‑of‑thought capabilities, adding full reactive streaming, and providing a simple one‑line API along with quick‑start instructions and code examples.

AI integrationDeepSeekJava
0 likes · 5 min read
Introducing DeepSeek4j 1.4: A Java Spring Boot Integration for DeepSeek AI with Chain‑of‑Thought and Streaming Support
Architects' Tech Alliance
Architects' Tech Alliance
Mar 7, 2025 · Industry Insights

How DeepSeek’s V3 and R1 Are Redefining the Global AI Landscape

The 2025 DeepSeek analysis report examines the V3 and R1 models' novel Transformer‑based technologies, their performance gains, and how they are reshaping global AI competition, boosting domestic AI valuations, and ushering in an open‑source AI breakthrough that could spark the next killer applications.

AI modelsDeepSeekmodel technology
0 likes · 5 min read
How DeepSeek’s V3 and R1 Are Redefining the Global AI Landscape
DevOps
DevOps
Mar 6, 2025 · Artificial Intelligence

Building Multi-Model Chat Agents with Dify: Integrating DeepSeek‑R1 and Gemini

This article explains how to create a high‑performance multi‑model chat agent on the Dify platform by combining DeepSeek‑R1 for reasoning and Gemini for answer generation, covering the underlying principles, configuration steps, API integration, performance benchmarks, and practical deployment guidance.

ChatbotDeepSeekDify
0 likes · 12 min read
Building Multi-Model Chat Agents with Dify: Integrating DeepSeek‑R1 and Gemini
Data Thinking Notes
Data Thinking Notes
Mar 6, 2025 · Artificial Intelligence

How China’s State‑Owned Giants Are Accelerating AI with DeepSeek

Amid a global digital surge, 45% of China’s central state‑owned enterprises have deployed the DeepSeek large‑model platform, rapidly integrating AI across energy, power, telecom, construction and other sectors to boost intelligent transformation and operational efficiency.

AI adoptionChinaDeepSeek
0 likes · 7 min read
How China’s State‑Owned Giants Are Accelerating AI with DeepSeek
Model Perspective
Model Perspective
Mar 6, 2025 · Artificial Intelligence

Can AI Boost High School Math Problem Solving? A DeepSeek Case Study

This article explores how the AI model DeepSeek can assist high‑school students in tackling challenging sequence problems from the 2024 Chinese college entrance exam, detailing its reasoning process, strengths, pitfalls, and practical tips for using AI to train mathematical thinking rather than just obtain answers.

AIDeepSeekhigh school
0 likes · 9 min read
Can AI Boost High School Math Problem Solving? A DeepSeek Case Study
Fun with Large Models
Fun with Large Models
Mar 6, 2025 · Artificial Intelligence

Master Prompt Engineering: Make AI Follow Your Commands with Simple, Effective Prompts

Prompt engineering transforms vague queries into precise, reliable AI responses by structuring prompts with clear instructions, context, input, and output specifications, and by using role‑playing and formatting tricks, enabling models like DeepSeek and OpenAI to deliver accurate, consistent results across tasks.

AI Prompt DesignDeepSeekOpenAI
0 likes · 15 min read
Master Prompt Engineering: Make AI Follow Your Commands with Simple, Effective Prompts
Architects' Tech Alliance
Architects' Tech Alliance
Mar 5, 2025 · Industry Insights

How DeepSeek’s Open‑Source Tools Are Supercharging AI Model Performance

DeepSeek’s Open‑Source Week unveiled five high‑performance projects—FlashMLA, DeepEP, DeepGEMM, DualPipe/EPLB, and 3FS—each delivering novel GPU optimizations, communication kernels, matrix‑multiplication libraries, parallelism strategies, and a distributed file system that together dramatically accelerate large‑scale AI training and inference workloads.

AI accelerationDeepSeekDistributed Training
0 likes · 9 min read
How DeepSeek’s Open‑Source Tools Are Supercharging AI Model Performance
Java Architect Essentials
Java Architect Essentials
Mar 5, 2025 · Artificial Intelligence

Step-by-Step Guide to Integrate DeepSeek AI with a WeChat Public Account Using a Cloud Server

This tutorial walks beginners through obtaining a DeepSeek API key, setting up an Alibaba Cloud ECS instance, configuring the WeChat public‑account interface, cloning and configuring the open‑source COW project, and finally deploying a Python service that connects the WeChat bot to the DeepSeek large‑language model.

DeepSeekPython TutorialWeChat
0 likes · 13 min read
Step-by-Step Guide to Integrate DeepSeek AI with a WeChat Public Account Using a Cloud Server
Architects' Tech Alliance
Architects' Tech Alliance
Mar 5, 2025 · Industry Insights

DeepSeek R1 & Kimi 1.5: Inside the Development of Near‑Strong Reasoning Models

The article analyzes DeepSeek's recent releases—V3 dialogue model and R1 inference model—detailing their launch dates, rapid popularity surge, R1's reinforcement‑learning‑based design for code and math tasks, and provides links to related Beijing University technical reports while stripping promotional sales content.

AIDeepSeekIndustry analysis
0 likes · 3 min read
DeepSeek R1 & Kimi 1.5: Inside the Development of Near‑Strong Reasoning Models
Model Perspective
Model Perspective
Mar 5, 2025 · Artificial Intelligence

Can AI Really Crack NP‑Hard Problems? Inside the DeepSeek‑R1 Breakthrough

Researchers from Nanjing University of Aeronautics, Nanjing University of Technology and Oxford show that high‑instruction prompts dramatically boost large language models' mathematical reasoning, enabling DeepSeek‑R1 and Qwen2.5 to solve complex polynomial tasks and even produce a new counterexample to Hilbert's 17th problem.

AIDeepSeekMathematical Reasoning
0 likes · 6 min read
Can AI Really Crack NP‑Hard Problems? Inside the DeepSeek‑R1 Breakthrough
Tencent Cloud Developer
Tencent Cloud Developer
Mar 5, 2025 · Artificial Intelligence

DeepSeek Series Overview: Core Technologies, Model Innovations, and Product Highlights

The article delivers a PPT‑style deep dive into the DeepSeek series—from the original LLM through DeepSeek‑MoE, Math, V2, V3 and R1—highlighting core innovations such as Multi‑Head Latent Attention, fine‑grained MoE, GRPO reinforcement learning, Multi‑Token Prediction, DualPipe parallelism and FP8 training that together achieve high performance at a fraction of traditional costs, and notes their integration into Tencent’s OlaChat intelligent assistant.

AIDeepSeekFP8 training
0 likes · 21 min read
DeepSeek Series Overview: Core Technologies, Model Innovations, and Product Highlights
Open Source Linux
Open Source Linux
Mar 5, 2025 · Artificial Intelligence

How DeepSeek‑R1 Redefines Prompt Engineering and Real‑World AI Deployment

The article analyzes DeepSeek‑R1’s low‑cost inference architecture, Chinese language optimizations, novel prompt‑engineering techniques, and the practical challenges of deploying large domestic models, offering insights into vertical AI applications and the evolving open‑source ecosystem in China.

AI deploymentDeepSeekModel Optimization
0 likes · 8 min read
How DeepSeek‑R1 Redefines Prompt Engineering and Real‑World AI Deployment
Data Thinking Notes
Data Thinking Notes
Mar 4, 2025 · Artificial Intelligence

Unlock AI-Powered Research: The DeepSeek‑R1 & DeepResearch Guide

Compiled by Tsinghua University experts, this guide systematically analyzes the DeepSeek‑R1 inference model and DeepResearch platform, offering multi‑model comparisons, real‑world case studies, and end‑to‑end AI‑driven solutions from data collection to report generation for researchers.

AI researchData AutomationDeepSeek
0 likes · 6 min read
Unlock AI-Powered Research: The DeepSeek‑R1 & DeepResearch Guide
Big Data Tech Team
Big Data Tech Team
Mar 4, 2025 · Industry Insights

100 Real-World DeepSeek Scenarios: How AI Is Reshaping Industries

The article analyzes DeepSeek's open‑source model launch, its rapid user growth, and presents a comprehensive list of 100 practical AI use cases across sectors—grouped by frequency and adoption stage—to illustrate the model's market impact and future potential.

AI applicationsDeepSeekMarket analysis
0 likes · 16 min read
100 Real-World DeepSeek Scenarios: How AI Is Reshaping Industries
JD Tech Talk
JD Tech Talk
Mar 4, 2025 · Artificial Intelligence

Building a Local Personal Knowledge Base with Ollama, DeepSeek‑R1, AnythingLLM and Integrating Continue into VSCode

This guide walks through setting up a local personal knowledge base using Ollama, DeepSeek‑R1, and AnythingLLM, and demonstrates how to integrate the Continue AI code assistant into VSCode, covering installation, configuration, and usage tips for efficient, secure development.

AI integrationAnythingLLMDeepSeek
0 likes · 10 min read
Building a Local Personal Knowledge Base with Ollama, DeepSeek‑R1, AnythingLLM and Integrating Continue into VSCode
Java Web Project
Java Web Project
Mar 4, 2025 · Artificial Intelligence

How to Seamlessly Integrate DeepSeek AI into IntelliJ IDEA for Java Development

This step‑by‑step guide shows Java developers how to prepare their environment, install the CodeGPT plugin, configure DeepSeek with an API key and model settings, and then use the assistant for code generation, completion, explanation, question answering, and usage monitoring within IntelliJ IDEA.

AI code assistantCodeGPTDeepSeek
0 likes · 8 min read
How to Seamlessly Integrate DeepSeek AI into IntelliJ IDEA for Java Development
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Mar 4, 2025 · Artificial Intelligence

Deploy a High‑Performance RAG Service with Hologres, DeepSeek, and PAI‑EAS

This guide walks you through building a Retrieval‑Augmented Generation (RAG) system by integrating Alibaba Cloud's Hologres vector store, the Proxima high‑performance vector engine, and DeepSeek large language models via PAI‑EAS, covering prerequisites, deployment steps, configuration, and inference verification.

AI deploymentDeepSeekHologres
0 likes · 12 min read
Deploy a High‑Performance RAG Service with Hologres, DeepSeek, and PAI‑EAS
Architect
Architect
Mar 3, 2025 · Artificial Intelligence

Unlocking Reasoning LLMs: Methods, DeepSeek R1 Insights, and Cost‑Effective Strategies

This article examines how to build and improve reasoning‑capable large language models, explains the definition and use‑cases of reasoning models, details DeepSeek‑R1’s training pipeline, compares four key enhancement methods—including inference‑time scaling, pure RL, SFT + RL, and distillation—and offers budget‑friendly advice.

AI researchDeepSeekInference Scaling
0 likes · 27 min read
Unlocking Reasoning LLMs: Methods, DeepSeek R1 Insights, and Cost‑Effective Strategies
AI Algorithm Path
AI Algorithm Path
Mar 3, 2025 · Artificial Intelligence

DeepSeek‑R1 Model Performance: Comparing 32B, 70B, and R1

This article evaluates DeepSeek‑R1’s 32B and 70B distilled models alongside the original R1 on a range of reasoning and coding tasks, detailing hardware setup, test methodology, per‑task results, and a comparative analysis of their strengths and weaknesses.

32B70BDeepSeek
0 likes · 6 min read
DeepSeek‑R1 Model Performance: Comparing 32B, 70B, and R1
DataFunSummit
DataFunSummit
Mar 3, 2025 · Artificial Intelligence

DeepSeek Open Source Week: Seven Core Technologies Reshaping Large‑Model Training

The DeepSeek open‑source week introduced seven breakthrough technologies—FlashMLA, DeepGEMM, DeepEP, DualPipe, EPLB, 3FS, and Smallpond—that together overhaul data flow, algorithmic complexity, hardware utilization, MoE communication, and resource balancing, dramatically improving large‑model training efficiency and lowering entry barriers for the AI industry.

AI hardwareDeepSeekdata pipelines
0 likes · 17 min read
DeepSeek Open Source Week: Seven Core Technologies Reshaping Large‑Model Training
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 3, 2025 · Mobile Development

Build a WeChat Mini‑Program Without Writing Code Using AI

This article demonstrates how a non‑programmer can use the DeepSeek‑powered “AI Programmer” mode in Tongyi Lingma to generate, modify, and deploy a functional WeChat mini‑program entirely through natural language prompts, complete with screenshots of each step.

AI programmingDeepSeekMobile Development
0 likes · 5 min read
Build a WeChat Mini‑Program Without Writing Code Using AI
macrozheng
macrozheng
Mar 3, 2025 · Artificial Intelligence

Integrate DeepSeek with Spring AI: Step‑by‑Step Spring Boot Guide

This tutorial walks you through integrating DeepSeek via Spring AI into a Spring Boot project, covering Spring AI basics, obtaining an API key, adding dependencies and configuration, implementing controller endpoints, testing with Postman, and accessing the full source code.

AI integrationChatbotDeepSeek
0 likes · 7 min read
Integrate DeepSeek with Spring AI: Step‑by‑Step Spring Boot Guide
AI Large Model Application Practice
AI Large Model Application Practice
Mar 3, 2025 · Artificial Intelligence

Can DeepSeek‑R1 Unlock True “Deep Thinking” for Enterprise RAG?

This article examines how swapping in DeepSeek‑R1 enhances Retrieval‑Augmented Generation with deeper reasoning, outlines its benefits and pitfalls—including slower inference, higher compute costs, and hallucinations—provides a simple hallucination test, and proposes an Agentic RAG research assistant to balance accuracy and creativity.

AI reasoningAgenticDeepSeek
0 likes · 10 min read
Can DeepSeek‑R1 Unlock True “Deep Thinking” for Enterprise RAG?