Tagged articles
566 articles
Page 6 of 6
Big Data Technology Architecture
Big Data Technology Architecture
Feb 7, 2025 · Artificial Intelligence

How to Build a DeepSeek AI Assistant on DingTalk

This guide explains why DeepSeek is a valuable AI assistant, outlines the challenges of high demand, and provides step‑by‑step instructions for creating, configuring, testing, and publishing a DeepSeek AI assistant within the DingTalk platform to ensure stable access.

AI AssistantDeepSeekDingTalk
0 likes · 4 min read
How to Build a DeepSeek AI Assistant on DingTalk
Tencent Cloud Developer
Tencent Cloud Developer
Feb 7, 2025 · Artificial Intelligence

Launch DeepSeek Models in Seconds with One‑Click Cloud Development

This guide shows how to start DeepSeek large‑language models on cnb.cool in just 5‑10 seconds without downloading, using a simple three‑step process that includes forking the repository, selecting a model branch, and running Ollama or Docker commands, plus options for long‑term cloud deployment.

AICloud NativeDeepSeek
0 likes · 3 min read
Launch DeepSeek Models in Seconds with One‑Click Cloud Development
Alibaba Cloud Native
Alibaba Cloud Native
Feb 7, 2025 · Information Security

How DeepSeek’s Attack Highlights the Need for Robust Cloud‑Native Security Observability

The article examines DeepSeek’s rapid rise, the large‑scale malicious attacks it suffered, and then provides a detailed, cloud‑native security observability guide using Alibaba Cloud services such as DDoS protection, WAF, CLB, SAS, and SLS for logging, monitoring, anomaly detection, and alert response.

AI securityAlibaba CloudCloud Native
0 likes · 15 min read
How DeepSeek’s Attack Highlights the Need for Robust Cloud‑Native Security Observability
MaGe Linux Operations
MaGe Linux Operations
Feb 7, 2025 · Artificial Intelligence

How to Deploy DeepSeek R1 Locally: A Step‑by‑Step AI Model Guide

This article walks you through everything you need to know about DeepSeek R1—including its different model sizes, hardware requirements, installation tools like Ollama, LM Studio and Docker, and how to set up a visual interface with Open‑WebUI or Dify—for offline, private, and cost‑effective AI inference.

AIDeepSeekDocker
0 likes · 15 min read
How to Deploy DeepSeek R1 Locally: A Step‑by‑Step AI Model Guide
Java Architecture Diary
Java Architecture Diary
Feb 7, 2025 · Artificial Intelligence

Unlock DeepSeek R1’s Full Potential in Spring with deepseek4j

This article introduces deepseek4j, a Spring‑Boot‑compatible library that fully supports DeepSeek R1’s chain‑of‑thought and billing features, offering reactive streaming, easy configuration, and a built‑in debugging page, with step‑by‑step setup and code examples to help developers quickly integrate the model.

AI integrationDeepSeekOpenAI4J
0 likes · 6 min read
Unlock DeepSeek R1’s Full Potential in Spring with deepseek4j
Top Architect
Top Architect
Feb 7, 2025 · Artificial Intelligence

Integrating DeepSeek AI Assistant into IntelliJ IDEA for Java Development

This guide walks Java developers through preparing the environment, installing the CodeGPT plugin, configuring DeepSeek API keys, and using the DeepSeek AI assistant within IntelliJ IDEA to generate, complete, explain, and troubleshoot code, while also showing usage statistics.

AI code assistantCodeGPTDeepSeek
0 likes · 8 min read
Integrating DeepSeek AI Assistant into IntelliJ IDEA for Java Development
JD Tech Talk
JD Tech Talk
Feb 7, 2025 · Artificial Intelligence

Building a Local AI Assistant with DeepSeek and Chatbox Using Ollama

This step‑by‑step tutorial shows beginners how to install Ollama, deploy the DeepSeek large language model locally, and configure the Chatbox AI client to create a functional AI assistant on Windows, macOS, Linux, or mobile devices within ten minutes.

AI AssistantChatboxDeepSeek
0 likes · 5 min read
Building a Local AI Assistant with DeepSeek and Chatbox Using Ollama
Java Captain
Java Captain
Feb 7, 2025 · Artificial Intelligence

DeepSeek: Disruptive Innovations in Large Language Model Architecture, Efficiency, and Ecosystem

DeepSeek reshapes the AI landscape by replacing brute‑force compute scaling with algorithmic breakthroughs such as a novel MoE architecture, memory compression, active‑learning data pipelines, and open‑source tooling, delivering dramatically lower training and inference costs while enabling edge deployment and a vibrant developer ecosystem.

Algorithmic EfficiencyDeepSeekMoE
0 likes · 11 min read
DeepSeek: Disruptive Innovations in Large Language Model Architecture, Efficiency, and Ecosystem
Alibaba Cloud Developer
Alibaba Cloud Developer
Feb 7, 2025 · Artificial Intelligence

Why DeepSeek V3 Achieves Low Training Costs: Inside Its AI Innovations

This article provides a comprehensive analysis of DeepSeek's large‑language‑model technology, covering the company's background, model capabilities, remarkably low training and inference costs, and the core architectural and algorithmic innovations such as MoE, MLA attention, FP8 mixed‑precision, and the DualPipe pipeline that enable efficient large‑scale AI deployment.

AI ArchitectureDeepSeekFP8 training
0 likes · 19 min read
Why DeepSeek V3 Achieves Low Training Costs: Inside Its AI Innovations
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Feb 6, 2025 · Artificial Intelligence

How Knowledge Distillation Powers Efficient Large‑Model Deployment

This article explains how knowledge distillation enables massive AI models to be compressed and deployed efficiently, covering its principles, classification dimensions, implementation steps, innovative practices at DeepSeek, real‑world applications, and future research directions.

DeepSeekartificial intelligenceknowledge distillation
0 likes · 11 min read
How Knowledge Distillation Powers Efficient Large‑Model Deployment
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Feb 6, 2025 · Artificial Intelligence

DeepSeek R1 vs V3: Which Model Fits Your Needs? A Detailed Comparison

An in‑depth comparison of DeepSeek’s R1 model variants—from 1.5B to 671B—covers parameter scale, accuracy, training and inference costs, and ideal use cases, followed by a detailed contrast with the V3 version’s design goals, architecture, training methods, performance and application scenarios.

AIDeepSeekmodel comparison
0 likes · 10 min read
DeepSeek R1 vs V3: Which Model Fits Your Needs? A Detailed Comparison
Java One
Java One
Feb 6, 2025 · Artificial Intelligence

Deploy DeepSeek‑R1 Locally on Your Laptop in Just 3 Minutes

This step‑by‑step guide shows non‑technical users how to install Ollama, pull the desired DeepSeek‑R1 model version, run it from the terminal, and optionally connect the free Chatbox desktop client for a visual chat interface, all without external network dependencies.

AI modelChatboxDeepSeek
0 likes · 6 min read
Deploy DeepSeek‑R1 Locally on Your Laptop in Just 3 Minutes
Architecture and Beyond
Architecture and Beyond
Feb 6, 2025 · Operations

Analyzing DeepSeek’s Availability Issues and Applying Traditional Internet Reliability Strategies to AIGC

This article examines DeepSeek’s frequent service interruptions, contrasts the inherent reliability challenges of AIGC products with traditional internet applications, and proposes adopting proven isolation, rate‑limiting, and elastic‑scaling techniques to improve AI service availability and user experience.

AIGCAvailabilityDeepSeek
0 likes · 12 min read
Analyzing DeepSeek’s Availability Issues and Applying Traditional Internet Reliability Strategies to AIGC
Top Architect
Top Architect
Feb 6, 2025 · Artificial Intelligence

Deploying DeepSeek R1 671B Model Locally with Ollama: Quantization, Hardware Requirements, and Step‑by‑Step Guide

This article provides a comprehensive tutorial on locally deploying the full‑size DeepSeek R1 671B model using Ollama, covering dynamic quantization options, hardware specifications, detailed installation commands, configuration files, performance observations, and practical recommendations for consumer‑grade systems.

AIDeepSeekGPU
0 likes · 14 min read
Deploying DeepSeek R1 671B Model Locally with Ollama: Quantization, Hardware Requirements, and Step‑by‑Step Guide
Tencent Cloud Developer
Tencent Cloud Developer
Feb 6, 2025 · Artificial Intelligence

DeepSeek V Series: Technical Overview of Scaling Laws, Grouped Query Attention, and Mixture‑of‑Experts

The article reviews DeepSeek’s V‑series papers, explaining how scaling‑law insights, Grouped Query Attention, a depth‑first design, loss‑free load balancing, multi‑token prediction and Multi‑Head Latent Attention together enable economical mixture‑of‑experts LLMs that rival closed‑source models while cutting compute and hardware costs.

DeepSeekGrouped Query AttentionMixture of Experts
0 likes · 13 min read
DeepSeek V Series: Technical Overview of Scaling Laws, Grouped Query Attention, and Mixture‑of‑Experts
Code Mala Tang
Code Mala Tang
Feb 6, 2025 · Artificial Intelligence

Why DeepSeek’s Explosive Growth Is Redefining AI, Cloud, and Chip Ecosystems

Within just 20 days of launch, DeepSeek has amassed over 20 million daily active users, secured support from China’s four major cloud providers and leading global chip makers, and sparked a wave of AI ecosystem development that highlights both market potential and technological collaboration across the industry.

AIChinaDeepSeek
0 likes · 7 min read
Why DeepSeek’s Explosive Growth Is Redefining AI, Cloud, and Chip Ecosystems
Cognitive Technology Team
Cognitive Technology Team
Feb 6, 2025 · Artificial Intelligence

DeepSeek Model Guide: 10 Practical Tips and Usage Techniques

This article presents ten detailed techniques for effectively using DeepSeek's large language models—including mode selection, model comparisons, knowledge updates, prompt engineering, RAG, file uploads, API access, and open‑source resources—while offering concrete examples and code snippets for each feature.

AI APIDeepSeekRAG
0 likes · 12 min read
DeepSeek Model Guide: 10 Practical Tips and Usage Techniques
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Feb 5, 2025 · Artificial Intelligence

What Optimizations Power DeepSeek’s High‑Efficiency LLMs?

The article enumerates DeepSeek’s extensive technical optimizations—including Grouped Query Attention, Multi‑head Latent Attention, Mixture‑of‑Experts, 4D parallelism, quantization, and multi‑token prediction—that together enable cheap, high‑performance large language models.

4D parallelismDeepSeekGrouped Query Attention
0 likes · 8 min read
What Optimizations Power DeepSeek’s High‑Efficiency LLMs?
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Feb 5, 2025 · Artificial Intelligence

Deploy DeepSeek R1 Locally with Ollama: Step‑by‑Step Guide for Windows & Linux

This article provides a comprehensive guide to locally deploying DeepSeek R1 models using Ollama on Windows and Linux, covering model variants, hardware requirements, installation steps, command‑line operations, visual client options, usage examples, performance tuning, and best‑practice recommendations for developers and enterprises.

AI modelDeepSeekDocker
0 likes · 10 min read
Deploy DeepSeek R1 Locally with Ollama: Step‑by‑Step Guide for Windows & Linux
Architect
Architect
Feb 5, 2025 · Industry Insights

What Makes DeepSeek R1 a Game-Changer? Inside the AI Industry’s Latest Power Shift

An in‑depth recap of a five‑hour Lex Fridman podcast reveals DeepSeek’s breakthrough R1 model, its cost‑saving MoE and MLA techniques, the geopolitical chip export battle, market reactions, and broader AI industry trends, offering a comprehensive analysis of technology, economics, and future implications.

AI industryDeepSeekGeopolitics
0 likes · 14 min read
What Makes DeepSeek R1 a Game-Changer? Inside the AI Industry’s Latest Power Shift
Java Web Project
Java Web Project
Feb 5, 2025 · Big Data

Master DeepSeek: Install, Configure, and Harness Its Data Processing Power

This guide walks you through DeepSeek’s core capabilities—including installation on Windows, macOS, and Linux, configuration of storage paths, API keys, and logging levels, as well as data import, cleaning, analysis, visualization, batch processing, scheduling, and plugin extensions—providing concrete command examples and troubleshooting tips.

AutomationDeepSeekcommand-line
0 likes · 8 min read
Master DeepSeek: Install, Configure, and Harness Its Data Processing Power
IT Services Circle
IT Services Circle
Feb 5, 2025 · Backend Development

Using DeepSeek AI to Implement Task Scheduling System Sharding

The author explores how DeepSeek V3 can assist in designing a task scheduling system's sharding logic, describing system components, slot allocation, and the AI's step‑by‑step code generation, analysis, and testing process.

AI code generationBackend DevelopmentDeepSeek
0 likes · 4 min read
Using DeepSeek AI to Implement Task Scheduling System Sharding
Java Tech Enthusiast
Java Tech Enthusiast
Feb 5, 2025 · Artificial Intelligence

DeepSeek: AI Breakthrough and Recruitment Insights

DeepSeek’s open‑source R1 model shattered the prevailing belief that closed‑source giants like OpenAI dominate AI progress by introducing a pure reinforcement‑learning‑driven inference breakthrough with its GRPO algorithm, sparking global excitement, prompting political concern, and leading the company to aggressively hire engineers in Beijing and Hangzhou with competitive 14‑month salaries despite demanding top‑conference publications.

AI DevelopmentDeepSeekGRPO algorithm
0 likes · 7 min read
DeepSeek: AI Breakthrough and Recruitment Insights
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Feb 5, 2025 · Artificial Intelligence

Deploy DeepSeek‑V3 on Ascend: Step‑by‑Step Guide for Fast AI Inference

This guide walks developers through obtaining the DeepSeek‑V3 model on the Ascend community, converting weights for GPU and NPU, loading the appropriate MindIE Docker image, launching the container, and configuring service‑level parameters to achieve efficient, out‑of‑the‑box AI inference on Ascend hardware.

AI inferenceAscendDeepSeek
0 likes · 4 min read
Deploy DeepSeek‑V3 on Ascend: Step‑by‑Step Guide for Fast AI Inference
Java Architecture Diary
Java Architecture Diary
Feb 5, 2025 · Artificial Intelligence

Unlocking DeepSeek R1’s Chain‑of‑Thought: A Spring WebFlux Integration Guide

This article examines why mainstream AI frameworks like Spring AI and LangChain4j cannot fully support DeepSeek’s R1 model, explains its unique chain‑of‑thought response format and parameter constraints, and provides a complete Spring WebFlux‑based solution—including API calls, streaming handling, and response parsing—to preserve reasoning content.

DeepSeekR1chain-of-thought
0 likes · 8 min read
Unlocking DeepSeek R1’s Chain‑of‑Thought: A Spring WebFlux Integration Guide
Su San Talks Tech
Su San Talks Tech
Feb 5, 2025 · Backend Development

How DeepSeek AI Supercharged My Task Scheduler Sharding Design

The author explores using DeepSeek V3 to design and implement a sharding mechanism for a three‑component task scheduling system, showcases the AI's step‑by‑step reasoning, code generation, and testing, and compares its output with that of Tongyi Qianwen, highlighting DeepSeek's superior programming assistance.

AI code generationDeepSeektask scheduling
0 likes · 4 min read
How DeepSeek AI Supercharged My Task Scheduler Sharding Design
21CTO
21CTO
Feb 4, 2025 · Artificial Intelligence

Is DeepSeek the Next Challenger to ChatGPT? A Deep Dive into Its AI Edge

This article explains what DeepSeek is, how its open‑source large language model works, its unique multilingual training, free access, the DeepSeek‑Coder variant, and compares its capabilities and goals with ChatGPT, highlighting strengths, limitations, and market impact.

AI modelsChatGPT comparisonDeepSeek
0 likes · 7 min read
Is DeepSeek the Next Challenger to ChatGPT? A Deep Dive into Its AI Edge
21CTO
21CTO
Feb 4, 2025 · Artificial Intelligence

Run DeepSeek Locally with Ollama: A Complete Step‑by‑Step Guide

This guide walks you through installing Ollama, selecting the appropriate DeepSeek model, running it locally, and exploring integration options, highlighting the benefits of offline AI such as data privacy, faster performance, and zero subscription costs.

AI TutorialDeepSeekLocal AI
0 likes · 7 min read
Run DeepSeek Locally with Ollama: A Complete Step‑by‑Step Guide
Tencent Tech
Tencent Tech
Feb 4, 2025 · Artificial Intelligence

Deploy and Test DeepSeek Large Language Models on Tencent Cloud TI in Minutes

This guide walks you through quickly deploying DeepSeek series models on the Tencent Cloud TI platform, covering model selection, resource planning, step‑by‑step service creation, free online trial, API testing via built‑in tools or curl, and managing inference services for both large and compact models.

AI inferenceDeepSeekModel Deployment
0 likes · 13 min read
Deploy and Test DeepSeek Large Language Models on Tencent Cloud TI in Minutes
Architect
Architect
Feb 3, 2025 · Artificial Intelligence

How DeepSeek‑R1 Uses Pure Reinforcement Learning to Match OpenAI’s o1

This article presents DeepSeek‑R1 and DeepSeek‑R1‑Zero, two next‑generation LLMs trained with pure reinforcement learning and multi‑stage fine‑tuning, details their GRPO training framework, model‑distillation pipeline, open‑source release, and evaluation results that rival OpenAI’s o1‑1217 across reasoning, knowledge, and coding benchmarks.

DeepSeekLLM evaluationOpenAI o1
0 likes · 10 min read
How DeepSeek‑R1 Uses Pure Reinforcement Learning to Match OpenAI’s o1
Tencent Cloud Developer
Tencent Cloud Developer
Feb 3, 2025 · Artificial Intelligence

DeepSeek's Emergence: Implications for AI, Enterprise Digital Transformation, and Future Software Development

DeepSeek’s debut marks a watershed for China’s AI, offering low‑cost, Chinese‑native reasoning that outperforms foreign models and prompting enterprises to restructure development around demand‑engineering, AI‑assisted low‑code, intelligent data stores, and a shift from “how to code” to “why to code” across a three‑phase transformation roadmap.

AI strategyDeepSeekDigital Transformation
0 likes · 15 min read
DeepSeek's Emergence: Implications for AI, Enterprise Digital Transformation, and Future Software Development
Cognitive Technology Team
Cognitive Technology Team
Feb 3, 2025 · Artificial Intelligence

DeepSeek R1 Introduces Group‑Related Policy Optimization for Advanced Reasoning in Large Language Models

DeepSeek AI’s new open‑source model DeepSeek‑R1 leverages a novel Group‑Related Policy Optimization (GRPO) reinforcement‑learning framework and multi‑stage training to dramatically boost complex reasoning performance, achieving AIME 2024 Pass@1 scores comparable to OpenAI’s o1 model.

AIDeepSeekGRPO
0 likes · 4 min read
DeepSeek R1 Introduces Group‑Related Policy Optimization for Advanced Reasoning in Large Language Models
IT Services Circle
IT Services Circle
Feb 2, 2025 · Artificial Intelligence

OpenAI and Anthropic Accuse DeepSeek of Model Distillation and IP Infringement: Industry Reactions and Technical Overview

OpenAI and Anthropic allege that DeepSeek has illegally distilled their large language models, prompting investigations, industry satire, and a detailed look at model distillation technology, its legal implications, and the broader trends shaping AI cost, scaling laws, and market dynamics.

AI ethicsDeepSeekOpenAI
0 likes · 10 min read
OpenAI and Anthropic Accuse DeepSeek of Model Distillation and IP Infringement: Industry Reactions and Technical Overview
DataFunTalk
DataFunTalk
Feb 2, 2025 · Artificial Intelligence

DeepSeek Releases Janus‑Pro‑7B Multimodal Model, Beats DALL‑E 3 and Stable Diffusion on Benchmarks

DeepSeek's newly released Janus‑Pro‑7B multimodal model, open‑sourced overnight, outperforms DALL‑E 3 and Stable Diffusion on GenEval and DPG‑Bench, showcases a unified self‑regressive architecture with SigLIP‑L visual encoder, and has sparked massive user adoption and market reactions worldwide.

AI benchmarksDeepSeek
0 likes · 9 min read
DeepSeek Releases Janus‑Pro‑7B Multimodal Model, Beats DALL‑E 3 and Stable Diffusion on Benchmarks
Code Mala Tang
Code Mala Tang
Feb 2, 2025 · Artificial Intelligence

How to Deploy DeepSeek AI Coding Assistant Locally: A Step‑by‑Step Guide

This guide walks you through the hardware and software prerequisites, Docker-based installation, environment configuration, model fine‑tuning, IDE integration, maintenance, and troubleshooting for running the DeepSeek AI programming assistant entirely on your own machine.

AI coding assistantDeepSeekDocker
0 likes · 12 min read
How to Deploy DeepSeek AI Coding Assistant Locally: A Step‑by‑Step Guide
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Feb 1, 2025 · Artificial Intelligence

Deploy DeepSeek-V3 and R1 Models with One-Click on Alibaba Cloud PAI Model Gallery

This article introduces Alibaba Cloud's PAI Model Gallery, detailing the DeepSeek-V3 and DeepSeek‑R1 large language models, their architectures and parameters, and provides a step‑by‑step guide for one‑click deployment of these models and their distilled variants using vLLM or BladeLLM.

AI inferenceAlibaba CloudDeepSeek
0 likes · 6 min read
Deploy DeepSeek-V3 and R1 Models with One-Click on Alibaba Cloud PAI Model Gallery
Code Mala Tang
Code Mala Tang
Jan 31, 2025 · Artificial Intelligence

Master DeepSeek: 7 Prompt Engineering Tricks to Boost AI Responses

This guide presents seven practical prompt‑engineering techniques—clear goals, structured queries, domain terminology, concrete examples, scoped questions, step‑by‑step breakdowns, and multi‑turn interactions—to help users get more accurate and useful answers from DeepSeek.

AI promptsDeepSeekLanguage Model
0 likes · 6 min read
Master DeepSeek: 7 Prompt Engineering Tricks to Boost AI Responses
Architect
Architect
Jan 29, 2025 · Artificial Intelligence

How Janus‑Pro Redefines Multimodal AI with Bigger Models and New Training Strategies

DeepSeek’s newly released Janus‑Pro series (1B and 7B) advances multimodal AI by decoupling visual understanding and generation, employing optimized three‑stage training, massive data expansion, and larger LLM backbones, achieving performance that matches or exceeds leading models such as Meta, Google, OpenAI, and Stability AI.

DeepSeekJanus-ProModel Scaling
0 likes · 6 min read
How Janus‑Pro Redefines Multimodal AI with Bigger Models and New Training Strategies
Programmer DD
Programmer DD
Jan 27, 2025 · Artificial Intelligence

Run DeepSeek‑R1 Locally with Ollama and Call It from Spring Boot

Learn how to deploy the open‑source DeepSeek‑R1 model using Ollama on Linux or macOS, configure various model sizes, and integrate it into a Spring Boot application with Spring AI to build an API‑driven translation service, complete with code examples and testing.

APIDeepSeekOllama
0 likes · 9 min read
Run DeepSeek‑R1 Locally with Ollama and Call It from Spring Boot
AI Code to Success
AI Code to Success
Jan 26, 2025 · Industry Insights

How DeepSeek‑R1 Is Challenging OpenAI’s o1 and Shaping the AI Landscape

DeepSeek‑R1 achieved a 1357‑point Arena score, ranking third overall and tying OpenAI o1 for first in StyleCtrl, while its open‑source MIT‑licensed release—including distilled variants—and low‑cost API service aim to democratize advanced AI inference for developers worldwide.

AI competitionArena benchmarkDeepSeek
0 likes · 5 min read
How DeepSeek‑R1 Is Challenging OpenAI’s o1 and Shaping the AI Landscape
DevOps
DevOps
Jan 25, 2025 · Artificial Intelligence

DeepSeek R1: An Open‑Source Large Model Matching OpenAI’s o1 at a Fraction of the Cost

DeepSeek’s newly released R1 model delivers performance comparable to OpenAI’s o1 while cutting inference costs by 90‑95%, leveraging innovative MLA and MoE architectures, low‑cost hardware training, an open‑source strategy, and a youthful, flat‑structured team that challenges the AI industry’s high‑spending model.

AI startupCost‑Efficient TrainingDeepSeek
0 likes · 12 min read
DeepSeek R1: An Open‑Source Large Model Matching OpenAI’s o1 at a Fraction of the Cost
Alibaba Cloud Native
Alibaba Cloud Native
Jan 22, 2025 · Cloud Native

Seamlessly Migrate from OpenAI to DeepSeek with Higress AI Gateway

This guide explains how to install the Higress AI gateway, configure provider API keys, set up gray‑release routing between OpenAI and DeepSeek, use a Python client to call DeepSeek, and enable content security and observability features for safe, cost‑effective large‑model deployments.

AI gatewayContent SecurityDeepSeek
0 likes · 7 min read
Seamlessly Migrate from OpenAI to DeepSeek with Higress AI Gateway
Baobao Algorithm Notes
Baobao Algorithm Notes
Jan 22, 2025 · Artificial Intelligence

Can RL‑Only Training Make LLMs Beat OpenAI‑o1? Inside DeepSeek‑R1’s Architecture and Results

DeepSeek‑R1’s open‑source series demonstrates that reinforcement‑learning‑only training can match top‑tier models like OpenAI‑o1, while a small amount of SFT further improves readability; the article dissects its technical report, training pipeline, reward design, distillation strategy, benchmark outcomes, and remaining challenges.

DeepSeekSupervised Fine‑Tuninglarge language model
0 likes · 11 min read
Can RL‑Only Training Make LLMs Beat OpenAI‑o1? Inside DeepSeek‑R1’s Architecture and Results
Java Architecture Diary
Java Architecture Diary
Jan 21, 2025 · Artificial Intelligence

Unlocking DeepSeek R1: How to Leverage the New Reasoning Model with Spring AI

This article introduces DeepSeek R1, a breakthrough reasoning‑focused large model that visualizes its chain‑of‑thought process, matches OpenAI O1 performance, offers open‑source advantages, and provides step‑by‑step Spring AI integration guidance, including dependency setup, configuration, and code examples.

AI integrationDeepSeekR1
0 likes · 9 min read
Unlocking DeepSeek R1: How to Leverage the New Reasoning Model with Spring AI
Baobao Algorithm Notes
Baobao Algorithm Notes
Jan 15, 2025 · Artificial Intelligence

How Multi-Token Prediction Boosts LLM Training and Inference Efficiency

This article reviews the evolution of Multi‑Token Prediction (MTP) techniques—from early blockwise parallel decoding to Meta's and DeepSeek's implementations—explaining their architectures, training and inference workflows, and the speed‑up gains they offer for large language models.

DeepSeekInference AccelerationLLM
0 likes · 20 min read
How Multi-Token Prediction Boosts LLM Training and Inference Efficiency
Baobao Algorithm Notes
Baobao Algorithm Notes
Jan 7, 2025 · Artificial Intelligence

How Efficient Is DeepSeek V3? Calculating Its MFU Around 37%

This article derives DeepSeek V3's training Model FLOPs Utilization (MFU) using publicly available data, showing an MFU of roughly 37%—about a 60% improvement over V2—and provides detailed formulas, parameter settings, and a reproducible Python script.

AI PerformanceDeepSeekMFU
0 likes · 8 min read
How Efficient Is DeepSeek V3? Calculating Its MFU Around 37%
ShiZhen AI
ShiZhen AI
Jan 6, 2025 · Industry Insights

AI Daily Roundup: Altman's Singularity Hint, Microsoft’s $80B AI Investment, DeepSeek DeepThink, and BCI Breakthroughs

The article reviews Sam Altman's cryptic six‑word tweet about approaching the AI singularity, Microsoft’s $80 billion plan to expand AI data‑center infrastructure, DeepSeek’s DeepThink feature for step‑wise reasoning, and NeuroXess’s brain‑computer‑interface advances that let patients control AI and robots with thought.

AIDeepSeekMicrosoft
0 likes · 8 min read
AI Daily Roundup: Altman's Singularity Hint, Microsoft’s $80B AI Investment, DeepSeek DeepThink, and BCI Breakthroughs
ZhongAn Tech Team
ZhongAn Tech Team
Jan 5, 2025 · Artificial Intelligence

Weekly AI Roundup Issue 9: OpenAI Vision, LeCun Interview, ByteDance HLLM, and DeepSeek‑V3 Highlights

This issue presents a curated overview of recent AI developments, including Sam Altman's 2025 technology vision poll, LeCun's interview on future AI directions, ByteDance's hierarchical large language model for recommendation, and the performance and cost advantages of the open‑source DeepSeek‑V3 model.

AIByteDanceDeepSeek
0 likes · 10 min read
Weekly AI Roundup Issue 9: OpenAI Vision, LeCun Interview, ByteDance HLLM, and DeepSeek‑V3 Highlights
CSS Magic
CSS Magic
May 13, 2024 · Artificial Intelligence

DeepSeek: China’s New LLM Dark Horse – First Impressions and Shockingly Low Prices

The article evaluates DeepSeek v2, a 100‑billion‑parameter MoE model, highlighting its near‑GPT‑4 benchmark performance, OpenAI‑compatible API, 32k‑token context, exceptionally low pricing, a custom token‑utilization metric, and the practical drawbacks observed during hands‑on testing.

API compatibilityBenchmarkDeepSeek
0 likes · 9 min read
DeepSeek: China’s New LLM Dark Horse – First Impressions and Shockingly Low Prices
Baobao Algorithm Notes
Baobao Algorithm Notes
May 9, 2024 · Artificial Intelligence

Inside Deepseek‑V2: How Multi‑Head Latent Attention Cuts KV‑Cache and Boosts Performance

This article provides an in‑depth technical analysis of Deepseek‑V2, covering its 236B parameter size, Multi‑Head Latent Attention optimization that reduces KV‑cache memory, architectural details, training pipelines, infrastructure choices, and performance results on benchmarks such as MMLU and instruction following.

AI ArchitectureDeepSeekModel Optimization
0 likes · 17 min read
Inside Deepseek‑V2: How Multi‑Head Latent Attention Cuts KV‑Cache and Boosts Performance