Tagged articles

566 articles

Page 6 of 6

Feb 7, 2025 · Artificial Intelligence

How to Build a DeepSeek AI Assistant on DingTalk

This guide explains why DeepSeek is a valuable AI assistant, outlines the challenges of high demand, and provides step‑by‑step instructions for creating, configuring, testing, and publishing a DeepSeek AI assistant within the DingTalk platform to ensure stable access.

AI AssistantDeepSeekDingTalk

0 likes · 4 min read

How to Build a DeepSeek AI Assistant on DingTalk

Tencent Cloud Developer

Feb 7, 2025 · Artificial Intelligence

Launch DeepSeek Models in Seconds with One‑Click Cloud Development

This guide shows how to start DeepSeek large‑language models on cnb.cool in just 5‑10 seconds without downloading, using a simple three‑step process that includes forking the repository, selecting a model branch, and running Ollama or Docker commands, plus options for long‑term cloud deployment.

AICloud NativeDeepSeek

0 likes · 3 min read

Launch DeepSeek Models in Seconds with One‑Click Cloud Development

Alibaba Cloud Native

Feb 7, 2025 · Information Security

How DeepSeek’s Attack Highlights the Need for Robust Cloud‑Native Security Observability

The article examines DeepSeek’s rapid rise, the large‑scale malicious attacks it suffered, and then provides a detailed, cloud‑native security observability guide using Alibaba Cloud services such as DDoS protection, WAF, CLB, SAS, and SLS for logging, monitoring, anomaly detection, and alert response.

AI securityAlibaba CloudCloud Native

0 likes · 15 min read

How DeepSeek’s Attack Highlights the Need for Robust Cloud‑Native Security Observability

Java Architect Essentials

Feb 7, 2025 · Big Data

Master DeepSeek: From Installation to Advanced Data Analysis in One Guide

This comprehensive guide walks you through DeepSeek's features, installation on Windows, macOS, and Linux, configuration details, basic commands for data import, querying, and visualization, as well as advanced cleaning, analysis, plugin extensions, troubleshooting tips, and a handy command cheat sheet.

Big DataData visualizationDeepSeek

0 likes · 9 min read

Master DeepSeek: From Installation to Advanced Data Analysis in One Guide

MaGe Linux Operations

Feb 7, 2025 · Artificial Intelligence

How to Deploy DeepSeek R1 Locally: A Step‑by‑Step AI Model Guide

This article walks you through everything you need to know about DeepSeek R1—including its different model sizes, hardware requirements, installation tools like Ollama, LM Studio and Docker, and how to set up a visual interface with Open‑WebUI or Dify—for offline, private, and cost‑effective AI inference.

AIDeepSeekDocker

0 likes · 15 min read

How to Deploy DeepSeek R1 Locally: A Step‑by‑Step AI Model Guide

Code Ape Tech Column

Feb 7, 2025 · Artificial Intelligence

Integrating DeepSeek AI Assistant into IntelliJ IDEA for Java Development

This guide walks you through preparing the environment, installing the CodeGPT plugin, configuring DeepSeek with API keys, and using its code generation, completion, explanation, and question‑answer features inside IntelliJ IDEA, while also showing how to monitor token usage.

AI code assistantCodeGPTDeepSeek

0 likes · 8 min read

Integrating DeepSeek AI Assistant into IntelliJ IDEA for Java Development

Java Architecture Diary

Feb 7, 2025 · Artificial Intelligence

Unlock DeepSeek R1’s Full Potential in Spring with deepseek4j

This article introduces deepseek4j, a Spring‑Boot‑compatible library that fully supports DeepSeek R1’s chain‑of‑thought and billing features, offering reactive streaming, easy configuration, and a built‑in debugging page, with step‑by‑step setup and code examples to help developers quickly integrate the model.

AI integrationDeepSeekOpenAI4J

0 likes · 6 min read

Unlock DeepSeek R1’s Full Potential in Spring with deepseek4j

Top Architect

Feb 7, 2025 · Artificial Intelligence

Integrating DeepSeek AI Assistant into IntelliJ IDEA for Java Development

This guide walks Java developers through preparing the environment, installing the CodeGPT plugin, configuring DeepSeek API keys, and using the DeepSeek AI assistant within IntelliJ IDEA to generate, complete, explain, and troubleshoot code, while also showing usage statistics.

AI code assistantCodeGPTDeepSeek

0 likes · 8 min read

JD Tech Talk

Feb 7, 2025 · Artificial Intelligence

Building a Local AI Assistant with DeepSeek and Chatbox Using Ollama

This step‑by‑step tutorial shows beginners how to install Ollama, deploy the DeepSeek large language model locally, and configure the Chatbox AI client to create a functional AI assistant on Windows, macOS, Linux, or mobile devices within ten minutes.

AI AssistantChatboxDeepSeek

0 likes · 5 min read

Building a Local AI Assistant with DeepSeek and Chatbox Using Ollama

JD Cloud Developers

Feb 7, 2025 · Artificial Intelligence

Build a Local AI Assistant with DeepSeek and Ollama in 10 Minutes

This guide walks you through installing Ollama, downloading the DeepSeek model, and configuring the Chatbox AI client so you can run a powerful local AI assistant on Windows, macOS, or Linux within minutes.

AI AssistantChatboxDeepSeek

0 likes · 3 min read

Build a Local AI Assistant with DeepSeek and Ollama in 10 Minutes

Java Captain

Feb 7, 2025 · Artificial Intelligence

DeepSeek: Disruptive Innovations in Large Language Model Architecture, Efficiency, and Ecosystem

DeepSeek reshapes the AI landscape by replacing brute‑force compute scaling with algorithmic breakthroughs such as a novel MoE architecture, memory compression, active‑learning data pipelines, and open‑source tooling, delivering dramatically lower training and inference costs while enabling edge deployment and a vibrant developer ecosystem.

Algorithmic EfficiencyDeepSeekMoE

0 likes · 11 min read

DeepSeek: Disruptive Innovations in Large Language Model Architecture, Efficiency, and Ecosystem

Su San Talks Tech

Feb 7, 2025 · Artificial Intelligence

Quickly Build a Personal Knowledge Base with DeepSeek – Step‑by‑Step Guide

This tutorial explains why a personal knowledge base is essential, then walks you through installing Cherry Studio, configuring DeepSeek chat and embedding models, creating and populating the knowledge base, and finally using an AI assistant to retrieve information efficiently.

AI AssistantCherry StudioDeepSeek

0 likes · 7 min read

Quickly Build a Personal Knowledge Base with DeepSeek – Step‑by‑Step Guide

Alibaba Cloud Developer

Feb 7, 2025 · Artificial Intelligence

Why DeepSeek V3 Achieves Low Training Costs: Inside Its AI Innovations

This article provides a comprehensive analysis of DeepSeek's large‑language‑model technology, covering the company's background, model capabilities, remarkably low training and inference costs, and the core architectural and algorithmic innovations such as MoE, MLA attention, FP8 mixed‑precision, and the DualPipe pipeline that enable efficient large‑scale AI deployment.

AI ArchitectureDeepSeekFP8 training

0 likes · 19 min read

Why DeepSeek V3 Achieves Low Training Costs: Inside Its AI Innovations

Architect's Alchemy Furnace

Feb 6, 2025 · Artificial Intelligence

How Knowledge Distillation Powers Efficient Large‑Model Deployment

This article explains how knowledge distillation enables massive AI models to be compressed and deployed efficiently, covering its principles, classification dimensions, implementation steps, innovative practices at DeepSeek, real‑world applications, and future research directions.

DeepSeekartificial intelligenceknowledge distillation

0 likes · 11 min read

How Knowledge Distillation Powers Efficient Large‑Model Deployment

Architect's Alchemy Furnace

Feb 6, 2025 · Artificial Intelligence

DeepSeek R1 vs V3: Which Model Fits Your Needs? A Detailed Comparison

An in‑depth comparison of DeepSeek’s R1 model variants—from 1.5B to 671B—covers parameter scale, accuracy, training and inference costs, and ideal use cases, followed by a detailed contrast with the V3 version’s design goals, architecture, training methods, performance and application scenarios.

AIDeepSeekmodel comparison

0 likes · 10 min read

DeepSeek R1 vs V3: Which Model Fits Your Needs? A Detailed Comparison

Java One

Feb 6, 2025 · Artificial Intelligence

Deploy DeepSeek‑R1 Locally on Your Laptop in Just 3 Minutes

This step‑by‑step guide shows non‑technical users how to install Ollama, pull the desired DeepSeek‑R1 model version, run it from the terminal, and optionally connect the free Chatbox desktop client for a visual chat interface, all without external network dependencies.

AI modelChatboxDeepSeek

0 likes · 6 min read

Deploy DeepSeek‑R1 Locally on Your Laptop in Just 3 Minutes

Architecture and Beyond

Feb 6, 2025 · Operations

Analyzing DeepSeek’s Availability Issues and Applying Traditional Internet Reliability Strategies to AIGC

This article examines DeepSeek’s frequent service interruptions, contrasts the inherent reliability challenges of AIGC products with traditional internet applications, and proposes adopting proven isolation, rate‑limiting, and elastic‑scaling techniques to improve AI service availability and user experience.

AIGCAvailabilityDeepSeek

0 likes · 12 min read

Analyzing DeepSeek’s Availability Issues and Applying Traditional Internet Reliability Strategies to AIGC

Architecture Digest

Feb 6, 2025 · Artificial Intelligence

Deploying DeepSeek R1 671B Model Locally with Ollama and Dynamic Quantization

This guide explains how to deploy the full 671B DeepSeek R1 model on local hardware using Ollama, leveraging dynamic quantization to shrink model size, detailing hardware requirements, step‑by‑step installation, configuration, performance observations, and practical recommendations.

DeepSeekDynamic QuantizationGPU

0 likes · 12 min read

Deploying DeepSeek R1 671B Model Locally with Ollama and Dynamic Quantization

Top Architect

Feb 6, 2025 · Artificial Intelligence

Deploying DeepSeek R1 671B Model Locally with Ollama: Quantization, Hardware Requirements, and Step‑by‑Step Guide

This article provides a comprehensive tutorial on locally deploying the full‑size DeepSeek R1 671B model using Ollama, covering dynamic quantization options, hardware specifications, detailed installation commands, configuration files, performance observations, and practical recommendations for consumer‑grade systems.

AIDeepSeekGPU

0 likes · 14 min read

Deploying DeepSeek R1 671B Model Locally with Ollama: Quantization, Hardware Requirements, and Step‑by‑Step Guide

Tencent Cloud Developer

Feb 6, 2025 · Artificial Intelligence

DeepSeek V Series: Technical Overview of Scaling Laws, Grouped Query Attention, and Mixture‑of‑Experts

The article reviews DeepSeek’s V‑series papers, explaining how scaling‑law insights, Grouped Query Attention, a depth‑first design, loss‑free load balancing, multi‑token prediction and Multi‑Head Latent Attention together enable economical mixture‑of‑experts LLMs that rival closed‑source models while cutting compute and hardware costs.

DeepSeekGrouped Query AttentionMixture of Experts

0 likes · 13 min read

DeepSeek V Series: Technical Overview of Scaling Laws, Grouped Query Attention, and Mixture‑of‑Experts

Code Mala Tang

Feb 6, 2025 · Artificial Intelligence

Why DeepSeek’s Explosive Growth Is Redefining AI, Cloud, and Chip Ecosystems

Within just 20 days of launch, DeepSeek has amassed over 20 million daily active users, secured support from China’s four major cloud providers and leading global chip makers, and sparked a wave of AI ecosystem development that highlights both market potential and technological collaboration across the industry.

AIChinaDeepSeek

0 likes · 7 min read

Why DeepSeek’s Explosive Growth Is Redefining AI, Cloud, and Chip Ecosystems

Su San Talks Tech

Feb 6, 2025 · Artificial Intelligence

How to Integrate DeepSeek AI with IntelliJ IDEA Using CodeGPT

This step‑by‑step guide shows how to install IntelliJ IDEA, add the CodeGPT plugin, configure DeepSeek API keys and URLs, and use the DeepSeek Coder template for AI‑assisted code completion within the IDE.

AI integrationCodeGPTDeepSeek

0 likes · 4 min read

How to Integrate DeepSeek AI with IntelliJ IDEA Using CodeGPT

Cognitive Technology Team

Feb 6, 2025 · Artificial Intelligence

DeepSeek Model Guide: 10 Practical Tips and Usage Techniques

This article presents ten detailed techniques for effectively using DeepSeek's large language models—including mode selection, model comparisons, knowledge updates, prompt engineering, RAG, file uploads, API access, and open‑source resources—while offering concrete examples and code snippets for each feature.

AI APIDeepSeekRAG

0 likes · 12 min read

DeepSeek Model Guide: 10 Practical Tips and Usage Techniques

AI2ML AI to Machine Learning

Feb 5, 2025 · Artificial Intelligence

What Optimizations Power DeepSeek’s High‑Efficiency LLMs?

The article enumerates DeepSeek’s extensive technical optimizations—including Grouped Query Attention, Multi‑head Latent Attention, Mixture‑of‑Experts, 4D parallelism, quantization, and multi‑token prediction—that together enable cheap, high‑performance large language models.

4D parallelismDeepSeekGrouped Query Attention

0 likes · 8 min read

What Optimizations Power DeepSeek’s High‑Efficiency LLMs?

Architect's Alchemy Furnace

Feb 5, 2025 · Artificial Intelligence

Deploy DeepSeek R1 Locally with Ollama: Step‑by‑Step Guide for Windows & Linux

This article provides a comprehensive guide to locally deploying DeepSeek R1 models using Ollama on Windows and Linux, covering model variants, hardware requirements, installation steps, command‑line operations, visual client options, usage examples, performance tuning, and best‑practice recommendations for developers and enterprises.

AI modelDeepSeekDocker

0 likes · 10 min read

Deploy DeepSeek R1 Locally with Ollama: Step‑by‑Step Guide for Windows & Linux

Architect

Feb 5, 2025 · Industry Insights

What Makes DeepSeek R1 a Game-Changer? Inside the AI Industry’s Latest Power Shift

An in‑depth recap of a five‑hour Lex Fridman podcast reveals DeepSeek’s breakthrough R1 model, its cost‑saving MoE and MLA techniques, the geopolitical chip export battle, market reactions, and broader AI industry trends, offering a comprehensive analysis of technology, economics, and future implications.

AI industryDeepSeekGeopolitics

0 likes · 14 min read

What Makes DeepSeek R1 a Game-Changer? Inside the AI Industry’s Latest Power Shift

Architecture Digest

Feb 5, 2025 · Big Data

DeepSeek: Complete Guide to Installation, Configuration, Basic Usage, Advanced Features, and Tips

This article provides a comprehensive, step‑by‑step tutorial on DeepSeek—a versatile command‑line data processing and analysis tool—covering its core features, installation on Windows/macOS/Linux, configuration options, basic commands, advanced functions, practical tips, and troubleshooting advice.

CLI toolDeepSeekdata analysis

0 likes · 9 min read

DeepSeek: Complete Guide to Installation, Configuration, Basic Usage, Advanced Features, and Tips

Alibaba Cloud Developer

Feb 5, 2025 · Artificial Intelligence

Deploy DeepSeek Models on Alibaba Cloud PAI with One-Click: A Step-by-Step Guide

This tutorial shows how to log into Alibaba Cloud PAI, navigate to the Model Gallery, select a DeepSeek model such as the distilled DeepSeek‑R1‑Distill‑Qwen‑7B, and deploy it with a single click using vLLM or BladeLLM, providing endpoint and token details for immediate use.

AIAlibaba CloudBladeLLM

0 likes · 3 min read

Deploy DeepSeek Models on Alibaba Cloud PAI with One-Click: A Step-by-Step Guide

Java Web Project

Feb 5, 2025 · Big Data

Master DeepSeek: Install, Configure, and Harness Its Data Processing Power

This guide walks you through DeepSeek’s core capabilities—including installation on Windows, macOS, and Linux, configuration of storage paths, API keys, and logging levels, as well as data import, cleaning, analysis, visualization, batch processing, scheduling, and plugin extensions—providing concrete command examples and troubleshooting tips.

AutomationDeepSeekcommand-line

0 likes · 8 min read

Master DeepSeek: Install, Configure, and Harness Its Data Processing Power

IT Services Circle

Feb 5, 2025 · Backend Development

Using DeepSeek AI to Implement Task Scheduling System Sharding

The author explores how DeepSeek V3 can assist in designing a task scheduling system's sharding logic, describing system components, slot allocation, and the AI's step‑by‑step code generation, analysis, and testing process.

AI code generationBackend DevelopmentDeepSeek

0 likes · 4 min read

Using DeepSeek AI to Implement Task Scheduling System Sharding

Java Tech Enthusiast

Feb 5, 2025 · Artificial Intelligence

DeepSeek: AI Breakthrough and Recruitment Insights

DeepSeek’s open‑source R1 model shattered the prevailing belief that closed‑source giants like OpenAI dominate AI progress by introducing a pure reinforcement‑learning‑driven inference breakthrough with its GRPO algorithm, sparking global excitement, prompting political concern, and leading the company to aggressively hire engineers in Beijing and Hangzhou with competitive 14‑month salaries despite demanding top‑conference publications.

AI DevelopmentDeepSeekGRPO algorithm

0 likes · 7 min read

DeepSeek: AI Breakthrough and Recruitment Insights

Huawei Cloud Developer Alliance

Feb 5, 2025 · Artificial Intelligence

Deploy DeepSeek‑V3 on Ascend: Step‑by‑Step Guide for Fast AI Inference

This guide walks developers through obtaining the DeepSeek‑V3 model on the Ascend community, converting weights for GPU and NPU, loading the appropriate MindIE Docker image, launching the container, and configuring service‑level parameters to achieve efficient, out‑of‑the‑box AI inference on Ascend hardware.

AI inferenceAscendDeepSeek

0 likes · 4 min read

Deploy DeepSeek‑V3 on Ascend: Step‑by‑Step Guide for Fast AI Inference

Java Architecture Diary

Feb 5, 2025 · Artificial Intelligence

Unlocking DeepSeek R1’s Chain‑of‑Thought: A Spring WebFlux Integration Guide

This article examines why mainstream AI frameworks like Spring AI and LangChain4j cannot fully support DeepSeek’s R1 model, explains its unique chain‑of‑thought response format and parameter constraints, and provides a complete Spring WebFlux‑based solution—including API calls, streaming handling, and response parsing—to preserve reasoning content.

DeepSeekR1chain-of-thought

0 likes · 8 min read

Unlocking DeepSeek R1’s Chain‑of‑Thought: A Spring WebFlux Integration Guide

Su San Talks Tech

Feb 5, 2025 · Backend Development

How DeepSeek AI Supercharged My Task Scheduler Sharding Design

The author explores using DeepSeek V3 to design and implement a sharding mechanism for a three‑component task scheduling system, showcases the AI's step‑by‑step reasoning, code generation, and testing, and compares its output with that of Tongyi Qianwen, highlighting DeepSeek's superior programming assistance.

AI code generationDeepSeektask scheduling

0 likes · 4 min read

How DeepSeek AI Supercharged My Task Scheduler Sharding Design

Mingyi World Elasticsearch

Feb 4, 2025 · Industry Insights

What Will DeepSeek Look Like in 2035? A Future‑Facing Take on the Current AI Hype

The author reflects on DeepSeek’s explosive popularity, compares it to early ChatGPT hype, and outlines ten observations—from early‑stage entry and tool comparison to long‑term AI marathon thinking—offering a forward‑looking perspective on how technologists should navigate the AI transformation.

AI toolsAI trendsDeepSeek

0 likes · 6 min read

What Will DeepSeek Look Like in 2035? A Future‑Facing Take on the Current AI Hype

21CTO

Feb 4, 2025 · Artificial Intelligence

Is DeepSeek the Next Challenger to ChatGPT? A Deep Dive into Its AI Edge

This article explains what DeepSeek is, how its open‑source large language model works, its unique multilingual training, free access, the DeepSeek‑Coder variant, and compares its capabilities and goals with ChatGPT, highlighting strengths, limitations, and market impact.

AI modelsChatGPT comparisonDeepSeek

0 likes · 7 min read

Is DeepSeek the Next Challenger to ChatGPT? A Deep Dive into Its AI Edge

21CTO

Feb 4, 2025 · Artificial Intelligence

Run DeepSeek Locally with Ollama: A Complete Step‑by‑Step Guide

This guide walks you through installing Ollama, selecting the appropriate DeepSeek model, running it locally, and exploring integration options, highlighting the benefits of offline AI such as data privacy, faster performance, and zero subscription costs.

AI TutorialDeepSeekLocal AI

0 likes · 7 min read

Run DeepSeek Locally with Ollama: A Complete Step‑by‑Step Guide

Radish, Keep Going!

Feb 4, 2025 · Artificial Intelligence

How DeepSeek Is Redefining AI: Efficiency, Open‑Source Impact, and Future Trends

The article reviews DeepSeek's breakthrough in inference efficiency, explores the trade‑offs of model distillation, compares open‑source and closed‑source ecosystems, examines shifting compute demands, highlights Chinese engineering innovations, and outlines future directions for AI development.

AI inferenceDeepSeekMultimodal AI

0 likes · 9 min read

How DeepSeek Is Redefining AI: Efficiency, Open‑Source Impact, and Future Trends

Tencent Tech

Feb 4, 2025 · Artificial Intelligence

Deploy and Test DeepSeek Large Language Models on Tencent Cloud TI in Minutes

This guide walks you through quickly deploying DeepSeek series models on the Tencent Cloud TI platform, covering model selection, resource planning, step‑by‑step service creation, free online trial, API testing via built‑in tools or curl, and managing inference services for both large and compact models.

AI inferenceDeepSeekModel Deployment

0 likes · 13 min read

Deploy and Test DeepSeek Large Language Models on Tencent Cloud TI in Minutes

Code Mala Tang

Feb 4, 2025 · Artificial Intelligence

Why OpenAI’s Race to More Products Is a Losing Game – Lessons from SpaceX and DeepSeek

The article analyzes OpenAI’s costly strategy of launching numerous AI products, compares its financial losses to other tech giants, and highlights how cost‑cutting approaches from SpaceX and DeepSeek offer a more sustainable path for AI development.

AIBusiness ModelCost reduction

0 likes · 7 min read

Why OpenAI’s Race to More Products Is a Losing Game – Lessons from SpaceX and DeepSeek

Architect

Feb 3, 2025 · Artificial Intelligence

How DeepSeek‑R1 Uses Pure Reinforcement Learning to Match OpenAI’s o1

This article presents DeepSeek‑R1 and DeepSeek‑R1‑Zero, two next‑generation LLMs trained with pure reinforcement learning and multi‑stage fine‑tuning, details their GRPO training framework, model‑distillation pipeline, open‑source release, and evaluation results that rival OpenAI’s o1‑1217 across reasoning, knowledge, and coding benchmarks.

DeepSeekLLM evaluationOpenAI o1

0 likes · 10 min read

How DeepSeek‑R1 Uses Pure Reinforcement Learning to Match OpenAI’s o1

Tencent Cloud Developer

Feb 3, 2025 · Artificial Intelligence

DeepSeek's Emergence: Implications for AI, Enterprise Digital Transformation, and Future Software Development

DeepSeek’s debut marks a watershed for China’s AI, offering low‑cost, Chinese‑native reasoning that outperforms foreign models and prompting enterprises to restructure development around demand‑engineering, AI‑assisted low‑code, intelligent data stores, and a shift from “how to code” to “why to code” across a three‑phase transformation roadmap.

AI strategyDeepSeekDigital Transformation

0 likes · 15 min read

DeepSeek's Emergence: Implications for AI, Enterprise Digital Transformation, and Future Software Development

Su San Talks Tech

Feb 3, 2025 · Artificial Intelligence

Run DeepSeek Locally with Ollama: Step‑by‑Step AI Chat Setup

This guide walks you through downloading Ollama, selecting the DeepSeek‑r1 model, installing it via a terminal command, and configuring ChatBoxAI for seamless local AI conversations, all with clear screenshots for each step.

AI deploymentChatbotDeepSeek

0 likes · 2 min read

Run DeepSeek Locally with Ollama: Step‑by‑Step AI Chat Setup

Cognitive Technology Team

Feb 3, 2025 · Artificial Intelligence

DeepSeek R1 Introduces Group‑Related Policy Optimization for Advanced Reasoning in Large Language Models

DeepSeek AI’s new open‑source model DeepSeek‑R1 leverages a novel Group‑Related Policy Optimization (GRPO) reinforcement‑learning framework and multi‑stage training to dramatically boost complex reasoning performance, achieving AIME 2024 Pass@1 scores comparable to OpenAI’s o1 model.

AIDeepSeekGRPO

0 likes · 4 min read

DeepSeek R1 Introduces Group‑Related Policy Optimization for Advanced Reasoning in Large Language Models

IT Services Circle

Feb 2, 2025 · Artificial Intelligence

OpenAI and Anthropic Accuse DeepSeek of Model Distillation and IP Infringement: Industry Reactions and Technical Overview

OpenAI and Anthropic allege that DeepSeek has illegally distilled their large language models, prompting investigations, industry satire, and a detailed look at model distillation technology, its legal implications, and the broader trends shaping AI cost, scaling laws, and market dynamics.

AI ethicsDeepSeekOpenAI

0 likes · 10 min read

OpenAI and Anthropic Accuse DeepSeek of Model Distillation and IP Infringement: Industry Reactions and Technical Overview

DataFunTalk

Feb 2, 2025 · Artificial Intelligence

DeepSeek Releases Janus‑Pro‑7B Multimodal Model, Beats DALL‑E 3 and Stable Diffusion on Benchmarks

DeepSeek's newly released Janus‑Pro‑7B multimodal model, open‑sourced overnight, outperforms DALL‑E 3 and Stable Diffusion on GenEval and DPG‑Bench, showcases a unified self‑regressive architecture with SigLIP‑L visual encoder, and has sparked massive user adoption and market reactions worldwide.

AI benchmarksDeepSeek

0 likes · 9 min read

DeepSeek Releases Janus‑Pro‑7B Multimodal Model, Beats DALL‑E 3 and Stable Diffusion on Benchmarks

Code Mala Tang

Feb 2, 2025 · Artificial Intelligence

How to Deploy DeepSeek AI Coding Assistant Locally: A Step‑by‑Step Guide

This guide walks you through the hardware and software prerequisites, Docker-based installation, environment configuration, model fine‑tuning, IDE integration, maintenance, and troubleshooting for running the DeepSeek AI programming assistant entirely on your own machine.

AI coding assistantDeepSeekDocker

0 likes · 12 min read

How to Deploy DeepSeek AI Coding Assistant Locally: A Step‑by‑Step Guide

Alibaba Cloud Big Data AI Platform

Feb 1, 2025 · Artificial Intelligence

Deploy DeepSeek-V3 and R1 Models with One-Click on Alibaba Cloud PAI Model Gallery

This article introduces Alibaba Cloud's PAI Model Gallery, detailing the DeepSeek-V3 and DeepSeek‑R1 large language models, their architectures and parameters, and provides a step‑by‑step guide for one‑click deployment of these models and their distilled variants using vLLM or BladeLLM.

AI inferenceAlibaba CloudDeepSeek

0 likes · 6 min read

Deploy DeepSeek-V3 and R1 Models with One-Click on Alibaba Cloud PAI Model Gallery

Code Mala Tang

Jan 31, 2025 · Artificial Intelligence

Master DeepSeek: 7 Prompt Engineering Tricks to Boost AI Responses

This guide presents seven practical prompt‑engineering techniques—clear goals, structured queries, domain terminology, concrete examples, scoped questions, step‑by‑step breakdowns, and multi‑turn interactions—to help users get more accurate and useful answers from DeepSeek.

AI promptsDeepSeekLanguage Model

0 likes · 6 min read

Master DeepSeek: 7 Prompt Engineering Tricks to Boost AI Responses

Alibaba Cloud Infrastructure

Jan 31, 2025 · Cloud Computing

How to Deploy DeepSeek‑R1 on Alibaba Cloud Compute Nest in Minutes

This guide walks you through deploying the open‑source DeepSeek‑R1 inference model on Alibaba Cloud's Compute Nest platform, covering service creation, instance configuration, login procedures, and API calls with sample curl commands for text generation and chat.

AI modelAlibaba CloudCompute Nest

0 likes · 4 min read

How to Deploy DeepSeek‑R1 on Alibaba Cloud Compute Nest in Minutes

Java Web Project

Jan 29, 2025 · Industry Insights

How DeepSeek’s Low‑Cost AI Model Is Redrawing the Compute Landscape and Salary Benchmarks

DeepSeek’s ability to deliver top‑tier model performance on modest hardware sparked a US‑stock flash crash, challenged the high‑GPU demand narrative, and revealed unusually high salary tiers for AI researchers, prompting a reassessment of compute economics and talent compensation in the industry.

AI computeDeepSeekMarket Trends

0 likes · 5 min read

How DeepSeek’s Low‑Cost AI Model Is Redrawing the Compute Landscape and Salary Benchmarks

Architect

Jan 29, 2025 · Artificial Intelligence

How Janus‑Pro Redefines Multimodal AI with Bigger Models and New Training Strategies

DeepSeek’s newly released Janus‑Pro series (1B and 7B) advances multimodal AI by decoupling visual understanding and generation, employing optimized three‑stage training, massive data expansion, and larger LLM backbones, achieving performance that matches or exceeds leading models such as Meta, Google, OpenAI, and Stability AI.

DeepSeekJanus-ProModel Scaling

0 likes · 6 min read

How Janus‑Pro Redefines Multimodal AI with Bigger Models and New Training Strategies

Su San Talks Tech

Jan 28, 2025 · Artificial Intelligence

How DeepSeek Overtook ChatGPT on the App Store: Low‑Cost AI Model Shakes the Industry

DeepSeek, a Chinese AI model, surged to the top of both China and US Apple App Store free‑app charts, outpacing ChatGPT and other major generative AI services, while boasting dramatically lower training costs and an open‑source approach that has sparked worldwide attention.

AI modelApp StoreChatGPT

0 likes · 4 min read

How DeepSeek Overtook ChatGPT on the App Store: Low‑Cost AI Model Shakes the Industry

Programmer DD

Jan 27, 2025 · Artificial Intelligence

Run DeepSeek‑R1 Locally with Ollama and Call It from Spring Boot

Learn how to deploy the open‑source DeepSeek‑R1 model using Ollama on Linux or macOS, configure various model sizes, and integrate it into a Spring Boot application with Spring AI to build an API‑driven translation service, complete with code examples and testing.

APIDeepSeekOllama

0 likes · 9 min read

Run DeepSeek‑R1 Locally with Ollama and Call It from Spring Boot

AI Code to Success

Jan 26, 2025 · Industry Insights

How DeepSeek‑R1 Is Challenging OpenAI’s o1 and Shaping the AI Landscape

DeepSeek‑R1 achieved a 1357‑point Arena score, ranking third overall and tying OpenAI o1 for first in StyleCtrl, while its open‑source MIT‑licensed release—including distilled variants—and low‑cost API service aim to democratize advanced AI inference for developers worldwide.

AI competitionArena benchmarkDeepSeek

0 likes · 5 min read

How DeepSeek‑R1 Is Challenging OpenAI’s o1 and Shaping the AI Landscape

DevOps

Jan 25, 2025 · Artificial Intelligence

DeepSeek R1: An Open‑Source Large Model Matching OpenAI’s o1 at a Fraction of the Cost

DeepSeek’s newly released R1 model delivers performance comparable to OpenAI’s o1 while cutting inference costs by 90‑95%, leveraging innovative MLA and MoE architectures, low‑cost hardware training, an open‑source strategy, and a youthful, flat‑structured team that challenges the AI industry’s high‑spending model.

AI startupCost‑Efficient TrainingDeepSeek

0 likes · 12 min read

DeepSeek R1: An Open‑Source Large Model Matching OpenAI’s o1 at a Fraction of the Cost

Alibaba Cloud Native

Jan 22, 2025 · Cloud Native

Seamlessly Migrate from OpenAI to DeepSeek with Higress AI Gateway

This guide explains how to install the Higress AI gateway, configure provider API keys, set up gray‑release routing between OpenAI and DeepSeek, use a Python client to call DeepSeek, and enable content security and observability features for safe, cost‑effective large‑model deployments.

AI gatewayContent SecurityDeepSeek

0 likes · 7 min read

Seamlessly Migrate from OpenAI to DeepSeek with Higress AI Gateway

Baobao Algorithm Notes

Jan 22, 2025 · Artificial Intelligence

Can RL‑Only Training Make LLMs Beat OpenAI‑o1? Inside DeepSeek‑R1’s Architecture and Results

DeepSeek‑R1’s open‑source series demonstrates that reinforcement‑learning‑only training can match top‑tier models like OpenAI‑o1, while a small amount of SFT further improves readability; the article dissects its technical report, training pipeline, reward design, distillation strategy, benchmark outcomes, and remaining challenges.

DeepSeekSupervised Fine‑Tuninglarge language model

0 likes · 11 min read

Can RL‑Only Training Make LLMs Beat OpenAI‑o1? Inside DeepSeek‑R1’s Architecture and Results

Java Architecture Diary

Jan 21, 2025 · Artificial Intelligence

Unlocking DeepSeek R1: How to Leverage the New Reasoning Model with Spring AI

This article introduces DeepSeek R1, a breakthrough reasoning‑focused large model that visualizes its chain‑of‑thought process, matches OpenAI O1 performance, offers open‑source advantages, and provides step‑by‑step Spring AI integration guidance, including dependency setup, configuration, and code examples.

AI integrationDeepSeekR1

0 likes · 9 min read

Unlocking DeepSeek R1: How to Leverage the New Reasoning Model with Spring AI

Baobao Algorithm Notes

Jan 15, 2025 · Artificial Intelligence

How Multi-Token Prediction Boosts LLM Training and Inference Efficiency

This article reviews the evolution of Multi‑Token Prediction (MTP) techniques—from early blockwise parallel decoding to Meta's and DeepSeek's implementations—explaining their architectures, training and inference workflows, and the speed‑up gains they offer for large language models.

DeepSeekInference AccelerationLLM

0 likes · 20 min read

How Multi-Token Prediction Boosts LLM Training and Inference Efficiency

Baobao Algorithm Notes

Jan 7, 2025 · Artificial Intelligence

How Efficient Is DeepSeek V3? Calculating Its MFU Around 37%

This article derives DeepSeek V3's training Model FLOPs Utilization (MFU) using publicly available data, showing an MFU of roughly 37%—about a 60% improvement over V2—and provides detailed formulas, parameter settings, and a reproducible Python script.

AI PerformanceDeepSeekMFU

0 likes · 8 min read

How Efficient Is DeepSeek V3? Calculating Its MFU Around 37%

ShiZhen AI

Jan 6, 2025 · Industry Insights

AI Daily Roundup: Altman's Singularity Hint, Microsoft’s $80B AI Investment, DeepSeek DeepThink, and BCI Breakthroughs

The article reviews Sam Altman's cryptic six‑word tweet about approaching the AI singularity, Microsoft’s $80 billion plan to expand AI data‑center infrastructure, DeepSeek’s DeepThink feature for step‑wise reasoning, and NeuroXess’s brain‑computer‑interface advances that let patients control AI and robots with thought.

AIDeepSeekMicrosoft

0 likes · 8 min read

AI Daily Roundup: Altman's Singularity Hint, Microsoft’s $80B AI Investment, DeepSeek DeepThink, and BCI Breakthroughs

ZhongAn Tech Team

Jan 5, 2025 · Artificial Intelligence

Weekly AI Roundup Issue 9: OpenAI Vision, LeCun Interview, ByteDance HLLM, and DeepSeek‑V3 Highlights

This issue presents a curated overview of recent AI developments, including Sam Altman's 2025 technology vision poll, LeCun's interview on future AI directions, ByteDance's hierarchical large language model for recommendation, and the performance and cost advantages of the open‑source DeepSeek‑V3 model.

AIByteDanceDeepSeek

0 likes · 10 min read

Weekly AI Roundup Issue 9: OpenAI Vision, LeCun Interview, ByteDance HLLM, and DeepSeek‑V3 Highlights

ZhongAn Tech Team

Dec 1, 2024 · Artificial Intelligence

AI Weekly Digest Issue 4: Market Insights, Industry Solutions, and Emerging Technologies

The fourth AI weekly newsletter reviews recent industry news—including Jensen Huang's robot era vision and Tesla's Optimus plans—introduces Claude's new style‑customization feature, explores AI‑enhanced input methods, and evaluates DeepSeek's R1‑Lite model performance on complex reasoning tasks.

AIAI applicationsClaude

0 likes · 10 min read

AI Weekly Digest Issue 4: Market Insights, Industry Solutions, and Emerging Technologies

CSS Magic

May 13, 2024 · Artificial Intelligence

DeepSeek: China’s New LLM Dark Horse – First Impressions and Shockingly Low Prices

The article evaluates DeepSeek v2, a 100‑billion‑parameter MoE model, highlighting its near‑GPT‑4 benchmark performance, OpenAI‑compatible API, 32k‑token context, exceptionally low pricing, a custom token‑utilization metric, and the practical drawbacks observed during hands‑on testing.

API compatibilityBenchmarkDeepSeek

0 likes · 9 min read

DeepSeek: China’s New LLM Dark Horse – First Impressions and Shockingly Low Prices

Baobao Algorithm Notes

May 9, 2024 · Artificial Intelligence

Inside Deepseek‑V2: How Multi‑Head Latent Attention Cuts KV‑Cache and Boosts Performance

This article provides an in‑depth technical analysis of Deepseek‑V2, covering its 236B parameter size, Multi‑Head Latent Attention optimization that reduces KV‑cache memory, architectural details, training pipelines, infrastructure choices, and performance results on benchmarks such as MMLU and instruction following.

AI ArchitectureDeepSeekModel Optimization

0 likes · 17 min read

Inside Deepseek‑V2: How Multi‑Head Latent Attention Cuts KV‑Cache and Boosts Performance