Tagged articles
5000 articles
Page 23 of 50
Architects' Tech Alliance
Architects' Tech Alliance
Mar 5, 2025 · Industry Insights

DeepSeek R1 & Kimi 1.5: Inside the Development of Near‑Strong Reasoning Models

The article analyzes DeepSeek's recent releases—V3 dialogue model and R1 inference model—detailing their launch dates, rapid popularity surge, R1's reinforcement‑learning‑based design for code and math tasks, and provides links to related Beijing University technical reports while stripping promotional sales content.

AIDeepSeekIndustry Analysis
0 likes · 3 min read
DeepSeek R1 & Kimi 1.5: Inside the Development of Near‑Strong Reasoning Models
Model Perspective
Model Perspective
Mar 5, 2025 · Artificial Intelligence

Can AI Really Crack NP‑Hard Problems? Inside the DeepSeek‑R1 Breakthrough

Researchers from Nanjing University of Aeronautics, Nanjing University of Technology and Oxford show that high‑instruction prompts dramatically boost large language models' mathematical reasoning, enabling DeepSeek‑R1 and Qwen2.5 to solve complex polynomial tasks and even produce a new counterexample to Hilbert's 17th problem.

AIDeepSeekMathematical Reasoning
0 likes · 6 min read
Can AI Really Crack NP‑Hard Problems? Inside the DeepSeek‑R1 Breakthrough
Tencent Cloud Developer
Tencent Cloud Developer
Mar 5, 2025 · Artificial Intelligence

DeepSeek Series Overview: Core Technologies, Model Innovations, and Product Highlights

The article delivers a PPT‑style deep dive into the DeepSeek series—from the original LLM through DeepSeek‑MoE, Math, V2, V3 and R1—highlighting core innovations such as Multi‑Head Latent Attention, fine‑grained MoE, GRPO reinforcement learning, Multi‑Token Prediction, DualPipe parallelism and FP8 training that together achieve high performance at a fraction of traditional costs, and notes their integration into Tencent’s OlaChat intelligent assistant.

AIDeepSeekFP8 training
0 likes · 21 min read
DeepSeek Series Overview: Core Technologies, Model Innovations, and Product Highlights
21CTO
21CTO
Mar 4, 2025 · Artificial Intelligence

Will AI Replace Developers? Emerging Roles for Software Engineers

The article examines how generative AI will automate many coding tasks yet create new opportunities for software engineers, emphasizing the need for human supervision, ethical oversight, and advanced roles such as AI integration, system architecture, and cybersecurity in the evolving tech landscape.

AIdeveloper rolesfuture of work
0 likes · 6 min read
Will AI Replace Developers? Emerging Roles for Software Engineers
JD Retail Technology
JD Retail Technology
Mar 4, 2025 · Artificial Intelligence

JD Retail End-to-End AI Engine Compatible with GPU and Domestic NPU: Architecture, Optimization, and Applications

JD Retail’s Nine‑Number Algorithm Platform delivers an end‑to‑end AI engine that unifies GPU and domestic NPU resources across a thousand‑card cluster, offering zero‑cost model migration, optimized training and inference pipelines, support for over 40 LLM and multimodal models, and proven business‑level performance that reduces dependence on overseas chips.

AIDistributed TrainingGPU
0 likes · 19 min read
JD Retail End-to-End AI Engine Compatible with GPU and Domestic NPU: Architecture, Optimization, and Applications
58UXD
58UXD
Mar 4, 2025 · Artificial Intelligence

Can DeepSeek AI Turn User Complaints into Actionable Design Solutions?

This article explores how DeepSeek AI was fed real negative user feedback from a 58.com B‑side posting page, compares its design recommendations with those of a professional designer, and evaluates the strengths and limitations of AI‑generated UX suggestions.

AIUX designcase study
0 likes · 4 min read
Can DeepSeek AI Turn User Complaints into Actionable Design Solutions?
AIWalker
AIWalker
Mar 3, 2025 · Artificial Intelligence

ByteDance’s Diffusion Restoration Adapter Achieves State‑of‑the‑Art Real‑World Image Recovery

This paper introduces a lightweight Diffusion Restoration Adapter that integrates into pre‑trained diffusion priors such as StableDiffusion XL and StableDiffusion 3, dramatically reduces parameter overhead compared with ControNet, and delivers superior quantitative and visual results on real‑world image restoration benchmarks through a novel sampling strategy.

AIAdapterDiffusion Models
0 likes · 17 min read
ByteDance’s Diffusion Restoration Adapter Achieves State‑of‑the‑Art Real‑World Image Recovery
Code Mala Tang
Code Mala Tang
Mar 3, 2025 · Artificial Intelligence

Unlock AI’s Full Potential with Structured Prompt Decorators

Prompt Decorators are structured prefixes that standardize and enhance AI responses, addressing common challenges like vague prompts, inconsistent answers, and lack of reasoning by guiding the model to produce clear, logical, and well‑organized outputs across various use cases.

AILLMPrompt engineering
0 likes · 23 min read
Unlock AI’s Full Potential with Structured Prompt Decorators
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Mar 3, 2025 · Cloud Computing

How Baidu Cloud Optimizes GPU Servers for AI Workloads

This article explains the design and implementation of GPU cloud servers, covering data processing pipelines, hardware selection, topology, interconnect technologies, virtualization, multi‑GPU communication methods, and Baidu's practical solutions for both virtualized and bare‑metal instances to boost AI inference and training performance.

AIGPUNVLink
0 likes · 29 min read
How Baidu Cloud Optimizes GPU Servers for AI Workloads
JD Tech Talk
JD Tech Talk
Mar 3, 2025 · Artificial Intelligence

AI Engine Technology Based on Domestic Chips for JD Retail

This article describes JD Retail's AI engine built on domestic NPU chips, covering challenges, heterogeneous GPU‑NPU scheduling, high‑performance training and inference engines, extensive model support, real‑world deployment cases, and future plans for large‑scale chip clusters and ecosystem development.

AIDistributed TrainingGPU
0 likes · 20 min read
AI Engine Technology Based on Domestic Chips for JD Retail
JD Cloud Developers
JD Cloud Developers
Mar 3, 2025 · Artificial Intelligence

How JD.com Leverages Domestic NPU Chips to Power Large‑Scale AI Models

This article details JD.com's challenges and solutions for deploying domestic NPU chips across heterogeneous GPU‑NPU clusters, covering architecture, scheduling, high‑performance training and inference engines, real‑world case studies, and future plans to scale AI workloads securely and efficiently.

AIDomestic ChipsInference
0 likes · 19 min read
How JD.com Leverages Domestic NPU Chips to Power Large‑Scale AI Models
DataFunTalk
DataFunTalk
Mar 3, 2025 · Artificial Intelligence

FlightVGM: FPGA-Accelerated Inference for Video Generation Models Wins Best Paper at FPGA 2025

The FlightVGM paper, awarded Best Paper at FPGA 2025, details a novel FPGA-based inference IP for video generation models that leverages time‑space activation sparsity, mixed‑precision DSP58 extensions, and adaptive scheduling to achieve up to 1.30× performance and 4.49× energy‑efficiency gains over a NVIDIA 3090 GPU while preserving model accuracy.

AIFPGAHardware acceleration
0 likes · 11 min read
FlightVGM: FPGA-Accelerated Inference for Video Generation Models Wins Best Paper at FPGA 2025
大转转FE
大转转FE
Mar 3, 2025 · Frontend Development

Zhuanzhuan Frontend Weekly – Curated Technical Articles

This issue of Zhuanzhuan Frontend Weekly curates five insightful technical articles covering React UI paradigm shifts, a Rust beginner’s journey to production, performance improvements in a mini‑program simulator, integration of the Qwen‑2.5‑VL model with Midscene.js, and Didi’s experience in managing technical debt for internationalization.

AIReactRust
0 likes · 5 min read
Zhuanzhuan Frontend Weekly – Curated Technical Articles
Java Architecture Diary
Java Architecture Diary
Mar 3, 2025 · Frontend Development

Boost Real-Time AI Streams in the Browser with fetch-event-source

This article explains how Server‑Sent Events (SSE) work, outlines the limitations of the native EventSource API, and demonstrates how the fetch‑event‑source library enhances SSE with POST support, custom headers, retry strategies, and visibility handling, enabling efficient real‑time AI data streams in modern web front‑ends.

AIJavaScriptReal-time Streaming
0 likes · 6 min read
Boost Real-Time AI Streams in the Browser with fetch-event-source
Big Data Technology & Architecture
Big Data Technology & Architecture
Mar 3, 2025 · Big Data

The Turning Point for Data Development: From Traditional Data Engineering to AI Data Engineering

The article analyzes how the rapid rise of open‑source large‑model AI in 2025 is reshaping the data development profession, urging developers to transition from specialized data‑engineer roles to full‑stack AI data engineering skills such as distributed computing, lake‑house architectures, and model tuning.

AIBig DataFlink
0 likes · 7 min read
The Turning Point for Data Development: From Traditional Data Engineering to AI Data Engineering
Data Thinking Notes
Data Thinking Notes
Mar 2, 2025 · Artificial Intelligence

How DeepSeek’s Open‑Source Week Accelerates AI with Cutting‑Edge GPU and Storage Innovations

During DeepSeek’s Open‑Source Week (Feb 24‑28), five production‑tested projects were released, spanning GPU‑optimized MLA kernels, MoE communication libraries, high‑performance FP8 GEMM, dual‑pipeline parallelism, and a AI‑focused distributed file system, each delivering significant performance and efficiency gains for large‑scale AI workloads.

AIDistributed TrainingGPU Optimization
0 likes · 13 min read
How DeepSeek’s Open‑Source Week Accelerates AI with Cutting‑Edge GPU and Storage Innovations
AI Algorithm Path
AI Algorithm Path
Mar 2, 2025 · Artificial Intelligence

Exploring Flux Labs AI’s New Virtual Try‑On Feature

The article reviews Flux Labs AI’s newly added virtual try‑on tool, explaining how AI, machine‑learning and computer‑vision enable seamless clothing overlays, outlining its main applications, providing a step‑by‑step usage guide, detailing pricing plans, and sharing the author’s positive performance impressions.

AIFlux LabsImage Generation
0 likes · 5 min read
Exploring Flux Labs AI’s New Virtual Try‑On Feature
DataFunTalk
DataFunTalk
Mar 2, 2025 · Artificial Intelligence

Implementing GRPO from Scratch with Distributed Reinforcement Learning on Qwen2.5-1.5B-Instruct

This tutorial explains how to build a distributed reinforcement‑learning pipeline using the GRPO algorithm, covering data preparation, evaluation and reward functions, multi‑GPU DataParallel implementation, and full fine‑tuning of the Qwen2.5‑1.5B‑Instruct model with PyTorch, FlashAttention2 and Weights & Biases.

AIDistributed TrainingGRPO
0 likes · 10 min read
Implementing GRPO from Scratch with Distributed Reinforcement Learning on Qwen2.5-1.5B-Instruct
DataFunTalk
DataFunTalk
Mar 2, 2025 · Artificial Intelligence

Top 10 AI Research Papers of 2024: Summaries, Contributions, and Practical Uses

This article presents a curated selection of ten groundbreaking 2024 AI research papers, detailing each model’s abstract, key contributions, and practical application scenarios across computer vision, multimodal learning, NLP, and efficient inference, offering readers inspiration and actionable insights for real‑world projects.

2024 researchAINLP
0 likes · 18 min read
Top 10 AI Research Papers of 2024: Summaries, Contributions, and Practical Uses
JD Retail Technology
JD Retail Technology
Mar 1, 2025 · Industry Insights

How JD Retail’s AI Assistant Uses Multimodal LLMs to Boost E‑Commerce

JD Retail’s AI assistant combines a Master‑Sub agent framework, ReAct paradigm, multimodal integration and MoE architecture to improve sales forecasting, pricing, and recommendation accuracy, while the team’s collaborative culture and open talent pathways illustrate how cutting‑edge AI is applied in real‑world e‑commerce.

AIJD RetailLLM
0 likes · 8 min read
How JD Retail’s AI Assistant Uses Multimodal LLMs to Boost E‑Commerce
ITPUB
ITPUB
Mar 1, 2025 · Artificial Intelligence

Can DeepSeek AI Replace Your DBA? Real-World Database Scenarios Tested

This article examines DeepSeek, a Chinese AGI‑focused AI model, explains prompt‑engineering techniques, and evaluates its performance across database architecture, development, and operations tasks through concrete Q&A examples, SQL plan analysis, and shell‑script generation, while also discussing its broader impact on professionals, vendors and enterprises.

AIDeepSeekPrompt engineering
0 likes · 10 min read
Can DeepSeek AI Replace Your DBA? Real-World Database Scenarios Tested
IT Architects Alliance
IT Architects Alliance
Feb 28, 2025 · Industry Insights

How AIGC Is Redefining Full‑Stack Development in 2025

In 2025, AIGC technology is transforming every stage of full‑stack development—from precise AI‑driven requirement analysis and automated UI design to code generation and intelligent testing—while also raising technical, ethical, and talent challenges that developers must address.

AIAIGCFull-Stack Development
0 likes · 22 min read
How AIGC Is Redefining Full‑Stack Development in 2025
Code Mala Tang
Code Mala Tang
Feb 28, 2025 · Fundamentals

Why AI Code Generation Needs Test‑Driven Development: Avoid Hidden Bugs

This article explains how AI‑generated code can be fast but unreliable, and demonstrates how applying Test‑Driven Development (TDD) with concrete Python examples catches errors like stack overflows, edge‑case failures, and security issues, ensuring robust, maintainable software.

AIPythonSoftware Testing
0 likes · 13 min read
Why AI Code Generation Needs Test‑Driven Development: Avoid Hidden Bugs
AI Product Manager Community
AI Product Manager Community
Feb 28, 2025 · Artificial Intelligence

What’s Inside DeepSeek’s Open‑Source Week? DualPipe, EPLB, 3FS and More Explained

DeepSeek’s recent Open‑Source Week unveiled a suite of AI‑focused tools—including the DualPipe pipeline parallelism algorithm, the EPLB expert load balancer, detailed training‑inference framework data, the high‑performance 3FS parallel file system, and the Smallpond data‑processing framework—each with GitHub links and performance highlights.

AIDistributed Trainingfile system
0 likes · 7 min read
What’s Inside DeepSeek’s Open‑Source Week? DualPipe, EPLB, 3FS and More Explained
AI Large Model Application Practice
AI Large Model Application Practice
Feb 28, 2025 · Artificial Intelligence

How Self-Attention Powers LLMs: A Step‑by‑Step Deep Dive

This article explains the self‑attention mechanism behind large language models, detailing why static word importance fails, how queries, keys, and values are generated, how attention scores are computed, scaled, softmaxed, and used to produce context‑aware word vectors, while noting computational costs.

AILLMSelf-Attention
0 likes · 9 min read
How Self-Attention Powers LLMs: A Step‑by‑Step Deep Dive
Java Tech Enthusiast
Java Tech Enthusiast
Feb 27, 2025 · Artificial Intelligence

Navigating the AI Era: Insights for Senior Engineers and R&D Leaders

A senior technical leader, reflecting on twelve years at a large tech firm, warns that while AI can triple a junior’s output in tasks like refactoring, it cannot replace deep business insight, strategic decision‑making, or mentorship, and urges engineers to treat AI as a helper, focus on high‑level architecture, and expand horizontally into business domains to stay indispensable.

AICareer DevelopmentSoftware Architecture
0 likes · 5 min read
Navigating the AI Era: Insights for Senior Engineers and R&D Leaders
JavaEdge
JavaEdge
Feb 27, 2025 · Artificial Intelligence

How to Quickly Build a DeepSeek‑Powered Knowledge Base on Tencent Cloud

This guide walks through deploying the full‑feature DeepSeek V3+R1 model on Tencent Cloud, configuring a smart knowledge‑base application, importing documentation, enabling internet search, tuning retrieval parameters, and publishing the app for public use, all without writing code.

AIDeepSeekKnowledge Base
0 likes · 6 min read
How to Quickly Build a DeepSeek‑Powered Knowledge Base on Tencent Cloud
Python Programming Learning Circle
Python Programming Learning Circle
Feb 26, 2025 · Artificial Intelligence

Key Python 3.13 Features Boosting Machine Learning and AI Performance

Python 3.13 introduces experimental free‑threading, a JIT compiler, enhanced type‑system utilities, asyncio improvements, and standard‑library updates that together aim to reduce the Global Interpreter Lock bottleneck, accelerate compute‑intensive workloads, and simplify deployment of AI and ML applications across diverse platforms.

AIJITML
0 likes · 25 min read
Key Python 3.13 Features Boosting Machine Learning and AI Performance
58UXD
58UXD
Feb 26, 2025 · Artificial Intelligence

How AI Tools Like Deepseek Transform Design Workflow

This article shows designers how to combine AI services such as Deepseek, JiMeng, Tripo, Tongyi and Jianying to accelerate 3D modeling, PPT creation and short‑video production, turning lengthy manual tasks into fast, creative processes.

3D modelingAIDeepSeek
0 likes · 5 min read
How AI Tools Like Deepseek Transform Design Workflow
Architecture Digest
Architecture Digest
Feb 26, 2025 · Artificial Intelligence

DeepSeek4j 1.4: A Java Integration Framework for DeepSeek AI Models

DeepSeek4j 1.4 introduces a Java‑native framework that fully preserves DeepSeek's chain‑of‑thought and billing features, adds reactive streaming support, and provides a Spring Boot starter for effortless integration, accompanied by quick‑start code, configuration examples, and a built‑in debugging UI.

AIAPIDeepSeek
0 likes · 5 min read
DeepSeek4j 1.4: A Java Integration Framework for DeepSeek AI Models
macrozheng
macrozheng
Feb 26, 2025 · Databases

Boost Your SQL Workflow with Chat2DB’s AI‑Powered Database Management

This article introduces Chat2DB, an AI‑enhanced SQL client and reporting tool, walks through its key features, Docker‑based installation, practical usage with a SpringBoot‑Vue e‑commerce project, and demonstrates how its built‑in AI can generate SQL queries automatically.

AIChat2DBDatabase Management
0 likes · 4 min read
Boost Your SQL Workflow with Chat2DB’s AI‑Powered Database Management
Model Perspective
Model Perspective
Feb 26, 2025 · Artificial Intelligence

How Do Large Language Models Compress Massive Data? Limits and Techniques

This article explains how large language models act like a super‑library by compressing vast amounts of text using information‑theoretic concepts, probability‑based coding, autoregressive neural networks, and arithmetic coding, while discussing accuracy, compression ratios, and theoretical limits.

AIarithmetic codingautoregressive networks
0 likes · 8 min read
How Do Large Language Models Compress Massive Data? Limits and Techniques
21CTO
21CTO
Feb 25, 2025 · Artificial Intelligence

How Alibaba’s Qwen 2.5‑Max Challenges GPT‑4o and Redefines China’s AI Race

Chinese tech giants Huawei and Alibaba respond to President Xi’s call for stronger innovation, with Huawei showcasing its HarmonyOS and server‑grade Arm processor while Alibaba unveils the Qwen 2.5‑Max large language model that outperforms leading Western AI systems on multiple benchmarks, highlighting China’s accelerating AI ambitions.

AIAlibabaChina
0 likes · 5 min read
How Alibaba’s Qwen 2.5‑Max Challenges GPT‑4o and Redefines China’s AI Race
DataFunSummit
DataFunSummit
Feb 25, 2025 · Artificial Intelligence

Collecting High-Quality LLM Training Data and Custom Model Training Guide

This article explains what constitutes high‑quality LLM training data, why large datasets are essential, outlines the step‑by‑step process for collecting, preprocessing, and fine‑tuning models, and highlights the best data sources—including web content, books, code repositories, and news—while noting available free datasets.

AILLMWeb Scraping
0 likes · 9 min read
Collecting High-Quality LLM Training Data and Custom Model Training Guide
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Feb 24, 2025 · Artificial Intelligence

Can Multi‑Teacher Distillation Overcome Catastrophic Forgetting in Continual Learning?

This paper proposes a multi‑teacher distillation framework for continual learning that combines active data rehearsal with feature‑decoupled distillation, demonstrating superior performance on PASCAL VOC and COCO benchmarks while mitigating catastrophic forgetting and balancing stability‑plasticity trade‑offs.

AICatastrophic Forgettingactive rehearsal
0 likes · 12 min read
Can Multi‑Teacher Distillation Overcome Catastrophic Forgetting in Continual Learning?
Java Web Project
Java Web Project
Feb 23, 2025 · Artificial Intelligence

Build Your First AI Chatbot with Spring Boot and DeepSeek LLM

This guide walks you through creating a Spring Boot project, configuring DeepSeek's large language model via SiliconFlow, setting up OpenAI‑compatible parameters, and implementing a REST controller that returns weather forecasts using the model, complete with step‑by‑step code snippets, configuration files, and deployment instructions.

AIChatbotDeepSeek
0 likes · 7 min read
Build Your First AI Chatbot with Spring Boot and DeepSeek LLM
DataFunTalk
DataFunTalk
Feb 23, 2025 · Artificial Intelligence

Insights from Snowflake CEO Sridhar Ramaswamy on AI Competition, Business Strategy, and Leadership

In this extensive interview, Snowflake CEO Sridhar Ramaswamy shares his perspectives on the AI arms race, the sustainable value of data platforms, competition with rivals like Databricks and DeepSeek, the challenges of scaling a public company, and personal leadership lessons drawn from his career and family life.

AIArtificial IntelligenceBusiness strategy
0 likes · 35 min read
Insights from Snowflake CEO Sridhar Ramaswamy on AI Competition, Business Strategy, and Leadership
ZhongAn Tech Team
ZhongAn Tech Team
Feb 22, 2025 · Artificial Intelligence

How SkyReels, DeepSeek NSA, Grok‑3, and KG²RAG Are Shaping the Next AI Wave

This issue reviews China's first open‑source short‑film model SkyReels‑V1, DeepSeek's Native Sparse Attention breakthrough, xAI's massive Grok‑3 deployment on 200k H100 GPUs, and a knowledge‑graph‑guided RAG framework, highlighting their performance gains, architectural innovations, and industry impact.

AIRAGindustry trends
0 likes · 15 min read
How SkyReels, DeepSeek NSA, Grok‑3, and KG²RAG Are Shaping the Next AI Wave
Java Tech Enthusiast
Java Tech Enthusiast
Feb 22, 2025 · Artificial Intelligence

Grok‑3 Evaluation Controversy and Community Reactions

Three days after Grok‑3’s launch, OpenAI was accused of inflating its benchmark scores by using a “cons@64” method that aggregates 64 answers, a practice critics say unfairly skews comparisons with single‑shot models like o3‑mini, while developers have already begun experimenting with the model in simple games.

AIGrok-3Model Evaluation
0 likes · 5 min read
Grok‑3 Evaluation Controversy and Community Reactions
21CTO
21CTO
Feb 22, 2025 · Artificial Intelligence

Are AI Coding Assistants Undermining Deep Learning for Developers?

The article argues that while AI tools like Copilot and GPT speed up simple coding tasks, they risk eroding developers' fundamental understanding and critical thinking, citing research that frequent AI use correlates with weaker cognitive skills and urging a balanced, verification‑first approach.

AISoftware Developmentcoding assistants
0 likes · 6 min read
Are AI Coding Assistants Undermining Deep Learning for Developers?
Architecture and Beyond
Architecture and Beyond
Feb 22, 2025 · Artificial Intelligence

Understanding Retrieval‑Augmented Generation (RAG) and Its Role in Enhancing Large Language Models

The article explains how the inherent knowledge‑staleness, hallucination, lack of private data, non‑traceable output, limited long‑text handling, and data‑security concerns of large language models can be mitigated by Retrieval‑Augmented Generation, which combines external retrieval, augmentation, and generation to provide up‑to‑date, reliable, and secure AI responses.

AIKnowledge augmentationLLM
0 likes · 15 min read
Understanding Retrieval‑Augmented Generation (RAG) and Its Role in Enhancing Large Language Models
Infra Learning Club
Infra Learning Club
Feb 21, 2025 · Artificial Intelligence

5 Must‑Try Open‑Source AI Projects You Can Start Using Today

This article introduces five open‑source AI tools—a PPT generator, an LLM app development platform, a cloud‑agnostic AI runner, a curated collection of LLM applications, and a one‑click HD video creator—detailing their key features, usage links, and sample configurations.

AIDifyLLM
0 likes · 8 min read
5 Must‑Try Open‑Source AI Projects You Can Start Using Today
Top Architect
Top Architect
Feb 21, 2025 · Artificial Intelligence

DeepSeek4j 1.4: Java Integration Framework for DeepSeek with Full Chain‑of‑Thought and Streaming Support

The article introduces DeepSeek4j 1.4, a Java‑based framework that overcomes Spring AI’s limitations by fully preserving DeepSeek’s chain‑of‑thought and billing features, adding reactive streaming, providing Spring Boot starter integration, and offering quick‑start code samples and configuration guidance.

AIChain-of-ThoughtDeepSeek
0 likes · 8 min read
DeepSeek4j 1.4: Java Integration Framework for DeepSeek with Full Chain‑of‑Thought and Streaming Support
Ma Wei Says
Ma Wei Says
Feb 21, 2025 · Artificial Intelligence

How PIKE‑RAG Boosts Retrieval‑Augmented Generation for Industrial AI

PIKE‑RAG, a Retrieval‑Augmented Generation framework from Microsoft Research, tackles knowledge source diversity, one‑size‑fits‑all limitations, and LLMs' lack of domain expertise by building multi‑layer heterogeneous graphs, task‑driven modular pipelines, and a staged L0‑L4 system for more accurate industrial AI responses.

AIKnowledgeGraphLLM
0 likes · 5 min read
How PIKE‑RAG Boosts Retrieval‑Augmented Generation for Industrial AI
AI Algorithm Path
AI Algorithm Path
Feb 20, 2025 · Artificial Intelligence

What Is Perplexity in Large Language Models?

The article explains perplexity as a metric for evaluating large language models, walks through a step‑by‑step probability calculation for a sample sentence, shows how to normalize by sentence length using the geometric mean, and demonstrates that lower perplexity indicates a more accurate and less uncertain model.

AIEvaluationLanguage Model
0 likes · 6 min read
What Is Perplexity in Large Language Models?
Top Architect
Top Architect
Feb 20, 2025 · Artificial Intelligence

Deploying DeepSeek R1 671B Model Locally with Ollama and Dynamic Quantization

This guide explains how to download, quantize, and run the full‑size 671‑billion‑parameter DeepSeek R1 model on local hardware using Ollama, covering model selection, hardware requirements, step‑by‑step deployment commands, optional web UI setup, performance observations, and practical recommendations.

AIDeepSeekDynamic Quantization
0 likes · 16 min read
Deploying DeepSeek R1 671B Model Locally with Ollama and Dynamic Quantization
Practical DevOps Architecture
Practical DevOps Architecture
Feb 20, 2025 · Artificial Intelligence

Training MiniDeepSeek V3+R1 from Scratch: Full-Scale Large Model Technical Practice for 2025

This tutorial series provides a step‑by‑step technical guide to training, deploying, and fine‑tuning the MiniDeepSeek V3+R1 large language model, covering model performance, open‑source details, API usage, parameter explanation, multi‑turn chatbot construction, function calling, integration with Open WebUI, GraphRAG, Swarm, and various deployment and optimization techniques.

AIMiniDeepSeekTraining
0 likes · 4 min read
Training MiniDeepSeek V3+R1 from Scratch: Full-Scale Large Model Technical Practice for 2025
Architecture Breakthrough
Architecture Breakthrough
Feb 20, 2025 · Artificial Intelligence

Can AI Really Replace You? Deepseek vs ChatGPT and How to Stay Ahead

The article analyzes Deepseek’s rapid rise, compares its strengths and limitations to ChatGPT, examines AI’s fundamental weaknesses, and offers practical strategies for individuals to build a “professional + AI” skill set that keeps them indispensable in the evolving AI landscape.

AIArtificial IntelligenceCareer Development
0 likes · 8 min read
Can AI Really Replace You? Deepseek vs ChatGPT and How to Stay Ahead
Data Thinking Notes
Data Thinking Notes
Feb 19, 2025 · Artificial Intelligence

DeepSeek Evolution: Key Technical Highlights from V1 to R1

This article examines DeepSeek’s various versions, detailing their core modules, underlying principles, architecture diagrams, and performance metrics, while illustrating the internal logic and advantages of each model to guide enthusiasts, professionals, and practitioners toward deeper AI innovation insights.

AIDeepSeekModel architecture
0 likes · 4 min read
DeepSeek Evolution: Key Technical Highlights from V1 to R1
Java Tech Enthusiast
Java Tech Enthusiast
Feb 19, 2025 · Artificial Intelligence

xAI's Grok 3 Model: Benchmarks, Reasoning, and Industry Reactions

Elon Musk’s xAI introduced the Grok 3 family—trained on roughly 200,000 GPUs and offered in standard, mini and Reasoning versions—that claims top‑slot performance on math, science and coding benchmarks, outpacing Google Gemini, DeepSeek V3, Claude and OpenAI GPT‑4o, while pricing starts at $30 per month and drawing both praise for its speed and criticism for lingering hallucinations and ethical sensitivities.

AIDeepSearchGrok3
0 likes · 16 min read
xAI's Grok 3 Model: Benchmarks, Reasoning, and Industry Reactions
Architect
Architect
Feb 18, 2025 · Artificial Intelligence

DeepSeek‑R1: Training Innovations and Architecture for High‑Performance Reasoning LLMs

The article explains how DeepSeek‑R1 advances large language model reasoning by releasing a lightweight distilled version, sharing a complete training pipeline—including pre‑training, supervised fine‑tuning, and reinforcement learning—introducing long‑chain reasoning data, a transitional inference model, and a comprehensive RL optimization that together yield strong mathematical and logical capabilities.

AIDeepSeekModel Training
0 likes · 10 min read
DeepSeek‑R1: Training Innovations and Architecture for High‑Performance Reasoning LLMs
DevOps Cloud Academy
DevOps Cloud Academy
Feb 18, 2025 · Operations

How AI Is Transforming DevOps: 10 Key Benefits

AI is reshaping DevOps by enhancing automation, enabling predictive analytics, optimizing CI/CD pipelines, managing resources intelligently, strengthening security, accelerating incident response, driving data-driven decisions, scaling infrastructure, fostering collaboration, and promoting continuous learning, thereby boosting flexibility, scalability, and reliability of software delivery.

AIDevOpsResource Management
0 likes · 8 min read
How AI Is Transforming DevOps: 10 Key Benefits
Architects' Tech Alliance
Architects' Tech Alliance
Feb 18, 2025 · Industry Insights

How DeepSeek V3 Is Driving a New Wave of Communication‑Hardware Demand

DeepSeek V3 cuts training to 2.788 M H800 GPU‑hours with FP8 mixed‑precision and a fully optimized framework, slashes token costs by 96% versus ChatGPT O1, and its efficient inference and model‑compression techniques are reshaping AI‑agent development, spurring demand for low‑latency, high‑bandwidth optical modules and edge‑computing infrastructure.

AICommunication IndustryDeepSeek
0 likes · 5 min read
How DeepSeek V3 Is Driving a New Wave of Communication‑Hardware Demand
Bilibili Tech
Bilibili Tech
Feb 18, 2025 · Artificial Intelligence

Algorithmic Empowerment of Bilibili Streaming: VOD Transcoding Decision, Resource Estimation, and Live Comment Semantic Analysis

The article details how Bilibili leverages AI algorithms—including XGBoost, statistical rules, XDeepFM, and fine‑tuned SBERT—to optimize VOD transcoding decisions, estimate compute resources and processing time, and analyze live comments, thereby boosting streaming efficiency, utilization, and user experience.

AITranscoding OptimizationXGBoost
0 likes · 19 min read
Algorithmic Empowerment of Bilibili Streaming: VOD Transcoding Decision, Resource Estimation, and Live Comment Semantic Analysis
JD Retail Technology
JD Retail Technology
Feb 18, 2025 · Artificial Intelligence

Engineering Practices of JD Advertising Agent: JDZunTong Intelligent Assistant

JD’s advertising R&D team created the JDZunTong Intelligent Assistant by engineering a modular Agent platform that combines advanced Retrieval‑Augmented Generation (RAG 1.0 → 2.0) and Function‑Call capabilities, a visual designer, custom tool registration, and a native Python workflow engine to deliver intelligent customer service, data queries, and ad creation for merchants.

AIAgentJD Advertising
0 likes · 18 min read
Engineering Practices of JD Advertising Agent: JDZunTong Intelligent Assistant
Architecture & Thinking
Architecture & Thinking
Feb 18, 2025 · Artificial Intelligence

Why Is DeepSeek Server Overloaded? Causes and Practical Workarounds

The article investigates why DeepSeek frequently returns a “server busy” message, analyzing factors such as sudden traffic spikes, compute and bandwidth limitations, security attacks, and maintenance policies, and then offers actionable solutions including query optimization, off‑peak usage, third‑party cloud platforms, and local deployment.

AIDeepSeekModel Deployment
0 likes · 10 min read
Why Is DeepSeek Server Overloaded? Causes and Practical Workarounds
DevOps Cloud Academy
DevOps Cloud Academy
Feb 17, 2025 · Operations

Top 10 AI Tools Transforming DevOps Engineering

This article reviews ten AI‑powered tools—including Jenkins, Ansible, Puppet, Dynatrace, Splunk, GitHub Copilot, New Relic, Azure DevOps, Prometheus, and Chef—that enhance DevOps workflows through predictive analytics, automated rollback, intelligent monitoring, and code assistance, helping teams achieve faster, more reliable software delivery.

AIDevOpsTooling
0 likes · 14 min read
Top 10 AI Tools Transforming DevOps Engineering
DeWu Technology
DeWu Technology
Feb 17, 2025 · Artificial Intelligence

Optimizing Large Model Inference: High‑Performance Frameworks and Techniques

The article reviews high‑performance inference strategies for large language models such as Deepseek‑R1, detailing CPU‑GPU process separation, Paged and Radix Attention, Chunked Prefill, output‑length reduction, tensor‑parallel multi‑GPU scaling, and speculative decoding, each shown to markedly boost throughput and cut latency in real deployments.

AIDistributed inferenceGPU Acceleration
0 likes · 22 min read
Optimizing Large Model Inference: High‑Performance Frameworks and Techniques
Tencent Technical Engineering
Tencent Technical Engineering
Feb 17, 2025 · Artificial Intelligence

Prompt Engineering: Definitions, Frameworks, Principles, and Advanced Techniques

The guide defines prompts as structured queries that unlock large‑language‑model abilities, outlines five core frameworks (RTF, Chain‑of‑Thought, RISEN, RODES, Density‑Chain), presents two key principles—clear, delimited instructions and explicit reasoning steps—to reduce hallucinations, and surveys advanced techniques such as zero‑shot, few‑shot, RAG, Tree‑of‑Thought and automatic prompt engineering.

AIChain-of-ThoughtRetrieval Augmented Generation
0 likes · 29 min read
Prompt Engineering: Definitions, Frameworks, Principles, and Advanced Techniques
macrozheng
macrozheng
Feb 17, 2025 · Artificial Intelligence

Unlock DeepSeek4j 1.4: Build a Private AI Knowledge Base with Spring Boot

This guide explains why DeepSeek4j is needed, its core features, and provides step‑by‑step instructions—including dependency setup, configuration, code examples, and a complete RAG pipeline using Milvus—to help developers quickly create a private AI knowledge base with Spring Boot.

AIDeepSeek4jMilvus
0 likes · 12 min read
Unlock DeepSeek4j 1.4: Build a Private AI Knowledge Base with Spring Boot
AI Product Manager Community
AI Product Manager Community
Feb 17, 2025 · Product Management

How AI Can Transform Your Product Roadmap into a Real‑Time Strategic Tool

In today’s fast‑changing market, traditional product planning falls short, so this article explains how AI‑powered data integration, predictive analytics, and dynamic feedback loops can create a real‑time, data‑driven product roadmap, detailing three implementation phases—data unification, intelligent analysis, and continuous adjustment—with practical steps for product managers.

AIData IntegrationRoadmap
0 likes · 8 min read
How AI Can Transform Your Product Roadmap into a Real‑Time Strategic Tool
Ops Development & AI Practice
Ops Development & AI Practice
Feb 15, 2025 · Artificial Intelligence

How to Efficiently Fine‑Tune Llama 3 on a Free Colab T4 GPU with Unsloth

This article provides a step‑by‑step, code‑rich tutorial for fine‑tuning the open‑source Llama 3 1B and 3B models on Google Colab using the Unsloth library and LoRA, covering environment setup, model loading, adapter insertion, dataset preparation, training configuration, inference, and model saving, all while keeping GPU memory usage low.

AIColabFine-tuning
0 likes · 13 min read
How to Efficiently Fine‑Tune Llama 3 on a Free Colab T4 GPU with Unsloth
dbaplus Community
dbaplus Community
Feb 14, 2025 · Databases

How AI Tools Are Transforming the Role of Database Administrators

The article argues that, despite common fears, AI and modern management tools like OEM and DB Console empower DBAs to work more efficiently, improve performance, and stay relevant, while highlighting real-world stories of tool adoption and the challenges of AI hallucinations.

AIDBAperformance optimization
0 likes · 7 min read
How AI Tools Are Transforming the Role of Database Administrators
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Feb 14, 2025 · Artificial Intelligence

AI-Driven Power Trading: Key Technologies, Architecture, and Future Trends

This article examines how artificial intelligence transforms power trading platforms by addressing challenges of renewable integration, introducing advanced forecasting, autonomous decision engines, market clearing optimization, and innovative architectures, while also analyzing international case studies, regulatory considerations, and future trends such as quantum machine learning and digital twins.

AIDigital TwinMarket Optimization
0 likes · 18 min read
AI-Driven Power Trading: Key Technologies, Architecture, and Future Trends
Tencent Technical Engineering
Tencent Technical Engineering
Feb 14, 2025 · Artificial Intelligence

Technical Overview of DeepSeek Series Models and Innovations

The DeepSeek series introduces a refined Mixture‑of‑Experts architecture with fine‑grained expert partitioning, shared experts, and learnable load‑balancing, alongside innovations such as Group Relative Policy Optimization, Multi‑Head Latent Attention, Multi‑Token Prediction, mixed‑precision FP8 training, and the R1/R1‑Zero models that use Long‑CoT reasoning, reinforcement‑learning pipelines, and distillation to achieve OpenAI‑comparable performance at lower cost.

AIDeepSeekMixture of Experts
0 likes · 25 min read
Technical Overview of DeepSeek Series Models and Innovations
Java Tech Enthusiast
Java Tech Enthusiast
Feb 14, 2025 · Artificial Intelligence

Apple Partners with Alibaba to Develop AI Features for iPhone Users

Apple’s new Apple Intelligence platform, unveiled at WWDC24, will incorporate Alibaba’s Qwen 2.5 Max model to create China‑specific AI features for iPhone users, with a custom dataset and regulatory submission, marking a shift from overseas ChatGPT reliance to a domestic partnership.

AIAlibabaApple
0 likes · 3 min read
Apple Partners with Alibaba to Develop AI Features for iPhone Users
Huolala Tech
Huolala Tech
Feb 14, 2025 · Artificial Intelligence

How AI‑Driven Loss Prevention Transforms Risk Management Across the Software Lifecycle

This article explains a comprehensive AI‑powered loss‑prevention framework that automatically identifies financial‑risk scenarios in both existing and new code, integrates model‑based detection into product, development, testing, and release stages, and continuously refines coverage through intelligent monitoring and rule enforcement.

AIModel Trainingloss prevention
0 likes · 11 min read
How AI‑Driven Loss Prevention Transforms Risk Management Across the Software Lifecycle
Code Ape Tech Column
Code Ape Tech Column
Feb 14, 2025 · Artificial Intelligence

Integrating DeepSeek Large Model with Spring AI: A Step‑by‑Step Guide

This article explains how to integrate DeepSeek's large language models—both the chat‑oriented deepseek‑chat and the reasoning‑focused deepseek‑reasoner—into a Spring AI application, covering API key setup, base‑URL configuration, model selection, and providing full code examples for dependency, configuration, and a simple chat controller.

AIChatbotDeepSeek
0 likes · 6 min read
Integrating DeepSeek Large Model with Spring AI: A Step‑by‑Step Guide
Architect
Architect
Feb 13, 2025 · Artificial Intelligence

How to Build a Mini ChatGPT on a Single GPU with MiniMind

This article provides a comprehensive, step‑by‑step guide to training and fine‑tuning a miniature large‑language model called MiniMind, covering lightweight model design, open‑source training pipelines, required datasets, tokenizer options, and deployment via a web UI, all using PyTorch on modest hardware.

AILLMMiniMind
0 likes · 11 min read
How to Build a Mini ChatGPT on a Single GPU with MiniMind
AIWalker
AIWalker
Feb 13, 2025 · Artificial Intelligence

How FlashVideo Turns Low‑Res Clips into 4K Video with Minimal Compute

FlashVideo introduces a two‑stage framework that first generates low‑resolution videos with strong prompt fidelity and then uses flow‑matching ODE trajectories to upscale to 4K quality in just four function evaluations, achieving top VBench‑Long scores while cutting generation time by up to five‑fold.

AIFlashVideoVideo Generation
0 likes · 26 min read
How FlashVideo Turns Low‑Res Clips into 4K Video with Minimal Compute
ByteDance Cloud Native
ByteDance Cloud Native
Feb 13, 2025 · Cloud Computing

Deploy the Full‑Size DeepSeek‑R1 Model on Volcengine Cloud with Terraform and Kubernetes

This guide walks you through two practical solutions for deploying the massive DeepSeek‑R1 model on Volcengine Cloud—one using Terraform for a quick two‑node GPU setup and another leveraging cloud‑native multi‑node distributed inference with Kubernetes, covering resource sizing, environment preparation, model download, monitoring, autoscaling, and storage acceleration.

AIKubernetesModel Deployment
0 likes · 22 min read
Deploy the Full‑Size DeepSeek‑R1 Model on Volcengine Cloud with Terraform and Kubernetes
Radish, Keep Going!
Radish, Keep Going!
Feb 13, 2025 · Cloud Native

How Wise Is Building a Scalable 2025 Tech Stack with Kubernetes and AI

Wise’s 2025 tech stack overhaul details how its 850‑engineer team leverages cloud‑native tools like Kubernetes, Terraform, and AWS, modernizes frontend with Next.js and Storybook, accelerates mobile builds via Swift Package Manager and Gradle, and integrates AI, data pipelines, and observability to support 12.8 million active customers worldwide.

AIDevOpsMobile Development
0 likes · 20 min read
How Wise Is Building a Scalable 2025 Tech Stack with Kubernetes and AI