Tagged articles
106 articles
Page 1 of 2
IT Services Circle
IT Services Circle
May 19, 2026 · Artificial Intelligence

Peter Steinberger’s $1.3 M Monthly Token Bill: OpenAI’s Subsidy Powers a 100‑Agent OpenClaw

Peter Steinberger revealed that his OpenAI API usage cost $1.3 million in the past 30 days, consuming 6 030 billion tokens across 7.6 million requests, most of which power a cloud‑run fleet of about 100 Codex agents that automate OpenClaw development, prompting a debate on AI‑driven software costs.

AI EngineeringCodexMulti-Agent
0 likes · 7 min read
Peter Steinberger’s $1.3 M Monthly Token Bill: OpenAI’s Subsidy Powers a 100‑Agent OpenClaw
Architect
Architect
May 17, 2026 · Artificial Intelligence

Agent Skills Survey: How Process Knowledge Becomes Technical Debt

The recent arXiv survey on Agent Skills maps the full lifecycle of skills—representation, acquisition, retrieval, and evolution—and warns that unchecked growth can turn a valuable process asset into technical debt, urging teams to enforce admission quality, robust routing, versioning, testing, and retirement mechanisms.

AI EngineeringAgent SkillsProcess Assets
0 likes · 26 min read
Agent Skills Survey: How Process Knowledge Becomes Technical Debt
James' Growth Diary
James' Growth Diary
May 17, 2026 · Artificial Intelligence

Deep Dive into Claude Code Hooks: Stop Hooks and the Self‑Validation Loop

The article dissects Claude Code's Hooks system, detailing its 27 lifecycle events, four hook types, the special behavior of Stop Hooks with exit code 2, the self‑validation loop, practical patterns like the Ralph Loop, and the design trade‑offs and mitigation strategies.

AI EngineeringClaude CodeSelf-Validation
0 likes · 15 min read
Deep Dive into Claude Code Hooks: Stop Hooks and the Self‑Validation Loop
AI Architecture Hub
AI Architecture Hub
May 10, 2026 · Artificial Intelligence

2026 AI Engineer Roadmap: Master Agent Engineering and Scheduling

This guide outlines a six‑stage, 17‑week roadmap for becoming a production‑ready AI agent engineer by 2026, detailing essential skills such as LangGraph orchestration, Claude Agent SDK scheduling, context‑engineering primitives, evaluation pipelines, and curated free resources while warning against over‑hyped frameworks.

AI EngineeringClaude Agent SDKContext Engineering
0 likes · 18 min read
2026 AI Engineer Roadmap: Master Agent Engineering and Scheduling
ITPUB
ITPUB
May 8, 2026 · Artificial Intelligence

How Oracle Skills Open Source Signals the Rise of the AI Skill Era

Oracle has open‑sourced its Skills repository on GitHub, providing over 100 curated, version‑compatible guides for Oracle Database, OCI, GraalVM, Fusion and APEX, and defining a new AI‑centric “Skill” abstraction that lets agents safely generate and execute database operations, heralding a Skill‑driven AI engineering era.

AI EngineeringAI agentsDatabase Skills
0 likes · 16 min read
How Oracle Skills Open Source Signals the Rise of the AI Skill Era
Architect
Architect
May 5, 2026 · Artificial Intelligence

From Anthropic to Google: Agent Skills Enter the Design‑Pattern Era

Google Cloud Tech’s recent article outlines five Agent Skill design patterns, building on Anthropic’s earlier work that standardized Skill format and loading, and shows how the community is shifting from merely defining Skill syntax to engineering robust, reusable workflow structures for AI agents.

AI EngineeringAgent SkillsDesign Patterns
0 likes · 25 min read
From Anthropic to Google: Agent Skills Enter the Design‑Pattern Era
AI Tech Publishing
AI Tech Publishing
May 1, 2026 · Artificial Intelligence

5 Counterintuitive Design Principles for Prompt Caching in Claude Code

The article details five counterintuitive design principles for Claude Code's prompt caching—optimizing prompt layout, using message‑based updates, never switching models or tools mid‑conversation, safely compressing context, and monitoring cache health—backed by concrete examples and up to 90% cost savings.

AI EngineeringClaude CodeLLM agents
0 likes · 10 min read
5 Counterintuitive Design Principles for Prompt Caching in Claude Code
Architect
Architect
May 1, 2026 · Artificial Intelligence

From Vibe Coding to Agentic Engineering: How AI Is Redefining the Engineer‑Architect Boundary

Karpathy’s 2026 Sequoia AI Ascent interview shows that while Vibe Coding lowers the barrier for rapid prototyping, the emerging Agentic Engineering paradigm pushes AI agents into the full software‑development lifecycle, demanding new control planes, verification, context handling and blurring the line between senior engineers and architects.

AI EngineeringAgentic EngineeringControl Plane
0 likes · 34 min read
From Vibe Coding to Agentic Engineering: How AI Is Redefining the Engineer‑Architect Boundary
ZhiKe AI
ZhiKe AI
May 1, 2026 · Artificial Intelligence

From Chatbot to Action: How Large‑Model Agents Turn Queries into Real‑World Tasks

The article explains that large‑model agents differ from traditional chatbots by perceiving goals, planning steps, invoking tools, and executing actions autonomously, covering their definition, core modules, ReAct reasoning‑acting loop, single‑ versus multi‑agent systems, current industry trends, and the reliability, safety, observability, and cost challenges they face.

AI AgentAI EngineeringAgent Architecture
0 likes · 18 min read
From Chatbot to Action: How Large‑Model Agents Turn Queries into Real‑World Tasks
ZhiKe AI
ZhiKe AI
Apr 30, 2026 · R&D Management

Why Martin Fowler Says Determinism Is Over in Software Engineering

Martin Fowler argues that software engineering has moved from a deterministic world to a nondeterministic one driven by LLMs, outlining how this paradigm shift reshapes development practices, introduces new risks, and demands a harness‑based engineering approach to manage uncertainty.

AI EngineeringAgentic EngineeringDeterminism
0 likes · 15 min read
Why Martin Fowler Says Determinism Is Over in Software Engineering
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Apr 29, 2026 · Interview Experience

ByteDance Interviewer Asks: What Rank r Do You Use for LoRA? I Said 64—He Said I'm Wasting GPU Memory

The article examines a common interview scenario where candidates are asked about LoRA rank selection, outlines two typical mistakes—guessing or staying silent—and presents a three‑step strategy of honest boundary setting, logical derivation, and asking a focused question, illustrating the approach with concrete LoRA calculations and a vLLM case study.

AI EngineeringLoRAinterview strategy
0 likes · 13 min read
ByteDance Interviewer Asks: What Rank r Do You Use for LoRA? I Said 64—He Said I'm Wasting GPU Memory
Java Web Project
Java Web Project
Apr 25, 2026 · Artificial Intelligence

Why GPT-5.5’s Silent Release Signals Real Engineering Power

OpenAI’s April 23, 2026 launch of GPT-5.5 delivers record‑high scores on SWE‑Bench Pro (58.6%) and Terminal‑Bench 2.0 (82.7%), adds persistent multi‑file context, dynamic reasoning time, and token efficiency, while real‑world case studies show substantial productivity gains across engineering teams.

AI EngineeringCodexGPT-5.5
0 likes · 13 min read
Why GPT-5.5’s Silent Release Signals Real Engineering Power
Architecture and Beyond
Architecture and Beyond
Apr 25, 2026 · Artificial Intelligence

Practical Insights on Recent AI Engineering Deployments

The article examines how large language models function as probabilistic components within deterministic software, discusses fault‑tolerance limits for viable AI use cases, and offers detailed engineering guidance on RAG pipelines, tool‑calling determinism, agent fragility, testing, monitoring, and privacy‑conscious deployment in finance.

AI EngineeringAgent ArchitectureLLM
0 likes · 14 min read
Practical Insights on Recent AI Engineering Deployments
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Apr 23, 2026 · Artificial Intelligence

Why Agent Harness Is Central to AI Engineering: OfficeClaw Design & Implementation

The article explains how Agent Harness, defined by six core components (Execution Loop, Tool Registry, Context Manager, State Store, Lifecycle Hooks, Evaluation Interface), forms the operating system for AI agents, and details Huawei Cloud OfficeClaw’s layered architecture and real‑world deployment that boosts task reliability and efficiency.

AI EngineeringAgent HarnessOfficeClaw
0 likes · 11 min read
Why Agent Harness Is Central to AI Engineering: OfficeClaw Design & Implementation
AI Architecture Hub
AI Architecture Hub
Apr 23, 2026 · Artificial Intelligence

Why Prompt Caching Is Critical: Lessons from Building Claude Code

Prompt caching, a prefix‑matching technique that reuses prior LLM interactions, proved essential for Claude Code’s low latency and cost, and the article details counter‑intuitive practices such as arranging static prompts first, updating info via messages, avoiding mid‑session model or tool changes, and ensuring cache‑safe context forks.

AI EngineeringClaude CodeLLM agents
0 likes · 10 min read
Why Prompt Caching Is Critical: Lessons from Building Claude Code
ZhiKe AI
ZhiKe AI
Apr 22, 2026 · Artificial Intelligence

Why Harness Engineering Is the Hottest AI Engineering Paradigm in 2026

The article explains how the emerging "Harness Engineering" paradigm—highlighted by OpenAI, Stripe and Anthropic—shifts AI development from prompt tweaking to building full control systems, promising ten‑fold efficiency gains, new architectural components, and both opportunities and risks for developers.

AI EngineeringHarness EngineeringPrompt engineering
0 likes · 9 min read
Why Harness Engineering Is the Hottest AI Engineering Paradigm in 2026
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Apr 21, 2026 · Artificial Intelligence

When Should an LLM Agent Extract Memory? A Deep Dive into Trigger Strategies

The article analyzes why memory extraction in LLM‑driven agents incurs cost, compares four frameworks—Claude Code, Generative Agents, MemGPT, and Mem0—detailing their trigger mechanisms, concurrency handling, and trade‑offs, and offers practical guidance for choosing the right strategy in real‑time, social, or batch‑processing scenarios.

AI EngineeringAgent DesignLLM
0 likes · 18 min read
When Should an LLM Agent Extract Memory? A Deep Dive into Trigger Strategies
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Apr 20, 2026 · Artificial Intelligence

Why Java Skills Alone Won’t Cut It for LLM Application Engineering

The article debunks the myth that Java developers only need a bit of AI knowledge to succeed in LLM application roles, explaining the full engineering stack—from retrieval and prompt design to deployment and performance tuning—through real‑world examples, metrics, and interview‑ready advice.

AI EngineeringBackendInterview Preparation
0 likes · 13 min read
Why Java Skills Alone Won’t Cut It for LLM Application Engineering
Baobao Algorithm Notes
Baobao Algorithm Notes
Apr 20, 2026 · Industry Insights

From Prompt Writer to Harness Architect: Redefining the Algorithm Engineer in the LLM Era

The article analyzes how the rise of foundation models shifts algorithm engineers from hand‑crafting models to building robust Harness environments, detailing OpenAI’s agent‑first experiments, the new "Model + Harness" formula, and practical steps for staying valuable in a prompt‑centric world.

AI EngineeringHarness architectureLLM
0 likes · 9 min read
From Prompt Writer to Harness Architect: Redefining the Algorithm Engineer in the LLM Era
MeowKitty Programming
MeowKitty Programming
Apr 19, 2026 · Artificial Intelligence

Why Java Developers Can Now Treat AI as a Full Engineering Stack

The article explains how recent releases like Java 26 and Spring AI 2.0 have turned Java‑AI from a hobbyist demo into a mature, production‑ready engineering stack, outlining the practical steps Java teams should follow to integrate AI into existing systems.

AIAI EngineeringBackend
0 likes · 8 min read
Why Java Developers Can Now Treat AI as a Full Engineering Stack
Su San Talks Tech
Su San Talks Tech
Apr 19, 2026 · Artificial Intelligence

Boost Enterprise RAG: Data Pipeline Tricks, Hybrid Search & Rerank

To make Retrieval‑Augmented Generation reliable in production, the article outlines five key engineering tactics—semantic chunking with metadata, hybrid vector‑keyword search, two‑stage retrieval with reranking, query rewriting and expansion, and dynamic result evaluation—each illustrated with concrete examples and code snippets.

AI EngineeringHybrid SearchQuery Rewriting
0 likes · 10 min read
Boost Enterprise RAG: Data Pipeline Tricks, Hybrid Search & Rerank
Qborfy AI
Qborfy AI
Apr 19, 2026 · Artificial Intelligence

Boosting Claude’s Front‑End Development with a GAN‑Inspired Multi‑Agent Harness

The article details how a GAN‑inspired multi‑agent harness—combining a generator, an evaluator, and a planner—overcomes context‑window anxiety and self‑evaluation bias, enabling Claude to produce higher‑quality front‑end designs and full‑stack applications through iterative scoring, sprint contracts, and extensive cost‑benefit experiments.

AI EngineeringFull-Stack DevelopmentGAN
0 likes · 19 min read
Boosting Claude’s Front‑End Development with a GAN‑Inspired Multi‑Agent Harness
DevOps in Software Development
DevOps in Software Development
Apr 17, 2026 · Artificial Intelligence

Designing a Control System for AI Code Generators: The Harness Engineering Framework

This article breaks down Birgitta Böckeler's Harness Engineering framework, explaining its 2×2 control matrix, the distinction between computational and inferential controls, three regulation dimensions, and new concepts like Harnessability and Harness Templates, while offering concrete actions for engineering leaders.

AI EngineeringAI code generationHarness Engineering
0 likes · 10 min read
Designing a Control System for AI Code Generators: The Harness Engineering Framework
AI Waka
AI Waka
Apr 16, 2026 · Artificial Intelligence

Why Modern AI Systems Should Compile Knowledge Instead of Just Retrieving It

Traditional RAG pipelines forget everything after each query, but the LLM Wiki mode proposed by Andrej Karpathy compiles source material into a version‑controlled, cross‑referenced Markdown wiki, enabling knowledge to compound over time, reduce query costs, and provide a transparent, human‑readable knowledge base for AI engineers.

AI EngineeringLLMPrompt engineering
0 likes · 23 min read
Why Modern AI Systems Should Compile Knowledge Instead of Just Retrieving It
AI Tech Publishing
AI Tech Publishing
Apr 15, 2026 · Artificial Intelligence

8 Critical Harness Design Issues That Threaten Long‑Running Agent Accuracy

The article systematically breaks down why autonomous agents lose control during long‑running engineering tasks—missing context, short‑sighted planning, context anxiety, and plan drift—and shows how a well‑designed harness layer can preempt these problems without changing the underlying model.

AI EngineeringAutonomous AgentsHarness
0 likes · 11 min read
8 Critical Harness Design Issues That Threaten Long‑Running Agent Accuracy
Design Hub
Design Hub
Apr 15, 2026 · Artificial Intelligence

Why Your Company’s “AI‑First” Strategy Might Not Be Real AI‑First

The article dissects CREAO’s AI‑first engineering system, contrasting true AI‑driven workflows with superficial AI assistance, and explains how a unified monorepo, automated CI/CD pipelines, self‑healing loops, and specialized roles enable a 25‑person team to outperform competitors by a factor of 100.

AI EngineeringAI-firstAgent Platform
0 likes · 15 min read
Why Your Company’s “AI‑First” Strategy Might Not Be Real AI‑First
Tech Freedom Circle
Tech Freedom Circle
Apr 12, 2026 · Artificial Intelligence

What Is Harness Agent? A Deep Dive into the New AI Engineering Framework

Harness Agent is an AI engineering framework that combines a large language model with a runtime control system—called the Harness—to provide task planning, sandboxed execution, tool integration, memory management, safety guardrails, and observability, turning raw model capabilities into reliable, production‑grade agents.

AI EngineeringAgent ArchitectureDeerFlow
0 likes · 26 min read
What Is Harness Agent? A Deep Dive into the New AI Engineering Framework
Qborfy AI
Qborfy AI
Apr 11, 2026 · Industry Insights

Why AI Agents Need Harness Engineering: Insights from OpenAI, LangChain, and Anthropic

This article explains how AI agents often stall, repeat mistakes, or diverge on complex tasks, argues that the missing piece is a well‑designed harness, and demonstrates with real‑world case studies from OpenAI, LangChain, and Anthropic how a six‑component harness can boost performance by over 13 percentage points and enable million‑line code generation.

AI EngineeringAgent HarnessAnthropic
0 likes · 12 min read
Why AI Agents Need Harness Engineering: Insights from OpenAI, LangChain, and Anthropic
AI Large-Model Wave and Transformation Guide
AI Large-Model Wave and Transformation Guide
Apr 7, 2026 · Artificial Intelligence

Why Harness Engineering Is the New AI Competitive Edge in 2026

The article argues that as large‑model capabilities converge, the decisive factor in 2026 AI competition shifts from raw model power to the ability to engineer a full‑stack Harness system that multiplies performance tenfold through standardized adapters, dynamic prompt registries, multi‑agent orchestration, context compression, and observability.

AI EngineeringHarnessMulti-Agent
0 likes · 14 min read
Why Harness Engineering Is the New AI Competitive Edge in 2026
ArcThink
ArcThink
Apr 6, 2026 · Artificial Intelligence

How Harness Engineering Let a 3‑Person Team Write 1 Million Lines of Code in 5 Months

Harness Engineering combines systematic prompts, context management, and robust validation loops to turn powerful LLMs into reliable agents, enabling a three‑engineer team to produce about one million lines of production code in five months and boosting LangChain’s benchmark ranking by 25 places, proving that well‑designed harnesses outweigh model improvements by an order of magnitude.

AI EngineeringAgent SystemsContext Engineering
0 likes · 25 min read
How Harness Engineering Let a 3‑Person Team Write 1 Million Lines of Code in 5 Months
Architecture Musings
Architecture Musings
Apr 4, 2026 · Industry Insights

Exploring the Harness Architecture Concept: A First Look

This article examines the emerging "Harness Architecture" idea, arguing that constraints should be applied before AI code generation by leveraging stepwise refinement, modular contracts, and living design documents to improve precision, reduce token usage, and prevent architectural drift in large software projects.

AI EngineeringHarness architectureSpec‑Driven Development
0 likes · 8 min read
Exploring the Harness Architecture Concept: A First Look
PaperAgent
PaperAgent
Apr 2, 2026 · Artificial Intelligence

Can an LLM Build a Full‑Stack Knowledge Graph System in Under 3 Hours?

Using the GLM‑5.1 large language model, the author automated the end‑to‑end development of an ontology‑based knowledge‑graph extraction and visualization platform—covering backend, frontend, and graph database—in just 2 hours 47 minutes, consuming 747 k tokens and self‑correcting multiple issues.

AI EngineeringFull-Stack DevelopmentGLM-5.1
0 likes · 12 min read
Can an LLM Build a Full‑Stack Knowledge Graph System in Under 3 Hours?
dbaplus Community
dbaplus Community
Apr 1, 2026 · Artificial Intelligence

What the Claude Code Leak Reveals About AI Engineering Practices

A massive accidental release of Claude Code's 512,000-line TypeScript source, exposed via a source‑map file, lets anyone reconstruct the entire codebase and offers a stark, real‑world case study of high‑performance AI tooling, architectural trade‑offs, and the hidden costs of rapid development.

AI EngineeringAnthropicClaude Code
0 likes · 10 min read
What the Claude Code Leak Reveals About AI Engineering Practices
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Apr 1, 2026 · Industry Insights

How Harness Engineering Is Redefining Industrial AI Agents

This article analyzes the emergence of Harness Engineering as the third‑generation AI engineering paradigm, explains its three‑layer Industrial Harness architecture, identifies three failure modes of long‑running industrial agents, and validates the approach with quantitative case studies and a roadmap for Physical AI OS deployment.

AI EngineeringHarness EngineeringIndustrial Agents
0 likes · 28 min read
How Harness Engineering Is Redefining Industrial AI Agents
ArcThink
ArcThink
Apr 1, 2026 · Artificial Intelligence

Inside Claude Code: 1,900‑File Source Dive Reveals Six‑Layer Architecture

After a source‑map leak exposed Claude Code’s 1,900 TypeScript files, this analysis dissects its six‑layer architecture, dynamic prompt assembly, four‑level caching, 60+ tool governance pipeline, six built‑in agents, five context‑compression strategies, and the real engineering trade‑offs hidden beneath the product.

AI EngineeringAgent SystemsPrompt engineering
0 likes · 31 min read
Inside Claude Code: 1,900‑File Source Dive Reveals Six‑Layer Architecture
Lao Guo's Learning Space
Lao Guo's Learning Space
Mar 31, 2026 · Operations

Harness Engineering Best Practices: Real‑World AI Ops Lessons from 4 Companies

This article explains Harness Engineering—a methodology that lets AI agents work reliably by steering humans and automating execution—through core principles, a performance boost demonstrated by OpenAI, and detailed case studies from OpenAI, Citi, Ancestry, and Ulta Beauty, followed by a step‑by‑step adoption roadmap.

AI EngineeringArchitecture ConstraintsContext Engineering
0 likes · 11 min read
Harness Engineering Best Practices: Real‑World AI Ops Lessons from 4 Companies
Java Architect Essentials
Java Architect Essentials
Mar 31, 2026 · Artificial Intelligence

What the Claude Code Leak Reveals: 510k Lines of AI Engine Exposed

A massive leak of Anthropic’s Claude Code exposed over 1,900 files and 510,000 lines of TypeScript, revealing its React‑Ink UI, Bun runtime, extensive toolset, hidden Kairos mode, electronic pet system, and covert Undercover features, sparking worldwide developer frenzy and security concerns.

AI EngineeringBun runtimeClaude
0 likes · 7 min read
What the Claude Code Leak Reveals: 510k Lines of AI Engine Exposed
Data Party THU
Data Party THU
Mar 30, 2026 · Artificial Intelligence

Why AI Needs a ‘Harness’: Building Environments for Persistent Agents

The article analyzes the emerging concept of Harness Engineering—combining AI models with structured environments, standards, and feedback loops—to enable agents that can work continuously, illustrated by OpenAI and Anthropic case studies, practical design guidelines, and a three‑week adoption plan.

AI EngineeringAgent DesignHarness Engineering
0 likes · 10 min read
Why AI Needs a ‘Harness’: Building Environments for Persistent Agents
AI Large Model Application Practice
AI Large Model Application Practice
Mar 30, 2026 · Artificial Intelligence

Why Agent Harnesses Are the Key to Production‑Ready AI Agents

The article analyzes the emerging concept of Agent Harnesses, explaining how they transform unruly large‑model agents into controllable, production‑grade systems by addressing long‑running tasks, legacy code complexity, execution‑delivery gaps, and safety concerns through systematic engineering practices.

AI EngineeringAgent HarnessLarge Language Models
0 likes · 18 min read
Why Agent Harnesses Are the Key to Production‑Ready AI Agents
Architect
Architect
Mar 28, 2026 · Artificial Intelligence

Why AI Agents Need a Harness: From Model Power to System Reliability

The article analyzes how the growing strength of large language models shifts engineering bottlenecks from model capabilities to system stability, introducing the concept of a "Harness" that integrates models into real‑world workflows through state management, constraints, feedback loops, and verification mechanisms.

AI EngineeringAI OpsAgent Harness
0 likes · 18 min read
Why AI Agents Need a Harness: From Model Power to System Reliability
Nightwalker Tech
Nightwalker Tech
Mar 27, 2026 · Artificial Intelligence

Why AI Needs a Harness Engineering Framework to Tackle Long‑Term Complex Tasks

The article explains that AI struggles with extended, complex tasks not because models lack intelligence but due to missing systematic engineering practices, and proposes a Harness Engineering framework that introduces external memory, task decomposition, fixed SOP loops, and test‑driven safeguards to turn AI agents into reliable, production‑grade collaborators.

AI EngineeringHarness frameworkSystematic AI
0 likes · 4 min read
Why AI Needs a Harness Engineering Framework to Tackle Long‑Term Complex Tasks
DataFunTalk
DataFunTalk
Mar 27, 2026 · Artificial Intelligence

Building a Production‑Ready RAG Engine: Architecture, Challenges & Solutions

This article examines the practical challenges of deploying Retrieval‑Augmented Generation in enterprise settings, outlines a layered RAG architecture with offline document processing and online query handling, and details the hybrid retrieval, multi‑stage ranking, knowledge filtering, and generation techniques that improve accuracy and reduce hallucinations.

AI EngineeringHybrid RetrievalKnowledge Filtering
0 likes · 22 min read
Building a Production‑Ready RAG Engine: Architecture, Challenges & Solutions
Tencent TDS Service
Tencent TDS Service
Mar 27, 2026 · Artificial Intelligence

How Kuikly’s AI Engineering Boosted Cross‑Platform Development Efficiency

The article details how the Kuikly cross‑platform framework team tackled AI coding challenges by redesigning their architecture, building precise AI context documents, standardizing requirement flows with Spec‑Kit, and integrating a suite of AI tools, resulting in significant productivity gains and higher code quality.

AI EngineeringCross‑platform developmentKuikly
0 likes · 15 min read
How Kuikly’s AI Engineering Boosted Cross‑Platform Development Efficiency
SuanNi
SuanNi
Mar 25, 2026 · Artificial Intelligence

Can Harness Engineering Enable AI Agents to Master Complex Long‑Running Tasks?

This article analyses the concept of Harness engineering introduced by OpenAI and Anthropic, explains how multi‑agent architectures decompose and manage long‑running AI tasks, examines practical experiments such as a retro game maker and a web‑audio workstation, and distills lessons for future AI system design.

AI EngineeringAnthropicClaude
0 likes · 16 min read
Can Harness Engineering Enable AI Agents to Master Complex Long‑Running Tasks?
Architect's Journey
Architect's Journey
Mar 25, 2026 · Artificial Intelligence

Why SKILL Makes AI Development Surprisingly Simple

The article introduces the SKILL framework, explains its file‑based structure and LLM‑driven entry point, compares it with traditional API‑centric backends, outlines its suitable use cases and limitations, and argues that mastering SKILL will become a core productivity skill for developers.

AI EngineeringLLMSKILL framework
0 likes · 8 min read
Why SKILL Makes AI Development Surprisingly Simple
Coder Circle
Coder Circle
Mar 17, 2026 · Industry Insights

After a Decade of Java, Why the Programmer Era Is Shifting

The article analyzes how AI is now writing code, compressing development cycles from half a day to minutes, reshaping programmer roles through three historical value shifts, highlighting new AI‑centric responsibilities, and offering a concrete learning path for Java developers to thrive in the AI era.

AI EngineeringAI PlatformsArtificial Intelligence
0 likes · 8 min read
After a Decade of Java, Why the Programmer Era Is Shifting
AI Engineer Programming
AI Engineer Programming
Mar 16, 2026 · Artificial Intelligence

Why “Agent Development” Misleads: Framework vs. Harness in LLM Agents

The article explains that the term “Agent development” hides a fundamental split between Agent Frameworks, which give developers building blocks to assemble their own agents, and Agent Harnesses, which provide ready‑to‑run agents, and shows how this distinction affects decisions, maintenance, and troubleshooting.

AI EngineeringClaude CodeFramework
0 likes · 10 min read
Why “Agent Development” Misleads: Framework vs. Harness in LLM Agents
AI Architecture Hub
AI Architecture Hub
Mar 10, 2026 · Industry Insights

How OpenAI Built a Million‑Line Codebase Without Human Typing – Lessons for AI‑Driven Software Engineering

OpenAI’s five‑month "Harness Engineering" experiment showed that a three‑person team could generate a million‑line software product entirely with Codex and GPT‑5, achieving ten‑fold productivity, redefining engineering roles, workflow loops, and offering five practical guidelines for AI‑augmented development while highlighting unresolved challenges.

AI EngineeringAI productivityHarness Engineering
0 likes · 18 min read
How OpenAI Built a Million‑Line Codebase Without Human Typing – Lessons for AI‑Driven Software Engineering
AI Architecture Hub
AI Architecture Hub
Mar 6, 2026 · Artificial Intelligence

How Agent Skills Solve AI Prompt Drift and Enable Scalable AI Workflows

This article analyzes the prompt‑drift problem in AI workflows, explains the open‑format Agent Skills standard, dissects a concrete Python code‑audit skill design, compares Anthropic and OpenAI ecosystems, and provides practical guidelines for building high‑availability, version‑controlled Agent Skills.

AI EngineeringAgent SkillsAnthropic
0 likes · 15 min read
How Agent Skills Solve AI Prompt Drift and Enable Scalable AI Workflows
Architect
Architect
Feb 28, 2026 · Artificial Intelligence

Designing Agent Tools: Key Lessons from Claude Code’s Action Space

This article distills the Claude Code team's hard‑won insights on building effective AI agents, highlighting why action‑space design outweighs model size, how structured questioning improves bandwidth, when to replace Todos with Tasks, and a repeatable seven‑step loop for evolving toolsets.

AI EngineeringAction SpacePrompt engineering
0 likes · 20 min read
Designing Agent Tools: Key Lessons from Claude Code’s Action Space
AI Large Model Application Practice
AI Large Model Application Practice
Feb 19, 2026 · Artificial Intelligence

When Should You Add a Knowledge Graph? 6 Practical Decision Criteria

This article outlines six concrete criteria—relationship‑centric data, reproducible reasoning, evolving schemas, multi‑hop queries, explainable decisions, and cross‑system data integration—to help engineers decide whether a knowledge graph is the right solution or if a relational database will suffice.

AI EngineeringData Integrationexplainability
0 likes · 15 min read
When Should You Add a Knowledge Graph? 6 Practical Decision Criteria
Yunqi AI+
Yunqi AI+
Feb 13, 2026 · Artificial Intelligence

AI Engineering: Methodology and Practice for Turning Generative AI into Production Systems

The article outlines a comprehensive AI engineering methodology—including the TPMR framework, an AI‑driven development lifecycle, talent transformation from co‑pilot to AI pilot, and a practical enterprise adoption roadmap—to move generative AI and large models from experimental prototypes to production‑grade systems.

AI EngineeringAI LifecycleLLMOps
0 likes · 5 min read
AI Engineering: Methodology and Practice for Turning Generative AI into Production Systems
Baobao Algorithm Notes
Baobao Algorithm Notes
Feb 3, 2026 · Industry Insights

How One Developer Merges 600 Commits a Day with AI: Inside the Clawdbot Workflow

In this in‑depth interview, Peter Steinberger explains how AI agents let him submit and merge hundreds of commits daily, replace traditional code reviews with prompt‑driven requests, and redesign his development workflow around a closed‑loop validation system that reshapes modern software engineering.

AI EngineeringCode reviewagent workflow
0 likes · 85 min read
How One Developer Merges 600 Commits a Day with AI: Inside the Clawdbot Workflow
High Availability Architecture
High Availability Architecture
Feb 1, 2026 · Artificial Intelligence

10 Proven Strategies to Become an AI Engineer with Claude Code

This guide shares ten practical techniques—from parallel git worktrees and plan‑mode prompting to custom Skills, sub‑agents, and terminal tweaks—that let developers treat Claude as a fully orchestrated AI engineering system, boosting productivity and reducing manual coding overhead.

AI EngineeringClaude Codeautomation
0 likes · 12 min read
10 Proven Strategies to Become an AI Engineer with Claude Code
Architecture and Beyond
Architecture and Beyond
Jan 17, 2026 · Artificial Intelligence

Progressive Disclosure & Dynamic Context: Making LLM Agents Reliable Execution Systems

This article explains how progressive disclosure and dynamic context management address the three core bottlenecks of complex LLM agents—context explosion, tool overload, and uncontrolled execution—by structuring context, tools, and SOPs into layered, token‑efficient, and verifiable workflows.

AI EngineeringLLM agentsProgressive Disclosure
0 likes · 15 min read
Progressive Disclosure & Dynamic Context: Making LLM Agents Reliable Execution Systems
Tencent Technical Engineering
Tencent Technical Engineering
Jan 9, 2026 · Artificial Intelligence

Why Traditional Spec‑Driven Tools Fail and How AI‑Powered Context Engineering Can Supercharge Development

This article analyzes the shortcomings of spec‑driven tools like speckit and openspec in complex enterprise environments, introduces context engineering and compound engineering concepts, and presents a practical AI‑engineered workflow that reduces marginal cost, captures knowledge automatically, and makes AI tools feel invisible to developers.

AI EngineeringContext Engineeringcompound-engineering
0 likes · 48 min read
Why Traditional Spec‑Driven Tools Fail and How AI‑Powered Context Engineering Can Supercharge Development
DaTaobao Tech
DaTaobao Tech
Jan 5, 2026 · Artificial Intelligence

Why AI Engineering Isn’t a Reinvention of Software Architecture – Insights from AI Search

The article examines how AI engineering builds on, rather than discards, traditional software engineering principles, using the evolution of AI‑driven search at Alibaba to illustrate architectural upgrades that manage uncertainty, integrate context engineering, and combine classic design patterns with new AI‑specific tools.

AI EngineeringContext EngineeringMulti-Agent
0 likes · 21 min read
Why AI Engineering Isn’t a Reinvention of Software Architecture – Insights from AI Search
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 22, 2025 · Artificial Intelligence

Why Your RAG System Slows Down Over Time and How to Fix It

The article explains why a production Retrieval‑Augmented Generation (RAG) system becomes slower as it runs—due to growing embedding costs, expanding vector databases, heavier re‑ranking, and larger prompts—and provides concrete engineering optimizations such as batching, async concurrency, caching, partitioned retrieval, HNSW tuning, replica scaling, answer caching, and prompt sparsification to keep performance stable.

AI EngineeringRAGembedding cache
0 likes · 10 min read
Why Your RAG System Slows Down Over Time and How to Fix It
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Nov 21, 2025 · Artificial Intelligence

How to Build a Multi‑Layer Cache for Dynamic RAG Systems

This article explains why dynamic Retrieval‑Augmented Generation (RAG) requires a layered caching strategy rather than simple result caching, details a four‑level cache architecture—including embedding, search, answer, and pipeline caches—provides practical key‑generation and TTL guidelines, and outlines dirty‑data defenses to keep caches consistent and performant.

AI EngineeringLLMRAG
0 likes · 10 min read
How to Build a Multi‑Layer Cache for Dynamic RAG Systems
JD Tech
JD Tech
Nov 6, 2025 · Artificial Intelligence

LLMs Revolutionize Recommendation Systems: From Generative Models to Production

This article surveys the evolution of generative recommendation systems powered by large language models, detailing their technical foundations, engineering challenges, recent breakthroughs, and future research directions, while highlighting why the paradigm shift is occurring now.

AI EngineeringGenerative RecommendationLLM
0 likes · 30 min read
LLMs Revolutionize Recommendation Systems: From Generative Models to Production
JD Tech Talk
JD Tech Talk
Oct 27, 2025 · Artificial Intelligence

How Large Language Models Are Revolutionizing Generative Recommendation Systems

Over the past year, generative recommendation has made substantial progress by leveraging large language models' powerful sequence modeling and reasoning abilities, introducing a new paradigm that replaces complex handcrafted features, addresses traditional recommendation bottlenecks, and outlines the evolution, core technologies, engineering challenges, and future directions of LLM‑based recommendation systems.

AI EngineeringEncoder-DecoderLLM
0 likes · 29 min read
How Large Language Models Are Revolutionizing Generative Recommendation Systems
21CTO
21CTO
Oct 6, 2025 · Artificial Intelligence

How to Become an AI Engineer: Skills, Workflow, and Career Path

This guide explains what AI engineering entails, outlines the end‑to‑end workflow from problem definition and data preparation through model development, deployment, and monitoring, and highlights the essential programming, cloud, and MLOps skills, career tracks, emerging trends, and salary outlook for aspiring AI engineers.

AI EngineeringMLOpsModel Deployment
0 likes · 11 min read
How to Become an AI Engineer: Skills, Workflow, and Career Path
Instant Consumer Technology Team
Instant Consumer Technology Team
Sep 30, 2025 · Artificial Intelligence

What Makes Youtu-GraphRAG’s Engineering Stand Out? Inside the AI Blueprint

This article dissects the engineering of Tencent's Youtu-GraphRAG, covering its architectural challenges, real‑time FastAPI/WebSocket design, security measures, iterative retrieval chains, parallel processing, intelligent caching, schema‑driven knowledge handling, and performance tweaks, offering practical insights for AI system builders.

AI EngineeringFastAPIGraphRAG
0 likes · 7 min read
What Makes Youtu-GraphRAG’s Engineering Stand Out? Inside the AI Blueprint
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Sep 28, 2025 · Artificial Intelligence

Core Metrics for Enterprise Large‑Model Engineering

The article outlines the five essential engineering domains—application, model, compute, knowledge, and data—in the era of large models, and details concrete scale, efficiency, service, value, quality, and security metrics that enterprises should track to drive intelligent outcomes.

AI Engineeringbusiness valuecompute metrics
0 likes · 7 min read
Core Metrics for Enterprise Large‑Model Engineering
21CTO
21CTO
Sep 21, 2025 · Artificial Intelligence

Andrew Ng Explains the AI Skills Gap and How to Become an In‑Demand AI Engineer

Andrew Ng highlights a growing mismatch in the tech job market: while companies scramble for AI‑savvy developers, many new CS graduates struggle to find work, and he outlines the core AI‑driven capabilities—prompt engineering, RAG, model evaluation, and rapid prototyping—that separate successful AI engineers from the rest.

AIAI Engineeringjob market
0 likes · 6 min read
Andrew Ng Explains the AI Skills Gap and How to Become an In‑Demand AI Engineer
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 15, 2025 · Artificial Intelligence

Why MCP Isn't a Magic AI Upgrade: Deep Dive into Its Architecture, Host Role, and Real Costs

This article debunks common misconceptions about the Model Context Protocol (MCP), explains its client‑host‑server (CHS) architecture, shows how the Host drives AI decisions while Server and Client remain model‑agnostic, compares MCP with Function Calling, analyzes SDK source code, evaluates practical trade‑offs, and outlines the true engineering value and costs of using MCP in AI applications.

AI EngineeringFunction CallingLLM
0 likes · 35 min read
Why MCP Isn't a Magic AI Upgrade: Deep Dive into Its Architecture, Host Role, and Real Costs
ShiZhen AI
ShiZhen AI
Sep 5, 2025 · Artificial Intelligence

Andrew Ng Highlights Core AI Engineer Skills Amidst Major AI Industry Updates

The article reports that ChatGPT now supports branch conversations, Anthropic restricts service use in certain regions, Andrew Ng outlines essential AI engineer capabilities such as AI‑assisted software building, prompting and agentic workflows, and highlights the market demand, while also covering the Kimi K2 model upgrade, Hugging Face’s FineVision dataset release, and Google’s AI‑driven Deep Loop Shaping method published in *Science*.

AI EngineeringAI SafetyAI for astronomy
0 likes · 8 min read
Andrew Ng Highlights Core AI Engineer Skills Amidst Major AI Industry Updates
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 8, 2025 · Artificial Intelligence

Unlocking Big Data Ops with Large Models: Opportunities, Challenges, Design

This article summarizes a Cloud Summit talk where Alibaba Cloud’s AI expert Zhang Yingying explains how large language models can enhance big‑data intelligent operations, covering opportunities, challenges, RAG‑based Q&A, multi‑agent diagnostics, and the engineering architecture needed for reliable, scalable deployment.

AI EngineeringBig Data OperationsLarge Language Models
0 likes · 20 min read
Unlocking Big Data Ops with Large Models: Opportunities, Challenges, Design
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Aug 8, 2025 · Artificial Intelligence

From Chain‑of‑Thought to Self‑Evolving Agents: Lessons from Alibaba’s Intelligent Ops

This article traces the evolution of Alibaba’s intelligent agents from the initial chain‑of‑thought design through instantiation, structuring, self‑evolution, and middleware integration, highlighting practical challenges, architectural refinements, and open‑source tools for large‑scale AI operations.

AI Engineeringagentmiddleware
0 likes · 16 min read
From Chain‑of‑Thought to Self‑Evolving Agents: Lessons from Alibaba’s Intelligent Ops
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Jun 4, 2025 · Artificial Intelligence

What Is an AI Engineer? Roles, Skills, and the Future of LLM‑Powered Systems

This article examines the evolving role of the AI engineer, contrasting it with AI researchers, ML engineers, and software engineers, outlines essential skills such as prompt engineering, MLOps, and data integration, and predicts how AI engineering will become a pivotal, high‑demand discipline in the coming years.

AI EngineeringAI systemsAgentic RAG
0 likes · 17 min read
What Is an AI Engineer? Roles, Skills, and the Future of LLM‑Powered Systems
Java Architecture Diary
Java Architecture Diary
May 19, 2025 · Artificial Intelligence

How Ollama 0.7 Unlocks Local Multimodal AI with One Command

Ollama 0.7 introduces a fully re‑engineered core that brings seamless multimodal model support, lists top visual models, showcases OCR and image analysis capabilities, explains technical breakthroughs, and provides a quick three‑step guide to deploy powerful local AI vision.

AI EngineeringAI modelsOllama
0 likes · 7 min read
How Ollama 0.7 Unlocks Local Multimodal AI with One Command
DevOps
DevOps
Apr 22, 2025 · Artificial Intelligence

How to Think About Agent Frameworks: A Critical Review of Design Patterns, Challenges, and LangGraph

This article critically examines popular agent frameworks, compares OpenAI and Anthropic definitions, highlights the core difficulty of maintaining proper context for reliable agents, and presents LangGraph’s declarative and imperative features along with practical guidance for building production‑grade agent systems.

AI EngineeringAgent FrameworksAgent Systems
0 likes · 24 min read
How to Think About Agent Frameworks: A Critical Review of Design Patterns, Challenges, and LangGraph
DaTaobao Tech
DaTaobao Tech
Feb 21, 2025 · Artificial Intelligence

AI-Powered Face Swapping for the Spring Festival Gala: System Design and Deployment

The paper details the design and deployment of an AI‑driven face‑swap platform for the 2025 CCTV Spring Festival Gala, featuring a dual‑model SDXL pipeline with ControlNet and LoRA fine‑tuning, optimized preprocessing and GPU‑specific acceleration to achieve sub‑3‑second latency at over 10 k QPS, supporting scaling, throttling, and multi‑region load balancing, and ultimately serving ten million users and generating hundreds of millions of personalized gala images.

AI EngineeringAIGCModel Optimization
0 likes · 28 min read
AI-Powered Face Swapping for the Spring Festival Gala: System Design and Deployment
21CTO
21CTO
Dec 18, 2024 · Artificial Intelligence

5 AI Engineering Trends Shaping 2024: Agents, Coding Tools, and the Rise of Small Models

The 2024 AI engineering landscape is defined by mature AI coding assistants, the surge of AI agents like LangChain and LlamaIndex, the emergence of small, locally‑hosted language models, the solidifying role of AI engineers, and heated debates over what truly counts as open‑source AI.

AI EngineeringAI coding toolssmall language models
0 likes · 9 min read
5 AI Engineering Trends Shaping 2024: Agents, Coding Tools, and the Rise of Small Models
Zhihu Tech Column
Zhihu Tech Column
Dec 9, 2024 · Artificial Intelligence

Large Model Application Engineering: ZhiLight Inference Framework and Zhihu Direct Answer System

The article details Zhihu's technical salon on large‑model engineering, covering the RAG‑based Zhihu Direct Answer system, the open‑source ZhiLight inference framework, prompt engineering, agent research, and future plans for integrating AI into product and community workflows.

AI EngineeringInference FrameworkPrompt engineering
0 likes · 8 min read
Large Model Application Engineering: ZhiLight Inference Framework and Zhihu Direct Answer System
Baobao Algorithm Notes
Baobao Algorithm Notes
Aug 29, 2024 · Industry Insights

Why Pretraining Boosts New Engineers More Than SFT: A Practical Guide

The answer argues that fresh graduates should join pre‑training teams because the required engineering tasks—large‑scale data crawling, Hadoop/Spark pipelines, torch and CUDA setup, megatron code debugging, and scaling‑law experiments—rapidly sharpen coding skills, while SFT work focuses mainly on data labeling and offers slower technical growth.

AI EngineeringSFTSkill development
0 likes · 7 min read
Why Pretraining Boosts New Engineers More Than SFT: A Practical Guide
Baobao Algorithm Notes
Baobao Algorithm Notes
Aug 26, 2024 · Artificial Intelligence

Master Essential LLM Engineering Skills: Transform, Model, and Infer with Custom Scripts

This guide presents a hands‑on curriculum of core large‑model engineering tasks—including model conversion scripts, custom modeling wrappers, multi‑model inference utilities, and channel‑aware loss tracking—to help practitioners build practical, reusable tools without deep theoretical overhead.

AI EngineeringInference OptimizationPython scripting
0 likes · 8 min read
Master Essential LLM Engineering Skills: Transform, Model, and Infer with Custom Scripts
JD Tech
JD Tech
May 31, 2024 · Artificial Intelligence

Understanding Large Language Models, Retrieval‑Augmented Generation, and AI Agents: Concepts, Engineering Practices, and Applications

This article explains the fundamentals and engineering practices of large language models (LLM), retrieval‑augmented generation (RAG) and AI agents, compares small and large embedding models, provides Python code for vector‑database RAG with Chroma, and discusses integration, use cases, and future challenges in AI development.

AI EngineeringAI agentsLLM
0 likes · 41 min read
Understanding Large Language Models, Retrieval‑Augmented Generation, and AI Agents: Concepts, Engineering Practices, and Applications
G7 EasyFlow Tech Circle
G7 EasyFlow Tech Circle
May 29, 2024 · Artificial Intelligence

Engineering Large Model Enterprise Applications: Best Practices

This article outlines the key characteristics of large‑model enterprise applications, compares them with consumer use cases, and presents a comprehensive engineering roadmap—including model selection, knowledge‑base integration, tool implementation, intent recognition, output control, high‑availability deployment, and ongoing optimization—to help practitioners effectively harness AI models in real‑world business environments.

AI EngineeringLarge ModelRAG
0 likes · 12 min read
Engineering Large Model Enterprise Applications: Best Practices
DevOps
DevOps
Apr 17, 2024 · Artificial Intelligence

Engineering Capabilities for Enterprise Large Model Applications: Prompt Engineering, RAG, and Fine‑Tuning

The article explores how enterprises can build and improve large‑model applications by combining prompt engineering, retrieval‑augmented generation (RAG), and fine‑tuning, discusses their relationships, optimization dimensions, testing challenges, and provides practical guidance for SE4AI implementation.

AI EngineeringEnterprise AIFine-tuning
0 likes · 20 min read
Engineering Capabilities for Enterprise Large Model Applications: Prompt Engineering, RAG, and Fine‑Tuning
HelloTech
HelloTech
Apr 10, 2024 · Artificial Intelligence

An Overview of LangChain: Architecture, Core Components, and Code Examples

LangChain is an open‑source framework that provides Python and JavaScript SDKs, templates, and services such as LangServe and LangSmith to compose models, embeddings, prompts, indexes, memory, chains, and agents via a concise expression language, enabling rapid prototyping, debugging, and deployment of LLM‑driven applications.

AI EngineeringJavaScriptLLM
0 likes · 19 min read
An Overview of LangChain: Architecture, Core Components, and Code Examples
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 10, 2024 · Artificial Intelligence

Master LangChain in 10 Minutes: From Basics to Advanced AI Engineering

This guide walks AI engineers through a rapid 10‑minute boot‑strap of LangChain, explaining its purpose, core concepts, design questions, environment setup, and step‑by‑step code examples that cover APIs, chains, memory, retrieval‑augmented generation, tools, agents, and the overall architecture.

AI EngineeringLLMLangChain
0 likes · 28 min read
Master LangChain in 10 Minutes: From Basics to Advanced AI Engineering
NewBeeNLP
NewBeeNLP
Mar 21, 2024 · Artificial Intelligence

Mastering Large Language Model Training: Key Challenges and Optimization Strategies

This article examines the resource and efficiency challenges of scaling large language model training, explains data, model, pipeline, and tensor parallelism, and provides practical I/O, communication, and stability optimization techniques—including high‑availability storage, RDMA networking, NCCL tuning, and fault‑tolerant recovery—to improve throughput and reliability.

AI EngineeringDistributed TrainingI/O optimization
0 likes · 15 min read
Mastering Large Language Model Training: Key Challenges and Optimization Strategies
DataFunTalk
DataFunTalk
Mar 20, 2024 · Artificial Intelligence

Challenges and Optimization Techniques for Large Language Model Training

The article outlines the resource and efficiency challenges of scaling large language models, explains data and model parallelism strategies, and details practical I/O, communication, and stability optimizations—including high‑availability storage, RDMA networking, and fault‑tolerance measures—to improve training throughput and reliability.

AI EngineeringI/O optimizationLarge Language Models
0 likes · 13 min read
Challenges and Optimization Techniques for Large Language Model Training
Architecture & Thinking
Architecture & Thinking
Jan 14, 2024 · Artificial Intelligence

How Baidu Scales Content Understanding to Trillions of Pages with AI Engineering

This article explains how Baidu processes internet‑scale content by applying deep AI‑driven understanding, detailing cost‑optimization, efficiency improvements, model‑service frameworks, resource‑scheduling systems, and batch‑compute platforms that together enable trillion‑level indexing and feature extraction.

AI EngineeringBatch ComputingHTAP storage
0 likes · 16 min read
How Baidu Scales Content Understanding to Trillions of Pages with AI Engineering
21CTO
21CTO
Dec 15, 2023 · Artificial Intelligence

Why 2024 Will Be the Year of AI Engineers and LLM‑Driven Apps

The article outlines five major AI engineering trends for 2024—including the rise of AI engineers, evolving LLM tech stacks, open‑source large models, vector databases, and AI agents—highlighting how these shifts will reshape application development and industry competition.

2024 trendsAI EngineeringAI agents
0 likes · 9 min read
Why 2024 Will Be the Year of AI Engineers and LLM‑Driven Apps
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Nov 8, 2023 · Big Data

How Big Data and AI Converge: Insights from Alibaba Cloud’s 2023 Conference

The talk outlines the evolution from model‑centric to data‑centric AI development, explains Alibaba Cloud’s integrated big data‑AI platform, showcases real‑world use cases like knowledge‑base QA and personalized recommendation, and details the underlying cloud‑native services that enable seamless data and AI collaboration.

AI EngineeringModel Training
0 likes · 16 min read
How Big Data and AI Converge: Insights from Alibaba Cloud’s 2023 Conference
DataFunSummit
DataFunSummit
Oct 7, 2023 · Artificial Intelligence

MLOps Implementation in Network Intelligence: Jiutian Platform Overview

This article presents the Jiutian Network Intelligence platform’s MLOps implementation at China Mobile, detailing its AI engineering workflow, platform functional and technical architecture, technology selections, model deployment, monitoring, and operational challenges, and shares insights on scaling AI services across 31 provinces.

AI EngineeringMLOpsNetwork Intelligence
0 likes · 20 min read
MLOps Implementation in Network Intelligence: Jiutian Platform Overview
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Aug 25, 2023 · Artificial Intelligence

DataFunSummit 2023: Recommendation Systems Online Summit

The DataFunSummit 2023 online summit (August 26‑27) will explore eight recommendation‑system topics—including core and engineering architecture, model training/inference, large models, graphs, cold start, and multi‑task scenarios—featuring Xiaohongshu leaders who will present on graph‑based business architecture, integrated training‑inference pipelines, and user/content cold‑start strategies.

AI EngineeringRecommendation Systemsarchitecture
0 likes · 6 min read
DataFunSummit 2023: Recommendation Systems Online Summit
DataFunSummit
DataFunSummit
May 5, 2023 · Artificial Intelligence

Advances in Virtual Humans, Multimodal Technology, and General AI – Insights from OPPO

The article presents OPPO's latest research on virtual human audio‑lip and RGB driving, multimodal learning breakthroughs such as CETNETs and cross‑modal matching, and a reflective discussion on the challenges and future directions of general artificial intelligence, highlighting the interconnections among these three domains.

AI EngineeringGeneral AIMultimodal Learning
0 likes · 9 min read
Advances in Virtual Humans, Multimodal Technology, and General AI – Insights from OPPO
Efficient Ops
Efficient Ops
Jan 16, 2023 · Artificial Intelligence

How MLOps Is Transforming AI Production in China: Trends, Tools, and Standards

This report examines how MLOps is accelerating AI production in China, highlighting industry adoption across sectors, the booming tool ecosystem, the rise of feature platforms, enhanced observability, performance needs for large models, AI asset management, and the emerging national standards and evaluation results.

AI EngineeringAI StandardsFeatureOps
0 likes · 8 min read
How MLOps Is Transforming AI Production in China: Trends, Tools, and Standards
GuanYuan Data Tech Team
GuanYuan Data Tech Team
Dec 1, 2022 · Artificial Intelligence

Why MLOps Is the Key to Scalable AI Projects

This article explains the concept, significance, and practical case studies of MLOps—showing how integrating DevOps principles with data and machine learning creates reliable, automated pipelines for data quality, model monitoring, error analysis, and continuous integration, ultimately accelerating AI delivery.

AI EngineeringMLOpsMachine Learning Operations
0 likes · 15 min read
Why MLOps Is the Key to Scalable AI Projects
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Nov 4, 2022 · Artificial Intelligence

How AI Platforms Turn Dreams into Reality: Scaling, Efficiency, and Usability

In this talk from the 2022 Yunqi Conference, Jia Yangqing explains how Alibaba's AI platform addresses efficiency, scale, and usability challenges by moving the Damo Academy to the cloud, open‑sourcing ModelScope, and delivering large‑model training, deployment, and inference services at massive scale.

AI EngineeringModel Scalingefficiency
0 likes · 10 min read
How AI Platforms Turn Dreams into Reality: Scaling, Efficiency, and Usability
Efficient Ops
Efficient Ops
Oct 26, 2022 · Artificial Intelligence

Unveiling China’s AI Model Delivery Standard: Boosting MLOps and AI Engineering

China’s 14th Five-Year Plan and 2035 Vision prioritize AI, prompting a shift from proof‑of‑concept to product deployment; the newly released Model Delivery standard, part of the Model/MLOps maturity model, defines five maturity levels and a reusable pipeline to boost AI engineering across industries.

AIAI EngineeringChina
0 likes · 5 min read
Unveiling China’s AI Model Delivery Standard: Boosting MLOps and AI Engineering
Efficient Ops
Efficient Ops
Apr 24, 2022 · Artificial Intelligence

How ModelOps and MLOps Accelerate AI Project Development

ModelOps and MLOps are transforming AI engineering by introducing continuous training, integration, and deployment, which streamline development cycles, standardize model management, and enable ongoing monitoring to enhance inference accuracy and maximize the business value generated by AI models.

AI EngineeringContinuous DeploymentMLOps
0 likes · 1 min read
How ModelOps and MLOps Accelerate AI Project Development