Tagged articles
48 articles
Page 1 of 1
Architect
Architect
May 20, 2026 · Artificial Intelligence

How to Turn a Single Hermes Agent into a Fully Operable System

The article walks through converting a chat‑based Hermes Agent into a maintainable, hand‑off‑ready system by building a control room, defining clear runtime and management files, applying security safeguards, and following a step‑by‑step production pipeline.

AI OpsAgent Control RoomAutomation
0 likes · 22 min read
How to Turn a Single Hermes Agent into a Fully Operable System
DataFunSummit
DataFunSummit
May 16, 2026 · Industry Insights

What Powers Palantir’s 137% Revenue Surge? Inside Its Ontology‑Based Enterprise AI Platform

Palantir’s Q4 2025 revenue jumped 70% to $14.07 billion, with U.S. commercial revenue soaring 137%, driven not merely by AI hype but by its Ontology‑centric approach that tightly integrates data, business logic, actions, and security, locking large enterprises into a deeply embedded decision‑making stack.

AI OpsCase StudiesData Integration
0 likes · 9 min read
What Powers Palantir’s 137% Revenue Surge? Inside Its Ontology‑Based Enterprise AI Platform
ITPUB
ITPUB
Apr 16, 2026 · Industry Insights

Why Harness Engineering Is Redefining AI Agent Development in 2026

The article traces the rapid rise of AI variants such as OpenClaw, Hermes, and Harness, explains how the industry shifted from model competitions to engineering deployment, outlines a 2022‑2026 timeline of breakthroughs, and argues that Harness is the essential “harness” that turns powerful models into reliable, productive agents.

AI OpsAgentHarness
0 likes · 11 min read
Why Harness Engineering Is Redefining AI Agent Development in 2026
Alibaba Cloud Native
Alibaba Cloud Native
Apr 8, 2026 · Operations

How HiClaw Transforms SRE with Multi‑Agent Collaboration in Cloud‑Native Environments

The article details how the HiClaw distributed multi‑agent platform is built and organized for SRE teams, explains the roles of human users and digital bots, describes permission design, showcases fault‑diagnosis and release scenarios, and evaluates the efficiency and innovation gains of this cloud‑native automation approach.

AI OpsAutomationCloud Native
0 likes · 14 min read
How HiClaw Transforms SRE with Multi‑Agent Collaboration in Cloud‑Native Environments
Architect
Architect
Mar 28, 2026 · Artificial Intelligence

Why AI Agents Need a Harness: From Model Power to System Reliability

The article analyzes how the growing strength of large language models shifts engineering bottlenecks from model capabilities to system stability, introducing the concept of a "Harness" that integrates models into real‑world workflows through state management, constraints, feedback loops, and verification mechanisms.

AI EngineeringAI OpsAgent Harness
0 likes · 18 min read
Why AI Agents Need a Harness: From Model Power to System Reliability
DevOps Coach
DevOps Coach
Mar 27, 2026 · Operations

Can Four LLM‑Powered Agents Build a Real Kubernetes Cluster Without Human Help?

An experiment with four LLM‑driven autonomous agents—Architect, Builder, Security Sentinel, and QA Tester—attempted to provision a Proxmox‑based HA Kubernetes cluster using real hardware, revealing costly context drift, emergent coordination failures, and stark differences between Gemini and Claude in diagnosing infrastructure‑as‑code errors.

AI OpsAnsibleAutonomous SRE
0 likes · 14 min read
Can Four LLM‑Powered Agents Build a Real Kubernetes Cluster Without Human Help?
AI Programming Lab
AI Programming Lab
Mar 26, 2026 · Artificial Intelligence

LLMs to the Left, Harness Engineering to the Right: Bridging the Gap

The article argues that the real bottleneck for LLM‑driven agents is not model capability but the surrounding control system—Harness Engineering—which can dramatically boost success rates, reduce failure cascades, and become the lasting moat for AI productivity.

AI OpsAgent HarnessContext Engineering
0 likes · 14 min read
LLMs to the Left, Harness Engineering to the Right: Bridging the Gap
Alibaba Cloud Native
Alibaba Cloud Native
Mar 22, 2026 · Artificial Intelligence

Revolutionizing AI‑Driven Operation Intelligence with AutoDA‑Timeseries, SemanticLog, and LogBase

The article outlines three core challenges—semantic gaps, poor generalization, and industrial usability—in operation intelligence and presents three academic breakthroughs—AutoDA‑Timeseries, SemanticLog, and LogBase—that together advance AI‑powered monitoring, log parsing, and large‑scale benchmarking for smarter, more efficient cloud operations.

AI OpsAutoDABenchmark
0 likes · 9 min read
Revolutionizing AI‑Driven Operation Intelligence with AutoDA‑Timeseries, SemanticLog, and LogBase
Architect
Architect
Mar 13, 2026 · Artificial Intelligence

Why Claude Code Fails Without Proper Governance and How to Build a Stable Agentic Coding System

The article explains that Claude Code’s core challenges lie not in prompts but in treating it as a verifiable, governed, layered agent system, and provides a detailed six‑layer architecture, practical governance tips, and step‑by‑step guidance for teams to achieve stable, production‑grade AI‑assisted coding.

AI OpsAgentic AIClaude Code
0 likes · 30 min read
Why Claude Code Fails Without Proper Governance and How to Build a Stable Agentic Coding System
AI Architecture Hub
AI Architecture Hub
Feb 27, 2026 · Artificial Intelligence

Mastering AI Agents in 2026: A Four‑Layer Blueprint for Stable Deployment

This article breaks down Anthropic's four‑layer AI Agent architecture, explains when multi‑Agent setups are worthwhile, details how to design reusable Skills and a standardized MCP connection protocol, and provides a practical checklist and a ready‑to‑use Skill template for immediate implementation.

AI OpsAgent ArchitectureModel Context Protocol
0 likes · 16 min read
Mastering AI Agents in 2026: A Four‑Layer Blueprint for Stable Deployment
大转转FE
大转转FE
Feb 2, 2026 · Artificial Intelligence

Inside Moltbot’s Core Architecture, AI Memory Systems, and ToolRL Advances

This edition of the ZuanZuan Frontend Weekly curates five in‑depth articles covering Moltbot’s underlying gateway architecture, the explosive growth of Moltbook AI agents, practical integration of Alibaba Cloud RDS AI assistants, the design of short‑ and long‑term AI Agent memory systems, and a two‑stage ToolRL approach that dramatically improves AI‑driven recommendation performance.

AI ArchitectureAI OpsAgent Memory
0 likes · 7 min read
Inside Moltbot’s Core Architecture, AI Memory Systems, and ToolRL Advances
DevOps Engineer
DevOps Engineer
Nov 25, 2025 · Operations

Why DevOps Will Still Be Essential in 2026 and Beyond

The article explains that despite evolving titles and tools, DevOps remains a vital culture and engineering practice through 2026, highlighting its core principles, the rise of platform engineering, essential skills, the impact of AI, and the growing importance of security roles.

AI Opscloud computingplatform engineering
0 likes · 7 min read
Why DevOps Will Still Be Essential in 2026 and Beyond
Efficient Ops
Efficient Ops
Oct 27, 2025 · Operations

How AI is Revolutionizing Observability and Intelligent Operations

At the GOPS Global Operations Conference in Shanghai, experts from finance, technology and energy sectors examined the challenges of observability, AIOps and intelligent agents, proposing metric standardization, digital‑twin fault simulation, and AI‑driven DevOps as key steps toward scalable, business‑value‑focused intelligent operations.

AI OpsDigital TwinIntelligent Operations
0 likes · 6 min read
How AI is Revolutionizing Observability and Intelligent Operations
Architects Research Society
Architects Research Society
Sep 6, 2025 · Artificial Intelligence

From Hype to Engineered AI: The Core Architecture Behind Modern AI Apps

This article breaks down the essential components of production‑grade AI applications, covering the intelligent core (model, orchestration, memory), enterprise‑level supporting infrastructure, and critical governance, security, and data‑integrity measures required for reliable AI systems.

AI ArchitectureAI OpsLLM Orchestration
0 likes · 4 min read
From Hype to Engineered AI: The Core Architecture Behind Modern AI Apps
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Aug 22, 2025 · Artificial Intelligence

Building Scalable AI Infrastructure: Insights from Alibaba Cloud’s AI Tech Day

The AI Infra Solutions and Best Practices salon held by Alibaba Cloud in Beijing gathered technical leaders from leading AI companies to share comprehensive strategies on network, compute, and storage architectures that enable high‑efficiency, low‑latency, and elastic AI infrastructure for modern enterprise workloads.

AI InfrastructureAI OpsStorage Solutions
0 likes · 7 min read
Building Scalable AI Infrastructure: Insights from Alibaba Cloud’s AI Tech Day
Efficient Ops
Efficient Ops
Jul 21, 2025 · Operations

30 Must‑Have DevOps Skills to Boost Your Resume in 2025

This article outlines 30 essential DevOps competencies—from foundational infrastructure and cloud/container orchestration to automation, monitoring, security, and AI‑driven operations—detailing key technologies, real‑world scenarios, and measurable impact, helping professionals craft a standout resume in the evolving operations landscape.

AI OpsAutomationDevOps
0 likes · 8 min read
30 Must‑Have DevOps Skills to Boost Your Resume in 2025
dbaplus Community
dbaplus Community
Jul 17, 2025 · Operations

How AI Agents Are Replacing DevOps Engineers at AWS – Real Metrics & Tools

A senior AWS solutions architect revealed that after automating about 90% of its infrastructure, AI agents now handle Terraform fixes, predictive Kubernetes scaling, and even cloud‑cost negotiations, prompting a month‑long investigation that uncovered striking internal metrics, open‑source tools, and practical guidance for engineers.

AI OpsAWSKubeGPT
0 likes · 6 min read
How AI Agents Are Replacing DevOps Engineers at AWS – Real Metrics & Tools
ITPUB
ITPUB
Jul 7, 2025 · Operations

How to Build a DeepSeek AI Ops Platform: Architecture & Implementation

This article presents a comprehensive blueprint for constructing a DeepSeek-powered AI Ops platform, detailing the six‑module architecture, data collection stack, AI engine deployment options, application and interaction layers, implementation road‑map, model training, security measures, cost estimates, and risk mitigation strategies.

AI OpsDeepSeekInfrastructure as Code
0 likes · 8 min read
How to Build a DeepSeek AI Ops Platform: Architecture & Implementation
Efficient Ops
Efficient Ops
Jul 1, 2025 · Operations

Inside Lenovo CloudOps: AI‑Driven Ops, LLMOps & FinOps Insights

The Lenovo Smart Cloud CloudOps session at the 26th GOPS Global Operations Conference showcased five deep‑dive topics—including large‑model‑powered intelligent operations, enterprise LLMOps, FinOps‑driven cost governance, cross‑region distributed ops, and SAP global ops—offering practical pathways for enterprises to accelerate their intelligent transformation.

AI OpsDistributed OperationsFinOps
0 likes · 8 min read
Inside Lenovo CloudOps: AI‑Driven Ops, LLMOps & FinOps Insights
360 Zhihui Cloud Developer
360 Zhihui Cloud Developer
Jun 27, 2025 · Operations

How AI‑Powered Ops‑Nexus Transforms Intelligent Operations for 100k+ Servers

This article details the design, technology choices, functional modules, core implementation, performance optimizations, and future roadmap of Ops‑Nexus, an AI‑driven intelligent operations platform that streamlines alarm analysis, log processing, and host health checks for large‑scale monitoring environments.

AI OpsIntelligent OperationsLLM
0 likes · 12 min read
How AI‑Powered Ops‑Nexus Transforms Intelligent Operations for 100k+ Servers
dbaplus Community
dbaplus Community
Jun 26, 2025 · Operations

How AI Can Transform Kubernetes Operations: 10 Smart Use Cases

This article explores ten practical AI‑driven scenarios for Kubernetes operations—including intelligent monitoring, automated scaling, log analysis, fault repair, resource optimization, CI/CD automation, security checks, knowledge‑base assistance, capacity planning, and an ops assistant—detailing methods, tools, and implementation tips.

AI OpsAutomationKubernetes
0 likes · 12 min read
How AI Can Transform Kubernetes Operations: 10 Smart Use Cases
Efficient Ops
Efficient Ops
Apr 22, 2025 · Operations

How AI Agents Are Transforming IT Operations and Fault Management

This article explores how AI agents powered by large models can predict failures, perform root‑cause analysis, enhance knowledge‑based Q&A, automate change releases, and enable intelligent decision‑making, dramatically improving efficiency and reliability in modern IT operations.

AI OpsRoot Cause Analysisfault prediction
0 likes · 7 min read
How AI Agents Are Transforming IT Operations and Fault Management
dbaplus Community
dbaplus Community
Apr 21, 2025 · Operations

Turn Zabbix Alerts into AI‑Powered Insights with DeepSeek

This guide shows how to integrate Zabbix with a locally deployed DeepSeek large language model via Webhook, enabling automatic analysis of alerts, generation of root‑cause explanations and remediation suggestions, and delivering results through WeChat bots, dashboards, or email to reduce MTTR and manual effort.

AI OpsAlert AutomationDeepSeek
0 likes · 4 min read
Turn Zabbix Alerts into AI‑Powered Insights with DeepSeek
dbaplus Community
dbaplus Community
Mar 17, 2025 · Operations

Designing an AI‑Powered Ops Platform with DeepSeek: Architecture, Modules, and Implementation

This article outlines a comprehensive AI‑Ops solution built on DeepSeek, covering its technical architecture, data collection stack, AI engine deployment, key functional modules, implementation roadmap, model training, security design, cost estimates, and risk mitigation strategies for modern operations teams.

AI OpsDeepSeekInfrastructure Automation
0 likes · 7 min read
Designing an AI‑Powered Ops Platform with DeepSeek: Architecture, Modules, and Implementation
phodal
phodal
Jun 27, 2023 · Artificial Intelligence

Designing an LLM‑Powered Architecture: The ArchGuard Co‑mate Reference Model

This article presents a detailed reference architecture for building LLM‑driven applications, using the ArchGuard Co‑mate project to illustrate layered design, local model integration, DSL‑based orchestration, and streaming LLM interfaces, complete with code examples and practical implementation notes.

AI OpsKotlinLLM
0 likes · 10 min read
Designing an LLM‑Powered Architecture: The ArchGuard Co‑mate Reference Model
Architect
Architect
Feb 25, 2023 · Cloud Native

Deploying a K8s ChatGPT Bot with Robusta: A Step‑by‑Step Guide

This article walks through installing Robusta, configuring Slack integration, adding Helm repositories, deploying the Robusta platform on a Kubernetes cluster, creating a crash‑loop pod to trigger alerts, and interacting with a ChatGPT bot to automatically troubleshoot Prometheus alerts, providing complete code snippets and screenshots for each step.

AI OpsChatGPTKubernetes
0 likes · 12 min read
Deploying a K8s ChatGPT Bot with Robusta: A Step‑by‑Step Guide
Tencent Cloud Developer
Tencent Cloud Developer
Aug 12, 2020 · Databases

How Autonomous Databases Evolve: From Stone Age to AI‑Driven Self‑Healing

This article traces the evolution of database autonomy from manual, knowledge‑driven operations through tool‑assisted and expert‑level stages to cloud‑native intelligent services, and details Tencent's DBbrain platform, its architecture, performance‑optimization, security, monitoring, cost‑based analysis, and future self‑healing capabilities.

AI OpsCloud DatabasesDBbrain
0 likes · 29 min read
How Autonomous Databases Evolve: From Stone Age to AI‑Driven Self‑Healing
dbaplus Community
dbaplus Community
Mar 9, 2020 · Artificial Intelligence

How LSTM‑Powered Real‑Time Alerting with Spark Streaming Boosts Ops Efficiency

This article details a deep‑learning‑driven, real‑time alert system that combines TensorFlow LSTM time‑series forecasting with Spark Streaming to achieve high‑coverage, low‑latency anomaly detection for large‑scale data‑ops environments, including data preprocessing, metric classification, model training, and deployment pipelines.

AI OpsLSTMSpark Streaming
0 likes · 18 min read
How LSTM‑Powered Real‑Time Alerting with Spark Streaming Boosts Ops Efficiency
Efficient Ops
Efficient Ops
Mar 14, 2019 · Cloud Native

How Alibaba Automates Cloud‑Native Operations at Massive Scale

This article explains Alibaba's intelligent, automated approach to managing large‑scale cloud‑native applications, covering challenges of scale, safety, and efficiency, and how AI‑driven decision making improves stability while reducing operational costs.

AI OpsAlibabacloud automation
0 likes · 8 min read
How Alibaba Automates Cloud‑Native Operations at Massive Scale
58 Tech
58 Tech
Feb 21, 2019 · Artificial Intelligence

Threshold‑Free Business Metric Monitoring Using Machine Learning

This article describes how a machine‑learning‑driven monitoring system replaces fixed thresholds with personalized, anomaly‑based detection for business‑level metrics such as network traffic and access volume, detailing the architecture, sample labeling, model training, alarm grading, and operational benefits.

AI Opsalarm gradinganomaly detection
0 likes · 8 min read
Threshold‑Free Business Metric Monitoring Using Machine Learning
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Sep 21, 2018 · Operations

Intelligent Operations Sessions at the 2018 Hangzhou Yunqi Conference

The 2018 Hangzhou Yunqi Conference featured a series of expert talks on intelligent operations, covering Alibaba's AI‑driven maintenance systems, robust supply‑chain optimization, data‑center automation, MSP transformation, and AI‑Ops practices, providing actionable insights for large‑scale infrastructure management.

AI OpsAlibabaData center
0 likes · 12 min read
Intelligent Operations Sessions at the 2018 Hangzhou Yunqi Conference
Efficient Ops
Efficient Ops
Apr 26, 2018 · Operations

How 360 Detects Network Anomalies with AI‑Powered Time‑Series Algorithms

This article explains how 360’s network operations team uses time‑series analysis, statistical thresholds, EWMA, dynamic limits, and machine‑learning models such as K‑Means and Isolation Forest to automatically detect, locate, and remediate traffic anomalies across massive data‑center exits.

AI OpsNetwork MonitoringTime Series
0 likes · 15 min read
How 360 Detects Network Anomalies with AI‑Powered Time‑Series Algorithms
Tencent Cloud Developer
Tencent Cloud Developer
Mar 29, 2018 · Artificial Intelligence

How AI Powers a Smart Ops Bot for Seamless Dev‑Ops Collaboration

This article explains the motivation behind the growing gap between developers and operations, introduces Tencent Cloud's AI‑driven intelligent operations robot, outlines its core features, typical use cases, and dives into the retrieval‑based dialogue system and matching models that enable natural‑language interactions.

AI OpsChatbotDevOps
0 likes · 13 min read
How AI Powers a Smart Ops Bot for Seamless Dev‑Ops Collaboration
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 27, 2017 · Databases

How Alibaba’s Next‑Gen Database Powered Double 11: Elasticity, Cloud & AI

Alibaba’s database team explains how their next‑generation X‑DB system achieved extreme elasticity, high performance, and cost efficiency during the Double 11 shopping festival by leveraging cloud‑native hybrid deployment, containerization, storage‑compute separation, Paxos‑based consistency, and AI‑driven self‑optimizing DBA tools, while outlining key challenges and solutions.

AI OpsDistributed SystemsPerformance Optimization
0 likes · 12 min read
How Alibaba’s Next‑Gen Database Powered Double 11: Elasticity, Cloud & AI