Alibaba Cloud Developer
Author

Alibaba Cloud Developer

Alibaba's official tech channel, featuring all of its technology innovations.

1.9k
Articles
0
Likes
665
Views
0
Comments
Recent Articles

Latest from Alibaba Cloud Developer

100 recent articles max
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 29, 2025 · Artificial Intelligence

How Alibaba’s Tair KVCache Manager Revolutionizes Enterprise‑Level LLM Cache Management

This article details the architecture and implementation of Tair KVCache Manager, an enterprise‑grade service that centralises KVCache metadata, decouples inference engines from storage, provides elastic scaling, multi‑tenant isolation, high availability, and performance‑optimised cache management for large‑scale LLM inference workloads.

Cache ManagementKVCacheLLM
0 likes · 28 min read
How Alibaba’s Tair KVCache Manager Revolutionizes Enterprise‑Level LLM Cache Management
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 26, 2025 · Artificial Intelligence

How AutoContextMemory Cuts LLM Costs by 70% in Long Conversations

This article explains the challenges of token explosion in long‑running AI agent dialogues and introduces AutoContextMemory, a Java component that automatically compresses, offloads, and summarizes conversation history to dramatically reduce token usage, speed up responses, and preserve critical information.

AgentScopeContext ManagementJava
0 likes · 12 min read
How AutoContextMemory Cuts LLM Costs by 70% in Long Conversations
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 26, 2025 · Artificial Intelligence

How to Build a Fully Automated Knowledge‑Extraction Pipeline for AI Agents with Python

This article presents a complete end‑to‑end pipeline that automatically extracts, generalizes, incrementally updates, and vector‑syncs knowledge from diverse sources such as tickets, documents, and SQL code, turning the traditionally labor‑intensive knowledge‑base construction for agents into a low‑effort, continuously maintainable Python‑driven solution.

Knowledge ExtractionLLMPython
0 likes · 15 min read
How to Build a Fully Automated Knowledge‑Extraction Pipeline for AI Agents with Python
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 24, 2025 · Artificial Intelligence

Boosting LLM Inference: RoleBasedGroup & Mooncake for Stable, High‑Performance Service

Large language model inference faces memory pressure, but by externalizing KVCache with Mooncake and orchestrating roles via the Kubernetes‑native RoleBasedGroup (RBG), developers can achieve stable, high‑throughput, cost‑effective serving with seamless in‑place upgrades and topology‑aware performance.

AI infrastructureKVCacheKubernetes
0 likes · 21 min read
Boosting LLM Inference: RoleBasedGroup & Mooncake for Stable, High‑Performance Service
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 24, 2025 · Artificial Intelligence

Why CLI AI Tools Matter: Aesthetics, Single vs Multi-Agent Design, Tips

This article explores the resurgence of command‑line AI coding tools, examining their product aesthetics rooted in Unix philosophy, the technical contrast between single‑agent and multi‑agent architectures, practical usage scenarios, context‑engineering techniques, and why a lightweight single‑agent core often outperforms more complex multi‑agent systems.

Agent architectureContext Engineeringsoftware development
0 likes · 17 min read
Why CLI AI Tools Matter: Aesthetics, Single vs Multi-Agent Design, Tips
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 23, 2025 · Artificial Intelligence

How Hybrid Transformer‑Mamba Architectures Overcome KVCache Challenges in Large‑Model Inference

This article explains how SGLang’s hybrid model design combines Transformer attention with Mamba state‑space layers, introduces a dual‑pool memory architecture and elastic allocation, and presents specialized prefix‑cache and speculative‑decoding techniques that together enable efficient, scalable inference for long‑context large language models.

KVCacheSGLanghybrid models
0 likes · 22 min read
How Hybrid Transformer‑Mamba Architectures Overcome KVCache Challenges in Large‑Model Inference
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 23, 2025 · Backend Development

How We Leveraged JVM Agents and JaCoCo to Clean Up Legacy Java Code

This article explains how a long‑standing Java backend service was instrumented with JVM agents and JaCoCo to collect execution coverage, visualize results in an IntelliJ IDEA plugin, and systematically remove dead code, improving maintainability while minimizing impact on production performance.

IDEA pluginJVM agentJaCoCo
0 likes · 24 min read
How We Leveraged JVM Agents and JaCoCo to Clean Up Legacy Java Code
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 22, 2025 · Artificial Intelligence

Deploy Multi‑Agent AI Apps with AgentScope on Alibaba Cloud Kubernetes

This guide explains how to use Alibaba Cloud's AgentScope framework and Container Service to build, orchestrate, and deploy enterprise‑grade AI agents, covering background, core features, step‑by‑step deployment, sandbox integration, and best‑practice recommendations for cloud‑native AI workloads.

AI AgentAgentScopeAlibaba Cloud
0 likes · 20 min read
Deploy Multi‑Agent AI Apps with AgentScope on Alibaba Cloud Kubernetes
Alibaba Cloud Developer
Alibaba Cloud Developer
Dec 22, 2025 · Artificial Intelligence

Turning Real‑Time Hotspot Detection into AI‑Powered E‑Commerce Recommendations

Traditional recommendation systems lag behind fast‑moving external trends, missing the freshness and surprise users crave. This article details an end‑to‑end AI pipeline that perceives, understands, and reacts to hotspots within hours, automatically generating high‑quality product selections and continuously optimizing through feedback loops.

AI recommendationAutomationLLM
0 likes · 25 min read
Turning Real‑Time Hotspot Detection into AI‑Powered E‑Commerce Recommendations