Author

Alibaba Cloud Developer

Alibaba's official tech channel, featuring all of its technology innovations.

1.9k

Articles

Likes

4.1k

Views

Comments

Latest from Alibaba Cloud Developer

100 recent articles max

Alibaba Cloud Developer

Jan 28, 2026 · Cloud Computing

Deploy Moltbot on Alibaba Cloud Lightweight Server in Minutes

This guide walks you through installing the Moltbot AI assistant on Alibaba Cloud's lightweight application server, configuring Baidu's Bailei API key, opening required ports, generating access tokens, and accessing the chat interface, plus a quick overview of the pre‑installed Shadow Cloud Computer image.

Alibaba CloudLightweight ServerMoltbot

0 likes · 3 min read

Deploy Moltbot on Alibaba Cloud Lightweight Server in Minutes

Alibaba Cloud Developer

Jan 28, 2026 · Artificial Intelligence

How We Built a High‑Performance AI Rental Advisor with One‑Model Tool‑Use and Reinforcement Learning

This article details the design, challenges, and performance gains of an AI‑driven rental recommendation system that replaces a multi‑agent architecture with a single LLM using dynamic tool‑use, introduces a two‑stage reinforcement‑learning pipeline, and achieves sub‑second latency and higher accuracy for complex rental scenarios.

AI recommendationTool Uselarge language model

0 likes · 19 min read

How We Built a High‑Performance AI Rental Advisor with One‑Model Tool‑Use and Reinforcement Learning

Alibaba Cloud Developer

Jan 27, 2026 · Artificial Intelligence

Why AI Coding Assistants Quit Early and How Ralph Loop Keeps Them Working

The article examines common pitfalls of AI programming assistants—premature exits, fragile single‑prompt workflows, costly re‑prompting, and context loss—explains that these stem from unreliable LLM self‑evaluation, and introduces the Ralph Loop (also called Ralph Wiggum Loop) as a robust, stop‑hook‑driven solution that forces continuous iteration until explicit, verifiable completion criteria are met.

AI codingAgent LoopAutomation

0 likes · 28 min read

Why AI Coding Assistants Quit Early and How Ralph Loop Keeps Them Working

Alibaba Cloud Developer

Jan 26, 2026 · Artificial Intelligence

How We Scaled a 3.5B MoE LLM for Real‑Time Search Relevance

This article details the engineering challenges and solutions for deploying a 3.5 billion‑parameter MoE LLM in Taobao's search relevance pipeline, covering large‑batch scheduling, dynamic load balancing, intra‑batch KV‑Cache reuse, and MoE kernel tuning to meet sub‑second latency requirements.

KV cacheLLMMoE

0 likes · 15 min read

How We Scaled a 3.5B MoE LLM for Real‑Time Search Relevance

Alibaba Cloud Developer

Jan 22, 2026 · Artificial Intelligence

How to Build a Prompt‑Optimizer Skill for Claude Using AI Coding Tools

This article explains the common difficulties of writing effective prompts, introduces the open‑source prompt‑optimizer skill that automatically selects and refines professional prompt frameworks, and provides step‑by‑step instructions for preparing materials, packaging the skill for Claude and Chrome, and installing it via GitHub repositories.

AI codingClaudeGitHub

0 likes · 10 min read

How to Build a Prompt‑Optimizer Skill for Claude Using AI Coding Tools

Alibaba Cloud Developer

Jan 19, 2026 · Artificial Intelligence

Can Meta Ray‑Ban Glasses Become an AI Traffic Cop with Alibaba Cloud Serverless?

This article walks through turning a Meta Ray‑Ban AR headset into an AI‑powered traffic enforcement prototype by combining a lightweight iOS client, Alibaba Cloud Function Compute AgentRun, prompt engineering, and serverless tools to perform OCR, rule checking, and real‑time feedback.

AIAgentRunAlibaba Cloud

0 likes · 16 min read

Can Meta Ray‑Ban Glasses Become an AI Traffic Cop with Alibaba Cloud Serverless?

Alibaba Cloud Developer

Jan 19, 2026 · Backend Development

Mastering OpenSpec: A CLI Guide to Spec‑Driven Development and AI Integration

This article explains the history, core concepts, and practical CLI commands of OpenSpec, a spec‑driven development tool that integrates with AI assistants, detailing project initialization, workflow steps, quality control, directory structure, common pitfalls, and comparisons with other specification frameworks to help developers adopt a more efficient, documentation‑first workflow.

AICLIOpenSpec

0 likes · 30 min read

Mastering OpenSpec: A CLI Guide to Spec‑Driven Development and AI Integration

Alibaba Cloud Developer

Jan 16, 2026 · Artificial Intelligence

How Quest Achieves Autonomous Programming with Agentic Architecture

Quest redesigns long‑running task execution by combining model capability, context management, dynamic reminders, and a minimal Bash‑centric toolset into a closed‑loop Agent architecture that lets AI generate, verify, and deliver complete software artifacts without constant human intervention.

AIautonomous programmingcoding automation

0 likes · 12 min read

How Quest Achieves Autonomous Programming with Agentic Architecture

Alibaba Cloud Developer

Jan 16, 2026 · Artificial Intelligence

Mastering Anthropic Skills: Fast, High‑Quality Skill Development Guide

This article explains what Anthropic Skills are, compares them with MCP, and provides a step‑by‑step, best‑practice workflow—including repository setup, OpenSkills commands, naming conventions, and executable skill patterns—to help developers quickly create robust, efficient Skills for Claude and other AI agents.

AI AgentsAnthropicOpenSkills

0 likes · 15 min read

Mastering Anthropic Skills: Fast, High‑Quality Skill Development Guide

Alibaba Cloud Developer

Jan 15, 2026 · Artificial Intelligence

How Hierarchical Sparse Attention Breaks KVCache Limits for Ultra‑Long Context LLMs

This article explains how a hierarchical sparse‑attention framework redesigns KVCache storage across GPU, CPU, and remote memory, eliminates bandwidth and capacity bottlenecks, and enables efficient inference for 128K‑token and larger contexts with dramatically reduced GPU memory usage and higher throughput.

Dynamic Sparse AttentionGPU memory optimizationHierarchical Storage

0 likes · 20 min read

How Hierarchical Sparse Attention Breaks KVCache Limits for Ultra‑Long Context LLMs