Tagged articles
16 articles
Page 1 of 1
AI Explorer
AI Explorer
Mar 26, 2026 · Artificial Intelligence

LiteLLM: A Unified Gateway for Over 100 LLM APIs

LiteLLM provides a single Python SDK and proxy server that let developers call more than 100 large‑language‑model APIs with a uniform OpenAI‑style interface, handling cost tracking, load balancing, rate limiting, and detailed logging to simplify multi‑model experimentation and production deployments.

AI gatewayLLM integrationLiteLLM
0 likes · 6 min read
LiteLLM: A Unified Gateway for Over 100 LLM APIs
Su San Talks Tech
Su San Talks Tech
Feb 28, 2026 · Artificial Intelligence

How to Install and Configure OpenClaw: A Self‑Hosted AI Assistant Gateway

This guide walks you through the overview, system requirements, step‑by‑step installation, custom model configuration, verification methods, core file locations, environment variables, hot‑reload options, common commands, and troubleshooting for OpenClaw, a self‑hosted AI assistant gateway.

AI gatewayCLIConfiguration
0 likes · 10 min read
How to Install and Configure OpenClaw: A Self‑Hosted AI Assistant Gateway
Alibaba Cloud Native
Alibaba Cloud Native
Nov 25, 2025 · Artificial Intelligence

AI‑Native Architecture Insights: Highlights from AgentX 2025 SECon

The AgentX 2025 SECon AI‑native application track, co‑hosted by Alibaba Cloud and the Institute of Information, delivered deep technical insights on AI‑native architecture, the AgentScope 1.0 framework, AI gateway capabilities, and observability‑driven reliability for long‑cycle agents, summarised here for practitioners.

AI gatewayAI-nativeAgentScope
0 likes · 7 min read
AI‑Native Architecture Insights: Highlights from AgentX 2025 SECon
Alibaba Cloud Native
Alibaba Cloud Native
Oct 28, 2025 · Artificial Intelligence

How SOFA AI Gateway Transforms Cloud‑Native AI Service Management

The article explains how the SOFA AI Gateway, built on the open‑source Higress kernel, evolves traditional API gateways into specialized AI gateways by adding intelligent routing, model proxy, agent proxy, and MCP market features to meet the unique latency, resource, and security demands of AI workloads.

AI gatewayCloud NativeHigress
0 likes · 12 min read
How SOFA AI Gateway Transforms Cloud‑Native AI Service Management
Alibaba Cloud Native
Alibaba Cloud Native
Aug 19, 2025 · Artificial Intelligence

Boost Dify AI App Performance with Higress AI Gateway: A Full-Scale High‑Availability Guide

This guide explains why Dify’s system components and model services become performance bottlenecks at scale, and how integrating the Higress AI gateway can provide protocol standardization, observability, security, and stability features to achieve full‑stack high availability for AI applications.

AI gatewayCloud NativeDify
0 likes · 16 min read
Boost Dify AI App Performance with Higress AI Gateway: A Full-Scale High‑Availability Guide
Alibaba Cloud Developer
Alibaba Cloud Developer
May 29, 2025 · Cloud Native

API Gateway vs API Management: Evolution, Differences, and AI Gateway Rise

This article traces the evolution of API gateways and API management from early traffic and microservice gateways to cloud-native and AI-focused solutions, compares their core responsibilities, roles, and technical foundations, and outlines how they can be integrated and what future trends, such as AI gateways and MCP servers, may bring to modern software architectures.

AI gatewayAPI ManagementCloud Native
0 likes · 16 min read
API Gateway vs API Management: Evolution, Differences, and AI Gateway Rise
Bilibili Tech
Bilibili Tech
May 9, 2025 · Artificial Intelligence

How an AI Gateway Scales LLM Services: Architecture, Auth, Quotas, and Load Balancing

This article explains the design of an AI gateway that centralizes LLM access, detailing its background, overall architecture, authentication, quota management, multi‑model routing, load‑balancing strategies, multi‑tenant isolation, observability features, and the supported API protocols for enterprise integration.

AI gatewayAuthenticationLLM
0 likes · 17 min read
How an AI Gateway Scales LLM Services: Architecture, Auth, Quotas, and Load Balancing
Alibaba Cloud Developer
Alibaba Cloud Developer
May 9, 2025 · Backend Development

How to Convert Alibaba HSF Services to MCP Servers Without Code Changes

This article explains how Alibaba engineers use the Higress AI gateway to offload MCP protocol and transform large‑scale HSF RPC services into MCP servers without modifying application code, covering challenges, required components, step‑by‑step migration, and practical insights for AI‑native infrastructure.

AI gatewayHSFHigress
0 likes · 9 min read
How to Convert Alibaba HSF Services to MCP Servers Without Code Changes
Alibaba Cloud Observability
Alibaba Cloud Observability
Mar 17, 2025 · Cloud Native

How to Master LLM Observability in Cloud‑Native Environments

This article explains the unique observability challenges of large language model (LLM) applications, outlines essential performance, cost, and safety metrics, and presents a comprehensive cloud‑native solution—including trace, metric, and log collection, domain‑specific dashboards, and step‑by‑step integration with Alibaba Cloud's Python Agent—to ensure reliable, efficient LLM deployments.

AI gatewayCloud NativeLLM Observability
0 likes · 18 min read
How to Master LLM Observability in Cloud‑Native Environments
Alibaba Cloud Native
Alibaba Cloud Native
Mar 7, 2025 · Artificial Intelligence

8 Real-World AI Gateway Use Cases Every Enterprise Should Know

This article outlines eight practical AI gateway scenarios—from multi‑model services and consumer authentication to token rate limiting, content safety, semantic caching, and observability—explaining the business needs behind each and how Alibaba Cloud's cloud‑native API gateway provides concrete technical solutions.

AI gatewayCloud NativeContent Safety
0 likes · 15 min read
8 Real-World AI Gateway Use Cases Every Enterprise Should Know
Alibaba Cloud Native
Alibaba Cloud Native
Jan 22, 2025 · Cloud Native

Seamlessly Migrate from OpenAI to DeepSeek with Higress AI Gateway

This guide explains how to install the Higress AI gateway, configure provider API keys, set up gray‑release routing between OpenAI and DeepSeek, use a Python client to call DeepSeek, and enable content security and observability features for safe, cost‑effective large‑model deployments.

AI gatewayContent SecurityDeepSeek
0 likes · 7 min read
Seamlessly Migrate from OpenAI to DeepSeek with Higress AI Gateway
Ops Development Stories
Ops Development Stories
Sep 19, 2024 · Artificial Intelligence

How to Connect Qwen LLMs with Higress AI Gateway: A Hands‑On Guide

This tutorial walks through setting up a local k3d cluster, installing Higress, and using its AI plugins—including AI Proxy, AI JSON formatter, AI Agent, and AI Statistics—to integrate and observe Alibaba Cloud's Qwen large language models across various use cases such as weather and flight queries.

AI gatewayAI pluginsHigress
0 likes · 30 min read
How to Connect Qwen LLMs with Higress AI Gateway: A Hands‑On Guide
21CTO
21CTO
Jun 7, 2024 · Artificial Intelligence

Why AI Gateways Are the Next Evolution of API Gateways

AI gateways have emerged as essential infrastructure for modern AI applications, offering specialized security, load balancing, cost management, and observability that go beyond traditional API gateways, and understanding their differences and deployment considerations is crucial for developers and ops teams.

AI InfrastructureAI gatewayCost Management
0 likes · 10 min read
Why AI Gateways Are the Next Evolution of API Gateways
Alibaba Cloud Native
Alibaba Cloud Native
May 15, 2024 · Cloud Native

Build a Cloud‑Native Playground to Compare GPT‑4o and Qwen‑2.5 with NextChat and Higress

This article walks through setting up a cloud‑native test environment using the open‑source NextChat UI and Higress API gateway to let Qwen‑2.5 masquerade as GPT‑4o, enabling a side‑by‑side comparison of their responses while showcasing Higress’s streaming, hot‑update, and security features for AI workloads.

AI gatewayDockerGPT-4o
0 likes · 8 min read
Build a Cloud‑Native Playground to Compare GPT‑4o and Qwen‑2.5 with NextChat and Higress