Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 10, 2025 · Artificial Intelligence

Seamlessly Switch Between DeepSeek‑R1 and QwQ‑32B with Higress AI Gateway

Learn how to deploy the new QwQ‑32B inference model alongside DeepSeek‑R1 using the Higress AI gateway, covering environment setup, model configuration, routing, token‑level rate limiting, content safety, semantic caching, and advanced features like automatic fallback and internet‑search integration.

DeepSeekHigressLLM integration
0 likes · 16 min read
Seamlessly Switch Between DeepSeek‑R1 and QwQ‑32B with Higress AI Gateway
Alibaba Cloud Native
Alibaba Cloud Native
Mar 7, 2025 · Artificial Intelligence

8 Real-World AI Gateway Use Cases Every Enterprise Should Know

This article outlines eight practical AI gateway scenarios—from multi‑model services and consumer authentication to token rate limiting, content safety, semantic caching, and observability—explaining the business needs behind each and how Alibaba Cloud's cloud‑native API gateway provides concrete technical solutions.

AI gatewayContent SafetyModel Management
0 likes · 15 min read
8 Real-World AI Gateway Use Cases Every Enterprise Should Know