Baidu Tech Salon
Mar 29, 2023 · Artificial Intelligence
Punica System: Enhancing AI Inference Service Efficiency Through FaaS Architecture
The Punica system unifies AI inference development, testing, deployment, and maintenance on a FaaS‑based one‑stop platform that automates resource scheduling, self‑healing, and monitoring, supporting multiple frameworks and GPUs, thereby doubling onboarding speed, quintuple scaling efficiency, and reclaiming hundreds of GPU cards.
AI inferenceCloud NativeContainer Framework
0 likes · 13 min read