Tag

VKE

0 views collected around this technical thread.

ByteDance Cloud Native
ByteDance Cloud Native
Mar 7, 2025 · Artificial Intelligence

How to Deploy the QwQ-32B Large Language Model on Volcengine Cloud in Minutes

This guide walks you through the end‑to‑end process of deploying the open‑source QwQ‑32B inference model on Volcengine's cloud platform, covering GPU ECS selection, VKE cluster creation, continuous delivery CP setup, vLLM service launch, and API gateway exposure.

GPU ECSQwQ-32BVKE
0 likes · 8 min read
How to Deploy the QwQ-32B Large Language Model on Volcengine Cloud in Minutes
ByteDance Cloud Native
ByteDance Cloud Native
Aug 12, 2024 · Cloud Native

How to Deploy NVIDIA NIM AI Models on Volcengine VKE in Minutes

This guide walks you through deploying large language models with NVIDIA NIM on Volcengine's Kubernetes Engine (VKE), covering environment setup, model optimization, Helm chart deployment, monitoring integration, and the key advantages of using NIM as a cloud‑native AI micro‑service.

AI deploymentGPUHelm
0 likes · 12 min read
How to Deploy NVIDIA NIM AI Models on Volcengine VKE in Minutes