Tagged articles
29 articles
Page 1 of 1
Raymond Ops
Raymond Ops
Nov 4, 2025 · Artificial Intelligence

How to Deploy GPUStack with Docker for Scalable AI Model Serving

This guide walks you through installing NVIDIA drivers and Docker, configuring the NVIDIA Container Toolkit, and deploying GPUStack in Docker to manage heterogeneous GPU resources, run large language, multimodal, diffusion, and embedding models, and scale from a single node to a multi‑node GPU cluster.

AI Model DeploymentDockerGPU cluster
0 likes · 15 min read
How to Deploy GPUStack with Docker for Scalable AI Model Serving
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Sep 25, 2025 · Artificial Intelligence

Unlocking Trillion‑Parameter MoE Models: Expert Parallelism and Alibaba Cloud PAI‑EAS Deployment Guide

This article explains the opportunities and challenges of Mixture of Experts (MoE) models, introduces expert parallelism as a solution to scaling and deployment bottlenecks, and provides a step‑by‑step guide for deploying MoE models with Alibaba Cloud PAI‑EAS, including configuration tips and code examples.

AI Model DeploymentExpert ParallelismMoE
0 likes · 11 min read
Unlocking Trillion‑Parameter MoE Models: Expert Parallelism and Alibaba Cloud PAI‑EAS Deployment Guide
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Sep 5, 2025 · Cloud Native

How OCI‑Based ModelDistribution Simplifies AI Model Deployment Across Regions

This article explains how Alibaba Cloud ACK One's ModelDistribution leverages OCI images to standardize, version, and efficiently distribute large AI models across multiple Kubernetes clusters worldwide, addressing challenges of storage, deployment speed, and pre‑warming for rapid inference services.

AI Model DeploymentKubernetesModelDistribution
0 likes · 9 min read
How OCI‑Based ModelDistribution Simplifies AI Model Deployment Across Regions
Ops Development Stories
Ops Development Stories
Jun 12, 2025 · Cloud Native

One-Click GPU-Enabled Kind Cluster Setup for Running Large AI Models

This tutorial walks you through using a one‑click script to create a GPU‑enabled Kind Kubernetes cluster, evenly distribute GPU resources across nodes with nvkind, install necessary drivers and toolkits, deploy a vLLM‑served large language model, and verify its operation, all on a local or cloud environment.

AI Model DeploymentDockerGPU
0 likes · 23 min read
One-Click GPU-Enabled Kind Cluster Setup for Running Large AI Models
MaGe Linux Operations
MaGe Linux Operations
Jun 3, 2025 · Artificial Intelligence

How to Deploy GPUStack with Docker for Scalable AI Model Serving

This guide walks you through installing NVIDIA drivers, Docker, and the NVIDIA Container Toolkit, then shows step‑by‑step how to run GPUStack in Docker, expand a GPU cluster, and serve large language, multimodal, diffusion, and embedding models with OpenAI‑compatible APIs.

AI Model DeploymentDockerGPU cluster
0 likes · 15 min read
How to Deploy GPUStack with Docker for Scalable AI Model Serving
MaGe Linux Operations
MaGe Linux Operations
May 16, 2025 · Artificial Intelligence

Deploying Massive AI Models with Docker: A Complete From‑Zero‑to‑Production Guide

Learn how to efficiently package, build, and run large AI models in Docker containers—from preparing the model and API code, creating Dockerfiles, building and testing images, to scaling in production with Kubernetes and GPU support—complete with step‑by‑step commands and best‑practice tips.

AI Model DeploymentDockerFastAPI
0 likes · 10 min read
Deploying Massive AI Models with Docker: A Complete From‑Zero‑to‑Production Guide
Liangxu Linux
Liangxu Linux
Apr 28, 2025 · Artificial Intelligence

Deploy DeepSeek‑R1 on Your Server in 15 Minutes with Zero Code

This guide shows how to use the lightweight OpenStation platform to install, configure, and launch the DeepSeek‑R1 large‑model on a personal server in under 15 minutes, covering zero‑code deployment, resource management, inference engine selection, and integration with CherryStudio.

AI Model DeploymentCherryStudioDeepSeek-R1
0 likes · 7 min read
Deploy DeepSeek‑R1 on Your Server in 15 Minutes with Zero Code
Eric Tech Circle
Eric Tech Circle
Mar 28, 2025 · Artificial Intelligence

How to Build a High‑Performance Local Text‑to‑Image Service with Flux and Cursor IDE

Learn step‑by‑step how to set up a stable, high‑efficiency local text‑to‑image generation service using the Flux model series on Alibaba Cloud’s Baile​n platform, integrate it with Cursor IDE’s MCP tool, configure environments, manage API keys, and run the service with sample code and results.

AI Model DeploymentCursor IDEFlux
0 likes · 13 min read
How to Build a High‑Performance Local Text‑to‑Image Service with Flux and Cursor IDE
Tencent Cloud Developer
Tencent Cloud Developer
Feb 25, 2025 · Artificial Intelligence

Deploy DeepSeek AI: Cloud, Local, API – Full Step‑by‑Step Guide

This guide walks developers through the full lifecycle of using DeepSeek—choosing the right deployment method (API, local machine, or private cloud), selecting model sizes based on hardware, configuring Tencent Cloud services, building AI applications, and integrating the model into development tools and mini‑programs.

AI Model DeploymentAI application developmentCloud Native
0 likes · 12 min read
Deploy DeepSeek AI: Cloud, Local, API – Full Step‑by‑Step Guide
macrozheng
macrozheng
Feb 22, 2025 · Artificial Intelligence

Choosing the Right DeepSeek‑R1 Model: Hardware Needs & Use Cases Explained

This guide compares DeepSeek‑R1’s 1.5B/7B/8B, 14B/32B, and 70B/671B versions, detailing their characteristics, typical applications, and the specific CPU, memory, and GPU specifications required for local deployment, helping you select the optimal model for your resources.

AI Model DeploymentDeepSeekHardware Requirements
0 likes · 7 min read
Choosing the Right DeepSeek‑R1 Model: Hardware Needs & Use Cases Explained
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Feb 18, 2025 · Artificial Intelligence

One-Click Deployment of Cutting-Edge Text-to-Video and Voice Interaction Models

This article introduces the state‑of‑the‑art Step‑Video‑T2V text‑to‑video model and the Step‑Audio‑Chat voice interaction model, outlines their technical specifications and benchmark results, and provides a detailed step‑by‑step guide for deploying both models with a single click using Alibaba Cloud's PAI Model Gallery.

AI Model DeploymentPAI Model Gallerystate-of-the-art
0 likes · 9 min read
One-Click Deployment of Cutting-Edge Text-to-Video and Voice Interaction Models
Baidu Geek Talk
Baidu Geek Talk
Feb 12, 2025 · Artificial Intelligence

Deploy DeepSeek, Llama, Qwen Models Fast on Baidu Baige AI Heterogeneous Platform

This guide walks you through creating a lightweight compute instance, adding it to Baidu Baige AI heterogeneous computing platform, deploying the vLLM tool, loading and serving small‑scale dense models such as DeepSeek, Llama and Qwen, and provides recommended configuration lists to achieve low‑cost, high‑performance inference.

AI Model DeploymentBaidu BaigeCloud AI
0 likes · 3 min read
Deploy DeepSeek, Llama, Qwen Models Fast on Baidu Baige AI Heterogeneous Platform
JD Tech Talk
JD Tech Talk
Feb 11, 2025 · Artificial Intelligence

Step-by-Step Guide to Deploying DeepSeek Locally with Cherry Studio

This guide walks you through registering on SiliconFlow, selecting DeepSeek models, installing Cherry Studio, configuring API keys, setting up the environment, and testing the AI assistant, enabling a full‑feature local deployment without high‑end hardware.

AI Model DeploymentCherry StudioDeepSeek
0 likes · 6 min read
Step-by-Step Guide to Deploying DeepSeek Locally with Cherry Studio
Tencent Cloud Developer
Tencent Cloud Developer
Feb 2, 2025 · Artificial Intelligence

Deploying DeepSeek-R1 Models on Tencent Cloud HAI Platform

Deploy DeepSeek‑R1 models on Tencent Cloud HAI in just three minutes by logging in, creating an application, and accessing the model via ChatbotUI or JupyterLab, without purchasing GPUs or configuring environments, while also leveraging integrated services like Cloud Studio and Object Storage for enterprise AI solutions.

AI Model DeploymentChatbotUIDeepSeek-R1
0 likes · 3 min read
Deploying DeepSeek-R1 Models on Tencent Cloud HAI Platform
360 Smart Cloud
360 Smart Cloud
Apr 15, 2024 · Artificial Intelligence

Fine‑Tuning Qwen‑14B Large Language Model: A Complete Guide

This article provides a comprehensive tutorial on fine‑tuning the Qwen‑14B large language model, covering the motivation, fine‑tuning concepts, step‑by‑step workflow, required code, DeepSpeed training parameters, testing scripts, and deployment using FastChat and the 360AI platform.

AI Model DeploymentDeepSpeedFastChat
0 likes · 9 min read
Fine‑Tuning Qwen‑14B Large Language Model: A Complete Guide
Alibaba Cloud Native
Alibaba Cloud Native
Apr 18, 2023 · Artificial Intelligence

How to Deploy a CPU‑Based Stable Diffusion Service on Alibaba Cloud ACK

This guide walks you through the prerequisites, step‑by‑step console and kubectl procedures, YAML configuration, and post‑deployment verification needed to run a CPU‑only Stable Diffusion model on Alibaba Cloud Container Service (ACK) and optionally switch to a GPU‑enabled version.

ACKAI Model DeploymentCPU
0 likes · 7 min read
How to Deploy a CPU‑Based Stable Diffusion Service on Alibaba Cloud ACK