Tagged articles

29 articles

Page 1 of 1

Apr 21, 2026 · Artificial Intelligence

Run Claude Code Locally or in the Cloud in 5 Minutes with Ollama, LM Studio, llama.cpp, and OpenRouter

This guide shows how to configure Claude Code to run on local or cloud models within five minutes, covering hardware requirements, recommended models, step‑by‑step installation for Ollama, llama.cpp, LM Studio, and cloud‑based options, plus performance and cost comparisons.

AI Model DeploymentClaude CodeLM Studio

0 likes · 12 min read

Run Claude Code Locally or in the Cloud in 5 Minutes with Ollama, LM Studio, llama.cpp, and OpenRouter

Tencent Cloud Developer

Mar 5, 2026 · Artificial Intelligence

Deploy Qwen3-4B on FlagOS with OpenClaw: A Complete Step‑by‑Step Guide

This guide walks you through deploying the Qwen3-4B-hygon-flagos model on the open‑source FlagOS stack, pulling the Docker image from Tencent Cloud HAI, configuring OpenClaw, and connecting the model to a QQ bot, while highlighting performance trends and practical considerations.

AI Model DeploymentDockerFlagOS

0 likes · 8 min read

Deploy Qwen3-4B on FlagOS with OpenClaw: A Complete Step‑by‑Step Guide

Raymond Ops

Nov 4, 2025 · Artificial Intelligence

How to Deploy GPUStack with Docker for Scalable AI Model Serving

This guide walks you through installing NVIDIA drivers and Docker, configuring the NVIDIA Container Toolkit, and deploying GPUStack in Docker to manage heterogeneous GPU resources, run large language, multimodal, diffusion, and embedding models, and scale from a single node to a multi‑node GPU cluster.

AI Model DeploymentDockerGPU cluster

0 likes · 15 min read

How to Deploy GPUStack with Docker for Scalable AI Model Serving

Alibaba Cloud Big Data AI Platform

Sep 25, 2025 · Artificial Intelligence

Unlocking Trillion‑Parameter MoE Models: Expert Parallelism and Alibaba Cloud PAI‑EAS Deployment Guide

This article explains the opportunities and challenges of Mixture of Experts (MoE) models, introduces expert parallelism as a solution to scaling and deployment bottlenecks, and provides a step‑by‑step guide for deploying MoE models with Alibaba Cloud PAI‑EAS, including configuration tips and code examples.

AI Model DeploymentExpert ParallelismMoE

0 likes · 11 min read

Unlocking Trillion‑Parameter MoE Models: Expert Parallelism and Alibaba Cloud PAI‑EAS Deployment Guide

Alibaba Cloud Infrastructure

Sep 5, 2025 · Cloud Native

How OCI‑Based ModelDistribution Simplifies AI Model Deployment Across Regions

This article explains how Alibaba Cloud ACK One's ModelDistribution leverages OCI images to standardize, version, and efficiently distribute large AI models across multiple Kubernetes clusters worldwide, addressing challenges of storage, deployment speed, and pre‑warming for rapid inference services.

AI Model DeploymentKubernetesModelDistribution

0 likes · 9 min read

How OCI‑Based ModelDistribution Simplifies AI Model Deployment Across Regions

Ops Development Stories

Jun 12, 2025 · Cloud Native

One-Click GPU-Enabled Kind Cluster Setup for Running Large AI Models

This tutorial walks you through using a one‑click script to create a GPU‑enabled Kind Kubernetes cluster, evenly distribute GPU resources across nodes with nvkind, install necessary drivers and toolkits, deploy a vLLM‑served large language model, and verify its operation, all on a local or cloud environment.

AI Model DeploymentDockerGPU

0 likes · 23 min read

One-Click GPU-Enabled Kind Cluster Setup for Running Large AI Models

MaGe Linux Operations

Jun 3, 2025 · Artificial Intelligence

How to Deploy GPUStack with Docker for Scalable AI Model Serving

This guide walks you through installing NVIDIA drivers, Docker, and the NVIDIA Container Toolkit, then shows step‑by‑step how to run GPUStack in Docker, expand a GPU cluster, and serve large language, multimodal, diffusion, and embedding models with OpenAI‑compatible APIs.

AI Model DeploymentDockerGPU cluster

0 likes · 15 min read

MaGe Linux Operations

May 16, 2025 · Artificial Intelligence

Deploying Massive AI Models with Docker: A Complete From‑Zero‑to‑Production Guide

Learn how to efficiently package, build, and run large AI models in Docker containers—from preparing the model and API code, creating Dockerfiles, building and testing images, to scaling in production with Kubernetes and GPU support—complete with step‑by‑step commands and best‑practice tips.

AI Model DeploymentDockerFastAPI

0 likes · 10 min read

Deploying Massive AI Models with Docker: A Complete From‑Zero‑to‑Production Guide

Liangxu Linux

Apr 28, 2025 · Artificial Intelligence

Deploy DeepSeek‑R1 on Your Server in 15 Minutes with Zero Code

This guide shows how to use the lightweight OpenStation platform to install, configure, and launch the DeepSeek‑R1 large‑model on a personal server in under 15 minutes, covering zero‑code deployment, resource management, inference engine selection, and integration with CherryStudio.

AI Model DeploymentCherryStudioDeepSeek-R1

0 likes · 7 min read

Deploy DeepSeek‑R1 on Your Server in 15 Minutes with Zero Code

Eric Tech Circle

Mar 28, 2025 · Artificial Intelligence

How to Build a High‑Performance Local Text‑to‑Image Service with Flux and Cursor IDE

Learn step‑by‑step how to set up a stable, high‑efficiency local text‑to‑image generation service using the Flux model series on Alibaba Cloud’s Bailen platform, integrate it with Cursor IDE’s MCP tool, configure environments, manage API keys, and run the service with sample code and results.

AI Model DeploymentCursor IDEFlux

0 likes · 13 min read

How to Build a High‑Performance Local Text‑to‑Image Service with Flux and Cursor IDE

Alibaba Cloud Developer

Mar 11, 2025 · Artificial Intelligence

How to Deploy the Open‑Source QwQ‑32B Inference Model on Alibaba Cloud CAP

This guide walks you through deploying the open‑source QwQ‑32B inference model using Alibaba Cloud's Serverless AI platform CAP, covering benchmark highlights, preparation steps, two deployment methods (application template and model service), verification, and project cleanup.

AI Model DeploymentAlibaba CloudCAP

0 likes · 7 min read

How to Deploy the Open‑Source QwQ‑32B Inference Model on Alibaba Cloud CAP

Baidu Intelligent Cloud Tech Hub

Feb 27, 2025 · Artificial Intelligence

Deploy and Extend Baidu DeepSeek Enterprise Suite in Minutes

This guide walks you through quickly deploying Baidu's DeepSeek‑R1 model, accessing its WebUI, and enabling key enterprise extensions such as web search, file upload, OCR, and content moderation to integrate AI capabilities into production workflows.

AI Model DeploymentAI extensionsDeepSeek

0 likes · 6 min read

Deploy and Extend Baidu DeepSeek Enterprise Suite in Minutes

Tencent Cloud Developer

Feb 25, 2025 · Artificial Intelligence

Deploy DeepSeek AI: Cloud, Local, API – Full Step‑by‑Step Guide

This guide walks developers through the full lifecycle of using DeepSeek—choosing the right deployment method (API, local machine, or private cloud), selecting model sizes based on hardware, configuring Tencent Cloud services, building AI applications, and integrating the model into development tools and mini‑programs.

AI Model DeploymentAI application developmentCloud Native

0 likes · 12 min read

Deploy DeepSeek AI: Cloud, Local, API – Full Step‑by‑Step Guide

Full-Stack Internet Architecture

Feb 24, 2025 · Artificial Intelligence

Deploying the DeepSeek Large Language Model Locally with Ollama on Windows

This guide explains how to install Ollama on a Windows machine, configure its environment, and use it to download and run the DeepSeek‑R1 1.5B large language model locally, enabling offline AI interactions without relying on remote servers.

AI Model DeploymentDeepSeekLocal-LLM

0 likes · 4 min read

Deploying the DeepSeek Large Language Model Locally with Ollama on Windows

macrozheng

Feb 22, 2025 · Artificial Intelligence

Choosing the Right DeepSeek‑R1 Model: Hardware Needs & Use Cases Explained

This guide compares DeepSeek‑R1’s 1.5B/7B/8B, 14B/32B, and 70B/671B versions, detailing their characteristics, typical applications, and the specific CPU, memory, and GPU specifications required for local deployment, helping you select the optimal model for your resources.

AI Model DeploymentDeepSeekHardware Requirements

0 likes · 7 min read

Choosing the Right DeepSeek‑R1 Model: Hardware Needs & Use Cases Explained

ByteDance Cloud Native

Feb 21, 2025 · Artificial Intelligence

Deploy DeepSeek‑R1‑Distill on Volcengine CPU Cloud for Low‑Cost AI Inference

This guide walks you through deploying the DeepSeek‑R1‑Distill model on Volcengine CPU ECS instances, covering use‑case scenarios, recommended server types, Docker setup, environment configuration, and verification steps to achieve cost‑effective, high‑compatibility AI inference.

AI Model DeploymentCPU inferenceDeepSeek

0 likes · 6 min read

Deploy DeepSeek‑R1‑Distill on Volcengine CPU Cloud for Low‑Cost AI Inference

Alibaba Cloud Big Data AI Platform

Feb 18, 2025 · Artificial Intelligence

One-Click Deployment of Cutting-Edge Text-to-Video and Voice Interaction Models

This article introduces the state‑of‑the‑art Step‑Video‑T2V text‑to‑video model and the Step‑Audio‑Chat voice interaction model, outlines their technical specifications and benchmark results, and provides a detailed step‑by‑step guide for deploying both models with a single click using Alibaba Cloud's PAI Model Gallery.

AI Model DeploymentPAI Model Gallerystate-of-the-art

0 likes · 9 min read

One-Click Deployment of Cutting-Edge Text-to-Video and Voice Interaction Models

Fun with Large Models

Feb 12, 2025 · Artificial Intelligence

Build a Local DeepSeek‑R1 Large Model Service with Ollama – Intro to AI LLMs

This guide walks through installing Ollama on Windows, configuring the OLLAMA_MODELS path, downloading the 7‑b DeepSeek‑R1 model, running it locally, and accessing it via a browser using the Page Assist extension, providing step‑by‑step commands, screenshots, and tips for offline setups.

AI Model DeploymentDeepSeek-R1Local-LLM

0 likes · 9 min read

Build a Local DeepSeek‑R1 Large Model Service with Ollama – Intro to AI LLMs

Baidu Geek Talk

Feb 12, 2025 · Artificial Intelligence

Deploy DeepSeek, Llama, Qwen Models Fast on Baidu Baige AI Heterogeneous Platform

This guide walks you through creating a lightweight compute instance, adding it to Baidu Baige AI heterogeneous computing platform, deploying the vLLM tool, loading and serving small‑scale dense models such as DeepSeek, Llama and Qwen, and provides recommended configuration lists to achieve low‑cost, high‑performance inference.

AI Model DeploymentBaidu BaigeCloud AI

0 likes · 3 min read

Deploy DeepSeek, Llama, Qwen Models Fast on Baidu Baige AI Heterogeneous Platform

JD Tech Talk

Feb 11, 2025 · Artificial Intelligence

Step-by-Step Guide to Deploying DeepSeek Locally with Cherry Studio

This guide walks you through registering on SiliconFlow, selecting DeepSeek models, installing Cherry Studio, configuring API keys, setting up the environment, and testing the AI assistant, enabling a full‑feature local deployment without high‑end hardware.

AI Model DeploymentCherry StudioDeepSeek

0 likes · 6 min read

Step-by-Step Guide to Deploying DeepSeek Locally with Cherry Studio

Open Source Tech Hub

Feb 11, 2025 · Artificial Intelligence

Deploy DeepSeek‑R1 Locally with Ollama and Connect It to Webman AI

This guide walks you through installing Ollama, selecting the appropriate DeepSeek‑R1 model size based on GPU memory, running the model locally, and optionally integrating it with Webman AI for a richer user experience.

AI Model DeploymentDeepSeekLocal-LLM

0 likes · 5 min read

Deploy DeepSeek‑R1 Locally with Ollama and Connect It to Webman AI

Full-Stack DevOps & Kubernetes

Feb 8, 2025 · Artificial Intelligence

Deploy DeepSeek‑R1 on Tencent Cloud with Ollama: A Complete Step‑by‑Step Guide

This guide walks you through preparing a Tencent Cloud account, creating a Cloud Studio workspace, installing Ollama, downloading and running the DeepSeek‑R1 large language model, interacting via terminal or API, and managing resources and model versions.

AI Model DeploymentAPIDeepSeek

0 likes · 8 min read

Deploy DeepSeek‑R1 on Tencent Cloud with Ollama: A Complete Step‑by‑Step Guide

Architect

Feb 2, 2025 · Artificial Intelligence

Deploying DeepSeek‑R1 Locally with Ollama and Accessing It via Spring Boot and Spring AI

This guide explains how to install Ollama, download and run the open‑source DeepSeek‑R1 language model locally, configure GPU acceleration, and integrate the model into a Spring Boot application using Spring AI to provide an API service for AI inference.

AI Model DeploymentDeepSeek-R1GPU Acceleration

0 likes · 12 min read

Deploying DeepSeek‑R1 Locally with Ollama and Accessing It via Spring Boot and Spring AI

Tencent Cloud Developer

Feb 2, 2025 · Artificial Intelligence

Deploying DeepSeek-R1 Models on Tencent Cloud HAI Platform

Deploy DeepSeek‑R1 models on Tencent Cloud HAI in just three minutes by logging in, creating an application, and accessing the model via ChatbotUI or JupyterLab, without purchasing GPUs or configuring environments, while also leveraging integrated services like Cloud Studio and Object Storage for enterprise AI solutions.

AI Model DeploymentChatbotUIDeepSeek-R1

0 likes · 3 min read

Deploying DeepSeek-R1 Models on Tencent Cloud HAI Platform

Alibaba Cloud Native

Dec 26, 2024 · Cloud Computing

Deploy Qwen2.5 LLM on Alibaba Cloud Function Compute: A Step‑by‑Step Guide

This guide explains how to deploy the Qwen2.5 large language model on Alibaba Cloud Function Compute using Ollama and Open WebUI, covering model selection, resource configuration, deployment steps, interface setup, multilingual capabilities, and automatic scaling for high‑concurrency workloads.

AI Model DeploymentFunction ComputeOllama

0 likes · 10 min read

Deploy Qwen2.5 LLM on Alibaba Cloud Function Compute: A Step‑by‑Step Guide

360 Smart Cloud

Apr 15, 2024 · Artificial Intelligence

Fine‑Tuning Qwen‑14B Large Language Model: A Complete Guide

This article provides a comprehensive tutorial on fine‑tuning the Qwen‑14B large language model, covering the motivation, fine‑tuning concepts, step‑by‑step workflow, required code, DeepSpeed training parameters, testing scripts, and deployment using FastChat and the 360AI platform.

AI Model DeploymentDeepSpeedFastChat

0 likes · 9 min read

Fine‑Tuning Qwen‑14B Large Language Model: A Complete Guide

Alibaba Cloud Big Data AI Platform

May 9, 2023 · Artificial Intelligence

Deploy Stable Diffusion WebUI on Alibaba Cloud PAI‑EAS in Minutes

This guide walks you through using Alibaba Cloud's PAI‑EAS to quickly deploy the open‑source Stable Diffusion text‑to‑image model with a low‑code WebUI, covering preparation, service configuration, inference testing, and common troubleshooting tips for performance and file management.

AI Model DeploymentAIGCAlibaba Cloud

0 likes · 9 min read

Deploy Stable Diffusion WebUI on Alibaba Cloud PAI‑EAS in Minutes

Alibaba Cloud Native

Apr 18, 2023 · Artificial Intelligence

How to Deploy a CPU‑Based Stable Diffusion Service on Alibaba Cloud ACK

This guide walks you through the prerequisites, step‑by‑step console and kubectl procedures, YAML configuration, and post‑deployment verification needed to run a CPU‑only Stable Diffusion model on Alibaba Cloud Container Service (ACK) and optionally switch to a GPU‑enabled version.

ACKAI Model DeploymentCPU

0 likes · 7 min read

How to Deploy a CPU‑Based Stable Diffusion Service on Alibaba Cloud ACK

Alibaba Cloud Native

Mar 13, 2023 · Cloud Native

How Knative and ECI Virtual Nodes Supercharged AI Model Deployment on Alibaba Cloud

Shuhe Technology leveraged Alibaba Cloud Container Service with Knative and ECI virtual nodes to achieve auto‑scaling, multi‑version management, and up to 60% cost reduction for AI model services, dramatically improving resource efficiency and stability under burst traffic.

AI Model DeploymentAlibaba CloudCloud Native

0 likes · 7 min read

How Knative and ECI Virtual Nodes Supercharged AI Model Deployment on Alibaba Cloud