Tagged articles

Ollama

168 articles · Page 1 of 2

Jul 5, 2026 · Artificial Intelligence

In‑Depth Evaluation of the 1.5B‑Parameter Security‑SLM‑1.5B Model Running Locally on CPU

The article provides a detailed technical assessment of the open‑source 1.5 billion‑parameter security‑SLM‑1.5B language model, covering its architecture, quantized GGUF format, blue‑team and red‑team capabilities, training metrics, performance improvements, and step‑by‑step deployment on CPU via Ollama, llama.cpp, and Python.

MITRE ATT&CK mappingOllamablue team

0 likes · 9 min read

In‑Depth Evaluation of the 1.5B‑Parameter Security‑SLM‑1.5B Model Running Locally on CPU

SpringMeng

Jun 20, 2026 · Artificial Intelligence

Building a Local AI Knowledge Base in 2 Months for 75k: My Development Journey

In two months and a budget of 75,000 CNY, I built a secure on‑premise AI knowledge‑base for a research institute using SpringBoot, Python, DeepSeek‑v4, RAGFlow, and a custom GPU‑rich server, and documented every step from hardware selection to Docker deployment.

AIDeepSeekDocker

0 likes · 11 min read

Building a Local AI Knowledge Base in 2 Months for 75k: My Development Journey

The Dominant Programmer

Jun 3, 2026 · Backend Development

Building a Minimal Spring AI Tool Chain for Multi-Tool Calls

This tutorial demonstrates how to integrate Spring AI with Ollama, define @Tool‑annotated weather and translation utilities, register them for automatic chaining, and let a large language model answer queries like “fetch Beijing weather and reply in English” using a concise end‑to‑end example.

JavaOllamaSpring AI

0 likes · 8 min read

Building a Minimal Spring AI Tool Chain for Multi-Tool Calls

Data STUDIO

Jun 1, 2026 · Artificial Intelligence

Build a CLI AI Agent in Just 250 Python Lines

This tutorial walks through seven incremental stages—starting with a simple while‑True loop and adding tool‑calling, dynamic skill loading, slash commands, JSON persistence, automatic context compression, and a background timed loop—to create a fully functional CLI AI Agent using Ollama and the local qwen3.5 model without GPU or API keys.

AI AgentCLIOllama

0 likes · 15 min read

Build a CLI AI Agent in Just 250 Python Lines

The Dominant Programmer

May 28, 2026 · Artificial Intelligence

Spring AI RAG: Concepts, Hands‑On Implementation, and Full Code

This article explains the limitations of large language models, introduces Retrieval‑Augmented Generation (RAG) and its four‑step workflow, details Spring AI's RAG components and vector‑store options, and provides complete, runnable Java code—including Maven, configuration, and service classes—to build a local knowledge‑base Q&A system.

EmbeddingJavaOllama

0 likes · 18 min read

Spring AI RAG: Concepts, Hands‑On Implementation, and Full Code

The Dominant Programmer

May 25, 2026 · Artificial Intelligence

Mastering Structured Output in Spring AI: Getting Precise JSON from Large Language Models

This article walks through using Spring AI with Ollama to enforce JSON‑schema‑based structured output for agents, showing why structured responses matter, how Spring AI generates schemas from Java beans, and providing complete runnable code for both basic and advanced tool‑calling scenarios.

AgentFunction CallingJSON schema

0 likes · 11 min read

Mastering Structured Output in Spring AI: Getting Precise JSON from Large Language Models

The Dominant Programmer

May 24, 2026 · Artificial Intelligence

Integrating Spring AI with Ollama for Tool Calling: A Complete Beginner‑to‑Practice Guide

This article walks through setting up Spring AI with Ollama, explains the tool‑calling workflow, shows two ways to define tools, provides full Maven and YAML configurations, presents runnable Java code for services, chat client, and controller, and addresses common compatibility and dependency issues.

AI integrationJavaOllama

0 likes · 12 min read

Integrating Spring AI with Ollama for Tool Calling: A Complete Beginner‑to‑Practice Guide

The Dominant Programmer

May 23, 2026 · Backend Development

Switching Spring AI from DashScope to Ollama: Multi‑MCP Server Calls and Targeted Server Example

This guide walks through replacing DashScope with a local Ollama model in a Spring AI project, showing how to configure multiple MCP servers, adjust Maven and YAML settings, run zero‑code changes, and troubleshoot common issues.

ConfigurationJavaLLM

0 likes · 9 min read

Switching Spring AI from DashScope to Ollama: Multi‑MCP Server Calls and Targeted Server Example

Old Zhang's AI Learning

May 15, 2026 · Artificial Intelligence

How to Use Codex on Your Phone for Free – A Step‑by‑Step Guide

This guide shows how OpenAI has embedded Codex into the ChatGPT mobile app, turning your phone into a remote control panel for a desktop‑running Codex instance, and walks through the free setup, Ollama installation, model switching, and connection workflow.

AI codingChatGPTCodex

0 likes · 6 min read

How to Use Codex on Your Phone for Free – A Step‑by‑Step Guide

Old Zhang's AI Learning

May 11, 2026 · Information Security

Critical CVE-2026-7482 'Bleeding Llama' in Ollama: Why You Must Upgrade Now

Ollama versions before 0.17.1 suffer a CVSS 9.1 heap out‑of‑bounds read vulnerability (CVE‑2026‑7482) that lets attackers upload malicious GGUF files, read server memory—including env vars and API keys—and exfiltrate data, affecting over 300,000 publicly exposed servers, so immediate upgrade and hardening are essential.

API vulnerabilityBleeding LlamaCVE-2026-7482

0 likes · 5 min read

Critical CVE-2026-7482 'Bleeding Llama' in Ollama: Why You Must Upgrade Now

Black & White Path

May 9, 2026 · Information Security

Ollama ‘Bleeding Llama’ Vulnerability Puts 300K Servers at Risk of Sensitive Data Exposure

A critical CVE‑2026‑7482 flaw in Ollama’s model quantization pipeline, dubbed “Bleeding Llama,” allows unauthenticated attackers to craft GGUF files that read beyond buffer limits, potentially leaking prompts, API keys and other confidential data from over 300,000 internet‑exposed servers, with mitigation requiring an upgrade to version 0.17.1 and stricter network controls.

AI securityBleeding LlamaCVE-2026-7482

0 likes · 5 min read

Ollama ‘Bleeding Llama’ Vulnerability Puts 300K Servers at Risk of Sensitive Data Exposure

Java Web Project

Apr 29, 2026 · Backend Development

Run Claude Code in VS Code for Free with a One‑Time Proxy Setup

This guide shows how to bypass Claude Code's paid Anthropic API by installing a local proxy that forwards requests to free models such as DeepSeek, Ollama, or NVIDIA NIM, covering all required tools, configuration steps, and troubleshooting tips.

Claude CodeDeepSeekFree AI

0 likes · 10 min read

Run Claude Code in VS Code for Free with a One‑Time Proxy Setup

The Dominant Programmer

Apr 28, 2026 · Backend Development

Spring Boot, LangChain4j & Ollama: Chain for Intent Recognition and Task Dispatch

The article demonstrates how to construct a Spring Boot application that orchestrates multiple AI services using LangChain4j and Ollama, defining intent‑classification and tool‑based assistants, registering them as beans, and routing user requests through a controller to achieve multi‑step intent recognition and task dispatch in a simulated intelligent customer‑service workflow.

AI orchestrationLangChain4jOllama

0 likes · 13 min read

Spring Boot, LangChain4j & Ollama: Chain for Intent Recognition and Task Dispatch

The Dominant Programmer

Apr 27, 2026 · Artificial Intelligence

Build and Integrate a Local LLM with Spring Boot, LangChain4j, and Ollama

This guide walks through installing Ollama on Windows, downloading a Qwen2.5‑7B model, configuring Spring Boot with LangChain4j dependencies, setting up application.yml, defining AI service interfaces, adding conversation memory, creating REST and streaming controllers, and testing the end‑to‑end local LLM workflow.

AIChatbotLLM

0 likes · 12 min read

Build and Integrate a Local LLM with Spring Boot, LangChain4j, and Ollama

The Dominant Programmer

Apr 27, 2026 · Artificial Intelligence

Building a Smart Customer Service with Spring Boot, LangChain4j, and Ollama Function Calling

This guide walks through setting up a local LLM with Ollama, configuring Spring Boot and LangChain4j, defining function‑calling tools for weather, order status, logistics and coupons, creating AI service beans, exposing REST controllers, and troubleshooting common integration issues.

AI integrationFunction CallingJava

0 likes · 14 min read

Building a Smart Customer Service with Spring Boot, LangChain4j, and Ollama Function Calling

The Dominant Programmer

Apr 27, 2026 · Artificial Intelligence

Building a Private Document Vector Search with SpringBoot, LangChain4j, and Ollama RAG

This guide walks through why Retrieval‑Augmented Generation (RAG) is needed for large language models, explains the three‑step indexing and query workflow, details LangChain4j’s core components, and provides a complete SpringBoot example—including Maven setup, configuration, service code, and troubleshooting—to create a private document‑vector search system powered by Ollama.

EmbeddingLangChain4jOllama

0 likes · 13 min read

Building a Private Document Vector Search with SpringBoot, LangChain4j, and Ollama RAG

Old Meng AI Explorer

Apr 26, 2026 · Artificial Intelligence

How to Integrate Codex with Domestic LLMs in 10 Minutes and Cut Costs by 90%

This guide shows developers how to replace costly OpenAI APIs by configuring Codex to use Chinese large‑language models such as DeepSeek, GLM‑4.7, and Qwen, detailing three setup methods, benchmark results, cost savings of up to 90 %, and best‑practice tips for optimal performance.

CodexLLMModel integration

0 likes · 18 min read

How to Integrate Codex with Domestic LLMs in 10 Minutes and Cut Costs by 90%

Old Zhang's AI Learning

Apr 24, 2026 · Artificial Intelligence

DeepSeek V4 Surge: Technical Specs, Quantization Details, Deployment Costs, and Market Impact

The article compiles key information on DeepSeek V4, covering Ollama's one‑click launch, the model's FP4/FP8 mixed‑precision quantization, size reductions, high local deployment costs, recent benchmark rankings, and the accompanying stock price movements in both China and the US.

AI benchmarksDeepSeek-V4FP4

0 likes · 5 min read

DeepSeek V4 Surge: Technical Specs, Quantization Details, Deployment Costs, and Market Impact

DevOps Coach

Apr 23, 2026 · Artificial Intelligence

Can Gemma 4 on a MacBook Pro or NVIDIA Blackwell Replace Cloud LLMs? A Hands‑On Performance Study

The author benchmarks Gemma 4 locally on a 24 GB M4 Pro MacBook Pro (llama.cpp) and on a Dell GB10 with an NVIDIA Blackwell GPU (Ollama), comparing token speed, tool‑call reliability, and task completion against cloud GPT‑5.4, showing the Mac runs faster per token but the Blackwell system achieves higher first‑pass success with fewer retries, and that the jump from Gemma 3 to Gemma 4 dramatically improves agentic coding viability.

BenchmarkGemma 4MacBook Pro

0 likes · 15 min read

Can Gemma 4 on a MacBook Pro or NVIDIA Blackwell Replace Cloud LLMs? A Hands‑On Performance Study

AI Algorithm Path

Apr 21, 2026 · Artificial Intelligence

Run Claude Code Locally or in the Cloud in 5 Minutes with Ollama, LM Studio, llama.cpp, and OpenRouter

This guide shows how to configure Claude Code to run on local or cloud models within five minutes, covering hardware requirements, recommended models, step‑by‑step installation for Ollama, llama.cpp, LM Studio, and cloud‑based options, plus performance and cost comparisons.

AI model deploymentClaude CodeLM Studio

0 likes · 12 min read

Run Claude Code Locally or in the Cloud in 5 Minutes with Ollama, LM Studio, llama.cpp, and OpenRouter

Coder Trainee

Apr 20, 2026 · Artificial Intelligence

How to Install and Configure Ollama Locally for a CRM AI Engine

This guide walks through installing Ollama on Windows 10, downloading a Chinese‑friendly LLM such as Qwen2, configuring a CRM’s application‑dev.yml to point to the local Ollama service, restarting the backend, and handling optional CORS settings, highlighting zero‑cost, privacy, and stability benefits.

AI DeploymentCRM integrationOllama

0 likes · 4 min read

How to Install and Configure Ollama Locally for a CRM AI Engine

Test Development Learning Exchange

Apr 19, 2026 · Artificial Intelligence

Master Ollama on macOS: Install, Run, and Optimize Large Language Models

This step‑by‑step guide shows how to install Ollama on macOS, verify the installation, manage and run open‑source LLMs, create custom models, enable the OpenAI‑compatible API, integrate with Open WebUI, and troubleshoot performance issues across different Apple silicon chips.

AIInstallationLLM

0 likes · 9 min read

Master Ollama on macOS: Install, Run, and Optimize Large Language Models

IT Services Circle

Apr 19, 2026 · Artificial Intelligence

How to Seamlessly Add AI Coding Assistants to IntelliJ IDEA

This guide walks you through configuring IntelliJ IDEA to use AI coding assistants like Claude, Codex, OpenAI‑compatible APIs, and local models via Ollama, covering plugin installation, provider setup, API key entry, and usage tips with screenshots.

AI assistantClaudeCodex

0 likes · 6 min read

How to Seamlessly Add AI Coding Assistants to IntelliJ IDEA

TonyBai

Apr 18, 2026 · Industry Insights

Why Ollama Fell From Open‑Source Hero to Community Villain

The article revisits Ollama’s rise as a user‑friendly local LLM runner, then details the community backlash over its omission of llama.cpp credit, the introduction of a private model format, performance regressions, and a VC‑driven commercialization pattern, while presenting open‑source alternatives.

OllamaOpen-sourceVC trap

0 likes · 9 min read

Why Ollama Fell From Open‑Source Hero to Community Villain

James' Growth Diary

Apr 13, 2026 · Frontend Development

Local Inference & Edge AI: Why Front‑End AI Is the Next Battlefield

Edge AI runs AI models directly in browsers or devices, offering zero latency, zero API cost, and full privacy, and the article explains the three technical breakthroughs that make it possible, compares WebLLM, Transformers.js and Ollama, and provides a hybrid architecture with concrete engineering challenges and solutions that can cut total AI costs by 40‑55% for typical front‑end applications.

OllamaTransformers.jsWebGPU

0 likes · 20 min read

Local Inference & Edge AI: Why Front‑End AI Is the Next Battlefield

LuTiao Programming

Apr 12, 2026 · Artificial Intelligence

From Scratch to Production: Java + Spring Boot RAG Pipeline for Enterprise GenAI

This article walks through building a production‑ready Retrieval‑Augmented Generation (RAG) system using Java, Spring Boot, LangChain4j, Chroma vector store, and Ollama LLM, covering architecture, key dependencies, configuration, document ingestion, retrieval APIs, scoring, and security considerations.

ChromaGenAIJava

0 likes · 8 min read

From Scratch to Production: Java + Spring Boot RAG Pipeline for Enterprise GenAI

Old Zhang's AI Learning

Apr 12, 2026 · Artificial Intelligence

Deploy the Open‑Source MiniMax‑M2.7 Model Locally: Step‑by‑Step Guide

MiniMax‑M2.7, the newly open‑sourced 230‑billion‑parameter MoE model, offers self‑evolution, professional software engineering and agent capabilities, and can be deployed locally using Ollama, vLLM, SGLang or Docker with 4‑8 H200 GPUs, while the article details hardware needs, performance gains and tool‑calling/Thinking features.

DeploymentGPULLM

0 likes · 11 min read

Deploy the Open‑Source MiniMax‑M2.7 Model Locally: Step‑by‑Step Guide

Machine Heart

Apr 10, 2026 · Artificial Intelligence

Run Gemma 4 with OpenClaw in Three Simple Steps – Official Google Guide

This article walks through Google’s official three‑step tutorial for connecting the Gemma 4 language model to OpenClaw using Ollama, details hardware requirements, discusses performance and security considerations, and evaluates the model’s capabilities compared to larger LLMs.

Gemma 4Local LLM DeploymentMac Studio

0 likes · 5 min read

Run Gemma 4 with OpenClaw in Three Simple Steps – Official Google Guide

Test Development Learning Exchange

Apr 8, 2026 · Backend Development

Build an AI-Powered API Test Framework on Mac with Ollama and Python

This guide shows how to combine a locally deployed Ollama LLM with Python Requests to create an AI-driven automated API testing framework that generates test data, performs smart assertions, and produces markdown reports, dramatically reducing manual effort and improving test quality.

API testingAutomationLLM

0 likes · 9 min read

Build an AI-Powered API Test Framework on Mac with Ollama and Python

Lao Guo's Learning Space

Apr 8, 2026 · Artificial Intelligence

Unlock Private AI on Mac Studio 128GB: One‑Click Multi‑Model Deployment & Auto‑Switch

This guide shows how to leverage the 128 GB unified memory of a Mac Studio to run multiple open‑source LLMs simultaneously, using Ollama for installation and OpenClaw for automatic model routing based on task type, achieving zero‑API cost, full privacy, and optimal performance.

AI model routingArena AI rankingsMac Studio

0 likes · 9 min read

Unlock Private AI on Mac Studio 128GB: One‑Click Multi‑Model Deployment & Auto‑Switch

Old Zhang's AI Learning

Apr 6, 2026 · Artificial Intelligence

Ollama 0.19 Boosts Apple Silicon LLM Inference with MLX Engine and NVFP4

Ollama 0.19 replaces its inference backend with Apple’s MLX framework and adopts NVIDIA’s NVFP4 4‑bit quantization, delivering up to a 93% speed increase on M5 chips while keeping accuracy comparable to cloud‑based deployments, and adds three cache upgrades for smoother agent interactions.

Apple SiliconLLM InferenceMLX

0 likes · 10 min read

Ollama 0.19 Boosts Apple Silicon LLM Inference with MLX Engine and NVFP4

Lao Guo's Learning Space

Apr 4, 2026 · Artificial Intelligence

Which Mac Studio Config Can Run the Largest AI Models? A One-Table Guide

The article explains how Apple’s updated 2025 Mac Studio, with its unified memory architecture and high bandwidth, determines the size of AI models it can run, compares M4 Max and M3 Ultra configurations, maps memory to model parameters, and recommends setups for various use cases.

Large Language ModelsM3 UltraM4 Max

0 likes · 8 min read

Which Mac Studio Config Can Run the Largest AI Models? A One-Table Guide

Old Zhang's AI Learning

Apr 4, 2026 · Artificial Intelligence

Deploy Gemma 4 Locally: Ollama, llama.cpp, MLX, vLLM + TurboQuant Optimization

The article reviews the four Gemma 4 model variants, analyzes their architecture and benchmark results versus Qwen3.5, and provides step‑by‑step instructions for local deployment using Ollama, llama.cpp, MLX and vLLM, while highlighting TurboQuant memory and weight compression techniques.

AI benchmarkingGemma 4MLX

0 likes · 15 min read

Deploy Gemma 4 Locally: Ollama, llama.cpp, MLX, vLLM + TurboQuant Optimization

Old Meng AI Explorer

Apr 2, 2026 · Artificial Intelligence

Slash Your AI Coding Costs: Connect Codex with Chinese Large Models in 10 Minutes

This guide shows how the high OpenAI Codex fees can be replaced by domestic large language models—DeepSeek, GLM‑4.7, Qwen3.5 and others—through three practical integration methods, providing step‑by‑step commands, configuration files, performance benchmarks and cost‑saving calculations for individual developers and teams.

AI codingCodex integrationLarge Language Models

0 likes · 20 min read

Slash Your AI Coding Costs: Connect Codex with Chinese Large Models in 10 Minutes

Test Development Learning Exchange

Mar 24, 2026 · Artificial Intelligence

Build a Test‑Specific AI Agent to Auto‑Generate Pytest Cases and Analyze Allure Reports

This guide presents an end‑to‑end solution for creating a test‑focused AI agent that indexes project code and defect data, integrates a large language model via LangChain, generates compliant Pytest cases, parses Allure reports, and offers deployment tips for seamless PyCharm integration.

AI AgentAllureLangChain

0 likes · 13 min read

Build a Test‑Specific AI Agent to Auto‑Generate Pytest Cases and Analyze Allure Reports

Advanced AI Application Practice

Mar 24, 2026 · Artificial Intelligence

Connecting OpenClaw to Ollama: Step‑by‑Step Guide and Common Pitfalls

This article explains why Ollama has become popular for local LLM deployment, outlines its core features, and provides a detailed, step‑by‑step tutorial for integrating OpenClaw with Ollama—including model selection, configuration, troubleshooting common errors, and advanced tips for customization and multi‑model switching.

AIModel DeploymentOllama

0 likes · 9 min read

Connecting OpenClaw to Ollama: Step‑by‑Step Guide and Common Pitfalls

Old Zhang's AI Learning

Mar 4, 2026 · Artificial Intelligence

How to Turn Thinking Mode On or Off for Qwen3.5 Models in Ollama, LM Studio, llama.cpp, and vLLM

This guide shows step‑by‑step how to enable or disable the thinking mode of Qwen3.5 series large language models across Ollama, LM Studio (GGUF and MLX), llama.cpp, and vLLM/SGLang using command‑line flags, custom model YAML files, and API parameters.

LM StudioOllamaQwen3.5

0 likes · 4 min read

How to Turn Thinking Mode On or Off for Qwen3.5 Models in Ollama, LM Studio, llama.cpp, and vLLM

Cognitive Technology Team

Mar 2, 2026 · Artificial Intelligence

Stream Real-Time Chat with Ollama’s qwen3 Model via Async Python & LangChain

This guide walks you through installing Ollama, downloading the qwen3:4b model, and using Python’s async client to perform streaming chat requests, then shows how to integrate the same model with LangChain, including setup, initialization, and both regular and streaming output examples.

Async PythonChatbotLangChain

0 likes · 5 min read

Stream Real-Time Chat with Ollama’s qwen3 Model via Async Python & LangChain

AI Large-Model Wave and Transformation Guide

Feb 27, 2026 · Artificial Intelligence

How to Deploy Dify and Ollama Locally on Windows 11: A Step‑by‑Step Guide

This article walks through enabling Hyper‑V on Windows 11 Pro, configuring Docker Desktop with Chinese mirrors, adjusting storage, installing Ubuntu via WSL, cloning and setting up Dify, running Docker Compose, and linking Ollama's LLM so the AI agent runs entirely on a local machine.

AI DeploymentDifyDocker

0 likes · 6 min read

How to Deploy Dify and Ollama Locally on Windows 11: A Step‑by‑Step Guide

Old Zhang's AI Learning

Feb 24, 2026 · Artificial Intelligence

Launch OpenClaw with a Single Command in Ollama 0.17 – Zero Configuration

With Ollama 0.17 you can start the powerful OpenClaw AI assistant using a single command, automatically install the software, choose cloud or local models, enable web‑search, connect to multiple messaging platforms, and keep all data private on your own machine.

AI assistantOllamaOpenClaw

0 likes · 10 min read

Launch OpenClaw with a Single Command in Ollama 0.17 – Zero Configuration

Old Zhang's AI Learning

Feb 23, 2026 · Artificial Intelligence

One-Click Tool to Determine Which Large Language Models Your PC Can Run Locally

The llmfit command‑line utility scans your CPU, RAM, GPU and VRAM, scores 157 models from over 30 providers, suggests the highest‑quality quantized version that fits, integrates with Ollama, and shows real‑world test results confirming its accuracy, though its model database is limited.

Large Language ModelsMixture of ExpertsOllama

0 likes · 6 min read

One-Click Tool to Determine Which Large Language Models Your PC Can Run Locally

Code Mala Tang

Feb 20, 2026 · Artificial Intelligence

How to Integrate Claude Code with Ollama for Local and Cloud LLM Workflows

This guide walks you through installing Claude Code and Ollama, pulling and configuring various open‑source models, setting environment variables, and running Claude Code with both local and cloud‑hosted models, while covering context length, performance considerations, and tool‑calling examples.

Claude CodeLLM integrationOllama

0 likes · 14 min read

How to Integrate Claude Code with Ollama for Local and Cloud LLM Workflows

Old Zhang's AI Learning

Feb 18, 2026 · Artificial Intelligence

New Ollama Features: Instant Model Switching, Subagents, and Built‑in Web Search

The latest Ollama 0.16.1 release lets users switch models and tools instantly, use Claude Code, Codex, and OpenClaw without extra configuration, and enables Subagents and built‑in web search directly via simple commands.

Claude CodeLarge Language ModelModel Switching

0 likes · 3 min read

New Ollama Features: Instant Model Switching, Subagents, and Built‑in Web Search

Old Zhang's AI Learning

Feb 12, 2026 · Artificial Intelligence

Testing the World's Most Powerful Open‑Source LLM: GLM‑5, Local Deployment & Free Ollama Cloud

The article evaluates GLM‑5, the claimed strongest open‑source large language model, comparing its benchmark scores to Claude Opus, Gemini and GPT, detailing its DeepSeek‑inspired architecture, quantized FP8 deployment requirements, and step‑by‑step usage of Ollama’s free cloud model with Agent, data‑analysis and document‑generation features.

AI benchmarkingAgent modeGLM-5

0 likes · 7 min read

Testing the World's Most Powerful Open‑Source LLM: GLM‑5, Local Deployment & Free Ollama Cloud

Old Zhang's AI Learning

Feb 3, 2026 · Artificial Intelligence

Why GLM-OCR Leads OCR Benchmarks: 0.9B Model Tops OmniDocBench

GLM-OCR, a 0.9B‑parameter multimodal OCR model from Zhipu, achieves the highest score (94.62) on OmniDocBench V1.5, offers lightweight deployment via vLLM, Ollama, API and SDK, and outperforms larger rivals like DeepSeek‑OCR and PaddleOCR in speed and accuracy.

DeploymentGLM-OCROCR

0 likes · 10 min read

Why GLM-OCR Leads OCR Benchmarks: 0.9B Model Tops OmniDocBench

Old Zhang's AI Learning

Feb 2, 2026 · Artificial Intelligence

The Easiest Free OpenClaw Setup with Ollama’s Cloud Model Support

This step‑by‑step guide shows how to prepare an Ubuntu (or Mac) VM, install an agent tool, set up Ollama, install OpenClaw via npm, configure the daemon, launch the GLM‑4.7:cloud model, and connect OpenClaw to Telegram for AI‑agent interactions.

AI AgentAgent toolsNode.js

0 likes · 4 min read

The Easiest Free OpenClaw Setup with Ollama’s Cloud Model Support

SpringMeng

Jan 30, 2026 · Artificial Intelligence

Hands‑On Guide: Build AI Agent Chatbots on Windows with RagFlow

Programmer Xiao Meng walks through a complete Windows setup for AI‑powered customer service agents using RagFlow, covering prerequisites, Docker and Ollama installation, model download, container deployment, configuration of knowledge bases, and testing, based on five real‑world projects.

AI ChatbotDockerLarge Language Model

0 likes · 7 min read

Hands‑On Guide: Build AI Agent Chatbots on Windows with RagFlow

AI Cyberspace

Jan 29, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Efficient LLM Fine‑Tuning with LoRA, QLoRA, and Llama‑Factory

This tutorial explains the concepts, methods, and practical commands for fine‑tuning large language models using efficient techniques like LoRA and QLoRA, covering model selection, resource considerations, Docker deployment, dataset preparation, training configuration, evaluation metrics, model merging, and deployment with GGUF and Ollama.

GGUFGPU memory optimizationLLM fine-tuning

0 likes · 27 min read

Step‑by‑Step Guide to Efficient LLM Fine‑Tuning with LoRA, QLoRA, and Llama‑Factory

Ubuntu

Jan 26, 2026 · Artificial Intelligence

Build a Fully Private Ubuntu AI Assistant with DeepSeek‑R1 and AnythingLLM (No Internet Needed)

This guide walks you through installing Ollama on Ubuntu, loading the open‑source DeepSeek‑R1 model, configuring AnythingLLM as a local RAG system, and testing it offline so the AI can answer questions from your private documents without any data ever leaving your machine.

AnythingLLMDeepSeek-R1Ollama

0 likes · 6 min read

Build a Fully Private Ubuntu AI Assistant with DeepSeek‑R1 and AnythingLLM (No Internet Needed)

Old Zhang's AI Learning

Jan 25, 2026 · Artificial Intelligence

Ollama launch: One‑Command Tool Setup and New 5‑Hour Cloud Sessions

The article introduces Ollama's new "ollama launch" command, which lets users configure and start programming tools like Claude Code, OpenCode, Codex, and Droid with a single command, and explains quick‑start steps, recommended local and cloud models, and an extended five‑hour cloud coding session.

AI modelsOllamacloud sessions

0 likes · 6 min read

Ollama launch: One‑Command Tool Setup and New 5‑Hour Cloud Sessions

Ubuntu

Jan 25, 2026 · Artificial Intelligence

Unlock Productivity: Create a Full‑Featured AI Coding Workflow on Ubuntu with CC Switch and Ollama

This step‑by‑step guide shows how to install Ollama on Ubuntu, download DeepSeek‑Coder‑V2 or Qwen2.5‑Coder models, set up Claude Code, Codex, and Gemini CLI clients, configure the open‑source CC Switch proxy to route their requests to the local Ollama engine, and run a test prompt that generates Python code without any external API keys.

AI codingCC SwitchClaude Code

0 likes · 8 min read

Ubuntu

Jan 24, 2026 · Artificial Intelligence

Unlock Full‑Stack AI Coding on Ubuntu with Ollama and CC Switch

This step‑by‑step guide shows how to replace cloud‑based AI coding tools with a private, zero‑cost workflow on Ubuntu by installing Ollama, configuring systemd, adding DeepSeek or Qwen2.5 models, installing Claude, Codex and Gemini CLIs, and routing them through CC Switch.

AI codingCC SwitchClaude Code

0 likes · 7 min read

Ubuntu

Jan 23, 2026 · Artificial Intelligence

Deploy DeepSeek Locally on Ubuntu: Build Your Private AI Assistant

This guide walks through why you might run a large language model locally—privacy, zero latency, and no token costs—then details hardware requirements, installs Ollama, pulls the appropriate DeepSeek‑R1 model, tests it with a coding prompt, and optionally adds a web UI via Docker.

AI assistantDeepSeekOllama

0 likes · 6 min read

Deploy DeepSeek Locally on Ubuntu: Build Your Private AI Assistant

AI Insight Log

Jan 20, 2026 · Artificial Intelligence

Is GLM-4.7-Flash the New 30B‑Level LLM King? Open‑Source and Ollama‑Ready

GLM‑4.7‑Flash, a 30B‑parameter MoE LLM released as fully open‑source and free, delivers 30B‑class performance across six benchmarks, runs locally with a single Ollama command, and offers a faster cloud‑hosted version with modest token‑based pricing, though hardware costs still apply.

Anthropic APIBenchmarkGLM-4.7-Flash

0 likes · 7 min read

Is GLM-4.7-Flash the New 30B‑Level LLM King? Open‑Source and Ollama‑Ready

AI Insight Log

Jan 19, 2026 · Artificial Intelligence

Run Claude Code for Free? Ollama Adds Anthropic API Compatibility

Ollama v0.14.0 now supports the Anthropic API, letting you run Claude Code locally with open‑source models like Qwen or Llama without an API key, network, or cost, and the article provides a step‑by‑step setup, SDK examples, and an objective assessment of the approach.

Anthropic APIClaude CodeOllama

0 likes · 7 min read

Run Claude Code for Free? Ollama Adds Anthropic API Compatibility

Fun with Large Models

Jan 18, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Deploying Large Language Models Locally with VLLM and Ollama

This article walks through two mainstream local deployment solutions—high‑performance VLLM for production Linux servers and lightweight Ollama for personal Windows machines—covering environment setup, model download, server launch, API testing, key configuration parameters, and the quantization technique that makes Ollama models compact.

GPU OptimizationLarge Language ModelsModel Quantization

0 likes · 18 min read

Step‑by‑Step Guide to Deploying Large Language Models Locally with VLLM and Ollama

Woodpecker Software Testing

Jan 15, 2026 · Artificial Intelligence

Step-by-Step Guide to Building Your First AI Agent: Connecting Alibaba Cloud, OpenAI, Dashscope, DeepSeek, and Ollama

This article provides a detailed, hands‑on tutorial for creating an AI agent, covering registration and API key setup for Alibaba Cloud, OpenAI, Dashscope and DeepSeek, installing and using Ollama for local model deployment, configuring CherryStudio, and implementing function‑calling and MCP techniques with full code examples.

AI AgentAlibaba CloudDashScope

0 likes · 26 min read

Step-by-Step Guide to Building Your First AI Agent: Connecting Alibaba Cloud, OpenAI, Dashscope, DeepSeek, and Ollama

Ubuntu

Jan 12, 2026 · Artificial Intelligence

How to Deploy a Privacy‑First AI Agent Workflow on Ubuntu (No Cloud Needed)

The article explains why running AI locally on Ubuntu offers data security, zero token costs, offline capability, and millisecond response times, then provides a step‑by‑step guide to install Ollama via Snap, pull the DeepSeek Coder 6.7B model, optimize GPU drivers and memory, integrate with VS Code, and monitor resource usage in real time.

DeepSeek CoderGPU OptimizationOllama

0 likes · 5 min read

How to Deploy a Privacy‑First AI Agent Workflow on Ubuntu (No Cloud Needed)

Raymond Ops

Dec 16, 2025 · Artificial Intelligence

Master Multi‑GPU Load Balancing for OLLAMA: From Setup to Production

This guide walks you through configuring OLLAMA for multi‑GPU load balancing, covering hardware checks, CUDA and Docker setup, native and containerized deployment methods, core parameter tuning, advanced sharding, dynamic monitoring, troubleshooting, production best practices, and a real‑world RTX 4090 case study.

AI inferenceCUDAGPU

0 likes · 15 min read

Master Multi‑GPU Load Balancing for OLLAMA: From Setup to Production

JakartaEE China Community

Dec 16, 2025 · Artificial Intelligence

Build a Retrieval‑Augmented Generation (RAG) System with Langchain4j and Ollama 3

This guide walks through the importance of Retrieval‑Augmented Generation, outlines the core Langchain4j and Ollama 3 components, and provides a complete Java example—including Maven setup, document ingestion, embedding creation, similarity search, prompt construction, and response generation—to demonstrate a functional RAG pipeline.

EmbeddingJavaLLM

0 likes · 9 min read

Build a Retrieval‑Augmented Generation (RAG) System with Langchain4j and Ollama 3

Java Companion

Dec 12, 2025 · Backend Development

AI‑Powered One‑Command Git Commit Message Beautifier for Clean Project History

The article introduces the open‑source tool git‑rewrite‑commits, shows how to install Ollama, configure its cloud model, and run a single npx command that uses AI to rewrite messy Git commit messages and then force‑push the cleaned history.

AIGitOllama

0 likes · 4 min read

AI‑Powered One‑Command Git Commit Message Beautifier for Clean Project History

Code Wrench

Dec 6, 2025 · Artificial Intelligence

Build a Local Go AI Agent with Ollama and DeepSeek – MVP Guide

This article walks you through creating a fully offline, extensible AI programming assistant in Go, using Ollama and DeepSeek‑R1, covering project layout, message formats, function calling, tool integration, a simple WebSocket UI, and future extension ideas.

AI AgentGoOllama

0 likes · 10 min read

Build a Local Go AI Agent with Ollama and DeepSeek – MVP Guide

JakartaEE China Community

Nov 18, 2025 · Artificial Intelligence

How to Build a Retrieval‑Augmented Generation (RAG) System with Langchain4j and Ollama 3

This article explains why Retrieval‑Augmented Generation improves LLM accuracy, outlines the key Langchain4j and Ollama3 components, and provides a step‑by‑step Java example—including Maven setup, document ingestion, embedding, similarity search, prompt creation, and response generation—to demonstrate a functional RAG pipeline.

EmbeddingJavaLLM

0 likes · 8 min read

How to Build a Retrieval‑Augmented Generation (RAG) System with Langchain4j and Ollama 3

Rare Earth Juejin Tech Community

Oct 31, 2025 · Artificial Intelligence

Build a Private AI Knowledge Base with Ollama and FastGPT

This guide walks you through setting up a locally deployed AI system using Ollama and FastGPT, covering model selection, Docker deployment, configuration, knowledge‑base creation, and testing so your team can query internal documents securely and efficiently.

AIDockerFastGPT

0 likes · 25 min read

Build a Private AI Knowledge Base with Ollama and FastGPT

DevOps Engineer

Oct 3, 2025 · Artificial Intelligence

Enable Ollama Local Model in Jenkins Explain Error Plugin – A Step‑by‑Step Guide

The Explain Error Plugin for Jenkins now supports the Ollama local model, offering a community‑driven AI solution that simplifies configuration, enhances security, and speeds up build error analysis for both individual developers and enterprise teams.

AICI/CDExplain Error Plugin

0 likes · 4 min read

Enable Ollama Local Model in Jenkins Explain Error Plugin – A Step‑by‑Step Guide

Raymond Ops

Sep 23, 2025 · Artificial Intelligence

Install Ollama’s Local LLM on Windows and Power It with ShellGPT

This guide walks you through installing the Ollama local large‑language‑model runtime on Windows, deploying a Gemma2 model, then setting up ShellGPT on Linux to interact with the local LLM, covering configuration, basic commands, and advanced usage examples.

AI assistantLinuxOllama

0 likes · 6 min read

Install Ollama’s Local LLM on Windows and Power It with ShellGPT

Code Wrench

Sep 22, 2025 · Artificial Intelligence

Build a Private ChatGPT on Your Laptop with Ollama, DeepSeek‑R1 and Go MCP

This guide walks you through installing Ollama, pulling the open‑source DeepSeek‑R1:1.5B model, wrapping it with a Go‑based Model Context Protocol (MCP) server, creating a client example, and enhancing the experience with Open‑WebUI while offering performance‑tuning tips.

DeepSeekGoMCP

0 likes · 9 min read

Build a Private ChatGPT on Your Laptop with Ollama, DeepSeek‑R1 and Go MCP

Instant Consumer Technology Team

Sep 12, 2025 · Cloud Native

Deploy Large Language Models on Kubernetes with Ollama and Open-WebUI

This guide walks through deploying a local LLM on Kubernetes using Ollama for model serving and Open-WebUI for a web interface, covering namespace creation, storage setup, GPU support, service exposure, validation, and model download to ensure privacy, low latency, and high availability.

GPUKubernetesLarge Language Model

0 likes · 9 min read

Deploy Large Language Models on Kubernetes with Ollama and Open-WebUI

Dunmao Tech Hub

Sep 1, 2025 · Artificial Intelligence

Deploy DeepSeek‑r1 Locally with a One‑Click Ollama Script

This guide walks you through a Bash script that automatically checks for Ollama, installs it if missing, lets you choose a DeepSeek‑r1 model size, starts the Ollama service, and runs the selected model locally, complete with usage examples and a token‑cost note.

AIDeepSeekModel Deployment

0 likes · 7 min read

Deploy DeepSeek‑r1 Locally with a One‑Click Ollama Script

Raymond Ops

Aug 26, 2025 · Artificial Intelligence

How to Deploy DeepSeek R1 Locally: Versions, Hardware, and UI Tools

This guide explains DeepSeek R1’s model variants, hardware requirements, local installation steps using Ollama, LM Studio or Docker, and how to add visual interfaces like Open‑WebUI and Dify for a complete on‑premise AI solution.

DeepSeekDifyHardware Requirements

0 likes · 14 min read

How to Deploy DeepSeek R1 Locally: Versions, Hardware, and UI Tools

Java Architecture Diary

Aug 7, 2025 · Artificial Intelligence

Run OpenAI’s Open‑Source gpt‑oss Models Locally with Ollama – A Quick Guide

OpenAI’s new open‑source gpt‑oss models, available in 20B and 120B sizes, can be run locally via Ollama with features like agentic capabilities, configurable reasoning, fine‑tuning, and MXFP4 quantization, and the article provides step‑by‑step installation, usage, and integration instructions.

AI modelsGPT-OSSJava

0 likes · 8 min read

Run OpenAI’s Open‑Source gpt‑oss Models Locally with Ollama – A Quick Guide

Mingyi World Elasticsearch

Aug 4, 2025 · Artificial Intelligence

Building Enterprise‑Grade Semantic Search with Ollama—No External APIs Required

This article walks through the complete design and implementation of a locally deployed, enterprise‑level semantic search system using Ollama for embedding generation and Easysearch for vector retrieval, covering problem analysis, architecture decisions, pipeline configuration, bulk indexing, and hybrid query execution.

EasysearchOllamaSearch Engine

0 likes · 12 min read

Building Enterprise‑Grade Semantic Search with Ollama—No External APIs Required

Eric Tech Circle

Aug 3, 2025 · Artificial Intelligence

How to Deploy Qwen3‑Coder Locally and Boost Front‑End Development

This article explains the key improvements of Qwen3‑Coder, walks through two local deployment methods (LM Studio and Ollama), showcases front‑end coding examples, compares performance and hardware requirements, and offers practical recommendations for developers seeking an on‑premise AI coding assistant.

AI code generationLM StudioOllama

0 likes · 7 min read

How to Deploy Qwen3‑Coder Locally and Boost Front‑End Development

Full-Stack Cultivation Path

Aug 2, 2025 · Artificial Intelligence

Is There a Design Pattern for AI Workflows? Exploring Prompt Chaining

The article explains how breaking complex LLM tasks into sequential steps—known as prompt chaining—improves answer accuracy, debuggability, flexibility, and enables sophisticated AI workflows such as report generation, chatbots, and content creation using tools like n8n and Ollama.

AI workflowAutomationLarge Language Model

0 likes · 6 min read

Is There a Design Pattern for AI Workflows? Exploring Prompt Chaining

Full-Stack Cultivation Path

Jul 26, 2025 · Artificial Intelligence

Step-by-Step Local Deployment Guide for Coze Studio: Launch Your Low-Code AI Agent Development

This article provides a comprehensive, hands‑on tutorial for installing Ollama, Docker, and the open‑source Coze Studio on a local machine, configuring various LLM services such as Qwen 3, DeepSeek‑V3, and OpenRouter, and running the platform via Docker Compose to create and test AI agents.

Coze StudioDockerLLM

0 likes · 7 min read

Step-by-Step Local Deployment Guide for Coze Studio: Launch Your Low-Code AI Agent Development

Mingyi World Elasticsearch

Jul 22, 2025 · Artificial Intelligence

Zero-Code Setup: Build a Local Document Knowledge Base with Coco AI 0.7.0

This guide walks you through a completely code‑free, step‑by‑step process to download Coco AI 0.7.0, configure the server and client, set up a local connector, link Ollama models, and verify both simple and deep‑thinking AI modes for document retrieval and intelligent Q&A.

0.7.0AI SearchCoco AI

0 likes · 5 min read

Zero-Code Setup: Build a Local Document Knowledge Base with Coco AI 0.7.0

Code Mala Tang

Jul 22, 2025 · Artificial Intelligence

Convert Any PDF to Clean Markdown with a Local LLM (Gemma 3)

Learn how to transform any PDF—including scanned documents—into well‑structured Markdown using a local LLM (Gemma 3 via Ollama), Python, PyMuPDF and Pillow, without cloud APIs or API keys, by converting pages to images, prompting the model, and saving the output.

GemmaLLMMarkdown

0 likes · 12 min read

Convert Any PDF to Clean Markdown with a Local LLM (Gemma 3)

21CTO

Jul 22, 2025 · Artificial Intelligence

Run Powerful LLMs Locally on <8GB RAM: Top 10 Small Models & Tools

This article explains how advanced quantization and model optimization enable running strong large language models on laptops or desktops with less than 8 GB of RAM or VRAM, outlines key technical concepts, recommends local inference tools, and lists ten compact LLMs with usage commands.

AILLM toolsOllama

0 likes · 10 min read

Run Powerful LLMs Locally on <8GB RAM: Top 10 Small Models & Tools

MaGe Linux Operations

Jul 21, 2025 · Artificial Intelligence

Master Multi‑GPU Load Balancing for OLLAMA: From Zero to Production

This guide walks you through configuring OLLAMA for multi‑GPU load balancing, covering hardware checks, CUDA setup, native and Docker deployment methods, detailed parameter tuning, advanced sharding strategies, troubleshooting, performance optimization, and production‑grade monitoring to maximize throughput and stability of large language models.

AI DeploymentCUDAOllama

0 likes · 16 min read

Master Multi‑GPU Load Balancing for OLLAMA: From Zero to Production

Ops Development Stories

Jun 30, 2025 · Artificial Intelligence

Build a Private AI Knowledge Assistant with n8n: Zero‑Code RAG in 30 Minutes

This guide shows how to create a fully local Retrieval‑Augmented Generation (RAG) system using n8n, Docker, Ollama and the free Qwen3 embedding model, enabling secure, up‑to‑date AI assistants that answer enterprise questions without exposing any proprietary data.

AI assistantDockerEmbedding

0 likes · 17 min read

Build a Private AI Knowledge Assistant with n8n: Zero‑Code RAG in 30 Minutes

Java Architecture Diary

Jun 9, 2025 · Artificial Intelligence

How Qwen3 Embedding Redefines Multilingual Vector Search Performance

This article examines the Qwen3 Embedding series released by Alibaba's Qwen team, detailing its architecture, multilingual capabilities, benchmark superiority across MTEB and C‑MTEB tests, and provides practical deployment guidance via Ollama and API integration.

AIBenchmarkEmbedding

0 likes · 8 min read

How Qwen3 Embedding Redefines Multilingual Vector Search Performance

Full-Stack Internet Architecture

Jun 6, 2025 · Artificial Intelligence

How to Build a Spring AI Hello World with Ollama and DeepSeek Locally

This step‑by‑step tutorial shows how to install Ollama, pull the DeepSeek‑R1 model, create a Spring Boot project with the Spring AI Ollama starter, code a ChatController, and test a local AI "Hello World" integration, illustrating AI‑enhanced backend development.

AI integrationDeepSeekJava

0 likes · 7 min read

How to Build a Spring AI Hello World with Ollama and DeepSeek Locally

Java Architecture Diary

Jun 5, 2025 · Artificial Intelligence

Unlock AI Reasoning: How Ollama’s New ‘Thinking’ Feature Works

Version 0.9.0 of Ollama introduces a ‘thinking’ control that lets users view and manage the AI model’s reasoning process, with detailed CLI commands, REST API usage, model support list, scripting options, and advanced Modelfile configurations for models like DeepSeek R1 and Qwen 3.

AI reasoningCLIDeepSeek

0 likes · 6 min read

Unlock AI Reasoning: How Ollama’s New ‘Thinking’ Feature Works

Eric Tech Circle

May 22, 2025 · Artificial Intelligence

Build a Fast, Zero‑Cost Local AI Knowledge Base with Ollama, Cherry Studio, and Qwen‑3

This guide walks you through building a high‑performance local AI knowledge base using Ollama, Cherry Studio, and the Qwen‑3 model, covering RAG fundamentals, model selection, document preparation, system configuration, and step‑by‑step UI operations for non‑programmers.

AIKnowledge BaseOllama

0 likes · 13 min read

Build a Fast, Zero‑Cost Local AI Knowledge Base with Ollama, Cherry Studio, and Qwen‑3

Java Architecture Diary

May 19, 2025 · Artificial Intelligence

How Ollama 0.7 Unlocks Local Multimodal AI with One Command

Ollama 0.7 introduces a fully re‑engineered core that brings seamless multimodal model support, lists top visual models, showcases OCR and image analysis capabilities, explains technical breakthroughs, and provides a quick three‑step guide to deploy powerful local AI vision.

AI EngineeringAI modelsOllama

0 likes · 7 min read

How Ollama 0.7 Unlocks Local Multimodal AI with One Command

Architect's Alchemy Furnace

May 6, 2025 · Operations

Master Ollama Deployment: Optimize Environment Variables for Peak Performance

This guide walks you through cross‑platform environment variable configuration, Docker containerization, GPU resource strategies, concurrency tuning, and security hardening for Ollama, providing practical code snippets and best‑practice tables to unleash its full potential in development and production.

DeploymentGPUOllama

0 likes · 14 min read

Master Ollama Deployment: Optimize Environment Variables for Peak Performance

Architect's Guide

May 2, 2025 · Artificial Intelligence

Deploying a Local High‑Performance AI Service with Spring AI, Ollama, Redis, and Docker

This tutorial walks developers through setting up a low‑cost, containerized AI service on Windows by installing Docker, deploying Redis and Ollama containers, pulling the DeepSeek‑R1 model, and integrating everything with Spring AI to enable continuous conversation support.

AI DeploymentDockerJava

0 likes · 12 min read

Deploying a Local High‑Performance AI Service with Spring AI, Ollama, Redis, and Docker

Java Architecture Diary

Apr 29, 2025 · Artificial Intelligence

Why Qwen3 Is the New Powerhouse in Open‑Source AI Models

Qwen3 introduces a suite of open‑source models—from a 235B expert model to compact 0.6B versions—offering competitive performance against top proprietary models, multilingual support, flexible thinking modes, and low deployment requirements, with detailed usage instructions via Ollama and OpenRouter.

Large Language ModelOllamaOpen-source AI

0 likes · 8 min read

Why Qwen3 Is the New Powerhouse in Open‑Source AI Models

Open Source Linux

Apr 14, 2025 · Artificial Intelligence

How to Deploy DeepSeek Locally: Step‑by‑Step Guide for Offline AI

This guide compares DeepSeek’s local and online versions, outlines hardware and privacy advantages of offline deployment, and provides a detailed step‑by‑step tutorial—including Ollama installation, model selection, command execution, and UI plugin setup—to help users run DeepSeek on their own machines.

AI modelDeepSeekOllama

0 likes · 6 min read

How to Deploy DeepSeek Locally: Step‑by‑Step Guide for Offline AI

Ops Development & AI Practice

Apr 6, 2025 · Industry Insights

How VS Code’s New Copilot Agent and Custom LLM Support Redefine AI‑Assisted Development

The VS Code v1.99 update introduces a Copilot Agent mode that deepens project‑level understanding and adds custom LLM integration—including OpenAI, Azure, Gemini, Anthropic, OpenRouter, and locally‑run Ollama—offering developers greater flexibility, cost control, privacy, and strategic advantages in the evolving AI‑IDE landscape.

AI IDEAI trendsCustom LLM

0 likes · 8 min read

How VS Code’s New Copilot Agent and Custom LLM Support Redefine AI‑Assisted Development

Ops Development & AI Practice

Apr 6, 2025 · Artificial Intelligence

How to Inspect Local LLM Specs with Ollama’s ‘show’ Command

This guide explains how to use the Ollama ‘show’ command to retrieve detailed specifications of locally stored large language models, covering architecture, parameters, context length, embedding size, quantization, capabilities, and licensing information for informed model selection.

AI toolsLLMOllama

0 likes · 4 min read

How to Inspect Local LLM Specs with Ollama’s ‘show’ Command

Ops Development & AI Practice

Apr 6, 2025 · Artificial Intelligence

Mastering Ollama Modelfile: Build and Customize Your Own LLM

This guide explains how to retrieve, analyze, and modify an Ollama Modelfile—using commands like `ollama show --modelfile`, dissecting key directives such as FROM, TEMPLATE, LICENSE, PARAMETER, SYSTEM, and ADAPTER—and walks through step‑by‑step creation of a custom model.

AI modelLLM customizationLoRA

0 likes · 9 min read

Mastering Ollama Modelfile: Build and Customize Your Own LLM

Qborfy AI

Mar 27, 2025 · Artificial Intelligence

How to Deploy DeepSeek‑R1 Locally with Ollama and Dify: A Step‑by‑Step Guide

This article walks through the entire process of deploying the DeepSeek‑R1 large language model on a personal machine, covering hardware requirements, Ollama installation, model download, service startup, remote access configuration, and visual UI integration with Dify, complete with concrete commands and screenshots.

AIDeepSeekDocker

0 likes · 9 min read

How to Deploy DeepSeek‑R1 Locally with Ollama and Dify: A Step‑by‑Step Guide

Alibaba Cloud Native

Mar 27, 2025 · Cloud Native

Deploy the QwQ‑32B LLM on Alibaba Cloud Function Compute with CAP in Minutes

This guide walks you through deploying the open‑source QwQ‑32B model on Alibaba Cloud Function Compute using the Cloud Application Platform (CAP), covering architecture, required services, account setup, step‑by‑step deployment, cost considerations, model interaction via Open WebUI and Chatbox, scaling configuration, and resource cleanup.

CAPFunction ComputeOllama

0 likes · 8 min read

Deploy the QwQ‑32B LLM on Alibaba Cloud Function Compute with CAP in Minutes

AI Algorithm Path

Mar 24, 2025 · Artificial Intelligence

How to Use Pydantic for Structured LLM Output

The article explains why LLM responses can be inconsistent, introduces Pydantic as a way to define custom output schemas, and walks through concrete examples—both with OpenAI and Ollama models—showing how to build a LangChain pipeline that parses responses into structured data.

LLMLangChainOllama

0 likes · 7 min read

How to Use Pydantic for Structured LLM Output

MaGe Linux Operations

Mar 21, 2025 · Artificial Intelligence

Step‑by‑Step Guide to Install Ollama and ShellGPT for Local LLM Use

This tutorial walks you through installing Ollama on Windows, configuring and running a local large language model, then setting up ShellGPT on Linux to communicate with Ollama, including configuration files, command examples, and REPL usage, while omitting unrelated promotional content.

AI assistantOllamaShellGPT

0 likes · 6 min read

Step‑by‑Step Guide to Install Ollama and ShellGPT for Local LLM Use

Java Architecture Diary

Mar 19, 2025 · Artificial Intelligence

Unlocking Google’s Gemma 3: Multimodal Power, 128k Context & Local Deployment Guide

This article introduces Google’s open‑source Gemma 3 model, highlighting its multimodal capabilities, massive 128k token context window, multilingual support, and provides step‑by‑step instructions for installing Ollama, pulling the model, and running local tests with code examples.

AI modelGemma 3Large Language Model

0 likes · 7 min read

Unlocking Google’s Gemma 3: Multimodal Power, 128k Context & Local Deployment Guide

Architect's Alchemy Furnace

Mar 18, 2025 · Artificial Intelligence

How to Build an AI Agent with Ollama: From Model Setup to Knowledge Base

This step‑by‑step guide shows how to create an AI Agent by configuring a local Ollama model, selecting an embedding model, building a knowledge base, uploading documents, and testing the agent's retrieval capabilities, providing a practical RAG workflow for developers.

AI AgentEmbeddingOllama

0 likes · 8 min read

How to Build an AI Agent with Ollama: From Model Setup to Knowledge Base

Open Source Tech Hub

Mar 13, 2025 · Artificial Intelligence

Build a Private AI Knowledge Base with Webman AI, Redis‑Stack, and Ollama

This guide walks you through setting up a private AI knowledge base using Webman AI 5.4.0, deploying Redis‑Stack, installing the illuminate/redis component, adding Ollama with DeepSeek and other embedding models, configuring Redis, importing training data, running the training process, and configuring role prompts for accurate AI responses.

AIDeepSeekOllama

0 likes · 6 min read

Build a Private AI Knowledge Base with Webman AI, Redis‑Stack, and Ollama

Cognitive Technology Team

Mar 11, 2025 · Artificial Intelligence

Deploying DeepSeek R1:7b Model Locally with Ollama and Building AI Applications Using Dify

This tutorial explains how to set up Ollama for CPU or GPU environments, run the DeepSeek R1:7b large language model, and use the open‑source Dify platform to create and deploy a custom AI application, providing step‑by‑step commands and configuration details.

AIDeepSeekDify

0 likes · 8 min read

Deploying DeepSeek R1:7b Model Locally with Ollama and Building AI Applications Using Dify