Old Zhang's AI Learning
Author

Old Zhang's AI Learning

AI practitioner specializing in large-model evaluation and on-premise deployment, agents, AI programming, Vibe Coding, general AI, and broader tech trends, with daily original technical articles.

141
Articles
0
Likes
3
Views
0
Comments
Recent Articles

Latest from Old Zhang's AI Learning

100 recent articles max
Old Zhang's AI Learning
Old Zhang's AI Learning
Feb 28, 2026 · Artificial Intelligence

How to Build a Private AI‑Powered RSS Reading Knowledge Base

The article details a fully automated workflow that fetches 92 top‑tech blogs via RSS, cleans the content into Markdown, uses a MiniMax‑M2.5 LLM to generate concise Chinese summaries, and delivers them through Bark and a Telegram bot, all stored for seamless integration with Obsidian.

AIAutomationBark
0 likes · 10 min read
How to Build a Private AI‑Powered RSS Reading Knowledge Base
Old Zhang's AI Learning
Old Zhang's AI Learning
Feb 27, 2026 · Backend Development

How I Built a Telegram AI Coding Bot (FakeClawBot) Using OpenCode

This article walks through creating a Telegram bot that leverages OpenCode's Server API to provide full AI coding assistance, covering setup, multi‑model integration, core architecture, common pitfalls, and extensible features, all with under 900 lines of Python code.

AI Coding AssistantAutomationLarge Language Model
0 likes · 13 min read
How I Built a Telegram AI Coding Bot (FakeClawBot) Using OpenCode
Old Zhang's AI Learning
Old Zhang's AI Learning
Feb 26, 2026 · Artificial Intelligence

How to Disable Thinking Output in Qwen3.5 Models Using LM Studio

This guide explains how to turn off the reasoning (thinking) output of Qwen3.5 series large language models in LM Studio by creating a virtual “-no‑thinking” model directory, editing a model.yaml file, and handling common pitfalls and error messages.

AI model configurationLM StudioQwen3.5
0 likes · 8 min read
How to Disable Thinking Output in Qwen3.5 Models Using LM Studio
Old Zhang's AI Learning
Old Zhang's AI Learning
Feb 26, 2026 · Artificial Intelligence

Is RAG Dead? Meet Sirchmunk – an Embedding‑Free Search Engine that Ditches Vector Databases

Sirchmunk, an open‑source search engine from Alibaba's ModelScope team, eliminates the need for embeddings and vector databases by using a multi‑stage, Monte‑Carlo‑based pipeline that builds self‑evolving knowledge clusters, offering zero‑setup indexing, real‑time freshness, and flexible integration options.

Embedding-Free RetrievalKnowledge ClustersLLM Search
0 likes · 14 min read
Is RAG Dead? Meet Sirchmunk – an Embedding‑Free Search Engine that Ditches Vector Databases
Old Zhang's AI Learning
Old Zhang's AI Learning
Feb 26, 2026 · Artificial Intelligence

Ultimate Guide to Local Deployment of Qwen3.5 Models (27B‑397B)

This guide reviews the Qwen3.5 model lineup, explains mixed‑inference and MoE architecture, presents benchmark comparisons with GPT‑5.2, Claude 4.5 and Gemini‑3 Pro, evaluates 4‑bit and 3‑bit quantization loss, outlines hardware requirements, and provides step‑by‑step deployment options using llama.cpp or llama‑server.

InferenceLarge Language ModelLocal Deployment
0 likes · 14 min read
Ultimate Guide to Local Deployment of Qwen3.5 Models (27B‑397B)
Old Zhang's AI Learning
Old Zhang's AI Learning
Feb 21, 2026 · Artificial Intelligence

Why Fine‑Tuning Large Models Is Now Ridiculously Easy

The article explains how Unsloth dramatically lowers the barrier to fine‑tuning large language models, offering one‑click installation, free Colab GPU support, extensive model coverage, impressive speed and memory gains, and detailed step‑by‑step guides that let anyone with basic Python skills train powerful models.

ColabGPULoRA
0 likes · 14 min read
Why Fine‑Tuning Large Models Is Now Ridiculously Easy