Old Zhang's AI Learning
Author

Old Zhang's AI Learning

AI practitioner specializing in large-model evaluation and on-premise deployment, agents, AI programming, Vibe Coding, general AI, and broader tech trends, with daily original technical articles.

229
Articles
0
Likes
711
Views
0
Comments
Recent Articles

Latest from Old Zhang's AI Learning

100 recent articles max
Old Zhang's AI Learning
Old Zhang's AI Learning
May 14, 2026 · R&D Management

From Topic to Submission: Claude Code’s ARS Pipeline for Academic Papers

The open‑source Academic Research Skills (ARS) suite builds on Claude Code to automate the entire research‑to‑publication workflow, offering human‑in‑the‑loop quality gates, style calibration, citation checks, and a low token cost of $4‑6 per 15k‑word paper, making it especially useful for graduate students and Chinese researchers aiming to publish in English.

AI agentsAcademic ResearchClaude Code
0 likes · 8 min read
From Topic to Submission: Claude Code’s ARS Pipeline for Academic Papers
Old Zhang's AI Learning
Old Zhang's AI Learning
May 14, 2026 · Artificial Intelligence

Boost Qwen3.6 with MTP: 1.5× Faster Local Deployment for Claude Code

The article explains how to enable Multi‑Token Prediction (MTP) in Qwen3.6 using a specific llama.cpp PR, achieving up to 1.5× faster local inference, details compilation steps, optimal parameters, memory requirements, and how to integrate the accelerated model with Claude Code while avoiding common pitfalls.

Claude CodeLLM accelerationMTP
0 likes · 11 min read
Boost Qwen3.6 with MTP: 1.5× Faster Local Deployment for Claude Code
Old Zhang's AI Learning
Old Zhang's AI Learning
May 13, 2026 · Frontend Development

32 Open-Source HTML Slide Templates to Stop AI from Generating Ugly PPTs

The article introduces a GitHub repository offering 32 ready‑to‑use HTML/CSS slide templates designed for AI agents, explains a six‑step workflow for selecting and customizing templates, evaluates the strengths and limitations of the approach, and argues that HTML is re‑emerging as a universal presentation format.

AI agentsFrontendHTML
0 likes · 9 min read
32 Open-Source HTML Slide Templates to Stop AI from Generating Ugly PPTs
Old Zhang's AI Learning
Old Zhang's AI Learning
May 13, 2026 · Artificial Intelligence

Why vLLM Now Leads Open‑Source LLM Inference Benchmarks

vLLM tops the Artificial Analysis ranking by delivering the highest throughput for DeepSeek V3.2, Qwen 3.5 397B, and MiniMax‑M2.5 on identical NVIDIA Blackwell Ultra hardware, thanks to extensive kernel‑fusion optimizations that remain in the main branch.

DeepSeekLLM inferenceQwen
0 likes · 7 min read
Why vLLM Now Leads Open‑Source LLM Inference Benchmarks
Old Zhang's AI Learning
Old Zhang's AI Learning
May 11, 2026 · Artificial Intelligence

Open‑Source Qwen3.6‑35B‑A3B Runs at 162 tok/s on a Single RTX 5090

The article introduces the open‑source Qwen3.6‑35B‑A3B model, explains its MoE architecture, three‑stage LoRA fine‑tuning, shows benchmark results where it achieves 161.9 tok/s on an RTX 5090—2.6× faster than a dense 27B counterpart—and discusses deployment tips, quantized GGUF release, and known compatibility pitfalls.

GGUF quantizationLarge Language ModelLoRA fine-tuning
0 likes · 7 min read
Open‑Source Qwen3.6‑35B‑A3B Runs at 162 tok/s on a Single RTX 5090