Old Zhang's AI Learning
Author

Old Zhang's AI Learning

AI practitioner specializing in large-model evaluation and on-premise deployment, agents, AI programming, Vibe Coding, general AI, and broader tech trends, with daily original technical articles.

229
Articles
0
Likes
713
Views
0
Comments
Recent Articles

Latest from Old Zhang's AI Learning

100 recent articles max
Old Zhang's AI Learning
Old Zhang's AI Learning
May 11, 2026 · Information Security

Critical CVE-2026-7482 'Bleeding Llama' in Ollama: Why You Must Upgrade Now

Ollama versions before 0.17.1 suffer a CVSS 9.1 heap out‑of‑bounds read vulnerability (CVE‑2026‑7482) that lets attackers upload malicious GGUF files, read server memory—including env vars and API keys—and exfiltrate data, affecting over 300,000 publicly exposed servers, so immediate upgrade and hardening are essential.

API vulnerabilityBleeding LlamaCVE-2026-7482
0 likes · 5 min read
Critical CVE-2026-7482 'Bleeding Llama' in Ollama: Why You Must Upgrade Now
Old Zhang's AI Learning
Old Zhang's AI Learning
May 11, 2026 · Artificial Intelligence

Ling-2.6-1T: 1T‑Parameter, Fast‑Thinking, Agent‑Ready Model After DeepSeek‑V4

Ant Group's Ling‑2.6‑1T, a 1‑trillion‑parameter LLM built for token efficiency and fast‑thinking, outperforms on elite reasoning and agentic benchmarks, offers easy local deployment via vLLM or SGLang, provides a quantized 3.6‑bit version, and includes practical usage tips for developers and knowledge workers.

Agentic ModelClaude Code IntegrationLing-2.6-1T
0 likes · 12 min read
Ling-2.6-1T: 1T‑Parameter, Fast‑Thinking, Agent‑Ready Model After DeepSeek‑V4
Old Zhang's AI Learning
Old Zhang's AI Learning
May 9, 2026 · Artificial Intelligence

Claude’s Open‑Source Financial Skills: A Deep Dive

Anthropic’s new claude‑for‑financial‑services repository bundles 11 ready‑to‑run agents, vertical plugins, and 11 MCP data connectors that automate core Wall Street workflows—from pitch decks and earnings reviews to valuation modeling—while offering clear installation paths and guidance for enterprise customization.

AI agentsClaudeInstallation
0 likes · 13 min read
Claude’s Open‑Source Financial Skills: A Deep Dive
Old Zhang's AI Learning
Old Zhang's AI Learning
May 9, 2026 · Artificial Intelligence

Why Gemini’s Multimodal RAG with File Search Is So Compelling

The article analyzes Google Gemini’s File Search tool as a fully managed multimodal RAG solution, detailing its architecture, key features, pricing model, step‑by‑step usage, strengths, limitations, and how it compares with OpenAI Assistants File Search and Vertex AI Search.

AI RetrievalEmbeddingFile Search
0 likes · 14 min read
Why Gemini’s Multimodal RAG with File Search Is So Compelling
Old Zhang's AI Learning
Old Zhang's AI Learning
May 8, 2026 · Artificial Intelligence

Testing RHTV: Native AI Agent Powers One‑Stop Face‑Swap, Image Refinement, and Video Production

The article evaluates RunningHub’s RHTV platform, showing how its native AI agent integrates face‑swap, product‑image refinement and video generation on a single infinite canvas, eliminating the fragmented workflow of other tools and enabling rapid, controllable short‑form video creation demonstrated with a toothbrush‑promotion example.

AI agentsAI video generationRHTV
0 likes · 7 min read
Testing RHTV: Native AI Agent Powers One‑Stop Face‑Swap, Image Refinement, and Video Production
Old Zhang's AI Learning
Old Zhang's AI Learning
May 7, 2026 · Artificial Intelligence

How Unsloth and NVIDIA Boost Consumer‑GPU LLM Training by ~25% with Three Simple Optimizations

Unsloth and NVIDIA identified three low‑level bottlenecks in LLM fine‑tuning on consumer GPUs—repeated packed‑sequence metadata construction, serialized copy‑and‑compute during gradient checkpointing, and per‑expert routing overhead in MoE—and applied targeted patches that together deliver roughly a 25% speedup without changing hardware, code, or frameworks.

GPU optimizationLLM trainingMixture of Experts
0 likes · 12 min read
How Unsloth and NVIDIA Boost Consumer‑GPU LLM Training by ~25% with Three Simple Optimizations