Author

Su San Talks Tech

Su San, former staff at several leading tech companies, is a top creator on Juejin and a premium creator on CSDN, and runs the free coding practice site www.susan.net.cn.

939

Articles

Likes

2.8k

Views

Comments

Latest from Su San Talks Tech

100 recent articles max

Su San Talks Tech

May 13, 2026 · Artificial Intelligence

Cut Claude Code Token Costs by Up to 89% with the Open‑Source RTK CLI

RTK is a high‑performance CLI proxy that filters and compresses command output before it reaches Claude Code’s 200k‑token LLM context, reducing token consumption by 60‑90% and cutting costs up to 89%, with step‑by‑step installation and usage instructions provided.

CLIClaude CodeLLM

0 likes · 5 min read

Cut Claude Code Token Costs by Up to 89% with the Open‑Source RTK CLI

Su San Talks Tech

May 12, 2026 · Artificial Intelligence

Managing All Claude Code AI Sessions with the New Agent View

Claude Code’s new Agent View lets developers open a single terminal interface to launch, monitor, and control multiple AI coding sessions, offering commands for background tasks, a Peek preview panel, Attach deep‑dive mode, git worktree isolation, status icons, and a full shortcut reference, all without losing sessions when terminals close.

AI CodingAgent ViewCLI

0 likes · 10 min read

Managing All Claude Code AI Sessions with the New Agent View

Su San Talks Tech

May 12, 2026 · Cloud Native

How Nacos 3.2 Evolves into an Enterprise AI Governance Platform

Nacos 3.2 expands beyond a micro‑service registry to become a unified AI asset governance platform, introducing AI Registry, MCP Registry, a three‑layer Skill security sandbox, Copilot assistance, and A2A protocol integration for seamless enterprise AI adoption.

A2A protocolAI RegistryCopilot

0 likes · 11 min read

How Nacos 3.2 Evolves into an Enterprise AI Governance Platform

Su San Talks Tech

May 11, 2026 · Artificial Intelligence

How Google’s Open‑Source MCP Toolbox Secures AI Agent Database Access

The article analyzes the dangers of giving LLMs unrestricted database privileges, explains Google’s MCP Toolbox design that enforces least‑privilege, structured queries and authentication, provides a step‑by‑step Go integration guide, shares production pitfalls, and compares suitable use cases versus raw function calling.

AI AgentMCP Toolboxdatabase security

0 likes · 18 min read

How Google’s Open‑Source MCP Toolbox Secures AI Agent Database Access

Su San Talks Tech

May 11, 2026 · Artificial Intelligence

Designing a Production‑Ready LLM Gateway: Architecture, Routing, Fallback, and Observability

This article outlines a production‑grade LLM Gateway design, detailing a three‑layer architecture, capability‑, cost‑, latency‑ and semantic‑based routing strategies, multi‑level fallback mechanisms, specialized load balancing, unified API adaptation, semantic caching, observability, and compares popular open‑source implementations.

GatewayLLMObservability

0 likes · 17 min read

Designing a Production‑Ready LLM Gateway: Architecture, Routing, Fallback, and Observability

Su San Talks Tech

May 9, 2026 · Artificial Intelligence

Build a Personal AI Knowledge Base with Claude Code, Obsidian, and DeepSeek V4

This step‑by‑step tutorial shows how to install Obsidian, add the Claudian plugin, configure Claude Code with DeepSeek V4 via the CC Switch desktop tool, and enable local AI‑powered search and summarisation across your markdown notes.

AI knowledge baseClaude CodeClaudian

0 likes · 6 min read

Build a Personal AI Knowledge Base with Claude Code, Obsidian, and DeepSeek V4

Su San Talks Tech

May 9, 2026 · Databases

Why Can Redis Handle Over 100,000 QPS? A Deep Technical Breakdown

Redis can sustain over 100,000 queries per second thanks to four key pillars—memory‑first storage, highly optimized data structures like SDS and skip lists, a single‑threaded event loop with epoll multiplexing, and multi‑core I/O threading—each explained with benchmarks, code samples, and real‑world comparisons.

Data StructuresIO MultiplexingPerformance

0 likes · 10 min read

Why Can Redis Handle Over 100,000 QPS? A Deep Technical Breakdown

Su San Talks Tech

May 7, 2026 · Artificial Intelligence

DeepSeek’s New Claude‑Code‑Style Terminal Agent: An Open‑Source Rust Project

An open‑source Rust‑based terminal agent for DeepSeek V4, dubbed DeepSeek‑TUI, offers Claude‑Code‑like capabilities such as file manipulation, shell execution, git management, parallel sub‑task scheduling, side‑git rollback, and LSP diagnostics, and has quickly attracted thousands of stars and active community contributions.

AI CodingDeepSeekLSP

0 likes · 5 min read

DeepSeek’s New Claude‑Code‑Style Terminal Agent: An Open‑Source Rust Project

Su San Talks Tech

May 6, 2026 · Backend Development

11 Essential Redis Use Cases Every Backend Engineer Should Know

This article walks through eleven practical Redis scenarios—from classic caching and distributed locks to rate limiting, leaderboards, timelines, social graph operations, lightweight queues, Bloom filters, hash‑based object storage, unique‑counting, and delayed tasks—providing code samples, advantages, drawbacks, and when to apply each pattern.

CachingRate LimitingSorted Set

0 likes · 15 min read

11 Essential Redis Use Cases Every Backend Engineer Should Know

Su San Talks Tech

May 6, 2026 · Information Security

What Is Prompt Injection? Attack Vectors and Defense Strategies

The article explains that Prompt injection is a new LLM security threat where attackers blur the line between instruction and data, outlines direct and indirect injection techniques—including command overriding, role‑play jailbreaks, encoding obfuscation, and multi‑turn attacks—and proposes a defense‑in‑depth framework with input filtering, prompt design, output validation, least‑privilege architecture, and specialized safeguards for RAG and agent scenarios.

AI safetyDefense in DepthLLM security

0 likes · 15 min read

What Is Prompt Injection? Attack Vectors and Defense Strategies