Su San Talks Tech
Author

Su San Talks Tech

Su San, former staff at several leading tech companies, is a top creator on Juejin and a premium creator on CSDN, and runs the free coding practice site www.susan.net.cn.

939
Articles
0
Likes
2.8k
Views
0
Comments
Recent Articles

Latest from Su San Talks Tech

100 recent articles max
Su San Talks Tech
Su San Talks Tech
May 13, 2026 · Artificial Intelligence

Cut Claude Code Token Costs by Up to 89% with the Open‑Source RTK CLI

RTK is a high‑performance CLI proxy that filters and compresses command output before it reaches Claude Code’s 200k‑token LLM context, reducing token consumption by 60‑90% and cutting costs up to 89%, with step‑by‑step installation and usage instructions provided.

CLIClaude CodeLLM
0 likes · 5 min read
Cut Claude Code Token Costs by Up to 89% with the Open‑Source RTK CLI
Su San Talks Tech
Su San Talks Tech
May 12, 2026 · Artificial Intelligence

Managing All Claude Code AI Sessions with the New Agent View

Claude Code’s new Agent View lets developers open a single terminal interface to launch, monitor, and control multiple AI coding sessions, offering commands for background tasks, a Peek preview panel, Attach deep‑dive mode, git worktree isolation, status icons, and a full shortcut reference, all without losing sessions when terminals close.

AI CodingAgent ViewCLI
0 likes · 10 min read
Managing All Claude Code AI Sessions with the New Agent View
Su San Talks Tech
Su San Talks Tech
May 12, 2026 · Cloud Native

How Nacos 3.2 Evolves into an Enterprise AI Governance Platform

Nacos 3.2 expands beyond a micro‑service registry to become a unified AI asset governance platform, introducing AI Registry, MCP Registry, a three‑layer Skill security sandbox, Copilot assistance, and A2A protocol integration for seamless enterprise AI adoption.

A2A protocolAI RegistryCopilot
0 likes · 11 min read
How Nacos 3.2 Evolves into an Enterprise AI Governance Platform
Su San Talks Tech
Su San Talks Tech
May 11, 2026 · Artificial Intelligence

How Google’s Open‑Source MCP Toolbox Secures AI Agent Database Access

The article analyzes the dangers of giving LLMs unrestricted database privileges, explains Google’s MCP Toolbox design that enforces least‑privilege, structured queries and authentication, provides a step‑by‑step Go integration guide, shares production pitfalls, and compares suitable use cases versus raw function calling.

AI AgentMCP Toolboxdatabase security
0 likes · 18 min read
How Google’s Open‑Source MCP Toolbox Secures AI Agent Database Access
Su San Talks Tech
Su San Talks Tech
May 11, 2026 · Artificial Intelligence

Designing a Production‑Ready LLM Gateway: Architecture, Routing, Fallback, and Observability

This article outlines a production‑grade LLM Gateway design, detailing a three‑layer architecture, capability‑, cost‑, latency‑ and semantic‑based routing strategies, multi‑level fallback mechanisms, specialized load balancing, unified API adaptation, semantic caching, observability, and compares popular open‑source implementations.

GatewayLLMObservability
0 likes · 17 min read
Designing a Production‑Ready LLM Gateway: Architecture, Routing, Fallback, and Observability
Su San Talks Tech
Su San Talks Tech
May 9, 2026 · Databases

Why Can Redis Handle Over 100,000 QPS? A Deep Technical Breakdown

Redis can sustain over 100,000 queries per second thanks to four key pillars—memory‑first storage, highly optimized data structures like SDS and skip lists, a single‑threaded event loop with epoll multiplexing, and multi‑core I/O threading—each explained with benchmarks, code samples, and real‑world comparisons.

Data StructuresIO MultiplexingPerformance
0 likes · 10 min read
Why Can Redis Handle Over 100,000 QPS? A Deep Technical Breakdown
Su San Talks Tech
Su San Talks Tech
May 7, 2026 · Artificial Intelligence

DeepSeek’s New Claude‑Code‑Style Terminal Agent: An Open‑Source Rust Project

An open‑source Rust‑based terminal agent for DeepSeek V4, dubbed DeepSeek‑TUI, offers Claude‑Code‑like capabilities such as file manipulation, shell execution, git management, parallel sub‑task scheduling, side‑git rollback, and LSP diagnostics, and has quickly attracted thousands of stars and active community contributions.

AI CodingDeepSeekLSP
0 likes · 5 min read
DeepSeek’s New Claude‑Code‑Style Terminal Agent: An Open‑Source Rust Project
Su San Talks Tech
Su San Talks Tech
May 6, 2026 · Backend Development

11 Essential Redis Use Cases Every Backend Engineer Should Know

This article walks through eleven practical Redis scenarios—from classic caching and distributed locks to rate limiting, leaderboards, timelines, social graph operations, lightweight queues, Bloom filters, hash‑based object storage, unique‑counting, and delayed tasks—providing code samples, advantages, drawbacks, and when to apply each pattern.

CachingRate LimitingSorted Set
0 likes · 15 min read
11 Essential Redis Use Cases Every Backend Engineer Should Know
Su San Talks Tech
Su San Talks Tech
May 6, 2026 · Information Security

What Is Prompt Injection? Attack Vectors and Defense Strategies

The article explains that Prompt injection is a new LLM security threat where attackers blur the line between instruction and data, outlines direct and indirect injection techniques—including command overriding, role‑play jailbreaks, encoding obfuscation, and multi‑turn attacks—and proposes a defense‑in‑depth framework with input filtering, prompt design, output validation, least‑privilege architecture, and specialized safeguards for RAG and agent scenarios.

AI safetyDefense in DepthLLM security
0 likes · 15 min read
What Is Prompt Injection? Attack Vectors and Defense Strategies