ShiZhen AI
ShiZhen AI
Apr 2, 2026 · Artificial Intelligence

How KV Cache Works and Why Large Model Outputs Cost Five Times More Than Inputs

The article explains the KV Cache mechanism that stores previously computed key/value vectors to avoid redundant Transformer calculations, delivering roughly a 5× speedup, while also detailing why generating output tokens is far more expensive than processing input tokens due to serial generation and memory trade‑offs.

KV cacheLLM inferencePrefill
0 likes · 9 min read
How KV Cache Works and Why Large Model Outputs Cost Five Times More Than Inputs
IT Services Circle
IT Services Circle
Aug 28, 2025 · Artificial Intelligence

Why DeepSeek V3.1 Keeps Spitting the ‘Extreme’ Token and How to Fix It

Developers using DeepSeek V3.1's API have reported that the model intermittently inserts the Chinese character “极” (or its variants) into generated code, a bug that spreads across multiple platforms and threatens high‑precision code generation, prompting community workarounds and speculation about its root causes.

AI model bugDeepSeekLLM
0 likes · 6 min read
Why DeepSeek V3.1 Keeps Spitting the ‘Extreme’ Token and How to Fix It
Java Architecture Diary
Java Architecture Diary
Jun 13, 2022 · Information Security

Step-by-Step Guide to OAuth2 Token Generation and Authentication Flow

This article walks through the complete OAuth2 token generation process, covering gateway pre‑processing, client authentication, request handling, authentication object assembly, password validation, token creation, storage options, and response handling with code examples and diagrams.

AuthenticationSpring SecurityToken Generation
0 likes · 5 min read
Step-by-Step Guide to OAuth2 Token Generation and Authentication Flow
Programmer DD
Programmer DD
Mar 26, 2022 · Information Security

What’s New in Spring Authorization Server 0.2.3? Explore Key Features and Code

Spring Authorization Server 0.2.3 introduces default client settings for public clients, splits OAuth2 client authentication providers, optimizes the in‑memory authorization service, adds federated‑identity demos, unifies token generation via OAuth2TokenGenerator, and upgrades core dependencies, with code examples illustrating each enhancement.

JavaOAuth2Spring Authorization Server
0 likes · 7 min read
What’s New in Spring Authorization Server 0.2.3? Explore Key Features and Code
Aotu Lab
Aotu Lab
Aug 19, 2020 · Frontend Development

How Design Tokens Streamline Frontend‑Designer Collaboration Across Platforms

This article explains the challenges of translating design specifications into code, introduces Design Tokens as a solution, and provides step‑by‑step guidance—including variable generation with Theo, custom iOS/Android implementations, mixin creation, and deployment—to improve consistency and efficiency between designers and developers.

Design TokensTheoToken Generation
0 likes · 12 min read
How Design Tokens Streamline Frontend‑Designer Collaboration Across Platforms