Tagged articles
8 articles
Page 1 of 1
Java Tech Enthusiast
Java Tech Enthusiast
Jun 8, 2026 · Artificial Intelligence

How Claude Code, Codex, and OpenCode Can Cut Token Usage by Up to 80%

The article breaks down token billing, shows that input tokens account for 70‑90% of cost, and provides concrete techniques—file filtering, context compression, doc‑driven prompts, memory caching, plan mode, output trimming, and model switching—across Claude Code, Codex, and OpenCode, culminating in a 10‑step checklist and a comparison table that demonstrate up to 80% token savings.

AI codingClaude CodeCodex
0 likes · 11 min read
How Claude Code, Codex, and OpenCode Can Cut Token Usage by Up to 80%
Architect's Tech Stack
Architect's Tech Stack
Jun 4, 2026 · Artificial Intelligence

How TencentDB Agent Memory Cuts Token Usage by 61% and Boosts Task Success

TencentDB Agent Memory, an open‑source hierarchical memory system for long‑running AI agents, offloads tool calls, structures short‑term and four‑layer long‑term memories, and reduces token consumption by 61% while raising task success rate 51% and persona accuracy from 48% to 76%, all running locally with SQLite and no API keys.

AI agentsOpenClawSQLite
0 likes · 4 min read
How TencentDB Agent Memory Cuts Token Usage by 61% and Boosts Task Success
AI Architecture Path
AI Architecture Path
Jun 3, 2026 · Artificial Intelligence

How Headroom Cuts Claude Code Token Usage by Up to 95% Without Losing Accuracy

Headroom is a locally run, reversible context‑compression layer for Claude Code that reduces input tokens by 60‑95 % without sacrificing precision, eliminates context‑limit errors, cuts token costs, protects privacy, and enables seamless memory sharing across multiple AI coding agents, as demonstrated by real‑world benchmarks.

AI codingClaude CodeHeadroom
0 likes · 15 min read
How Headroom Cuts Claude Code Token Usage by Up to 95% Without Losing Accuracy
AI Engineering
AI Engineering
May 16, 2026 · Backend Development

Cut 92% of Claude Code Tool Calls for Large Codebases with CodeGraph

CodeGraph builds a semantic knowledge graph of a codebase so Claude Code can query the graph instead of scanning files, reducing tool calls by an average of 92% and speeding up exploration by 71% across multiple large, multi‑language projects.

AI code assistanceClaude CodeToken Reduction
0 likes · 6 min read
Cut 92% of Claude Code Tool Calls for Large Codebases with CodeGraph
Machine Heart
Machine Heart
May 12, 2026 · Artificial Intelligence

DECS Cuts Overthinking in Models: Halve Inference Tokens and Raise Accuracy

DECS, a novel training framework introduced by researchers from Fudan, Shanghai Jiao Tong, and the Shanghai AI Lab, theoretically exposes the flaws of length‑penalty rewards and, through token‑level reward decoupling and dynamic batch scheduling, reduces inference token counts by over 50% while improving accuracy across multiple benchmarks.

DECSToken Reductionbenchmark evaluation
0 likes · 9 min read
DECS Cuts Overthinking in Models: Halve Inference Tokens and Raise Accuracy
Java Web Project
Java Web Project
May 1, 2026 · Artificial Intelligence

How a Single Command Cuts AI Coding Token Usage from 210K to 23K

The article explains why AI coding tools waste hundreds of thousands of tokens on noisy terminal output, presents official data showing a typical two‑hour session generating 210,000 useless tokens, and demonstrates how the open‑source Rust Token Killer (RTK) filters output to save up to 80% of tokens with a single command.

AI codingCLIProductivity
0 likes · 4 min read
How a Single Command Cuts AI Coding Token Usage from 210K to 23K
AI Engineering
AI Engineering
Apr 2, 2026 · Artificial Intelligence

Cut Claude Code’s Fluff with 8 Lines: Slash Output Tokens by 63%

By adding an eight‑line CLAUDE.md file that suppresses polite openings, repetitions, and unnecessary explanations, developers reduced Claude Code’s output token count by 63% without losing information, achieving up to 75% shorter code reviews and 64% shorter concept explanations, as verified by independent benchmarks.

AutomationClaudeGitHub
0 likes · 4 min read
Cut Claude Code’s Fluff with 8 Lines: Slash Output Tokens by 63%
AI Open-Source Efficiency Guide
AI Open-Source Efficiency Guide
Mar 26, 2026 · Artificial Intelligence

OpenSpace: HKU’s Open‑Source AI Agent Engine Cuts Tokens by 46% and Boosts ROI 4.2×

OpenSpace is an open‑source, self‑evolving AI agent engine that supports major agent frameworks, reduces token consumption by 46%, achieves a 4.2‑fold return on 50 professional tasks across six industries using the Qwen 3.5‑Plus model, and provides auto‑fix, auto‑improve, and auto‑learn capabilities for collective intelligence.

AI AgentOpenSourceSelf‑Evolution
0 likes · 9 min read
OpenSpace: HKU’s Open‑Source AI Agent Engine Cuts Tokens by 46% and Boosts ROI 4.2×