AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Apr 9, 2026 · Artificial Intelligence

How OAG Shrinks a Million‑Token Ontology to 11% While Keeping LLM Reasoning Power

This article presents the OAG (Ontology‑Augmented Generation) architecture, which uses a three‑stage pipeline of semantic filtering, graph‑based path pruning, and format conversion to compress enterprise‑scale ontologies by up to 89% of tokens while limiting inference accuracy loss to around 3% and adding only ~240 ms latency.

AI agentsLLMToken Optimization
0 likes · 21 min read
How OAG Shrinks a Million‑Token Ontology to 11% While Keeping LLM Reasoning Power
Qborfy AI
Qborfy AI
Mar 8, 2026 · Artificial Intelligence

How to Make AI Forget‑Proof: Master Context Compression for Better Answers

This guide explains why AI models hit a "context window" limit, how that leads to selective forgetting and information overload, and provides a step‑by‑step method—extracting key facts, verifying deletions, and re‑using the compressed summary—to keep AI focused on large documents.

AIcontext windowknowledge compression
0 likes · 8 min read
How to Make AI Forget‑Proof: Master Context Compression for Better Answers