knowledge compression — 2 Technical Articles

AsiaInfo Technology: New Tech Exploration

Apr 9, 2026 · Artificial Intelligence

How OAG Shrinks a Million‑Token Ontology to 11% While Keeping LLM Reasoning Power

This article presents the OAG (Ontology‑Augmented Generation) architecture, which uses a three‑stage pipeline of semantic filtering, graph‑based path pruning, and format conversion to compress enterprise‑scale ontologies by up to 89% of tokens while limiting inference accuracy loss to around 3% and adding only ~240 ms latency.

AI agentsLLMToken Optimization

0 likes · 21 min read

How OAG Shrinks a Million‑Token Ontology to 11% While Keeping LLM Reasoning Power

Qborfy AI

Mar 8, 2026 · Artificial Intelligence

How to Make AI Forget‑Proof: Master Context Compression for Better Answers

This guide explains why AI models hit a "context window" limit, how that leads to selective forgetting and information overload, and provides a step‑by‑step method—extracting key facts, verifying deletions, and re‑using the compressed summary—to keep AI focused on large documents.

AIcontext windowknowledge compression

0 likes · 8 min read

How to Make AI Forget‑Proof: Master Context Compression for Better Answers