Author

AI Engineer Programming

In the AI era, defining problems is often more important than solving them; here we explore AI's contradictions, boundaries, and possibilities.

Articles

Likes

223

Views

Comments

Latest from AI Engineer Programming

95 recent articles

AI Engineer Programming

Jun 23, 2026 · Artificial Intelligence

Why Data Lineage Is the Final Piece of RAG Governance

The article explains how data lineage in Retrieval‑Augmented Generation systems links data quality, ingestion, and incremental sync into a traceable whole, detailing the five lineage nodes, schema trade‑offs, storage choices, and how lineage supports debugging, impact analysis, and version control.

RAGdata governancedata lineage

0 likes · 15 min read

Why Data Lineage Is the Final Piece of RAG Governance

AI Engineer Programming

Jun 22, 2026 · Artificial Intelligence

Ensuring Consistent Incremental Sync in RAG Systems (Part 2)

The article examines how incremental synchronization, index stability, shadow‑index atomic switching, checkpointing, idempotency, backpressure handling, batch‑vs‑streaming trade‑offs, and multi‑layer validation (count reconciliation, content sampling, and retrieval regression) together keep vector‑based RAG knowledge bases reliable and up‑to‑date.

RAGdata governanceincremental sync

0 likes · 13 min read

Ensuring Consistent Incremental Sync in RAG Systems (Part 2)

AI Engineer Programming

Jun 21, 2026 · Artificial Intelligence

RAG Data Governance: Incremental Sync and Consistency (Part 1)

The article explains how additions, updates, and deletions affect a vector store differently, outlines three layers of incremental synchronization—change detection, change handling, and service stability—and compares timestamp polling, content‑hash diffing, and CDC while discussing consistency models and conflict resolution in distributed vector databases.

CDCRAGconsistency

0 likes · 16 min read

RAG Data Governance: Incremental Sync and Consistency (Part 1)

AI Engineer Programming

Jun 20, 2026 · Artificial Intelligence

RAG Data Ingestion: Managing Heterogeneous Sources and Unified Metadata

The article analyzes common pitfalls in RAG data ingestion—connection failures and incomplete records—advocates defining required metadata fields before integration, and provides source‑specific guidelines for databases, APIs, object storage, web crawlers, and manual uploads to ensure reliable downstream governance.

AIETLRAG

0 likes · 17 min read

RAG Data Ingestion: Managing Heterogeneous Sources and Unified Metadata

AI Engineer Programming

Jun 19, 2026 · Artificial Intelligence

RAG Data Quality: Old Problems in a New Bottle

Even with meticulous cleaning, residual noise, redundant legal clauses, and approximate duplicates can degrade retrieval and generation in RAG systems, while privacy risks from embedding inversion and the need for continuous, metric‑driven governance make data quality the ultimate ceiling for performance.

Data QualityEmbedding InversionLLM Retrieval

0 likes · 8 min read

RAG Data Quality: Old Problems in a New Bottle

AI Engineer Programming

Jun 18, 2026 · Artificial Intelligence

RAG Data Governance: Pre‑Ingestion Data Quality Challenges (Part 1)

The article analyzes how RAG systems inherit classic data‑quality problems, explains why clean input is essential for retrieval and generation, outlines historical GIGO lessons, highlights new risks introduced by vectorization and LLMs, and reviews practical chunking and governance strategies to mitigate hidden failures.

ChunkingData QualityLLM

0 likes · 18 min read

RAG Data Governance: Pre‑Ingestion Data Quality Challenges (Part 1)

AI Engineer Programming

Jun 17, 2026 · Artificial Intelligence

Local LLMs Viable: Sparse Attention, MoE, KV Compression, Multi‑Token Prediction

In early 2026, open‑source local large language models become practical alternatives thanks to sparse attention, MoE routing, latent KV compression, multi‑token prediction, and 4‑bit quantization, while hardware memory shortages and benchmark gaps with closed‑source models shape their deployment choices.

4-bit quantizationKV compressionMixture of Experts

0 likes · 13 min read

Local LLMs Viable: Sparse Attention, MoE, KV Compression, Multi‑Token Prediction

AI Engineer Programming

Jun 16, 2026 · Artificial Intelligence

Why AI Agents Enhance, Not Replace, Code Review Workflows

The article analyzes how AI agents improve code review by using multi‑step reasoning, context engineering, graph‑based code understanding, hybrid LLM‑static analysis, and multi‑agent orchestrator‑worker architectures, while discussing design challenges, open‑source implementations, and inherent limitations.

AI agentsContext engineeringLLM

0 likes · 14 min read

Why AI Agents Enhance, Not Replace, Code Review Workflows

AI Engineer Programming

Jun 15, 2026 · R&D Management

Why and How to Conduct Code Reviews: From Traditional Practices to AI Agents (Part 1)

The article explains why code review matters, how it should be performed, and how the rise of AI‑generated code reshapes review practices, introducing a five‑level review taxonomy and a methodology that combines atomic pull requests, layered reading, comment grading, and measurable SLAs.

AI agentsLLMcode review

0 likes · 13 min read

Why and How to Conduct Code Reviews: From Traditional Practices to AI Agents (Part 1)

AI Engineer Programming

Jun 14, 2026 · Artificial Intelligence

10 RAG Architectures Every AI Engineer Should Master

The article debunks the claim that Retrieval‑Augmented Generation is obsolete, explains why huge context windows are impractical, and systematically presents ten RAG patterns—from basic Naïve RAG to advanced Graph and Multimodal RAG—detailing their trade‑offs, costs, and suitable use cases.

AI architectureEmbedding ModelsRAG

0 likes · 16 min read

10 RAG Architectures Every AI Engineer Should Master