Author

AI Engineer Programming

In the AI era, defining problems is often more important than solving them; here we explore AI's contradictions, boundaries, and possibilities.

Articles

Likes

223

Views

Comments

Latest from AI Engineer Programming

95 recent articles

AI Engineer Programming

Jun 13, 2026 · Artificial Intelligence

8 Prompt Templates to Structure AI Reasoning, Review, and Creative Output

These eight prompt templates guide AI through chain-of-thought reasoning, self-review iteration, role-and-constraint framing, parallel solution generation, code-performance analysis, multi-style title creation, meta-prompt nesting, and Socratic questioning, helping users craft structured, reliable, and creative interactions.

AI InteractionChain-of-ThoughtMeta-Prompt

0 likes · 5 min read

8 Prompt Templates to Structure AI Reasoning, Review, and Creative Output

AI Engineer Programming

Jun 11, 2026 · Artificial Intelligence

Understanding LLM Generation Parameters: Temperature, Top‑k, Top‑p, Penalties, and Max Tokens

The article explains how logits are transformed into probabilities via softmax and how generation parameters such as temperature, top‑k, top‑p, frequency‑penalty, presence‑penalty, and max_tokens intervene in the logits‑to‑sampling pipeline, detailing their mechanisms, common misconceptions, and practical limitations.

LLMfrequency_penaltygeneration parameters

0 likes · 15 min read

Understanding LLM Generation Parameters: Temperature, Top‑k, Top‑p, Penalties, and Max Tokens

AI Engineer Programming

Jun 11, 2026 · Artificial Intelligence

How to Build Truly Effective LLM-as-a-Judge Evaluators

The article explains how to construct reliable LLM-as-a-Judge evaluators by combining deterministic code checks for syntactic validation, designing clear semantic evaluation rubrics, choosing appropriate output formats, calibrating with human‑labeled data, mitigating known model biases, and integrating trace‑based monitoring into production workflows.

AI safetyLLM evaluationLLM-as-a-Judge

0 likes · 15 min read

How to Build Truly Effective LLM-as-a-Judge Evaluators

AI Engineer Programming

Jun 9, 2026 · Artificial Intelligence

How Pi Works: Agent Architecture, Tools, Interactive UI, and Skills

The article breaks down Pi, a minimalist programming agent, explaining its two‑layer architecture, the iterative agent loop, a four‑tool set, extensible extensions, layered context construction, and reusable Skills, showing why a clear design, not tool count, determines an agent’s capability.

AI agentContext LayeringExtensions

0 likes · 6 min read

How Pi Works: Agent Architecture, Tools, Interactive UI, and Skills

AI Engineer Programming

Jun 8, 2026 · Artificial Intelligence

Parse vs Extract: When to Use Full Document Parsing vs Targeted Data Extraction for AI

The article explains the fundamental difference between parsing—converting documents into AI‑friendly formats that preserve structure and context—and extraction—pulling predefined fields into structured outputs—while offering concrete scenarios, decision criteria, and example implementations with LlamaParse and LlamaExtract.

AILLMLlamaExtract

0 likes · 10 min read

Parse vs Extract: When to Use Full Document Parsing vs Targeted Data Extraction for AI

AI Engineer Programming

Jun 8, 2026 · Artificial Intelligence

When to Use Small Models: A System Design Perspective

Small models are chosen based on deployment constraints rather than absolute parameter counts; the article outlines how resource limits, latency, cost, privacy, and task characteristics define their suitability, compares their strengths and weaknesses to large models, and offers system‑level design patterns for effective use.

LLM deploymentRAGinference optimization

0 likes · 20 min read

When to Use Small Models: A System Design Perspective

AI Engineer Programming

Jun 7, 2026 · Artificial Intelligence

Why Intent Recognition Is the Decision Hub of Agentic AI Systems

The article explains how intent recognition has evolved from simple keyword matching to a central decision hub in Agentic AI, covering basic concepts, LLM and small‑model solutions, hybrid architectures, clarification and out‑of‑scope handling, multi‑turn challenges, routing, evaluation methods, and best‑practice recommendations.

ClarificationHybrid ArchitectureIntent Recognition

0 likes · 14 min read

Why Intent Recognition Is the Decision Hub of Agentic AI Systems

AI Engineer Programming

Jun 6, 2026 · Artificial Intelligence

How Query Rewriting Boosts Retrieval in RAG Systems

In RAG applications, ambiguous user queries often hinder retrieval effectiveness, so rewriting queries before search—through normalization, synonym expansion, linguistic rules, LLM‑based generation, query decomposition, and multi‑view strategies—can improve relevance, but must avoid over‑expansion, semantic drift, and added latency.

Information RetrievalLLMNatural Language Processing

0 likes · 11 min read

How Query Rewriting Boosts Retrieval in RAG Systems

AI Engineer Programming

Jun 5, 2026 · Artificial Intelligence

Multi‑Hop Reasoning vs Document Parsing: Comparing GraphRAG, LightRAG, AgenticRAG and RAGFlow

The article analyzes the classic vector RAG pipeline, highlights its shortcomings for multi‑hop reasoning and global theme inference, and then systematically compares four open‑source frameworks—GraphRAG, LightRAG, AgenticRAG and RAGFlow—detailing their design choices, processing stages, trade‑offs, limitations, and practical selection guidance for production use.

AgenticRAGGraphRAGLightRAG

0 likes · 17 min read

Multi‑Hop Reasoning vs Document Parsing: Comparing GraphRAG, LightRAG, AgenticRAG and RAGFlow

AI Engineer Programming

Jun 3, 2026 · Artificial Intelligence

Production-Grade Agent Memory: Compaction, Decay, and the Observation Engine

The article presents a comprehensive architecture for production‑grade autonomous agents, detailing failure modes, four distinct memory types, a nightly observation engine that turns patterns into procedural rules, tier‑aware decay scoring, context budgeting, GDPR‑compliant deletion, and a step‑by‑step maintenance pipeline.

Agent MemoryCompactionGDPR compliance

0 likes · 31 min read

Production-Grade Agent Memory: Compaction, Decay, and the Observation Engine