Author

AI Engineering

Focused on cutting‑edge product and technology information and practical experience sharing in the AI field (large models, MLOps/LLMOps, AI application development, AI infrastructure).

240

Articles

Likes

768

Views

Comments

Latest from AI Engineering

100 recent articles max

AI Engineering

Jul 28, 2026 · Artificial Intelligence

Why Veteran Developers Skip Reading Agent-Generated Code

A seasoned programmer explains how imposing strict testing constraints on AI code agents—through a TDD‑style workflow, a multi‑stage gauntlet, and reproducible evidence—lets him trust generated code without manually reviewing each line.

AI agentsTDDcode generation

0 likes · 7 min read

Why Veteran Developers Skip Reading Agent-Generated Code

AI Engineering

Jul 27, 2026 · Artificial Intelligence

Low‑Cost Game Development with OpenCode AI: How Inference Routing Drives a Godot Shootout

The article details how OpenCode AI, combined with DigitalOcean's inference router, built a full‑featured Godot 4 penalty‑shootout game in a few hours, routing 596 tasks across cheap open‑source models, cutting token costs from $123 to $8.25 while revealing model performance, latency, and when to prefer frontier models.

AI codingCost OptimizationGodot

0 likes · 22 min read

Low‑Cost Game Development with OpenCode AI: How Inference Routing Drives a Godot Shootout

AI Engineering

Jul 26, 2026 · Industry Insights

FDE Roles Jump 321% in Six Months – What AI Companies Need Most Now

From February to July 2026, Forward Deployed Engineer positions surged from 28 to 118, a 321% increase that outpaces the overall AI engineering market growth of 134%, highlighting a rapid shift toward customer‑facing AI integration roles.

AI engineeringAI job marketForward Deployed Engineer

0 likes · 6 min read

FDE Roles Jump 321% in Six Months – What AI Companies Need Most Now

AI Engineering

Jul 25, 2026 · Artificial Intelligence

Claude Opus 5 Beats Fable 5 at Half the Price with Superior Coding

Claude Opus 5 launches at the same $5 / M input and $25 / M output pricing as Opus 4.8, yet it outperforms Fable 5 in coding benchmarks, achieves higher intelligence scores, costs less per task, though it is slower and shows higher hallucination rates.

AI benchmarkingClaude Opus 5Fable 5

0 likes · 7 min read

Claude Opus 5 Beats Fable 5 at Half the Price with Superior Coding

AI Engineering

Jul 24, 2026 · Artificial Intelligence

Andrew Ng’s OpenWorker: An Out‑of‑the‑Box AI Agent Built for Getting Real Work Done

OpenWorker, the newly open‑sourced AI agent announced by Andrew Ng, lets users specify desired outcomes and automatically breaks tasks into steps, invokes selected LLMs and tools, and delivers completed results—supporting 25+ integrations, local data handling, model‑agnostic operation, and a safety‑first approval workflow.

AI AgentAndrew NgAutomation

0 likes · 4 min read

Andrew Ng’s OpenWorker: An Out‑of‑the‑Box AI Agent Built for Getting Real Work Done

AI Engineering

Jul 23, 2026 · Artificial Intelligence

Is Graph Engineering Really New? Why LangChain Says It’s Not

The article explains that Graph Engineering isn’t a brand‑new concept but an evolution of Prompt, Loop, and Harness engineering, detailing how LangGraph has been used for three years, the core components of graph‑based agents, practical patterns, pitfalls, and when to choose graphs over other approaches.

AI WorkflowAgent GraphsDynamic Routing

0 likes · 13 min read

Is Graph Engineering Really New? Why LangChain Says It’s Not

AI Engineering

Jul 21, 2026 · Artificial Intelligence

Google DeepMind Unveils Three New Gemini Models: Lower Token Use, Higher Quality, Same Cost

Google DeepMind released three Gemini models—3.6 Flash, 3.5 Flash‑Lite and 3.5 Flash Cyber—offering up to 65% token savings, nearly double generation speed, improved benchmark scores and enhanced security while keeping pricing stable, with the flagship 3.5 Pro still delayed.

AI modelDeepMindGemini

0 likes · 9 min read

Google DeepMind Unveils Three New Gemini Models: Lower Token Use, Higher Quality, Same Cost

AI Engineering

Jul 21, 2026 · Artificial Intelligence

Unsloth Adds AMD Support: Train LLMs on 3 GB VRAM GPUs

Unsloth now supports AMD GPUs with custom ROCm‑optimized Triton kernels, delivering up to double the training speed and 70% lower memory usage, enabling over 500 LLMs to be trained on as little as 3 GB VRAM and providing detailed performance benchmarks on MI300X.

AMDGPULLM

0 likes · 5 min read

Unsloth Adds AMD Support: Train LLMs on 3 GB VRAM GPUs

AI Engineering

Jul 20, 2026 · Artificial Intelligence

wigolo: Add a Local, API‑Key‑Free Search Brain to Your AI Agent

wigolo is an open‑source tool that bundles search, web crawling, extraction, caching, research and autonomous agent capabilities into a single local MCP server, eliminating API keys, per‑call fees, and preserving data privacy while offering configurable design principles and performance comparisons.

AI AgentMCPNode.js

0 likes · 7 min read

wigolo: Add a Local, API‑Key‑Free Search Brain to Your AI Agent

AI Engineering

Jul 19, 2026 · Artificial Intelligence

Why Most Tokens Are Needlessly Recomputed and How LMCache Goes Beyond KV‑Cache

The article analyzes how up to 62% of tokens in AI agents are redundantly recomputed, explains the limits of prefix caching, and shows how LMCache’s separate‑process architecture and CacheBlend technique dramatically improve KV‑cache hit rates and inference performance.

AI inferenceKV cacheLMCache

0 likes · 9 min read

Why Most Tokens Are Needlessly Recomputed and How LMCache Goes Beyond KV‑Cache