Tagged articles

agent planning

5 articles · Page 1 of 1

Jun 11, 2026 · Artificial Intelligence

How Code Serves as the Harness for AI Agents: Insights from UIUC, Meta, and Stanford

The article analyzes how code—broadly defined as any executable or machine‑checkable artifact—acts as the core harness that connects large language models to the real world, detailing its roles in reasoning, acting, environment modeling, planning, memory, tool use, multi‑agent collaboration, and the safety challenges that arise.

AI agentsLLMMemory Management

0 likes · 11 min read

How Code Serves as the Harness for AI Agents: Insights from UIUC, Meta, and Stanford

AI Insight Log

Feb 18, 2026 · Artificial Intelligence

Claude Sonnet 4.6 Launches on Chinese New Year with Opus-Level Coding Power

Anthropic unveiled Claude Sonnet 4.6 on February 18, touting Opus-level coding ability, a 1 million-token context window, and unchanged pricing; benchmarks show a SWE-bench score of 79.6% (up from 77.2%), OSWorld 72.5% (vs 61.4%), and GPQA Diamond 89.9%, while industry leaders praise its reduced laziness, stronger instruction following, and strategic long-term planning.

AI CodingAnthropicClaude Sonnet 4.6

0 likes · 7 min read

Claude Sonnet 4.6 Launches on Chinese New Year with Opus-Level Coding Power

ShiZhen AI

Feb 17, 2026 · Artificial Intelligence

Sonnet 4.6 Nears Opus Performance While Retaining Sonnet Pricing

Anthropic released Sonnet 4.6 just 12 days after Opus 4.6, delivering near‑Opus capabilities across coding, computer use, long‑context reasoning, and agent planning with a 1 M‑token window, while keeping the lower Sonnet price, prompting mixed community debate and rapid ecosystem adoption.

AI benchmarksAnthropicComputer Use

0 likes · 12 min read

Sonnet 4.6 Nears Opus Performance While Retaining Sonnet Pricing

Architect

Jan 20, 2026 · Artificial Intelligence

Turning AI Agents into Reliable Team Members: Practical Engineering Practices

This guide explains how architects can treat AI agents as controllable teammates by establishing clear plans, managing context, creating verification loops, versioning assets, leveraging parallelism, and applying multi‑layer risk governance to make agent‑driven development safe and efficient.

Engineering PracticesSoftware Architectureagent planning

0 likes · 13 min read

Turning AI Agents into Reliable Team Members: Practical Engineering Practices

Alibaba Cloud Developer

May 22, 2025 · Artificial Intelligence

Why Planning Boosts Multi‑Tool Agent Performance and How to Implement It

This article explains the importance of planning for multi‑tool AI agents, compares OpenAI and Anthropic approaches, presents experimental results, and provides practical guidance on tool design, prompt configuration, model selection, and parallel versus serial tool calls to improve efficiency and effectiveness.

AI agentsAnthropicOpenAI

0 likes · 16 min read

Why Planning Boosts Multi‑Tool Agent Performance and How to Implement It