Tagged articles
47 articles
Page 1 of 1
Data Party THU
Data Party THU
Apr 28, 2026 · Artificial Intelligence

How MiniMax Drives Joint Evolution of Models and Harnesses

The article analyzes MiniMax’s strategy of co‑evolving large language models with a Harness framework, contrasting product philosophies, detailing a live MaxHermes demo that creates and refines reusable Skills, and explaining how this dual evolution reshapes the competitive focus from single‑turn Q&A to sustained, self‑improving agent workflows.

AI agentsHermesMiniMax
0 likes · 14 min read
How MiniMax Drives Joint Evolution of Models and Harnesses
CodeTrend
CodeTrend
Apr 21, 2026 · Artificial Intelligence

AI Agents for Beginners: A Zero‑Prerequisite Course Overview

This article breaks down Microsoft’s open‑source AI‑Agent learning repository, explaining core concepts, five design patterns, production deployment considerations, and emerging protocols, while offering practical engineering guidance for building reliable multi‑agent systems from scratch.

AI agentsAgentic RAGMetacognition
0 likes · 10 min read
AI Agents for Beginners: A Zero‑Prerequisite Course Overview
Baidu Geek Talk
Baidu Geek Talk
Apr 15, 2026 · Artificial Intelligence

Unveiling Claude Code: How Rules, MCP, and Skills Power the Coding Agent

This article dissects the leaked Claude Code v2.1.88 source to reveal how the three core concepts—Rules, MCP, and Skills—are implemented, where they are injected in the Anthropic LLM API request, and when developers should prefer each mechanism for reliable, secure, and token‑efficient coding agent workflows.

Claude CodeLLM agentsMCP
0 likes · 25 min read
Unveiling Claude Code: How Rules, MCP, and Skills Power the Coding Agent
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Apr 8, 2026 · Artificial Intelligence

Understanding OpenClaw: Inside the AI Agent Framework Explained by Prof. Li Hongyi

In this detailed lecture, Prof. Li Hongyi of National Taiwan University dissects the OpenClaw AI Agent, explaining its system prompts, tool usage, memory handling, sub‑agents, security risks like prompt injection, and practical safeguards for deploying autonomous agents on personal computers.

AI AgentContext EngineeringOpenClaw
0 likes · 35 min read
Understanding OpenClaw: Inside the AI Agent Framework Explained by Prof. Li Hongyi
AI Open-Source Efficiency Guide
AI Open-Source Efficiency Guide
Apr 1, 2026 · Artificial Intelligence

Build an AI Agent Harness from Scratch: Deep Dive into Claude Code Architecture

This article walks developers through the learn-claude-code project, teaching them how to construct a Claude‑style AI Agent Harness by covering twelve progressive lessons, core concepts such as agents, harnesses, sub‑agents, context compression, task management, and providing runnable Python examples and architectural diagrams.

AI AgentAgent HarnessClaude Code
0 likes · 13 min read
Build an AI Agent Harness from Scratch: Deep Dive into Claude Code Architecture
Su San Talks Tech
Su San Talks Tech
Mar 26, 2026 · Artificial Intelligence

Unlocking AI Agents: How OpenClaw Turns Language Models into Actionable Bots

This article explains how OpenClaw functions as an AI Agent framework that connects chat applications to large language models, manages multi‑turn dialogues, executes tool commands, handles memory and security, and demonstrates advanced features such as sub‑agents, cron jobs, and context compression.

AI AgentMemory ManagementOpenClaw
0 likes · 19 min read
Unlocking AI Agents: How OpenClaw Turns Language Models into Actionable Bots
AI Engineer Programming
AI Engineer Programming
Mar 25, 2026 · Artificial Intelligence

What Is an AI Agent? Definition, Core Capabilities, and Architecture

The article explains AI agents as autonomous systems that perceive environments, plan, use tools, iterate through action loops, and self‑reflect, contrasting them with traditional chatbots and workflows, and outlines their core abilities, memory types, tool‑use mechanisms, and single‑ versus multi‑agent architectures.

AI AgentMemoryMulti-Agent
0 likes · 8 min read
What Is an AI Agent? Definition, Core Capabilities, and Architecture
Full-Stack Cultivation Path
Full-Stack Cultivation Path
Mar 25, 2026 · Artificial Intelligence

Understanding Tool Use in LLMs: How Models Leverage Tool Calls

This article explains why large language models need tool use, defines the concepts of Tool Use, Tool Call, and Function Calling, compares them, walks through a complete tool‑use workflow, and discusses architectural, safety, and design considerations for building reliable LLM agents.

AgentLLMPrompt engineering
0 likes · 17 min read
Understanding Tool Use in LLMs: How Models Leverage Tool Calls
Data STUDIO
Data STUDIO
Mar 24, 2026 · Artificial Intelligence

Turn LLMs into Real Assistants: Build a Tool‑Using Agent in Minutes

This article explains why large language models alone can hallucinate, introduces the tool‑using agent architecture, and provides a step‑by‑step Python tutorial using LangChain, LangGraph, and Tavily to create, run, and evaluate a real‑time web‑search capable AI assistant.

AgentLLMLangChain
0 likes · 16 min read
Turn LLMs into Real Assistants: Build a Tool‑Using Agent in Minutes
Java One
Java One
Mar 23, 2026 · Artificial Intelligence

Master Claude Code: From Installation to Advanced AI‑Powered Development Workflows

This comprehensive guide walks you through the Claude Certified Architect exam structure, explains how Claude Code turns a language model into a full‑featured programming assistant, details installation on macOS, Linux and Windows, demonstrates tool‑use mechanics, context management, custom commands, and extending functionality with MCP servers such as Playwright.

AIClaudeCoding Assistant
0 likes · 27 min read
Master Claude Code: From Installation to Advanced AI‑Powered Development Workflows
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Mar 6, 2026 · Artificial Intelligence

Why Reasoning and Tool-Use Clash in Agentic RL—and How DART Solves It

Recent studies reveal that in Agentic RL, jointly training reasoning and tool-use on shared parameters creates a persistent negative interaction, with gradients nearly orthogonal, limiting performance; a disentangled tuning approach (DART) using separate LoRA adapters isolates the two abilities and restores gains across benchmarks.

DARTGradient InterferenceLoRA
0 likes · 12 min read
Why Reasoning and Tool-Use Clash in Agentic RL—and How DART Solves It
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Mar 5, 2026 · Artificial Intelligence

Build a Natural‑Language Easysearch Assistant with LLM‑Powered Tool Use (No DSL Required)

This article shows how to create an Easysearch intelligent assistant that lets users manage indexes, write data, search and aggregate documents using Chinese natural language, by combining the DeepSeek large‑language model with OpenAI‑compatible function calling (Tool Use) and a lightweight Node.js executor.

DeepSeekEasysearchLLM
0 likes · 12 min read
Build a Natural‑Language Easysearch Assistant with LLM‑Powered Tool Use (No DSL Required)
AI Tech Publishing
AI Tech Publishing
Feb 15, 2026 · Artificial Intelligence

Mastering Agent Tool Use: Adding Search, Time, and Calculator Functions

This tutorial extends a minimal LLM Agent loop by introducing Tool Use (function calling) to give the agent actionable capabilities—searching the web, retrieving the current datetime, and performing mathematical calculations—while explaining the BaseTool architecture, registration process, system‑prompt adjustments, and practical execution examples.

AI AgentBaseToolFunction Calling
0 likes · 15 min read
Mastering Agent Tool Use: Adding Search, Time, and Calculator Functions
Alibaba Cloud Developer
Alibaba Cloud Developer
Jan 28, 2026 · Artificial Intelligence

How We Built a High‑Performance AI Rental Advisor with One‑Model Tool‑Use and Reinforcement Learning

This article details the design, challenges, and performance gains of an AI‑driven rental recommendation system that replaces a multi‑agent architecture with a single LLM using dynamic tool‑use, introduces a two‑stage reinforcement‑learning pipeline, and achieves sub‑second latency and higher accuracy for complex rental scenarios.

AI recommendationSystem ArchitectureTool Use
0 likes · 19 min read
How We Built a High‑Performance AI Rental Advisor with One‑Model Tool‑Use and Reinforcement Learning
AI Engineering
AI Engineering
Jan 10, 2026 · Artificial Intelligence

Teaching LLMs to Manage Memory Autonomously, Dropping Manual Rules

Alibaba's new AgeMem framework turns long‑term and short‑term memory management for large language model agents into a learnable reinforcement‑learning task, replacing handcrafted rules with a three‑stage training process and achieving significant benchmark gains.

AgeMemBenchmarkGRPO
0 likes · 9 min read
Teaching LLMs to Manage Memory Autonomously, Dropping Manual Rules
Frontend AI Walk
Frontend AI Walk
Dec 14, 2025 · Artificial Intelligence

Advanced AI Agent Skills: Behind the Scenes and a Developer’s Guide

This article explains the origins of AI Agent Skills, walks through the standard tool‑use loop, provides step‑by‑step code for defining and invoking Skills with the OpenAI API, compares Skills with the emerging Model Context Protocol (MCP), and offers practical guidance for developers and architects building AI‑enabled systems.

AI Agent SkillsFunction CallingMCP
0 likes · 11 min read
Advanced AI Agent Skills: Behind the Scenes and a Developer’s Guide
AI Info Trend
AI Info Trend
Nov 25, 2025 · Artificial Intelligence

Why Claude Opus 4.5 Is the New Powerhouse for Enterprise AI Agents

Claude Opus 4.5, Anthropic’s latest flagship LLM, dramatically upgrades reasoning, tool use, and multi‑step automation, targeting high‑intensity enterprise scenarios, offering stronger coding, longer context handling, and better cost‑effectiveness, while still requiring careful prompt engineering and budgeting for token usage.

Claude Opus 4.5Coding AutomationEnterprise AI
0 likes · 7 min read
Why Claude Opus 4.5 Is the New Powerhouse for Enterprise AI Agents
AI Tech Publishing
AI Tech Publishing
Nov 17, 2025 · Artificial Intelligence

Frontier AI Models in RL Environments Reveal an Agent Capability Hierarchy

The article evaluates nine cutting‑edge AI models on 150 simulated workplace tasks, showing that even the strongest models complete fewer than 40% of tasks, and uses these results to propose a hierarchical framework of agentic capabilities ranging from tool use to common‑sense reasoning.

AI model evaluationTool Useagentic capabilities
0 likes · 19 min read
Frontier AI Models in RL Environments Reveal an Agent Capability Hierarchy
Meituan Technology Team
Meituan Technology Team
Nov 3, 2025 · Artificial Intelligence

Introducing VitaBench: A Real-World Agent Benchmark That Reveals a 30% Success Gap

VitaBench, a new open‑source benchmark from Meituan’s LongCat team, evaluates LLM‑driven agents across three realistic life‑service scenarios—food ordering, restaurant dining, and travel planning—using 66 tools and quantifying reasoning, tool, and interaction complexities, exposing a mere 30% success rate on complex cross‑scene tasks.

AIAgentBenchmark
0 likes · 14 min read
Introducing VitaBench: A Real-World Agent Benchmark That Reveals a 30% Success Gap
Instant Consumer Technology Team
Instant Consumer Technology Team
Oct 28, 2025 · Artificial Intelligence

How 7B AgentFlow Beats 200B GPT-4o: Small Models, Big Wins

AgentFlow, a Stanford-led multi‑agent system built on a 7B model, outperforms massive models like GPT‑4o across ten benchmarks by leveraging modular agents, on‑policy learning, and a novel Flow‑GRPO training engine that solves sparse‑reward, long‑horizon challenges.

AgentFlowSmall Model PerformanceTool Use
0 likes · 12 min read
How 7B AgentFlow Beats 200B GPT-4o: Small Models, Big Wins
DataFunTalk
DataFunTalk
Oct 22, 2025 · Artificial Intelligence

Introducing VitaBench: A Real-World Benchmark for Complex LLM Agents

VitaBench is a newly released, highly realistic benchmark that evaluates large‑language‑model agents across three everyday scenarios—food ordering, restaurant dining, and travel planning—by quantifying reasoning, tool‑use, and interaction complexities, revealing a significant performance gap in current models.

AI EvaluationBenchmarkLLM agents
0 likes · 13 min read
Introducing VitaBench: A Real-World Benchmark for Complex LLM Agents
Tech Stroll Journey
Tech Stroll Journey
Oct 15, 2025 · Artificial Intelligence

From Tools to Autonomous Employees: Understanding AI Agents

This article explains AI Agents by contrasting them with traditional AI tools, detailing their official definition, core components—planning, tool use, memory, action—illustrating a travel‑planning example, outlining agent types, and highlighting their significance for AGI and real‑world applications.

AI AgentAutonomous AIMemory
0 likes · 7 min read
From Tools to Autonomous Employees: Understanding AI Agents
Baobao Algorithm Notes
Baobao Algorithm Notes
Sep 23, 2025 · Artificial Intelligence

How LongCat-Flash-Thinking Sets New SOTA in Open‑Source AI Inference

LongCat-Flash-Thinking, the latest open‑source model from Meituan, introduces domain‑parallel RL training, a high‑throughput DORA infra, and a dual‑path inference framework that together achieve state‑of‑the‑art performance on logical, mathematical, coding, and agentic tasks while maintaining top‑tier speed.

BenchmarkInferenceLongCat
0 likes · 10 min read
How LongCat-Flash-Thinking Sets New SOTA in Open‑Source AI Inference
Meituan Technology Team
Meituan Technology Team
Sep 22, 2025 · Artificial Intelligence

LongCat-Flash-Thinking: The New SOTA Open-Source LLM for Deep Reasoning and Tool Use

Meituan’s LongCat team unveiled LongCat-Flash-Thinking, an open‑source large language model that combines deep logical reasoning with tool‑calling capabilities, achieving state‑of‑the‑art performance across logic, mathematics, code, and agentic tasks, and introducing novel training frameworks such as domain‑parallel RL and DORA.

AIBenchmarkTool Use
0 likes · 7 min read
LongCat-Flash-Thinking: The New SOTA Open-Source LLM for Deep Reasoning and Tool Use
AI Algorithm Path
AI Algorithm Path
Jul 14, 2025 · Artificial Intelligence

The Most Powerful Open‑Source Agent Model: Kimi K2

Kimi K2, an open‑source trillion‑parameter AI model released by Moonshot AI, offers Base and Instruct variants, achieves leading scores on benchmarks such as SWE‑bench, LiveCodeBench and AceBench, and introduces a novel post‑training autonomous‑exploration stage with MuonClip optimization to enable robust tool use and reinforcement‑learning‑driven self‑improvement.

Autonomous AgentsKimi K2Tool Use
0 likes · 8 min read
The Most Powerful Open‑Source Agent Model: Kimi K2
Instant Consumer Technology Team
Instant Consumer Technology Team
Jun 23, 2025 · Artificial Intelligence

What Are AI Agents? Architecture, Applications, and Future Trends

AI Agents, autonomous intelligent programs that perceive, reason, and act, are reshaping industries from healthcare to autonomous driving; this article explains their core components, differences from large language models, planning techniques, memory mechanisms, tool use, real‑world applications, current challenges, and future directions.

AI AgentApplicationsAutonomous AI
0 likes · 35 min read
What Are AI Agents? Architecture, Applications, and Future Trends
Tencent Technical Engineering
Tencent Technical Engineering
Jun 16, 2025 · Artificial Intelligence

Mastering RAG and AI Agents: Practical Tips, Code Samples, and Evaluation Strategies

This comprehensive guide walks you through the fundamentals of Retrieval‑Augmented Generation (RAG) and AI agents, explains their inner workings, shares optimization tricks, provides ready‑to‑run code snippets, and demonstrates how to evaluate performance with metrics such as recall, faithfulness, and answer relevance.

AI agentsLLMPrompt engineering
0 likes · 36 min read
Mastering RAG and AI Agents: Practical Tips, Code Samples, and Evaluation Strategies
Alibaba Cloud Developer
Alibaba Cloud Developer
May 22, 2025 · Artificial Intelligence

Why Planning Boosts Multi‑Tool Agent Performance and How to Implement It

This article explains the importance of planning for multi‑tool AI agents, compares OpenAI and Anthropic approaches, presents experimental results, and provides practical guidance on tool design, prompt configuration, model selection, and parallel versus serial tool calls to improve efficiency and effectiveness.

AI agentsAgent planningAnthropic
0 likes · 16 min read
Why Planning Boosts Multi‑Tool Agent Performance and How to Implement It
Tencent Cloud Developer
Tencent Cloud Developer
May 8, 2025 · Artificial Intelligence

Advances and Future of AI Agents: Capabilities, Trends, and Applications

AI agents are rapidly evolving toward a 2025 breakthrough in perception, autonomous planning, tool use and memory, driven by multimodal models, neural‑symbolic reasoning and embodied intelligence, with $27 billion investment forecasts, exemplified by general‑purpose agents like Manus and emerging applications in code generation, research, healthcare, and risk analysis.

AI AgentAgent FrameworkAutonomous Planning
0 likes · 12 min read
Advances and Future of AI Agents: Capabilities, Trends, and Applications
Data Thinking Notes
Data Thinking Notes
Apr 15, 2025 · Artificial Intelligence

Understanding AI Agents: From Reinforcement Learning to LLM-Powered Planning

Professor Li Hongyi’s lecture provides a comprehensive, step‑by‑step exploration of AI agents, covering their definitions, reinforcement‑learning roots, LLM integration, memory mechanisms, tool usage, planning strategies, benchmarks, and practical examples, offering a valuable resource for anyone studying modern artificial intelligence.

AI agentsBenchmarkMemory
0 likes · 67 min read
Understanding AI Agents: From Reinforcement Learning to LLM-Powered Planning
AI Algorithm Path
AI Algorithm Path
Mar 11, 2025 · Artificial Intelligence

AI Agents Overview: Foundations, Core Components, and When to Use Them

This article provides a comprehensive overview of AI Agents, tracing their evolution from traditional chatbots to LLM‑driven agents, explaining core components such as perception, reasoning, action, knowledge bases, learning and communication interfaces, and discussing practical use cases, interaction cycles, and future prospects.

AI agentsRetrieval Augmented GenerationTool Use
0 likes · 15 min read
AI Agents Overview: Foundations, Core Components, and When to Use Them
Fun with Large Models
Fun with Large Models
Mar 2, 2025 · Artificial Intelligence

Why 2025 Is the Year of AI Agents: Definitions, Types, and Real‑World Examples

The article explains what AI Agents are, how they differ from single large‑model systems, outlines four agent architectures—Reflection, Tool Use, Planning, and Multi‑Agent—and cites concrete examples from Grammarly, VS Code plugins, Image Describer X, ChatDev, as well as initiatives by Tencent and Google, highlighting the 2025 AI Agent boom.

ChatDevMulti-AgentPlanning
0 likes · 9 min read
Why 2025 Is the Year of AI Agents: Definitions, Types, and Real‑World Examples
DevOps
DevOps
Dec 12, 2024 · Artificial Intelligence

The Future of Large Language Models: From Consumer Q&A to Agentic Workflows

Andrew Ng highlights that large language models are shifting from optimizing simple question‑answering for consumers to supporting complex agentic workflows, including tool usage, computer interaction, and multi‑agent collaboration, signaling a major evolution in AI capabilities.

AI agentsAI trendsAgentic AI
0 likes · 8 min read
The Future of Large Language Models: From Consumer Q&A to Agentic Workflows
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Jul 18, 2024 · Artificial Intelligence

Why AI Agents Are the Next Frontier of Intelligent Systems

This article surveys the rapid rise of AI agents powered by large language models, explaining their core perception‑planning‑action loop, memory architectures, tool‑use mechanisms, self‑reflection techniques, and real‑world case studies while highlighting current challenges and future prospects for autonomous intelligent systems.

AI AgentLLMMemory
0 likes · 29 min read
Why AI Agents Are the Next Frontier of Intelligent Systems
Tencent Cloud Developer
Tencent Cloud Developer
May 28, 2024 · Artificial Intelligence

AI Agents: Concepts, Key Components, and Development Frameworks

AI agents extend large language models with planning, short‑term and long‑term memory, and tool‑use capabilities, enabling autonomous task decomposition, external API interaction, and persistent knowledge retrieval; frameworks such as MetaGPT, LangChain, and CrewAI simplify building agents like a researcher that gather information, browse web content, and generate reports, heralding broader AI‑enhanced productivity.

AI agentsFunction CallingPlanning
0 likes · 20 min read
AI Agents: Concepts, Key Components, and Development Frameworks
Architect
Architect
Nov 8, 2023 · Artificial Intelligence

AI Agents Unleashed: From Assistants API to Multi‑Agent Frameworks

The article dissects the rise of AI agents—from OpenAI's Assistants API and multimodal perception‑brain‑action pipelines to retrieval‑augmented generation, tool‑use strategies, single‑ and multi‑agent deployments, and emerging frameworks like AutoGen—while highlighting concrete examples, benchmark results, and current limitations.

AI agentsAssistants APIEmbodied AI
0 likes · 38 min read
AI Agents Unleashed: From Assistants API to Multi‑Agent Frameworks
Baidu Geek Talk
Baidu Geek Talk
May 8, 2023 · Artificial Intelligence

Augmented Language Models: Reasoning and External Tool Utilization

The survey shows that once language models exceed roughly ten billion parameters they spontaneously acquire two complementary abilities—step‑by‑step reasoning, often elicited by chain‑of‑thought prompts or scratch‑pad training, and the capacity to invoke external tools such as search engines, calculators, or robots—enabling them to retrieve up‑to‑date information, perform complex computations, and act in the world, thereby advancing toward general artificial intelligence.

AIPrompt engineeringTool Use
0 likes · 20 min read
Augmented Language Models: Reasoning and External Tool Utilization