Author

PaperAgent

Daily updates, analyzing cutting-edge AI research papers

216

Articles

Likes

411

Views

Comments

Latest from PaperAgent

100 recent articles max

PaperAgent

Jun 5, 2026 · Artificial Intelligence

The Most Systematic 102‑Page Review of Agent Harnesses

This article provides a comprehensive overview of the "Code as Agent Harness" paradigm, detailing its three‑layer architecture, the roles of code in reasoning, acting, and environment modeling, the mechanisms that enable reliable long‑term execution, and how multi‑agent systems scale the harness through shared code and feedback loops.

Agent HarnessCode as AgentLLM

0 likes · 10 min read

The Most Systematic 102‑Page Review of Agent Harnesses

PaperAgent

Jun 4, 2026 · Artificial Intelligence

SkillOpt: Enabling Self‑Evolving Agent Skills via Text‑Space Optimization

SkillOpt reframes LLM agent skills as trainable external state, applying a deep‑learning‑style optimizer to systematically improve skill documents, and demonstrates across six benchmarks, seven models, and three execution modes that this approach yields consistent, large gains and robust transferability.

Agent skillsSelf‑Evolving AgentsSkillOpt

0 likes · 12 min read

SkillOpt: Enabling Self‑Evolving Agent Skills via Text‑Space Optimization

PaperAgent

Jun 4, 2026 · Artificial Intelligence

127 Curated Large‑Model Papers Across 17 Research Directions – From CVPR to Nature

This free collection gathers 127 top‑conference papers covering 17 large‑model research directions—from perception and decision to safety—providing PDFs, GitHub links, and a web interface to help AI engineers, researchers, and students stay up‑to‑date.

AI researchLarge ModelsSynthetic Data

0 likes · 5 min read

127 Curated Large‑Model Papers Across 17 Research Directions – From CVPR to Nature

PaperAgent

Jun 1, 2026 · Artificial Intelligence

Bengio’s New Parallel Multi‑Trajectory Reasoning Paradigm

The article introduces GRAM (Generative Recursive Reasoning Models), a parallel multi‑trajectory inference framework that replaces deterministic single‑track recursion with stochastic latent transitions and width scaling, achieving state‑of‑the‑art results on Sudoku‑Extreme, ARC‑AGI, N‑Queens and Graph Coloring benchmarks.

GRAMGenerative Recursive ReasoningYoshua Bengio

0 likes · 9 min read

Bengio’s New Parallel Multi‑Trajectory Reasoning Paradigm

PaperAgent

May 30, 2026 · Artificial Intelligence

DeepSeek Researcher Co‑authors Two New Papers on Autonomous AI Research and Continual Learning

The article summarizes two recent DeepSeek papers—one presenting an L1–L5 taxonomy and four architecture patterns for autonomous research agents, the other proposing a three‑dimensional taxonomy for continual learning, detailing method families, a self‑improvement phase diagram, experimental comparisons, an impossibility theorem, and the production statistics of the Deli AutoResearch framework.

AI researchAutonomous AgentsLLM taxonomy

0 likes · 12 min read

DeepSeek Researcher Co‑authors Two New Papers on Autonomous AI Research and Continual Learning

PaperAgent

May 29, 2026 · Artificial Intelligence

Why Claude Opus 4.8’s Real Breakthrough Is Its Dynamic Workflows

Anthropic’s Claude Opus 4.8 upgrades agentic reliability and honesty, while its new Dynamic Workflows turn hundreds of agents into a hierarchical, parallel, verifiable pipeline that can orchestrate large‑scale code migrations such as React‑to‑Solid.js or a 750k‑line Rust rewrite in days.

AI orchestrationClaudeCode Migration

0 likes · 7 min read

Why Claude Opus 4.8’s Real Breakthrough Is Its Dynamic Workflows

PaperAgent

May 28, 2026 · Artificial Intelligence

AgenticRAG Delivers 5.9× Recall Boost in Enterprise Retrieval – Real‑World Pre‑Production Results

The article analyzes Microsoft’s AgenticRAG, a tool‑based RAG framework that lets LLMs control retrieval, showing up to a 5.9× recall improvement over standard methods, reduced need for fine‑tuning, and practical design insights from pre‑production deployment.

AgenticRAGClaudeGPT-5-mini

0 likes · 12 min read

AgenticRAG Delivers 5.9× Recall Boost in Enterprise Retrieval – Real‑World Pre‑Production Results

PaperAgent

May 28, 2026 · Artificial Intelligence

How a Desktop AI Agent Turns My PC into a One‑Person Capability Hub

The author reviews the 商汤办公小浣熊桌面端 2.0 agent, showing how it moves beyond chat‑only assistants to directly manipulate local files, browsers, and enterprise tools, automating a weekly competitive‑analysis report and embodying the OPC (One Person Capability) concept.

AI AgentOPCProductivity

0 likes · 10 min read

How a Desktop AI Agent Turns My PC into a One‑Person Capability Hub

PaperAgent

May 26, 2026 · Artificial Intelligence

Why External Retrieval in RAG Is Redundant: Insights from NVIDIA’s INTRA Paper

The INTRA paper shows that using a decoder’s cross‑attention as an internal retrieval mechanism eliminates the need for a separate retriever, achieving state‑of‑the‑art multihop QA performance with only 164 K trainable parameters and shared pre‑encoded representations.

INTRARAGRetrieval

0 likes · 8 min read

Why External Retrieval in RAG Is Redundant: Insights from NVIDIA’s INTRA Paper

PaperAgent

May 25, 2026 · Artificial Intelligence

DeepSeek’s Harness: How Agent Harness Engineering Is Shaping the Next LLM Agent Era

The article surveys DeepSeek’s Harness initiative, presenting the Binding‑Constraint Thesis, three‑stage evolution from prompt to harness engineering, the ETCLOVG seven‑layer architecture, and concrete benchmark evidence that harness‑only improvements far outweigh model upgrades, while detailing security, observability, and governance considerations for reliable LLM agents.

AI ArchitectureAgent EvaluationAgent Harness Engineering

0 likes · 12 min read

DeepSeek’s Harness: How Agent Harness Engineering Is Shaping the Next LLM Agent Era