PaperAgent
Author

PaperAgent

Daily updates, analyzing cutting-edge AI research papers

170
Articles
0
Likes
18
Views
0
Comments
Recent Articles

Latest from PaperAgent

100 recent articles max
PaperAgent
PaperAgent
Apr 28, 2026 · Artificial Intelligence

MiniCPM‑o 4.5 Achieves Full‑Duplex Multimodal AI That DeepSeek V4 Missed

MiniCPM‑o 4.5 introduces the world’s first end‑to‑end full‑duplex multimodal 9‑billion‑parameter model, powered by the Omni‑Flow framework, running on a single consumer‑grade GPU with 12 GB memory, and delivers benchmark results that match or surpass Gemini 2.5 Flash while offering open‑source demos, APIs, and a Windows/macOS installer.

AIMiniCPM-obenchmark
0 likes · 13 min read
MiniCPM‑o 4.5 Achieves Full‑Duplex Multimodal AI That DeepSeek V4 Missed
PaperAgent
PaperAgent
Apr 27, 2026 · Artificial Intelligence

A Comprehensive Review of Modern LLM Agent Memory Frameworks

The article surveys recent LLM‑based agent memory research, presenting a unified framework that breaks memory systems into four components, detailing their design choices, experimental evaluation on LOCOMO and LONGMEMEVAL, key findings, and a new low‑token SOTA architecture.

Agent MemoryEvaluationInformation Retrieval
0 likes · 8 min read
A Comprehensive Review of Modern LLM Agent Memory Frameworks
PaperAgent
PaperAgent
Apr 26, 2026 · Artificial Intelligence

ICLR 2026 Outstanding Papers Reveal the Real Test for LLMs

The ICLR 2026 Outstanding Paper awards spotlight two studies—one proving Transformers are mathematically succinct and another showing that all major LLMs lose about 39% performance in multi‑turn conversations, exposing a reliability gap missed by single‑turn benchmarks.

AI benchmarksICLR 2026LLM evaluation
0 likes · 7 min read
ICLR 2026 Outstanding Papers Reveal the Real Test for LLMs
PaperAgent
PaperAgent
Apr 25, 2026 · Artificial Intelligence

86K‑Star Repo Turns Karpathy’s Coding Wisdom into Practical AI‑Coding Rules

The article shares four concrete principles distilled from Andrej Karpathy’s experience—captured in the 86.1k‑star "andrej‑karpathy‑skills" repository—to help developers steer large language models toward reliable, concise, and goal‑driven code changes, with installation tips for Claude Code and other AI assistants.

AI codingClaude CodeKarpathy
0 likes · 7 min read
86K‑Star Repo Turns Karpathy’s Coding Wisdom into Practical AI‑Coding Rules
PaperAgent
PaperAgent
Apr 24, 2026 · Artificial Intelligence

DeepSeek‑V4 Open‑Sources Its Million‑Token Architecture and Calls Out Claude Opus 4.6

DeepSeek‑V4’s open‑source report reveals a hybrid CSA/HCA attention design, manifold‑constrained residuals and the Muon optimizer that cut per‑token FLOPs to 27 % and KV‑Cache to 10 % at 1 M tokens, while benchmark results show it outperforms Claude Opus 4.6 on most tasks yet still lags on complex instruction following and multi‑turn dialogue.

AI ArchitectureClaude OpusDeepSeek V4
0 likes · 11 min read
DeepSeek‑V4 Open‑Sources Its Million‑Token Architecture and Calls Out Claude Opus 4.6
PaperAgent
PaperAgent
Apr 24, 2026 · Artificial Intelligence

Agent Skills Practical Guide: From Concept to Actionable AI Agents

The article explains Anthropic’s 2025 Agent Skills standard, how it enables AI to perform actions such as database queries and API calls, and provides a detailed guide covering its definition, modular design, industry adoption, and practical usage scenarios.

AI agentsAgent SkillsAnthropic
0 likes · 3 min read
Agent Skills Practical Guide: From Concept to Actionable AI Agents
PaperAgent
PaperAgent
Apr 23, 2026 · Artificial Intelligence

Stop RAG, Navigate Enterprise Knowledge Directly with CORPUS2SKILL

The article critiques traditional RAG’s blind spots, introduces CORPUS2SKILL’s offline‑compile, online‑navigate two‑stage architecture that builds a hierarchical topic tree and progressive‑disclosure skill files, and shows through WixQA benchmarks that this approach outperforms dense retrieval and Agentic RAG on F1, factuality and recall while highlighting cost and hierarchy quality trade‑offs.

Agentic AIHierarchical ClusteringRAG
0 likes · 7 min read
Stop RAG, Navigate Enterprise Knowledge Directly with CORPUS2SKILL
PaperAgent
PaperAgent
Apr 22, 2026 · Artificial Intelligence

How SkillClaw Enables Collective Evolution of Agent Skills in Real-World Use

SkillClaw introduces a centralized evolution framework that transforms user interactions into structured evidence, allowing LLM agents to refine, create, or skip skills based on aggregated success and failure patterns, with nightly validation ensuring only proven improvements are deployed, resulting in consistent performance gains across diverse tasks.

AI workflowLLM agentsSkill Evolution
0 likes · 13 min read
How SkillClaw Enables Collective Evolution of Agent Skills in Real-World Use
PaperAgent
PaperAgent
Apr 22, 2026 · Artificial Intelligence

Alibaba Unveils Four New Open‑Source Qwen3.6 Models: 27B Dense and 35B‑A3B MoE

Alibaba has added four new open‑source weight versions to its Qwen3.6 series, featuring the 27‑billion‑parameter dense multimodal model Qwen3.6‑27B and the 35‑billion‑parameter sparse expert model Qwen3.6‑35B‑A3B, both designed for stable, real‑world coding tasks and outperforming their Qwen3.5 predecessors.

AI agentsAlibabaDense Model
0 likes · 4 min read
Alibaba Unveils Four New Open‑Source Qwen3.6 Models: 27B Dense and 35B‑A3B MoE
PaperAgent
PaperAgent
Apr 21, 2026 · Artificial Intelligence

How to Understand Agents: From Resource‑Constrained Decisions to Contextual Cognition

This survey clarifies the essence of AI agents as resource‑limited sequential decision‑making and contextual‑cognition systems, introduces a formal definition, outlines a five‑stage evolution of large models, presents a four‑loop architecture, and illustrates the concepts with the OpenClaw agent case study.

AI SurveyAgent architectureAgentic AI
0 likes · 11 min read
How to Understand Agents: From Resource‑Constrained Decisions to Contextual Cognition