PaperAgent
Author

PaperAgent

Daily updates, analyzing cutting-edge AI research papers

216
Articles
1
Likes
412
Views
0
Comments
Recent Articles

Latest from PaperAgent

100 recent articles max
PaperAgent
PaperAgent
Apr 24, 2026 · Artificial Intelligence

DeepSeek‑V4 Open‑Sources Its Million‑Token Architecture and Calls Out Claude Opus 4.6

DeepSeek‑V4’s open‑source report reveals a hybrid CSA/HCA attention design, manifold‑constrained residuals and the Muon optimizer that cut per‑token FLOPs to 27 % and KV‑Cache to 10 % at 1 M tokens, while benchmark results show it outperforms Claude Opus 4.6 on most tasks yet still lags on complex instruction following and multi‑turn dialogue.

AI ArchitectureClaude OpusDeepSeek V4
0 likes · 11 min read
DeepSeek‑V4 Open‑Sources Its Million‑Token Architecture and Calls Out Claude Opus 4.6
PaperAgent
PaperAgent
Apr 24, 2026 · Artificial Intelligence

Agent Skills Practical Guide: From Concept to Actionable AI Agents

The article explains Anthropic’s 2025 Agent Skills standard, how it enables AI to perform actions such as database queries and API calls, and provides a detailed guide covering its definition, modular design, industry adoption, and practical usage scenarios.

AI agentsAgent skillsAnthropic
0 likes · 3 min read
Agent Skills Practical Guide: From Concept to Actionable AI Agents
PaperAgent
PaperAgent
Apr 23, 2026 · Artificial Intelligence

Stop RAG, Navigate Enterprise Knowledge Directly with CORPUS2SKILL

The article critiques traditional RAG’s blind spots, introduces CORPUS2SKILL’s offline‑compile, online‑navigate two‑stage architecture that builds a hierarchical topic tree and progressive‑disclosure skill files, and shows through WixQA benchmarks that this approach outperforms dense retrieval and Agentic RAG on F1, factuality and recall while highlighting cost and hierarchy quality trade‑offs.

Hierarchical ClusteringPrompt EngineeringRAG
0 likes · 7 min read
Stop RAG, Navigate Enterprise Knowledge Directly with CORPUS2SKILL
PaperAgent
PaperAgent
Apr 22, 2026 · Artificial Intelligence

How SkillClaw Enables Collective Evolution of Agent Skills in Real-World Use

SkillClaw introduces a centralized evolution framework that transforms user interactions into structured evidence, allowing LLM agents to refine, create, or skip skills based on aggregated success and failure patterns, with nightly validation ensuring only proven improvements are deployed, resulting in consistent performance gains across diverse tasks.

AI workflowLLM AgentsSkill Evolution
0 likes · 13 min read
How SkillClaw Enables Collective Evolution of Agent Skills in Real-World Use
PaperAgent
PaperAgent
Apr 22, 2026 · Artificial Intelligence

Alibaba Unveils Four New Open‑Source Qwen3.6 Models: 27B Dense and 35B‑A3B MoE

Alibaba has added four new open‑source weight versions to its Qwen3.6 series, featuring the 27‑billion‑parameter dense multimodal model Qwen3.6‑27B and the 35‑billion‑parameter sparse expert model Qwen3.6‑35B‑A3B, both designed for stable, real‑world coding tasks and outperforming their Qwen3.5 predecessors.

AI agentsAlibabaDense Model
0 likes · 4 min read
Alibaba Unveils Four New Open‑Source Qwen3.6 Models: 27B Dense and 35B‑A3B MoE
PaperAgent
PaperAgent
Apr 21, 2026 · Artificial Intelligence

How to Understand Agents: From Resource‑Constrained Decisions to Contextual Cognition

This survey clarifies the essence of AI agents as resource‑limited sequential decision‑making and contextual‑cognition systems, introduces a formal definition, outlines a five‑stage evolution of large models, presents a four‑loop architecture, and illustrates the concepts with the OpenClaw agent case study.

AI SurveyContextual Cognitionagent architecture
0 likes · 11 min read
How to Understand Agents: From Resource‑Constrained Decisions to Contextual Cognition
PaperAgent
PaperAgent
Apr 21, 2026 · Artificial Intelligence

OpenMythos: Rebuilding Claude Mythos with Recursive Transformers and MoE

OpenMythos is an open‑source PyTorch reimplementation of Anthropic's Claude Mythos that uses a mixed‑expert routed recurrent Transformer, introduces Recursive Depth Transformers, Multi‑Latent Attention, and several stability mechanisms, and demonstrates parameter‑efficient scaling backed by empirical studies.

AI ArchitectureClaude MythosMoE
0 likes · 6 min read
OpenMythos: Rebuilding Claude Mythos with Recursive Transformers and MoE
PaperAgent
PaperAgent
Apr 20, 2026 · Artificial Intelligence

How 9 Parallel Claude Agents Surpassed Human Researchers in Weak‑to‑Strong Supervision

Anthropic’s Automated Weak‑to‑Strong Researcher (AAR) system uses nine parallel Claude Opus agents to replace human researchers, achieving a Performance Gap Recovered (PGR) of 0.97 in five days at a cost of about $18,000, demonstrating that AI‑driven automation can outperform humans on well‑defined alignment tasks.

AARAI alignmentClaude
0 likes · 9 min read
How 9 Parallel Claude Agents Surpassed Human Researchers in Weak‑to‑Strong Supervision
PaperAgent
PaperAgent
Apr 17, 2026 · Artificial Intelligence

How Automated Harnesses Are Revolutionizing LLM Agents: Memory and Action Constraints

This article reviews two recent papers that introduce automated harness methods—M⋆ for task‑specific memory programs and AutoHarness for code‑level action constraints—detailing their designs, reflective evolution processes, experimental evaluations across diverse benchmarks, and the broader shift toward harness‑centric LLM agent research.

AgentAutoHarnessLLM
0 likes · 10 min read
How Automated Harnesses Are Revolutionizing LLM Agents: Memory and Action Constraints
PaperAgent
PaperAgent
Apr 16, 2026 · Artificial Intelligence

Do LLMs Learn Hidden Preferences? Inside the Subliminal Learning Phenomenon

A recent Nature paper by Anthropic reveals that large language models can covertly transmit preferences and misaligned behaviors through unrelated data, demonstrating a "subliminal learning" effect that spans numbers, code, and chain‑of‑thought tasks and is driven by shared model initialization.

AnthropicLLMModel Alignment
0 likes · 10 min read
Do LLMs Learn Hidden Preferences? Inside the Subliminal Learning Phenomenon