How Agent Skills and MCP Are Redefining Enterprise AI in 2025
The report analyzes the rapid emergence of Claude Skills and OpenAI's Agentic Commerce Protocol, detailing their technical architectures, benchmark performance, cross‑platform compatibility, enterprise adoption metrics, security challenges, and strategic implications for businesses entering the agentic AI era.
Chapter 1: Claude Skills Architecture and Open Standard Evolution
Anthropic introduced the concept of Agent Skills in October 2025 and released a full open standard on December 18 via agentskills.io. The move signals a shift from pure model‑centric competition to infrastructure building.
1.1 Technical Dissection and Progressive Disclosure
Claude Skills adopt a “Progressive Disclosure” philosophy to mitigate LLM context‑window limits. A compliant Skill is a modular directory containing mandatory metadata, a body with step‑by‑step instructions, optional executable scripts, reference documents, and static assets. Only the minimal metadata (≈100 tokens) is loaded at startup; full instructions and scripts are fetched on demand, preventing context exhaustion even when thousands of Skills are deployed.
Metadata file (mandatory) : SKILL.md with YAML front‑matter defining name and description.
Instruction body (mandatory) : SKILL.md (Body) providing detailed steps, examples, and edge‑case handling.
Executable scripts (optional) : /scripts containing deterministic Python, Bash, or JavaScript code.
Reference documents (optional) : /references such as REFERENCE.md or FORMS.md.
Static assets (optional) : /assets for templates, charts, or data files.
1.2 Deterministic Logic Integration
Unlike prompt‑based plugins, the December 2025 Skills standard emphasizes deterministic logic. Embedded Python scripts can handle high‑compute or high‑precision tasks—e.g., PDF form filling or large‑scale data sorting—without loading the raw data into the LLM, dramatically improving reliability and efficiency.
1.3 Enterprise Management and Partner Ecosystem
Anthropic also launched a centralized Skill‑management console for Team and Enterprise plans, enabling bulk distribution and updates. A partner catalog featuring Notion, Canva, Figma, Atlassian, Stripe, and Zapier provides pre‑built Skills that can be invoked directly from Claude, such as applying brand guidelines in Figma or creating Jira tickets.
Chapter 2: OpenAI’s Agentic Ecosystem
In parallel, OpenAI reinforced its developer and enterprise position with a series of December 2025 releases focused on long‑horizon task handling and autonomous commerce.
2.1 GPT‑5.2‑Codex Performance Leap
On December 12, OpenAI launched GPT‑5.2‑Codex and its “Max” variant. Benchmarks such as SWE‑Bench Pro and Terminal‑Bench 2.0 show breakthroughs in long‑horizon work and context compression, allowing agents to refactor tens of thousands of lines of code without frequent human intervention.
2.2 Agentic Commerce Protocol (ACP)
In collaboration with Stripe, OpenAI open‑sourced the Agentic Commerce Protocol (ACP) under Apache 2.0. ACP defines a universal transaction language for agents, covering three modules:
Product Feed : Structured product catalogs (JSON/CSV) with signature‑based secure endpoints.
Agentic Checkout : Mapping of checkout flows to API actions, adhering to a minimal data‑exposure principle.
Delegated Payment : Generation of shared payment tokens so agents never see raw credit‑card data.
ACP enables a single user authorization to power purchases across any merchant supporting the protocol (e.g., Etsy, Shopify) while preserving the merchant‑of‑record status.
2.3 AGENTS.md Specification
To address multi‑agent context inconsistency, OpenAI contributed the lightweight AGENTS.md markdown spec, which defines project‑level agent guidelines, build steps, and test requirements. By December 2025, over 60 000 open‑source projects adopted it, making it a core repository file alongside README.md.
Chapter 3: Global Standardization Organizations
December 9, 2025 saw the Linux Foundation launch the Autonomous Agent AI Foundation (AAIF), a neutral body ensuring transparent, collaborative governance of agent infrastructure. Founding platinum members include Amazon (AWS), Anthropic, Block, Bloomberg, Cloudflare, Google, Microsoft, and OpenAI.
Model Context Protocol (MCP) : Originating from Anthropic, now the de‑facto standard for connecting models to external data and tools.
goose : Block’s open‑source, local‑first agent framework built on MCP.
AGENTS.md : OpenAI’s project‑level agent specification.
AAIF’s neutral governance requires any major protocol change to undergo public technical‑committee review, mitigating vendor lock‑in risk.
3.2 Cross‑Platform Compatibility of Skills
Within the final two weeks of December, OpenAI silently added Skills support to its Codex documentation, integrating the standard into the ChatGPT web UI and Codex CLI. Major development tools—Cursor, OpenCode, VS Code, GitHub Copilot—announced compatibility with agentskills.io, allowing a single Skill (e.g., an OWASP code‑review Skill) to run across Claude Code and Copilot without rewrites.
Chapter 4: Enterprise Adoption and Productivity Evidence
OpenAI’s “2025 Enterprise AI State Report” shows ChatGPT Enterprise seats grew nine‑fold in the past year, surpassing 7 million seats. Productivity gains reported per function include:
IT Operations : Faster issue resolution via automated fault diagnosis and ticket routing.
Marketing : Accelerated campaign execution through multi‑channel content generation and data feedback loops.
Human Resources : Improved employee engagement via automated onboarding and Q&A.
Software Engineering : Increased code delivery speed as agents handle refactoring, test generation, and CI/CD fixes.
Non‑technical staff also began sending code‑related messages, indicating agents lower the technical barrier for data analysis and lightweight automation.
4.2 Autonomy Gains from Anthropic Data
Anthropic’s internal R&D survey reveals Claude Code’s autonomous tool‑call count rose from 9.8 to 21.2 calls per task over six months—a 115% increase—while human interaction rounds per task dropped dramatically, shifting humans from “driver” to “supervisor.”
Chapter 5: MCP vs. Skills Collaboration
The community agrees that MCP provides the “connectivity” layer (how models reach data sources), while Skills supply the “procedural knowledge” layer (how models execute tasks). A side‑by‑side comparison:
Core Value : MCP = connectivity; Skills = procedural knowledge.
Metaphor : MCP is like a USB‑C port; Skills resemble an employee onboarding handbook.
Resource Load : MCP carries heavier API definitions and architecture diagrams; Skills are lightweight, metadata‑driven.
Typical Use : MCP connects to Google Drive, Slack, SQL; Skills handle PDF form filling, brand‑style validation, complex data logic.
5.2 Benchmark and Token Cost Optimization
CData Connect AI performed a head‑to‑head test. Pure MCP data discovery consumed 2,069 tokens for a cross‑system query, whereas combining MCP discovery with a pre‑built Skill reduced token usage to 871 tokens, demonstrating substantial cost savings.
Chapter 6: Security Governance, Identity Management, and Production‑Risk
With agents gaining “action rights,” security discussions moved from theory to practice. AAIF’s research team identified novel attack vectors such as tool poisoning, silent definition tampering, and tool shadowing, where malicious Skills embed backdoors or masquerade as legitimate API calls.
6.2 OBO Flow and Delegated Authentication
Solo.io’s Agentgateway (now a Linux Foundation project) introduces an OAuth 2.0 “On‑Behalf‑Of” (OBO) flow, enforcing:
Agents inherit the identity of the human user rather than possessing privileged accounts.
All actions must stay within a predefined “blast radius.”
High‑risk operations (e.g., large transfers or code deployments) trigger a step‑up authentication requiring secondary human confirmation.
Chapter 7: Key Industry Progress (Last Two Weeks of December 2025)
7.1 Cloud Providers and Infrastructure
Google Cloud : Launched managed MCP services covering Maps, BigQuery, and GKE, with Cloud API Registry converting existing APIs into MCP‑compatible endpoints.
Amazon (AWS) : Updated Bedrock AgentCore with isolated sandbox environments and cross‑cloud policy‑execution gateways.
NVIDIA : Reported soaring demand for Blackwell chips to support long‑term reasoning agents, and is optimizing distributed inference to cut latency.
7.2 Vertical Breakthroughs
Healthcare : Microsoft released an agent‑driven cancer‑map AI that autonomously pulls pathology data across labs and generates multi‑dimensional diagnostic graphs.
Finance : Accenture announced a “Flagship AI Program” leveraging OpenAI’s autonomous commerce capabilities to fully automate month‑end financial close.
Chapter 8: Conclusions and 2026 Outlook
Standardization ends fragmentation: Anthropic’s Skills, OpenAI’s silent follow‑up, and AAIF’s neutral governance converge on a common protocol, lowering migration costs.
Commerce enters an autonomous era: ACP turns AI assistants from recommendation engines into fully capable personal shopping managers.
Development shifts to “Vibe Coding” and Skill composition: Developers now prefer assembling modular Skills over writing extensive system prompts.
8.2 Strategic Recommendations for 2026
Enterprises should immediately codify private workflows, brand guidelines, and compliance rules into the standard SKILL.md format, expose them via MCP, and build an identity‑centric security framework as a prerequisite for production deployment.
The agentic AI wave is reshaping the global digital foundation at unprecedented speed; open standards and interoperable protocols will ensure this transformation proceeds within an auditable, open, and secure ecosystem.
BirdNest Tech Talk
Author of the rpcx microservice framework, original book author, and chair of Baidu's Go CMC committee.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.
