Artificial Intelligence 12 min read

How Google’s Open‑Source Agent Skills Turn AI Coding from Prototype to Production

Agent Skills, an open‑source project by Google engineer Addy Osmani, breaks the software development lifecycle into six stages with 24 structured skills, anti‑rationalization checks, doubt‑driven development, and context engineering, enabling AI‑generated code to meet production‑grade quality standards.

Old Zhang's AI Learning

Jun 15, 2026

How Google’s Open‑Source Agent Skills Turn AI Coding from Prototype to Production

Overview

Agent Skills is an open‑source framework created by Google Chrome engineering manager Addy Osmani. It encodes senior engineers’ disciplined workflow into a set of 24 structured Skills that cover the entire software development lifecycle from requirement definition to production deployment.

Lifecycle Stages

DEFINE   PLAN   BUILD   VERIFY   REVIEW   SHIP
┌──────┐ ┌──────┐ ┌──────┐ ┌──────┐ ┌──────┐ ┌──────┐
│ Idea │─▶│ Spec │─▶│ Code │─▶│ Test │─▶│ QA │─▶│ Go │
│Refine│ │ PRD │ │ Impl │ │Debug│ │Gate │ │Live│
└──────┘ └──────┘ └──────┘ └──────┘ └──────┘ └──────┘
/spec   /plan   /build   /test   /review   /ship

Each stage is implemented as a separate Skill with its own command (e.g., /spec, /plan, /build, /test, /review, /ship).

Core Features

24 Structured Skills : Every Skill is a complete workflow with steps, checkpoints, and exit criteria, covering everything from interview‑style requirement gathering to shipping and launch.

Anti‑rationalization : Each Skill contains a “excuse vs. rebuttal” table that blocks the Agent from skipping essential steps such as testing or security review.

7 Slash Commands : One‑click triggers for each stage (e.g., /spec writes specifications, /plan breaks tasks, /build implements code, /test runs tests, /review performs code review, /ship deploys).

4 Expert Personas : Staff engineer for code review, QA expert for testing strategy, security engineer for vulnerability scanning, performance engineer for Web Vitals audit.

Broad Tool Compatibility : Works with Claude Code, Cursor, Gemini CLI, Windsurf, GitHub Copilot, Kiro IDE, Antigravity CLI, and any agent that can read Markdown.

Why It Matters

The project draws directly from Google’s engineering culture, referencing concepts from Software Engineering at Google such as Hyrum’s Law, the Beyoncé Rule, Chesterton’s Fence, trunk‑based development, and shift‑left testing. These principles are embedded in the Skills rather than being abstract guidelines.

Two particularly novel ideas are highlighted:

“Doubt‑Driven Development” : A five‑step cycle (CLAIM → EXTRACT → DOUBT → RECONCILE → STOP) that forces an adversarial reviewer to challenge confident AI answers, especially for high‑risk decisions.

Context Engineering : A five‑layer context hierarchy (global rules, spec/architecture docs, relevant source files, error/test output, conversation history) that ensures the Agent receives the right amount of information.

Skill Structure

SKILL.md
├── Frontmatter   → name + description (trigger conditions)
├── Overview      → what the Skill does
├── When to Use   → activation criteria
├── Core Process  → step‑by‑step execution (essential!)
├── Rationalizations → excuse vs. rebuttal table
├── Red Flags     → signs of misuse
└── Verification  → evidence‑based exit criteria

The design follows four key decisions: “Process not prose”, “Anti‑rationalization”, “Verification non‑negotiable”, and “Progressive disclosure”.

Installation

For Claude Code users (recommended):

/plugin marketplace add addyosmani/agent-skills
/plugin install agent-skills@addy-agent-skills

Local development:

git clone https://github.com/addyosmani/agent-skills.git
claude --plugin-dir /path/to/agent-skills

Gemini CLI:

gemini skills install https://github.com/addyosmani/agent-skills.git --path skills

Other tools (Cursor, Windsurf, Copilot, Kiro, Codex) have analogous integration steps.

One‑Click Automation

The /build auto command generates a full plan from a completed spec and executes each task automatically, preserving TDD checkpoints and pausing on high‑risk steps.

Author’s Assessment

Pros

Most systematic AI‑coding engineering guideline available.

Backed by Addy Osmani’s decade‑long experience at Google.

Anti‑rationalization table injects critical thinking into AI.

End‑to‑end coverage from requirements to launch.

Pure Markdown implementation, zero runtime dependencies.

Limitations

Loading all 24 Skills at once consumes a large context window; selective loading is recommended.

Primarily targets web development (frontend + Node.js backend); other stacks may need extensions.

Documentation is mainly in English, requiring proficiency for non‑English speakers.

For rapid prototyping the full workflow may feel heavyweight.

Intended Audience

Developers using AI agents for production‑grade projects.

Technical leads seeking to standardize AI‑assisted coding practices.

Individual developers wanting to move from “it runs” to “production ready”.

Engineers interested in designing their own AI‑coding Skills.

Conclusion

Agent Skills resolves the core tension in AI‑assisted development: speed versus reliability. By codifying senior engineers’ discipline into structured workflows, it lets AI retain its speed advantage while enforcing the critical steps that make code production‑ready. For serious AI‑driven development, loading at least the three core Skills— spec-driven-development, test-driven-development, and code-review-and-quality —covers the most important quality gaps.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.

automation prompt engineering Software Engineering code quality AI Programming Agent Skills Addy Osmani

Written by

Old Zhang's AI Learning

AI practitioner specializing in large-model evaluation and on-premise deployment, agents, AI programming, Vibe Coding, general AI, and broader tech trends, with daily original technical articles.

0 followers

Reader feedback

How this landed with the community

Rate this article

Was this worth your time?

Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.