How to Cut Dynamic Workflows Costs from $62,000 to $129 with AI Agents
The article details how separating planning to Claude Opus 4.8 and execution to Kimi K2.6 Agent Swarm reduces monthly AI workflow expenses from $62,000 to $129, outlines routing logic, provides fifteen prompt templates, cost‑tracking scripts, and a reusable skill framework for large‑scale parallel agent orchestration.
A developer visualized 300 Kimi K2.6 agents running in parallel, coordinating 4,000 steps, with each moving line representing a real‑time signal, not a conceptual diagram.
Running all tasks on Claude Opus 4.8 alone cost $62,000 per month in API fees, because Opus, while excellent at reasoning, is not designed for bulk execution.
Anthropic’s Dynamic Workflows feature is meant to let Opus manage complex tasks and orchestrate hundreds of parallel sub‑agents, not act as every sub‑agent itself.
By assigning Opus 4.8 as the strategic planner and Kimi K2.6 Agent Swarm as the executor, output quality and scale remained unchanged while monthly cost dropped to $129, saving $54,200 per month or $652,000 annually.
The division of labor is simple:
Opus 4.8 acts as the brain: it handles planning, reasoning, judgment, and quality control, processing only tasks that require high‑level cognition.
Kimi K2.6 acts as the hands: it runs 300 specialized sub‑agents in parallel, coordinates up to 4,000 steps, produces real files, achieves a SWE‑Bench Pro score of 58.6 %, and costs only $0.6 per million tokens.
The core architecture separates the "brain" from the "hands".
Part 1: Five Prompts for Opus Orchestration
Opus 4.8 includes three features rarely used by teams: Dynamic Workflows for managing hundreds of parallel agents, Effort Controls for automatic compute allocation, and Extended Independence for longer autonomous runs. These features only require Opus to produce clear plans.
Prompt 1: Project Planning
I need to execute the following project using multiple AI agents:
Project: [Describe the desired final result]
Scale: [Number of tasks/files/projects]
Quality standard: [What a qualifying output looks like]
Constraints: [Deadline, format, restrictions]
Your job is to build the execution plan, not execute it.
Produce:
1. The breakdown of work into parallel streams
2. The dependencies between streams (what must happen before what)
3. The quality checkpoints where output must be reviewed before continuing
4. The exact brief for each execution stream (what each agent needs to know to do its part)
5. The final assembly instructions (how the outputs combine into the finished deliverable)
Do not write any of the actual content. Only the plan.Prompt 2: Dynamic Workflow Task Classification
I have the following tasks to complete this week:
[List all your tasks]
Sort them into two groups:
Group A — Opus handles (reasoning‑heavy, judgment calls, ambiguous, high‑stakes):
[Opus‑level criteria: architecture decisions, quality review, strategic analysis, any task where errors cause downstream problems]
Group B — Kimi Agent Swarm handles (execution‑heavy, batch processing, repetitive at scale, clear output spec):
[Kimi‑level criteria: research N sources, generate N files, transform N inputs, any task with a clear, parallelizable output]
For each Group B task, write the exact project brief that Kimi Agent Swarm will receive.
For each Group A task, work through it directly.Prompt 3: Define Quality Standards Before Execution
Kimi Agent Swarm is about to execute the following project:
[Describe the project]
Before it runs, I need a quality rubric.
Define:
1. What a passing output looks like for each deliverable type
2. What a failing output looks like (be specific — not "low quality" but "missing citations," "under 500 words," "wrong format")
3. The three most common failure modes for this type of task
4. How to catch each failure mode in the output before it reaches the final assembly stage
This rubric will be used to review Agent Swarm outputs before they are accepted.Prompt 4: Opus Reviews Kimi Output
Kimi Agent Swarm completed the following project:
Project brief: [Project requirements]
Output: [Paste or summarize output]
Review against the quality rubric:
[Insert quality rubric from Prompt 3]
For each deliverable:
- Pass or fail against each rubric criterion
- Specific issues that need to be fixed (quote the exact problem)
- Whether the issue requires a full re‑run or a targeted fix
Produce a revision brief: exactly what Kimi needs to fix, in the same project brief format, so it can be sent directly without re‑explanation.Prompt 5: Opus Assembles Final Result
Kimi Agent Swarm produced the following outputs:
[List or paste all outputs]
These are the components of: [Describe the final deliverable]
Your job is to assemble them into a coherent final output.
Rules:
- Do not rewrite what works. Connect and integrate.
- Identify and fix inconsistencies between sections
- Ensure the final output reads as one unified piece, not a compilation
- Flag anything that requires my review before finalizing
Final format: [Accurately describe what the assembled output should look like]Part 2: Five Prompts for Kimi Execution
Kimi K2.6 Agent Swarm can run 300 domain‑specific sub‑agents simultaneously, coordinate up to 4,000 steps, and directly generate PDFs, spreadsheets, websites, datasets, or code files.
Cost examples:
Research 50 competitor landing pages: outsourcing costs $25,000, Kimi costs only $4–6 in token fees.
Batch write 100 customized outreach emails: copywriting costs $2,000–$5,000, Kimi completes them in one run.
Technical audit of 30 codebases: consulting costs $15,000–$40,000, Kimi runs Opus‑defined rules and Opus reviews the summary.
Prompt 6: Convert Opus Plan into Kimi Brief
Take the following execution plan produced by Claude Opus:
[Paste Opus plan from Prompt 1]
Rewrite it as a Kimi Agent Swarm project brief.
Format:
Project: [One‑sentence summary]
Input: [Attached files, URLs, data]
Output: [File type/quantity/naming rule/format]
Phase 1: [First execution stream – what the agent does, what it produces]
Phase 2: [Second execution stream – dependencies]
Phase 3: [Assembly – how outputs are merged]
Quality note: [Minimum standards each output must meet]
The brief should be complete enough that Agent Swarm can execute without clarification.Prompt 7: Batch Execution with Output Specification
Project: [Describe batch task]
Input: [N projects – attached files or list]
Output spec: [Exact format, file type, naming rule, one output per input]
For each input:
- [Step 1]
- [Step 2]
- [Step 3]
- Output: [Exact format of deliverable]
Quality standard: [Minimum requirements – word count, citation format, structure, etc.]
Run all [N] inputs in parallel. Deliver as [file format] named [naming rule].Prompt 8: Research‑to‑Delivery in One Step
Research phase:
Search for [topic/competitor/object] across [N] sources.
For each source extract: [List specific data points]
Output: structured dataset with one row per source.
Analysis phase:
Using the research dataset, identify: [patterns / gaps / opportunities / rankings]
Flag any source where the data is unclear or contradictory.
Deliverable phase:
Produce [final output format] using the research and analysis.
Format: [Exact specification]
Length: [Word count or pages]
Citations: [Format]
Total output: one [file type], one supporting dataset.Prompt 9: Save Workflow as Reusable Skill
We just completed the following workflow:
Opus defined: [What Opus planned]
Kimi executed: [What Kimi ran]
Output: [What was produced]
Save this as a reusable Skill called [Name].
Document:
- The trigger (what kind of request activates this skill)
- The Opus orchestration prompt (what to send Opus to generate the brief)
- The Kimi execution brief template (what gets sent to Agent Swarm)
- The Opus review checklist (what Opus checks before accepting output)
- Expected inputs and outputs
Next time we run this workflow, start from the Skill instead of from scratch.Prompt 10: Workflow Cost Tracking
This workflow just ran:
Opus 4.8 tasks:
[List each Opus task, estimated token count]
Kimi K2.6 tasks:
[List each Kimi task, estimated token count]
Calculate:
- Opus cost: [tokens] × $0.015 per 1K tokens
- Kimi cost: [tokens] × $0.0006 per 1K tokens
- Total actual cost
- What this workflow would have cost running entirely on Opus
- Savings this run
Log this to WORKFLOW_COSTS.md with date, workflow name, and breakdown.Part 3: Routing Classification System
The dramatic cost drop from $62,000 to $129 is not simply swapping Kimi for Opus; it adds a routing layer that first classifies each incoming task as judgment‑heavy or execution‑heavy and routes it to the appropriate model.
Routing rule: if a clear, machine‑scorable rule can be written, assign the task to Kimi; otherwise, assign it to Opus.
Prompt 11: Weekly Workflow Audit
Review the following workflows we ran this week:
[List weekly workflows]
For each workflow, classify every task:
- Opus‑only: judgment‑heavy, no clear rubric
- Kimi‑only: execution‑heavy, clear output spec, parallel‑friendly
- Hybrid: Opus plans + Kimi executes + Opus reviews
For each Hybrid workflow, write the handoff points:
- What Opus produces before handoff (the brief)
- What Kimi receives (the execution spec)
- What Opus reviews after (the quality rubric)
Flag any task currently on Opus that should move to Kimi.Prompt 12: Generate Routing Decision Tree
I need a routing framework for our recurring workflow types.
For each workflow type below, define:
1. Which model handles each stage (Opus / Kimi / both)
2. The trigger that routes it to the right model
3. The handoff format between models
4. The cost estimate per run
Workflow types:
- Content research + production
- Competitive analysis
- Personalized outreach email generation
- Code review + refactoring
Output: a routing table I can use to classify any incoming request in 30 seconds.Prompt 13: Monthly Cost Optimization Review
Last month we ran the following volume through our AI workflow:
[Describe volume – e.g., 200 research briefs, 500 outreach emails, 30 code audits]
Current setup: [Describe current Opus/Kimi allocation]
Current monthly cost: [Amount]
Analyze:
1. Which workflows are over‑allocated to Opus that Kimi could handle
2. Which workflows are on Kimi that Opus should handle for quality
3. What the optimal split would look like
4. Projected cost at the optimal split
Produce a routing change recommendation with estimated monthly savings.Prompt 14: Handoff Template
Create a standard handoff template for the following workflow:
When Opus completes [task type], it produces [output format].
Kimi Agent Swarm receives this and executes [execution task].
Opus then reviews [review standards].
Write:
1. The Opus output format that serves as the Kimi brief (structured, unambiguous)
2. The Kimi execution brief template (slots for Opus to fill in)
3. The Opus review checklist (five criteria Kimi's output is graded against)
4. The revision loop: if Kimi output fails, what gets sent back and in what format
This template becomes the permanent interface between the two models for this workflow type.Prompt 15: ROI Report for Stakeholders
Produce a monthly ROI report for our Opus 4.8 + Kimi K2.6 workflow setup.
Include:
1. Total workflows run this month: [N]
2. Volume processed: [N items / files / tasks]
3. Cost breakdown:
- Opus 4.8 spend: $[amount]
- Kimi K2.6 spend: $[amount]
- Total: $[amount]
4. Equivalent cost running all workflows on Opus 4.8 alone: $[calculated amount]
5. Monthly saving: $[amount]
6. Annualized saving: $[amount × 12]
7. Quality incidents (workflows that failed review and required re‑run): [N]
8. Quality incident rate: [%]
Format as an executive summary. One page. Numbers first.Conclusion
After May 28 2026, the logic solidified: Anthropic’s Opus 4.8 Dynamic Workflows is designed to orchestrate hundreds of parallel agents, not to act as every agent; Kimi K2.6 Agent Swarm performs large‑scale parallel execution at $0.6 per million tokens.
They are not competitors but complementary layers of the same system. Teams that adopted this split achieved the original $62,000‑per‑month performance for only $129, with unchanged quality and speed, proving that cost control, not task difficulty, is the decisive advantage when AI can handle any request.
Signed-in readers can open the original source through BestHub's protected redirect.
This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactand we will review it promptly.
AI Engineering
Focused on cutting‑edge product and technology information and practical experience sharing in the AI field (large models, MLOps/LLMOps, AI application development, AI infrastructure).
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.
