How Amazon’s New Bedrock and SageMaker Features Cut AI Agent Costs and Speed Up Customization

The article explains how Amazon Bedrock’s reinforced fine‑tuning and SageMaker AI’s new serverless model‑customization dramatically lower the cost and latency of AI agents, delivering up to a 73% accuracy boost and shrinking model‑building cycles from months to days for enterprises of any size.

Amazon Cloud Developers
Amazon Cloud Developers
Amazon Cloud Developers
How Amazon’s New Bedrock and SageMaker Features Cut AI Agent Costs and Speed Up Customization

Efficiency is a core challenge when deploying AI agents at scale; large‑model inference is costly and resource‑intensive, yet agents spend most of their time on routine tasks such as schedule lookup and document search that do not require the most powerful models.

Amazon addresses this gap with two new customization capabilities announced at re:Invent 2025: the reinforced fine‑tuning feature in Amazon Bedrock and a serverless model‑customization capability in Amazon SageMaker AI. Both aim to let developers replace heavyweight foundation models with smaller, task‑specific models, reducing cost, improving response speed, and increasing accuracy.

Amazon Bedrock reinforced fine‑tuning

Bedrock’s reinforced fine‑tuning (RFT) automates the entire fine‑tuning pipeline, requiring no PhD‑level expertise. Developers select a base model, point it to logs or upload a dataset, choose a reward function (AI‑based, rule‑based, or a preset template), and the service runs the fine‑tuning end‑to‑end.

According to benchmark tests, fine‑tuned models achieve an average 66% accuracy improvement over the original base model, and in a Salesforce case study the accuracy gain reached 73%.

When the feature launched, it supported the Amazon Nova 2 Lite model, with plans to add more models later.

Amazon SageMaker AI serverless model‑customization

SageMaker AI now compresses the model‑customization cycle from months to days by offering two modes:

Agent‑guided mode: developers describe requirements in natural language; an AI agent orchestrates data synthesis, training, and evaluation.

Autonomous mode: developers retain full control, managing parameters and techniques without handling infrastructure.

The service supports a suite of advanced customization techniques, including Reinforcement Learning from AI Feedback (RLAIF), Reinforcement Learning with Verifiable Rewards, Supervised Fine‑Tuning, and Direct Preference Optimization. It also works with open‑weight models such as Llama, Qwen, DeepSeek, and GPT‑OSS.

Customer examples illustrate the impact: Salesforce agents saw both accuracy and efficiency rise sharply after applying Bedrock RFT; Collinear AI reported weeks saved in their model‑optimization pipeline by using SageMaker AI’s serverless workflow.

Quotes and resources

"The reinforced fine‑tuning feature lets us achieve up to a 73% accuracy boost without the overhead of massive models," says Phil Mui, Senior Vice President of Software Engineering at Salesforce Agentforce.

"With SageMaker AI’s serverless customization, we shortened our experiment cycle from weeks to days, allowing us to focus on high‑quality training data rather than infrastructure," adds Soumyadeep Bakshi, Co‑Founder of Collinear AI.

For more details, see the Amazon Bedrock customization page (https://aws.amazon.com/bedrock/customize/) and the Amazon SageMaker AI model‑customization page (https://aws.amazon.com/sagemaker/ai/model-customization).

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

model fine-tuningAI Agentreinforcement learningAmazon BedrockAmazon SageMakerserverless customization
Amazon Cloud Developers
Written by

Amazon Cloud Developers

Official technical community of Amazon Cloud. Shares practical AI/ML, big data, database, modern app development, IoT content, offers comprehensive learning resources, hosts regular developer events, and continuously empowers developers.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.