Blog

Amazon SageMaker AI now supports optimized generative AI inference recommendations

Amazon SageMaker AI now supports optimized generative AI inference recommendations

Organizations are racing to deploy generative AI models into production to power intelligent assistants, code generation tools, content engines, and customer-facing applications. But deploying these models to production remains a weeks-long process of navigating GPU configurations, optimization techniques, and manual benchmarking, delaying the value these models are built to deliver. Today, Amazon SageMaker AI  supports …

Amazon SageMaker AI now supports optimized generative AI inference recommendations Read More »

Get to your first working agent in minutes: Announcing new features in Amazon Bedrock AgentCore

Get to your first working agent in minutes: Announcing new features in Amazon Bedrock AgentCore

Getting an agent running has always meant solving a long list of infrastructure problems before you can test whether the agent itself is any good. You wire up frameworks, storage, authentication, and deployment pipelines, and by the time your agent handles its first real task, you’ve spent days on infrastructure instead of agent logic. We …

Get to your first working agent in minutes: Announcing new features in Amazon Bedrock AgentCore Read More »

Company-wise memory in Amazon Bedrock with Amazon Neptune and Mem0

Company-wise memory in Amazon Bedrock with Amazon Neptune and Mem0

This post is cowritten by Shawn Tsai from TrendMicro. Delivering relevant, context-aware responses is important for customer satisfaction. For enterprise-grade AI chatbots, understanding not only the current query but also the organizational context behind it is key. Company-wise memory in Amazon Bedrock, powered by Amazon Neptune and Mem0, provides AI agents with persistent, company-specific context—enabling …

Company-wise memory in Amazon Bedrock with Amazon Neptune and Mem0 Read More »

From developer desks to the whole organization: Running Claude Cowork in Amazon Bedrock

From developer desks to the whole organization: Running Claude Cowork in Amazon Bedrock

Today, we’re excited to announce Claude Cowork in Amazon Bedrock. You can now run Cowork and Claude Code Desktop through Amazon Bedrock, directly or using an LLM gateway. From startups to global enterprises across every industry, organizations build with Claude Code in Amazon Bedrock to boost developer productivity and accelerate delivery. With Amazon Bedrock you …

From developer desks to the whole organization: Running Claude Cowork in Amazon Bedrock Read More »

End-to-end lineage with DVC and Amazon SageMaker AI MLflow apps

End-to-end lineage with DVC and Amazon SageMaker AI MLflow apps

Production machine learning (ML) teams struggle to trace the full lineage of a model through the data and the code that trained it, the exact dataset version it consumed, and the experiment metrics that justified its deployment. Without this traceability, questions like “which data trained the model currently in production?” or “can we reproduce the …

End-to-end lineage with DVC and Amazon SageMaker AI MLflow apps Read More »

Accelerate Generative AI Inference on Amazon SageMaker AI with G7e Instances

Accelerate Generative AI Inference on Amazon SageMaker AI with G7e Instances

As the demand for generative AI continues to grow, developers and enterprises seek more flexible, cost-effective, and powerful accelerators to meet their needs. Today, we are thrilled to announce the availability of G7e instances powered by NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs on Amazon SageMaker AI. You can provision nodes with 1, 2, …

Accelerate Generative AI Inference on Amazon SageMaker AI with G7e Instances Read More »

ToolSimulator: scalable tool testing for AI agents

ToolSimulator: scalable tool testing for AI agents

You can use ToolSimulator, an LLM-powered tool simulation framework within Strands Evals, to thoroughly and safely test AI agents that rely on external tools, at scale. Instead of risking live API calls that expose personally identifiable information (PII), trigger unintended actions, or settling for static mocks that break with multi-turn workflows, you can use ToolSimulator’s large …

ToolSimulator: scalable tool testing for AI agents Read More »

Omnichannel ordering with Amazon Bedrock AgentCore and Amazon Nova 2 Sonic

Omnichannel ordering with Amazon Bedrock AgentCore and Amazon Nova 2 Sonic

Introduction Building a voice-enabled ordering system that works across mobile apps, websites, and voice interfaces (an omnichannel approach) presents real challenges. You need to process bidirectional audio streams, maintain conversation context across multiple turns, integrate backend services without tight coupling, and scale to handle peak traffic. In this post, we’ll show you how to build …

Omnichannel ordering with Amazon Bedrock AgentCore and Amazon Nova 2 Sonic Read More »

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.

Scroll to Top