Blog

Secure AI agents with Policy in Amazon Bedrock AgentCore

Secure AI agents with Policy in Amazon Bedrock AgentCore

Deploying AI agents safely in regulated industries is challenging. Without proper boundaries, agents that access sensitive data or execute transactions can pose significant security risks. Unlike traditional software, an AI agent chooses actions to achieve a goal by invoking tools, accessing data, and adapting its reasoning using data from its environment and users. This autonomy …

Secure AI agents with Policy in Amazon Bedrock AgentCore Read More »

Multimodal embeddings at scale: AI data lake for media and entertainment workloads

Multimodal embeddings at scale: AI data lake for media and entertainment workloads

This post shows you how to build a scalable multimodal video search system that enables natural language search across large video datasets using Amazon Nova models and Amazon OpenSearch Service. You will learn how to move beyond manual tagging and keyword-based searches to enable semantic search that captures the full richness of video content. We …

Multimodal embeddings at scale: AI data lake for media and entertainment workloads Read More »

Fine-tuning NVIDIA Nemotron Speech ASR on Amazon EC2 for domain adaptation

Fine-tuning NVIDIA Nemotron Speech ASR on Amazon EC2 for domain adaptation

This post is a collaboration between AWS, NVIDIA and Heidi.  Automatic speech recognition (ASR), often called speech-to-text (STT) is becoming increasingly critical across industries like healthcare, customer service, and media production. While pre-trained models offer strong capabilities for general speech, fine-tuning for specific domains and use cases can enhance accuracy and performance. In this post, …

Fine-tuning NVIDIA Nemotron Speech ASR on Amazon EC2 for domain adaptation Read More »

Accelerate custom LLM deployment: Fine-tune with Oumi and deploy to Amazon Bedrock

Accelerate custom LLM deployment: Fine-tune with Oumi and deploy to Amazon Bedrock

This post is cowritten by David Stewart and Matthew Persons from Oumi. Fine-tuning open source large language models (LLMs) often stalls between experimentation and production. Training configurations, artifact management, and scalable deployment each require different tools, creating friction when moving from rapid experimentation to secure, enterprise-grade environments. In this post, we show how to fine-tune …

Accelerate custom LLM deployment: Fine-tune with Oumi and deploy to Amazon Bedrock Read More »

Run NVIDIA Nemotron 3 Nano as a fully managed serverless model on Amazon Bedrock

Run NVIDIA Nemotron 3 Nano as a fully managed serverless model on Amazon Bedrock

This post is cowritten with Abdullahi Olaoye, Curtice Lockhart, Nirmal Kumar Juluru from NVIDIA. We are excited to announce that NVIDIA’s Nemotron 3 Nano is now available as a fully managed and serverless model in Amazon Bedrock. This follows our earlier announcement at AWS re:Invent supporting NVIDIA Nemotron 2 Nano 9B and NVIDIA Nemotron 2 …

Run NVIDIA Nemotron 3 Nano as a fully managed serverless model on Amazon Bedrock Read More »

Access Anthropic Claude models in India on Amazon Bedrock with Global cross-Region inference

Access Anthropic Claude models in India on Amazon Bedrock with Global cross-Region inference

The adoption and implementation of generative AI inference has increased with organizations building more operational workloads that use AI capabilities in production at scale. To help customers achieve the scale of their generative AI applications, Amazon Bedrock offers cross-Region inference (CRIS) profiles. CRIS is a powerful feature that organizations can use to seamlessly distribute inference …

Access Anthropic Claude models in India on Amazon Bedrock with Global cross-Region inference Read More »

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.

Scroll to Top