Blog_dumb

Apply fine-grained access control with Bedrock AgentCore Gateway interceptors

Apply fine-grained access control with Bedrock AgentCore Gateway interceptors

As enterprises rapidly adopt AI agents to automate workflows and enhance productivity, they face a critical scaling challenge: managing secure access to thousands of tools across their organization. Modern AI deployments no longer involve a handful of agents calling a few APIs—instead, enterprises are building unified AI platforms where hundreds of agents, consumer AI applications, …

Apply fine-grained access control with Bedrock AgentCore Gateway interceptors Read More »

How Condé Nast accelerated contract processing and rights analysis with Amazon Bedrock

How Condé Nast accelerated contract processing and rights analysis with Amazon Bedrock

This post is co-written with Bob Boiko, Christopher Donnellan, and Sarat Tatavarthi from Condé Nast. For over a century, Condé Nast has stood at the forefront of global media, shaping culture and conversation through its prestigious portfolio of brands. Founded in 1909, the company has evolved from a traditional publisher into a modern media powerhouse. …

How Condé Nast accelerated contract processing and rights analysis with Amazon Bedrock Read More »

Building AI-Powered Voice Applications: Amazon Nova Sonic Telephony Integration Guide

Building AI-Powered Voice Applications: Amazon Nova Sonic Telephony Integration Guide

Organizations are increasingly seeking to enhance customer experiences through natural, responsive voice interactions across their telephony systems. Amazon Nova Sonic addresses this need as a speech-to-speech generative AI model that delivers real-time voice conversations with low latency and natural turn-taking. It understands speech across different accents and speaking styles, responds with expressive voices in multiple …

Building AI-Powered Voice Applications: Amazon Nova Sonic Telephony Integration Guide Read More »

University of California Los Angeles delivers an immersive theater experience with AWS generative AI services

University of California Los Angeles delivers an immersive theater experience with AWS generative AI services

This post was co-written with Andrew Browning, Anthony Doolan, Jerome Ronquillo, Jeff Burke, Chiheb Boussema, and Naisha Agarwal from UCLA. The University of California, Los Angeles (UCLA) is home to 16 Nobel Laureates and has been ranked the #1 public university in the United States for 8 consecutive years. The Office of Advanced Research Computing …

University of California Los Angeles delivers an immersive theater experience with AWS generative AI services Read More »

Optimizing Mobileye’s REM™ with AWS Graviton: A focus on ML inference and Triton integration

Optimizing Mobileye’s REM™ with AWS Graviton: A focus on ML inference and Triton integration

This post is written by Chaim Rand, Principal Engineer, Pini Reisman, Software Senior Principal Engineer, and Eliyah Weinberg, Performance and Technology Innovation Engineer, at Mobileye. The Mobileye team would like to thank Sunita Nadampalli and Guy Almog from AWS for their contributions to this solution and this post. Mobileye is driving the global evolution toward …

Optimizing Mobileye’s REM™ with AWS Graviton: A focus on ML inference and Triton integration Read More »

Evaluate models with the Amazon Nova evaluation container using Amazon SageMaker AI

Evaluate models with the Amazon Nova evaluation container using Amazon SageMaker AI

This blog post introduces the new Amazon Nova model evaluation features in Amazon SageMaker AI. This release adds custom metrics support, LLM-based preference testing, log probability capture, metadata analysis, and multi-node scaling for large evaluations. The new features include: Custom metrics use the bring your own metrics (BYOM) functions to control evaluation criteria for your …

Evaluate models with the Amazon Nova evaluation container using Amazon SageMaker AI Read More »

Beyond the technology: Workforce changes for AI

Beyond the technology: Workforce changes for AI

Workplaces are increasingly integrating AI tools into daily operations, with AI assistants supporting teams, predictive analytics informing strategies, and automation streamlining workflows. AI has moved from experimental technology to standard business practice, changing how work gets done. Organizations need to understand what AI can do and how it affects their workforce to implement it successfully. …

Beyond the technology: Workforce changes for AI Read More »

Enhanced performance for Amazon Bedrock Custom Model Import

Enhanced performance for Amazon Bedrock Custom Model Import

You can now achieve significant performance improvements when using Amazon Bedrock Custom Model Import, with reduced end-to-end latency, faster time-to-first-token, and improved throughput through advanced PyTorch compilation and CUDA graph optimizations. With Amazon Bedrock Custom Model Import you can to bring your own foundation models to Amazon Bedrock for deployment and inference at scale. These …

Enhanced performance for Amazon Bedrock Custom Model Import Read More »

Amazon SageMaker AI introduces EAGLE based adaptive speculative decoding to accelerate generative AI inference

Amazon SageMaker AI introduces EAGLE based adaptive speculative decoding to accelerate generative AI inference

Generative AI models continue to expand in scale and capability, increasing the demand for faster and more efficient inference. Applications need low latency and consistent performance without compromising output quality. Amazon SageMaker AI introduces new enhancements to its inference optimization toolkit that bring EAGLE based adaptive speculative decoding to more model architectures. These updates make …

Amazon SageMaker AI introduces EAGLE based adaptive speculative decoding to accelerate generative AI inference Read More »

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.

Scroll to Top