Blog_dumb

Warner Bros. Discovery achieves 60% cost savings and faster ML inference with AWS Graviton

Warner Bros. Discovery achieves 60% cost savings and faster ML inference with AWS Graviton

This post is written by Nukul Sharma, Machine Learning Engineering Manager, and Karthik Dasani, Staff Machine Learning Engineer, at Warner Bros. Discovery. Warner Bros. Discovery (WBD) is a leading global media and entertainment company that creates and distributes the world’s most differentiated and complete portfolio of content and brands across television, film and streaming. With iconic …

Warner Bros. Discovery achieves 60% cost savings and faster ML inference with AWS Graviton Read More »

Physical AI in practice: Technical foundations that fuel human-machine interactions

Physical AI in practice: Technical foundations that fuel human-machine interactions

In our previous post, Transforming the physical world with AI: the next frontier in intelligent automation, we explored how the field of physical AI is redefining a wide range of industries including construction, manufacturing, healthcare, and agriculture. Now, we turn our attention to the complete development lifecycle behind this technology – the process of creating intelligent …

Physical AI in practice: Technical foundations that fuel human-machine interactions Read More »

HyperPod now supports Multi-Instance GPU to maximize GPU utilization for generative AI tasks

HyperPod now supports Multi-Instance GPU to maximize GPU utilization for generative AI tasks

We are excited to announce the general availability of GPU partitioning with Amazon SageMaker HyperPod, using NVIDIA Multi-Instance GPU (MIG). With this capability you can run multiple tasks concurrently on a single GPU, minimizing wasted compute and memory resources that result from dedicating entire hardware (for example, entire GPUs) to tasks that can under-utilize the resources. By …

HyperPod now supports Multi-Instance GPU to maximize GPU utilization for generative AI tasks Read More »

Accelerate generative AI innovation in Canada with Amazon Bedrock cross-Region inference

Accelerate generative AI innovation in Canada with Amazon Bedrock cross-Region inference

Generative AI has created unprecedented opportunities for Canadian organizations to transform their operations and customer experiences. We are excited to announce that customers in Canada can now access advanced foundation models including Anthropic’s Claude Sonnet 4.5 and Claude Haiku 4.5 on Amazon Bedrock through cross-Region inference (CRIS). This post explores how Canadian organizations can use …

Accelerate generative AI innovation in Canada with Amazon Bedrock cross-Region inference Read More »

Power up your ML workflows with interactive IDEs on SageMaker HyperPod

Power up your ML workflows with interactive IDEs on SageMaker HyperPod

Amazon SageMaker HyperPod clusters with Amazon Elastic Kubernetes Service (EKS) orchestration now support creating and managing interactive development environments such as JupyterLab and open source Visual Studio Code, streamlining the ML development lifecycle by providing managed environments for familiar tools to data scientists. This feature introduces a new add-on called Amazon SageMaker Spaces for AI developers to create and …

Power up your ML workflows with interactive IDEs on SageMaker HyperPod Read More »

Deploy GPT-OSS models with Amazon Bedrock Custom Model Import

Deploy GPT-OSS models with Amazon Bedrock Custom Model Import

Amazon Bedrock Custom Model Import now supports OpenAI models with open weights, including GPT-OSS variants with 20-billion and 120-billion parameters. GPT-OSS models offer reasoning capabilities and can be used with OpenAI Chat Completions API. By preserving full OpenAI API compatibility, organizations can migrate their existing applications to AWS, gaining enterprise-grade security, scaling, and cost control. …

Deploy GPT-OSS models with Amazon Bedrock Custom Model Import Read More »

Streamline AI operations with the Multi-Provider Generative AI Gateway reference architecture

Streamline AI operations with the Multi-Provider Generative AI Gateway reference architecture

As organizations increasingly adopt AI capabilities across their applications, the need for centralized management, security, and cost control of AI model access is a required step in scaling AI solutions. The Generative AI Gateway on AWS guidance addresses these challenges by providing guidance for a unified gateway that supports multiple AI providers while offering comprehensive …

Streamline AI operations with the Multi-Provider Generative AI Gateway reference architecture Read More »

Deploy geospatial agents with Foursquare Spatial H3 Hub and Amazon SageMaker AI

Deploy geospatial agents with Foursquare Spatial H3 Hub and Amazon SageMaker AI

Organizations have used geospatial machine learning (ML) for property risk assessment, disaster response, and infrastructure planning. These systems worked well but couldn’t scale beyond specialized use cases. Each question required multiple geospatial datasets, each with its own model and often its own workflow, limiting these capabilities to a handful of high-value use cases at the …

Deploy geospatial agents with Foursquare Spatial H3 Hub and Amazon SageMaker AI Read More »

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.

Scroll to Top