Blog

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Today, we’re excited to announce the availability of Llama 2 inference and fine-tuning support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. Using AWS Trainium and Inferentia based instances, through SageMaker, can help users lower fine-tuning costs by up to 50%, and lower deployment costs by 4.7x, while lowering per token latency. …

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium Read More »

Use mobility data to derive insights using Amazon SageMaker geospatial capabilities

Use mobility data to derive insights using Amazon SageMaker geospatial capabilities

Geospatial data is data about specific locations on the earth’s surface. It can represent a geographical area as a whole or it can represent an event associated with a geographical area. Analysis of geospatial data is sought after in a few industries. It involves understanding where the data exists from a spatial perspective and why …

Use mobility data to derive insights using Amazon SageMaker geospatial capabilities Read More »

Dynatrace and the Microsoft commercial marketplace: AI-powered cloud transformation

Dynatrace and the Microsoft commercial marketplace: AI-powered cloud transformation

In today’s digital landscape, the cloud has become a cornerstone for over 90%1 of organizations. This widespread adoption brings a growing complexity to cloud portfolios as companies navigate the challenges of migrating on-premises environments, managing hybrid workloads, and modernizing their cloud estate—all while operating under current business imperatives of speed and scale. To overcome this, …

Dynatrace and the Microsoft commercial marketplace: AI-powered cloud transformation Read More »

Host the Whisper Model on Amazon SageMaker: exploring inference options

Host the Whisper Model on Amazon SageMaker: exploring inference options

OpenAI Whisper is an advanced automatic speech recognition (ASR) model with an MIT license. ASR technology finds utility in transcription services, voice assistants, and enhancing accessibility for individuals with hearing impairments. This state-of-the-art model is trained on a vast and diverse dataset of multilingual and multitask supervised data collected from the web. Its high accuracy …

Host the Whisper Model on Amazon SageMaker: exploring inference options Read More »

Scroll to Top