Blog_dumb

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning

In this post, we demonstrate how to use neural architecture search (NAS) based structural pruning to compress a fine-tuned BERT model to improve model performance and reduce inference times. Pre-trained language models (PLMs) are undergoing rapid commercial and enterprise adoption in the areas of productivity tools, customer service, search and recommendations, business process automation, and …

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning Read More »

Microsoft named a Leader in the 2023 Gartner® Magic Quadrant™ for Container Management

Microsoft named a Leader in the 2023 Gartner® Magic Quadrant™ for Container Management

Cloud-native technologies like containers and Kubernetes are the future of application development. That’s why we’re honored to announce that Microsoft has been named a Leader in the 2023 Gartner® Magic Quadrant™ for Container Management*. We believe that this recognition validates our end-to-end approach for developing and deploying enterprise-grade, cloud-native apps that run on Azure, in …

Microsoft named a Leader in the 2023 Gartner® Magic Quadrant™ for Container Management Read More »

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium

Today, we’re excited to announce the availability of Llama 2 inference and fine-tuning support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. Using AWS Trainium and Inferentia based instances, through SageMaker, can help users lower fine-tuning costs by up to 50%, and lower deployment costs by 4.7x, while lowering per token latency. …

Fine-tune and deploy Llama 2 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium Read More »

Use mobility data to derive insights using Amazon SageMaker geospatial capabilities

Use mobility data to derive insights using Amazon SageMaker geospatial capabilities

Geospatial data is data about specific locations on the earth’s surface. It can represent a geographical area as a whole or it can represent an event associated with a geographical area. Analysis of geospatial data is sought after in a few industries. It involves understanding where the data exists from a spatial perspective and why …

Use mobility data to derive insights using Amazon SageMaker geospatial capabilities Read More »

Dynatrace and the Microsoft commercial marketplace: AI-powered cloud transformation

Dynatrace and the Microsoft commercial marketplace: AI-powered cloud transformation

In today’s digital landscape, the cloud has become a cornerstone for over 90%1 of organizations. This widespread adoption brings a growing complexity to cloud portfolios as companies navigate the challenges of migrating on-premises environments, managing hybrid workloads, and modernizing their cloud estate—all while operating under current business imperatives of speed and scale. To overcome this, …

Dynatrace and the Microsoft commercial marketplace: AI-powered cloud transformation Read More »

Scroll to Top