Blog

Elevate Azure expertise with new AI and optimization video episodes

The Azure Enablement Show is a library of fun, bite-sized videos that highlight valuable resources for anyone looking to learn more about cloud computing and related Azure tools. In this blog we announce two new video series: Azure Optimization Skilling, which explores best practices and resources to maximize the value of your cloud investment, and …

Elevate Azure expertise with new AI and optimization video episodes Read More »

Benchmark and optimize endpoint deployment in Amazon SageMaker JumpStart 

Benchmark and optimize endpoint deployment in Amazon SageMaker JumpStart 

When deploying a large language model (LLM), machine learning (ML) practitioners typically care about two measurements for model serving performance: latency, defined by the time it takes to generate a single token, and throughput, defined by the number of tokens generated per second. Although a single request to the deployed endpoint would exhibit a throughput …

Benchmark and optimize endpoint deployment in Amazon SageMaker JumpStart  Read More »

Architect defense-in-depth security for generative AI applications using the OWASP Top 10 for LLMs

Architect defense-in-depth security for generative AI applications using the OWASP Top 10 for LLMs

Generative artificial intelligence (AI) applications built around large language models (LLMs) have demonstrated the potential to create and accelerate economic value for businesses. Examples of applications include conversational search, customer support agent assistance, customer support analytics, self-service virtual assistants, chatbots, rich media generation, content moderation, coding companions to accelerate secure, high-performance software development, deeper insights …

Architect defense-in-depth security for generative AI applications using the OWASP Top 10 for LLMs Read More »

Deploy a Microsoft Teams gateway for Amazon Q, your business expert

Deploy a Microsoft Teams gateway for Amazon Q, your business expert

Amazon Q is a new generative AI-powered application that helps users get work done. Amazon Q can become your tailored business expert and let you discover content, brainstorm ideas, or create summaries using your company’s data safely and securely. You can use Amazon Q to have conversations, solve problems, generate content, gain insights, and take …

Deploy a Microsoft Teams gateway for Amazon Q, your business expert Read More »

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace

Generative AI solutions have the potential to transform businesses by boosting productivity and improving customer experiences, and using large language models (LLMs) with these solutions has become increasingly popular. Building proofs of concept is relatively straightforward because cutting-edge foundation models are available from specialized providers through a simple API call. Therefore, organizations of various sizes …

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace Read More »

Build a vaccination verification solution using the Queries feature in Amazon Textract

Build a vaccination verification solution using the Queries feature in Amazon Textract

Amazon Textract is a machine learning (ML) service that enables automatic extraction of text, handwriting, and data from scanned documents, surpassing traditional optical character recognition (OCR). It can identify, understand, and extract data from tables and forms with remarkable accuracy. Presently, several companies rely on manual extraction methods or basic OCR software, which is tedious …

Build a vaccination verification solution using the Queries feature in Amazon Textract Read More »

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning

In this post, we demonstrate how to use neural architecture search (NAS) based structural pruning to compress a fine-tuned BERT model to improve model performance and reduce inference times. Pre-trained language models (PLMs) are undergoing rapid commercial and enterprise adoption in the areas of productivity tools, customer service, search and recommendations, business process automation, and …

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning Read More »

Microsoft named a Leader in the 2023 Gartner® Magic Quadrant™ for Container Management

Microsoft named a Leader in the 2023 Gartner® Magic Quadrant™ for Container Management

Cloud-native technologies like containers and Kubernetes are the future of application development. That’s why we’re honored to announce that Microsoft has been named a Leader in the 2023 Gartner® Magic Quadrant™ for Container Management*. We believe that this recognition validates our end-to-end approach for developing and deploying enterprise-grade, cloud-native apps that run on Azure, in …

Microsoft named a Leader in the 2023 Gartner® Magic Quadrant™ for Container Management Read More »

Scroll to Top