Blog

Efficiently build and tune custom log anomaly detection models with Amazon SageMaker

Efficiently build and tune custom log anomaly detection models with Amazon SageMaker

In this post, we walk you through the process to build an automated mechanism using Amazon SageMaker to process your log data, run training iterations over it to obtain the best-performing anomaly detection model, and register it with the Amazon SageMaker Model Registry for your customers to use it. Log-based anomaly detection involves identifying anomalous …

Efficiently build and tune custom log anomaly detection models with Amazon SageMaker Read More »

Explore the business case for responsible AI in new IDC whitepaper

I am pleased to introduce Microsoft’s commissioned whitepaper with IDC: The Business Case for Responsible AI. This whitepaper, based on IDC’s Worldwide Responsible AI Survey sponsored by Microsoft, offers guidance to business and technology leaders on how to systematically build trustworthy AI. In today’s rapidly evolving technological landscape, AI has emerged as a transformative force, …

Explore the business case for responsible AI in new IDC whitepaper Read More »

Optimizing costs of generative AI applications on AWS

Optimizing costs of generative AI applications on AWS

The report The economic potential of generative AI: The next productivity frontier, published by McKinsey & Company, estimates that generative AI could add an equivalent of $2.6 trillion to $4.4 trillion in value to the global economy. The largest value will be added across four areas: customer operations, marketing and sales, software engineering, and R&D. …

Optimizing costs of generative AI applications on AWS Read More »

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium

Training large language models (LLMs) models has become a significant expense for businesses. For many use cases, companies are looking to use LLM foundation models (FM) with their domain-specific data. However, companies are discovering that performing full fine tuning for these models with their data isn’t cost effective. To reduce costs while continuing to use the …

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium Read More »

Using transcription confidence scores to improve slot filling in Amazon Lex

Using transcription confidence scores to improve slot filling in Amazon Lex

When building voice-enabled chatbots with Amazon Lex, one of the biggest challenges is accurately capturing user speech input for slot values. For example, when a user needs to provide their account number or confirmation code, speech recognition accuracy becomes crucial. This is where transcription confidence scores come in to help ensure reliable slot filling. What …

Using transcription confidence scores to improve slot filling in Amazon Lex Read More »

Improving Retrieval Augmented Generation accuracy with GraphRAG

Improving Retrieval Augmented Generation accuracy with GraphRAG

Customers need better accuracy to take generative AI applications into production. In a world where decisions are increasingly data-driven, the integrity and reliability of information are paramount. To address this, customers often begin by enhancing generative AI accuracy through vector-based retrieval systems and the Retrieval Augmented Generation (RAG) architectural pattern, which integrates dense embeddings to …

Improving Retrieval Augmented Generation accuracy with GraphRAG Read More »

Achieving AI-powered success: Learnings from Cloud Cultures Season 3 

Achieving AI-powered success: Learnings from Cloud Cultures Season 3 

Cloud technology plays an important role in how organizations around the world are modernizing and innovating. Equally important is the vision that business and technology leaders bring to the process. Shaped by culture, traditions, and values, it’s our lived experience that often forms the foundation for innovation. Technology helps bring the vision to life. In …

Achieving AI-powered success: Learnings from Cloud Cultures Season 3  Read More »

Add a generative AI experience to your website or web application with Amazon Q embedded

Add a generative AI experience to your website or web application with Amazon Q embedded

Generative AI offers many benefits for both you, as a software provider, and your end-users. AI assistants can help users generate insights, get help, and find information that may be hard to surface using traditional means. In addition, they can help your employees reduce repetitive tasks and focus on high-value work. However, adding generative AI …

Add a generative AI experience to your website or web application with Amazon Q embedded Read More »

Scroll to Top