Blog_dumb

Evaluate healthcare generative AI applications using LLM-as-a-judge on AWS

Evaluate healthcare generative AI applications using LLM-as-a-judge on AWS

In our previous blog posts, we explored various techniques such as fine-tuning large language models (LLMs), prompt engineering, and Retrieval Augmented Generation (RAG) using Amazon Bedrock to generate impressions from the findings section in radiology reports using generative AI. Part 1 focused on model fine-tuning. Part 2 introduced RAG, which combines LLMs with external knowledge …

Evaluate healthcare generative AI applications using LLM-as-a-judge on AWS Read More »

AWS DeepRacer: Closing time at AWS re:Invent 2024 –How did that physical racing go?

AWS DeepRacer: Closing time at AWS re:Invent 2024 –How did that physical racing go?

Having spent the last years studying the art of AWS DeepRacer in the physical world, the author went to AWS re:Invent 2024. How did it go? In AWS DeepRacer: How to master physical racing?, I wrote in detail about some aspects relevant to racing AWS DeepRacer in the physical world. We looked at the differences …

AWS DeepRacer: Closing time at AWS re:Invent 2024 –How did that physical racing go? Read More »

Empowering innovation: The next generation of the Phi family

Empowering innovation: The next generation of the Phi family

We are excited to announce Phi-4-multimodal and Phi-4-mini, the newest models in Microsoft’s Phi family of small language models (SLMs). These models are designed to empower developers with advanced AI capabilities. Phi-4-multimodal, with its ability to process speech, vision, and text simultaneously, opens new possibilities for creating innovative and context-aware applications. Phi-4-mini, on the other …

Empowering innovation: The next generation of the Phi family Read More »

Azure NetApp Files: Revolutionizing silicon design for high-performance computing

Azure NetApp Files: Revolutionizing silicon design for high-performance computing

High-performance computing (HPC) workloads place significant demands on cloud infrastructure, requiring robust and scalable resources to handle complex and intensive computational tasks. These workloads often necessitate high levels of parallel processing power, typically provided by clusters of central processing unit (CPU) or graphics processing unit (GPU)-based virtual machines. Additionally, HPC applications demand substantial data storage …

Azure NetApp Files: Revolutionizing silicon design for high-performance computing Read More »

How Pattern PXM’s Content Brief is driving conversion on ecommerce marketplaces using AI

How Pattern PXM’s Content Brief is driving conversion on ecommerce marketplaces using AI

Brands today are juggling a million things, and keeping product content up-to-date is at the top of the list. Between decoding the endless requirements of different marketplaces, wrangling inventory across channels, adjusting product listings to catch a customer’s eye, and trying to outpace shifting trends and fierce competition, it’s a lot. And let’s face it—staying …

How Pattern PXM’s Content Brief is driving conversion on ecommerce marketplaces using AI Read More »

How to configure cross-account model deployment using Amazon Bedrock Custom Model Import

How to configure cross-account model deployment using Amazon Bedrock Custom Model Import

In enterprise environments, organizations often divide their AI operations into two specialized teams: an AI research team and a model hosting team. The research team is dedicated to developing and enhancing AI models using model training and fine-tuning techniques. Meanwhile, a separate hosting team is responsible for deploying these models across their own development, staging, …

How to configure cross-account model deployment using Amazon Bedrock Custom Model Import Read More »

ByteDance processes billions of daily videos using their multimodal video understanding models on AWS Inferentia2

ByteDance processes billions of daily videos using their multimodal video understanding models on AWS Inferentia2

This is a guest post authored by the team at ByteDance. ByteDance is a technology company that operates a range of content platforms to inform, educate, entertain, and inspire people across languages, cultures, and geographies. Users trust and enjoy our content platforms because of the rich, intuitive, and safe experiences they provide. These experiences are …

ByteDance processes billions of daily videos using their multimodal video understanding models on AWS Inferentia2 Read More »

How IDIADA optimized its intelligent chatbot with Amazon Bedrock

How IDIADA optimized its intelligent chatbot with Amazon Bedrock

This post is co-written with Xavier Vizcaino, Diego Martín Montoro, and Jordi Sánchez Ferrer from Applus+ Idiada. In 2021, Applus+ IDIADA, a global partner to the automotive industry with over 30 years of experience supporting customers in product development activities through design, engineering, testing, and homologation services, established the Digital Solutions department. This strategic move …

How IDIADA optimized its intelligent chatbot with Amazon Bedrock Read More »

Accelerate IaC troubleshooting with Amazon Bedrock Agents

Accelerate IaC troubleshooting with Amazon Bedrock Agents

Troubleshooting infrastructure as code (IaC) errors often consumes valuable time and resources. Developers can spend multiple cycles searching for solutions across forums, troubleshooting repetitive issues, or trying to identify the root cause. These delays can lead to missed security errors or compliance violations, especially in complex, multi-account environments. This post demonstrates how you can use …

Accelerate IaC troubleshooting with Amazon Bedrock Agents Read More »

Scroll to Top