Blog_dumb

Whiteboard to cloud in minutes using Amazon Q, Amazon Bedrock Data Automation, and Model Context Protocol

Whiteboard to cloud in minutes using Amazon Q, Amazon Bedrock Data Automation, and Model Context Protocol

Upgrading legacy systems has become increasingly important to stay competitive in today’s market as outdated infrastructure can cost organizations time, money, and market position. However, modernization efforts face challenges like time-consuming architecture reviews, complex migrations, and fragmented systems. These delays not only impact engineering teams but have broader impacts including lost market opportunities, reduced competitiveness, …

Whiteboard to cloud in minutes using Amazon Q, Amazon Bedrock Data Automation, and Model Context Protocol Read More »

Bringing agentic Retrieval Augmented Generation to Amazon Q Business

Bringing agentic Retrieval Augmented Generation to Amazon Q Business

Amazon Q Business is a generative AI-powered enterprise assistant that helps organizations unlock value from their data. By connecting to enterprise data sources, employees can use Amazon Q Business to quickly find answers, generate content, and automate tasks—from accessing HR policies to streamlining IT support workflows, all while respecting existing permissions and providing clear citations. …

Bringing agentic Retrieval Augmented Generation to Amazon Q Business Read More »

Empowering students with disabilities: University Startups’ generative AI solution for personalized student pathways

Empowering students with disabilities: University Startups’ generative AI solution for personalized student pathways

This post was co-authored with Laura Lee Williams and John Jabara from University Startups. University Startups, headquartered in Bethesda, MD, was founded in 2020 to empower high school students to expand their education beyond a traditional curriculum. University Startups is focused on special education and related services in school districts throughout the US. After students …

Empowering students with disabilities: University Startups’ generative AI solution for personalized student pathways Read More »

Citations with Amazon Nova understanding models

Citations with Amazon Nova understanding models

Large language models (LLMs) have become increasingly prevalent across both consumer and enterprise applications. However, their tendency to “hallucinate” information and deliver incorrect answers with seeming confidence has created a trust problem. Think of LLMs as you would a human expert: we typically trust experts who can back up their claims with references and walk …

Citations with Amazon Nova understanding models Read More »

Securely launch and scale your agents and tools on Amazon Bedrock AgentCore Runtime

Securely launch and scale your agents and tools on Amazon Bedrock AgentCore Runtime

Organizations are increasingly excited about the potential of AI agents, but many find themselves stuck in what we call “proof of concept purgatory”—where promising agent prototypes struggle to make the leap to production deployment. In our conversations with customers, we’ve heard consistent challenges that block the path from experimentation to enterprise-grade deployment: “Our developers want …

Securely launch and scale your agents and tools on Amazon Bedrock AgentCore Runtime Read More »

PwC and AWS Build Responsible AI with Automated Reasoning on Amazon Bedrock

PwC and AWS Build Responsible AI with Automated Reasoning on Amazon Bedrock

This is a guest post co-written with Scott Likens, Ambuj Gupta, Adam Hood, Chantal Hudson, Priyanka Mukhopadhyay, Deniz Konak Ozturk, and Kevin Paul from PwC Organizations are deploying generative AI solutions while balancing accuracy, security, and compliance. In this globally competitive environment, scale matters less, speed matters more, and innovation matters most of all, according …

PwC and AWS Build Responsible AI with Automated Reasoning on Amazon Bedrock Read More »

How Amazon scaled Rufus by building multi-node inference using AWS Trainium chips and vLLM

How Amazon scaled Rufus by building multi-node inference using AWS Trainium chips and vLLM

At Amazon, our team builds Rufus, a generative AI-powered shopping assistant that serves millions of customers at immense scale. However, deploying Rufus at scale introduces significant challenges that must be carefully navigated. Rufus is powered by a custom-built large language model (LLM). As the model’s complexity increased, we prioritized developing scalable multi-node inference capabilities that …

How Amazon scaled Rufus by building multi-node inference using AWS Trainium chips and vLLM Read More »

Scroll to Top