Blog_dumb

Create rich, custom tooltips in Amazon Quick Sight

Create rich, custom tooltips in Amazon Quick Sight

Amazon Quick Sight, the business intelligence (BI) capability of Amazon Quick, is a unified BI service. It provides modern interactive dashboards, natural language querying, pixel-perfect reports, machine learning (ML) insights, and embedded analytics at scale. Amazon Quick brings together AI agents for business insights, research, and automation in one integrated experience, helping you work smarter …

Create rich, custom tooltips in Amazon Quick Sight Read More »

Accelerating decode-heavy LLM inference with speculative decoding on AWS Trainium and vLLM

Accelerating decode-heavy LLM inference with speculative decoding on AWS Trainium and vLLM

Practical benchmarks showing faster inter-token latency when deploying Qwen3 models with vLLM, Kubernetes, and AWS AI Chips. Speculative decoding on AWS Trainium can accelerate token generation by up to 3x for decode-heavy workloads, helping reduce the cost per output token and improving throughput without sacrificing output quality. If you build AI writing assistants, coding agents, …

Accelerating decode-heavy LLM inference with speculative decoding on AWS Trainium and vLLM Read More »

Rede Mater Dei de Saúde: Monitoring AI agents in the revenue cycle with Amazon Bedrock AgentCore

Rede Mater Dei de Saúde: Monitoring AI agents in the revenue cycle with Amazon Bedrock AgentCore

This post is cowritten by Renata Salvador Grande, Gabriel Bueno and Paulo Laurentys at Rede Mater Dei de Saúde. The growing adoption of multi-agent AI systems is redefining critical operations in healthcare. In large hospital networks, where thousands of decisions directly impact cash flow, service delivery times, and the risk of claim denials, the ability …

Rede Mater Dei de Saúde: Monitoring AI agents in the revenue cycle with Amazon Bedrock AgentCore Read More »

Navigating the generative AI journey: The Path-to-Value framework from AWS

Navigating the generative AI journey: The Path-to-Value framework from AWS

Generative AI is reshaping how organizations approach productivity, customer experiences, and operational capabilities. Across industries, teams are experimenting with generative AI to unlock new ways of working. Many of these efforts produce compelling proofs of concept (POC) that demonstrate technical feasibility. The real challenge begins after those early wins. Although POCs frequently demonstrate technical feasibility, …

Navigating the generative AI journey: The Path-to-Value framework from AWS Read More »

Use-case based deployments on SageMaker JumpStart

Use-case based deployments on SageMaker JumpStart

Amazon SageMaker JumpStart provides pretrained models for a wide range of problem types to help you get started with AI workloads. SageMaker JumpStart offers access to solutions for top use cases that can be deployed to SageMaker AI Managed Inference endpoints or SageMaker HyperPod clusters. Through pre-set deployment options, customers can quickly move from model …

Use-case based deployments on SageMaker JumpStart Read More »

Best practices to run inference on Amazon SageMaker HyperPod

Best practices to run inference on Amazon SageMaker HyperPod

Deploying and scaling foundation models for generative AI inference presents challenges for organizations. Teams often struggle with complex infrastructure setup, unpredictable traffic patterns that lead to over-provisioning or performance bottlenecks, and the operational overhead of managing GPU resources efficiently. These pain points result in delayed time-to-market, suboptimal model performance, and inflated costs that can make …

Best practices to run inference on Amazon SageMaker HyperPod Read More »

How Guidesly built AI-generated trip reports for outdoor guides on AWS

How Guidesly built AI-generated trip reports for outdoor guides on AWS

This is guest post by David Lord, Taylor Lord, Shiva Prasad, Anup Banasavalli Hiriyanagowda, Nikhil Chandra from Guidesly. Guidesly is reshaping how outdoor recreation is booked, run, and experienced. Founded in 2019, it began as a way to connect anglers, hunters, divers, and outdoor recreation enthusiasts with trusted guides, dive shops, and charters. It has …

How Guidesly built AI-generated trip reports for outdoor guides on AWS Read More »

Spring AI SDK for Amazon Bedrock AgentCore is now Generally Available

Spring AI SDK for Amazon Bedrock AgentCore is now Generally Available

Agentic AI is transforming how organizations use generative AI, moving beyond prompt-response interactions to autonomous systems that can plan, execute, and complete complex multi-step tasks. While early proof of concepts in Agentic AI spaces excite business stakeholders, scaling them to production requires addressing scalability, governance, and security challenges. Amazon Bedrock AgentCore is an Agentic AI …

Spring AI SDK for Amazon Bedrock AgentCore is now Generally Available Read More »

Sign In

Register

Reset Password

Please enter your username or email address, you will receive a link to create a new password via email.

Scroll to Top