Blog_dumb

How GoDaddy built a category generation system at scale with batch inference for Amazon Bedrock

How GoDaddy built a category generation system at scale with batch inference for Amazon Bedrock

This post was co-written with Vishal Singh, Data Engineering Leader at Data & Analytics team of GoDaddy Generative AI solutions have the potential to transform businesses by boosting productivity and improving customer experiences, and using large language models (LLMs) in these solutions has become increasingly popular. However, inference of LLMs as single model invocations or …

How GoDaddy built a category generation system at scale with batch inference for Amazon Bedrock Read More »

Benchmarking customized models on Amazon Bedrock using LLMPerf and LiteLLM

Benchmarking customized models on Amazon Bedrock using LLMPerf and LiteLLM

Open foundation models (FMs) allow organizations to build customized AI applications by fine-tuning for their specific domains or tasks, while retaining control over costs and deployments. However, deployment can be a significant portion of the effort, often requiring 30% of project time because engineers must carefully optimize instance types and configure serving parameters through careful …

Benchmarking customized models on Amazon Bedrock using LLMPerf and LiteLLM Read More »

Creating asynchronous AI agents with Amazon Bedrock

Creating asynchronous AI agents with Amazon Bedrock

The integration of generative AI agents into business processes is poised to accelerate as organizations recognize the untapped potential of these technologies. Advancements in multimodal artificial intelligence (AI), where agents can understand and generate not just text but also images, audio, and video, will further broaden their applications. This post will discuss agentic AI driven …

Creating asynchronous AI agents with Amazon Bedrock Read More »

How to run Qwen 2.5 on AWS AI chips using Hugging Face libraries

How to run Qwen 2.5 on AWS AI chips using Hugging Face libraries

The Qwen 2.5 multilingual large language models (LLMs) are a collection of pre-trained and instruction tuned generative models in 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B (text in/text out and code out). The Qwen 2.5 fine tuned text-only models are optimized for multilingual dialogue use cases and outperform both previous generations of Qwen models, and …

How to run Qwen 2.5 on AWS AI chips using Hugging Face libraries Read More »

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight

This post is cowritten with Harrison Hunter is the CTO and co-founder of MaestroQA. MaestroQA augments call center operations by empowering the quality assurance (QA) process and customer feedback analysis to increase customer satisfaction and drive operational efficiencies. They assist with operations such as QA reporting, coaching, workflow automations, and root cause analysis. In this …

Revolutionizing customer service: MaestroQA’s integration with Amazon Bedrock for actionable insight Read More »

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI

DeepSeek-R1, developed by AI startup DeepSeek AI, is an advanced large language model (LLM) distinguished by its innovative, multi-stage training process. Instead of relying solely on traditional pre-training and fine-tuning, DeepSeek-R1 integrates reinforcement learning to achieve more refined outputs. The model employs a chain-of-thought (CoT) approach that systematically breaks down complex queries into clear, logical …

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI Read More »

Exploring creative possibilities: A visual guide to Amazon Nova Canvas

Exploring creative possibilities: A visual guide to Amazon Nova Canvas

Compelling AI-generated images start with well-crafted prompts. In this follow-up to our Amazon Nova Canvas Prompt Engineering Guide, we showcase a curated gallery of visuals generated by Nova Canvas—categorized by real-world use cases—from marketing and product visualization to concept art and design exploration. Each image is paired with the prompt and parameters that generated it, …

Exploring creative possibilities: A visual guide to Amazon Nova Canvas Read More »

Microsoft Cost Management updates—March 2025

Microsoft Cost Management updates—March 2025

In this article Optimizing AKS (Azure Kubernetes Service) costs AWS (Azure Web Services) connector deprecationExchange of Azure OpenAI Service Provisioned ReservationsHelp shape the future of cost reportingDocumentation updatesWhat’s next for Cost Management? Whether you’re a new student, a thriving startup, or the largest enterprise, you have financial constraints, and you need to know what you’re spending, …

Microsoft Cost Management updates—March 2025 Read More »

Announcing the Responses API and Computer-Using Agent in Azure AI Foundry

Announcing the Responses API and Computer-Using Agent in Azure AI Foundry

AI agents are transforming industries by automating workflows, enhancing productivity, and enabling intelligent decision-making. Businesses are leveraging AI agents to process insurance claims, manage IT service desks, optimize supply chain logistics, and even assist healthcare professionals in analyzing medical records. The potential is vast, and we’re excited to introduce two powerful innovations in Azure AI …

Announcing the Responses API and Computer-Using Agent in Azure AI Foundry Read More »

Benchmarking Amazon Nova and GPT-4o models with FloTorch

Benchmarking Amazon Nova and GPT-4o models with FloTorch

Based on original post by Dr. Hemant Joshi, CTO, FloTorch.ai A recent evaluation conducted by FloTorch compared the performance of Amazon Nova models with OpenAI’s GPT-4o. Amazon Nova is a new generation of state-of-the-art foundation models (FMs) that deliver frontier intelligence and industry-leading price-performance. The Amazon Nova family of models includes Amazon Nova Micro, Amazon …

Benchmarking Amazon Nova and GPT-4o models with FloTorch Read More »

Scroll to Top