Blog_dumb - Page 31 of 251 - HKU SPACE AI Hub

Cost-effective multilingual audio transcription at scale with Parakeet-TDT and AWS Batch

Many organizations are archiving large media libraries, analyzing contact center recordings, preparing training data for AI, or processing on-demand video for subtitles. When data volumes grow significantly, managed automatic speech recognition (ASR) service costs can quickly become the primary constraint on scalability. To address this cost-scalability challenge, we use the NVIDIA Parakeet-TDT-0.6B-v3 model, deployed through […]

Cost-effective multilingual audio transcription at scale with Parakeet-TDT and AWS Batch Read More »

Amazon SageMaker AI now supports optimized generative AI inference recommendations

Leave a Comment / Blog / admin

Organizations are racing to deploy generative AI models into production to power intelligent assistants, code generation tools, content engines, and customer-facing applications. But deploying these models to production remains a weeks-long process of navigating GPU configurations, optimization techniques, and manual benchmarking, delaying the value these models are built to deliver. Today, Amazon SageMaker AI supports

Amazon SageMaker AI now supports optimized generative AI inference recommendations Read More »

Get to your first working agent in minutes: Announcing new features in Amazon Bedrock AgentCore

Leave a Comment / Blog / admin

Getting an agent running has always meant solving a long list of infrastructure problems before you can test whether the agent itself is any good. You wire up frameworks, storage, authentication, and deployment pipelines, and by the time your agent handles its first real task, you’ve spent days on infrastructure instead of agent logic. We

Get to your first working agent in minutes: Announcing new features in Amazon Bedrock AgentCore Read More »

Company-wise memory in Amazon Bedrock with Amazon Neptune and Mem0

Leave a Comment / Blog / admin

This post is cowritten by Shawn Tsai from TrendMicro. Delivering relevant, context-aware responses is important for customer satisfaction. For enterprise-grade AI chatbots, understanding not only the current query but also the organizational context behind it is key. Company-wise memory in Amazon Bedrock, powered by Amazon Neptune and Mem0, provides AI agents with persistent, company-specific context—enabling

Company-wise memory in Amazon Bedrock with Amazon Neptune and Mem0 Read More »

We’re launching two specialized TPUs for the agentic era.

Leave a Comment / Blog / admin

The eighth generation of Google’s TPU includes two specialized chips that will power the future of AI.

We’re launching two specialized TPUs for the agentic era. Read More »

From developer desks to the whole organization: Running Claude Cowork in Amazon Bedrock

Leave a Comment / Blog / admin

Today, we’re excited to announce Claude Cowork in Amazon Bedrock. You can now run Cowork and Claude Code Desktop through Amazon Bedrock, directly or using an LLM gateway. From startups to global enterprises across every industry, organizations build with Claude Code in Amazon Bedrock to boost developer productivity and accelerate delivery. With Amazon Bedrock you

From developer desks to the whole organization: Running Claude Cowork in Amazon Bedrock Read More »

End-to-end lineage with DVC and Amazon SageMaker AI MLflow apps

Leave a Comment / Blog / admin

Production machine learning (ML) teams struggle to trace the full lineage of a model through the data and the code that trained it, the exact dataset version it consumed, and the experiment metrics that justified its deployment. Without this traceability, questions like “which data trained the model currently in production?” or “can we reproduce the

End-to-end lineage with DVC and Amazon SageMaker AI MLflow apps Read More »

3 new ways Ads Advisor is making Google Ads safer and faster

Leave a Comment / Blog / admin

Three new agentic safety and policy features integrated into Ads Advisor will help protect and streamline your Google Ads account.

3 new ways Ads Advisor is making Google Ads safer and faster Read More »

Accelerate Generative AI Inference on Amazon SageMaker AI with G7e Instances

Leave a Comment / Blog / admin

As the demand for generative AI continues to grow, developers and enterprises seek more flexible, cost-effective, and powerful accelerators to meet their needs. Today, we are thrilled to announce the availability of G7e instances powered by NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs on Amazon SageMaker AI. You can provision nodes with 1, 2,

Accelerate Generative AI Inference on Amazon SageMaker AI with G7e Instances Read More »

ToolSimulator: scalable tool testing for AI agents

Leave a Comment / Blog / admin

You can use ToolSimulator, an LLM-powered tool simulation framework within Strands Evals, to thoroughly and safely test AI agents that rely on external tools, at scale. Instead of risking live API calls that expose personally identifiable information (PII), trigger unintended actions, or settling for static mocks that break with multi-turn workflows, you can use ToolSimulator’s large

ToolSimulator: scalable tool testing for AI agents Read More »