Blog_dumb

Solar models from Upstage are now available in Amazon SageMaker JumpStart

Solar models from Upstage are now available in Amazon SageMaker JumpStart

This blog post is co-written with Hwalsuk Lee at Upstage. Today, we’re excited to announce that the Solar foundation model developed by Upstage is now available for customers using Amazon SageMaker JumpStart. Solar is a large language model (LLM) 100% pre-trained with Amazon SageMaker that outperforms and uses its compact size and powerful track records …

Solar models from Upstage are now available in Amazon SageMaker JumpStart Read More »

Scale LLMs with PyTorch 2.0 FSDP on Amazon EKS – Part 2

Scale LLMs with PyTorch 2.0 FSDP on Amazon EKS – Part 2

This is a guest post co-written with Meta’s PyTorch team and is a continuation of Part 1 of this series, where we demonstrate the performance and ease of running PyTorch 2.0 on AWS. Machine learning (ML) research has proven that large language models (LLMs) trained with significantly large datasets result in better model quality. In …

Scale LLMs with PyTorch 2.0 FSDP on Amazon EKS – Part 2 Read More »

New infrastructure for the era of AI: Emerging technology and trends in 2024

New infrastructure for the era of AI: Emerging technology and trends in 2024

This is part of a larger series on the new infrastructure of the era of AI, highlighting emerging technology and trends in large-scale compute. This month, we’re sharing the 2024 edition of the State of AI Infrastructure report to help businesses harness the power of AI now.  The era of AI is upon us. You’ve …

New infrastructure for the era of AI: Emerging technology and trends in 2024 Read More »

Microsoft Azure AI celebrates Women’s History Month through our customers

Microsoft Azure AI celebrates Women’s History Month through our customers

As a young girl growing up, the tech world seemed like a distant reality, predominantly occupied by men. It wasn’t until my older sister pursued a degree in informatics in college that I realized the potential for women in technology. This revelation was not just about inspiring young girls like me but about fostering an …

Microsoft Azure AI celebrates Women’s History Month through our customers Read More »

Provide live agent assistance for your chatbot users with Amazon Lex and Talkdesk cloud contact center

Provide live agent assistance for your chatbot users with Amazon Lex and Talkdesk cloud contact center

Amazon Lex provides advanced conversational artificial intelligence (AI) capabilities to enable self-service support for your organization’s contact center. With Amazon Lex, you can implement an omnichannel strategy where customers engage via phone, websites, and messaging platforms. The bots can answer FAQs, provide self-service experiences, or triage customer requests before transferring to a human agent. Amazon Lex integrates …

Provide live agent assistance for your chatbot users with Amazon Lex and Talkdesk cloud contact center Read More »

Advanced RAG patterns on Amazon SageMaker

Advanced RAG patterns on Amazon SageMaker

Today, customers of all industries—whether it’s financial services, healthcare and life sciences, travel and hospitality, media and entertainment, telecommunications, software as a service (SaaS), and even proprietary model providers—are using large language models (LLMs) to build applications like question and answering (QnA) chatbots, search engines, and knowledge bases. These generative AI applications are not only …

Advanced RAG patterns on Amazon SageMaker Read More »

Efficient continual pre-training LLMs for financial domains

Efficient continual pre-training LLMs for financial domains

Large language models (LLMs) are generally trained on large publicly available datasets that are domain agnostic. For example, Meta’s Llama models are trained on datasets such as CommonCrawl, C4, Wikipedia, and ArXiv. These datasets encompass a broad range of topics and domains. Although the resulting models yield amazingly good results for general tasks, such as …

Efficient continual pre-training LLMs for financial domains Read More »

Announcing new tools in Azure AI to help you build more secure and trustworthy generative AI applications

Announcing new tools in Azure AI to help you build more secure and trustworthy generative AI applications

In the rapidly evolving landscape of generative AI, business leaders are trying to strike the right balance between innovation and risk management. Prompt injection attacks have emerged as a significant challenge, where malicious actors try to manipulate an AI system into doing something outside its intended purpose, such as producing harmful content or exfiltrating confidential …

Announcing new tools in Azure AI to help you build more secure and trustworthy generative AI applications Read More »

Scroll to Top