AWS Expands NVIDIA NIM Microservices Availability to Drive High-Performance Generative AI Inference

news

Dec 5, 2024 8:26 AM

Read time ~ 2 minutes

UPDATED: Dec 16, 2024 3:55 PM

SANTA CLARA — At its annual AWS re:Invent conference, Amazon Web Services (AWS) announced an expanded collaboration with NVIDIA to enhance the deployment of generative AI applications. NVIDIA NIM microservices, optimized for high-performance inference, are now available through AWS Marketplace, Amazon Bedrock Marketplace, and Amazon SageMaker JumpStart. This integration aims to deliver faster AI inference, reduced latency, and cost-effective scalability for enterprises leveraging increasingly complex AI models.

NVIDIA NIM, part of the NVIDIA AI Enterprise software platform, provides developers with secure, enterprise-grade microservices for deploying AI model inference across cloud, data center, and workstation environments. Built on robust inference engines like NVIDIA Triton Inference Server and NVIDIA TensorRT, these prebuilt containers support a wide range of AI models, from open-source community models to proprietary NVIDIA AI Foundation models.

Key Models and Applications

AWS services such as Amazon EC2, Amazon EKS, and Amazon SageMaker can now host over 100 NVIDIA NIM microservices, including:

NVIDIA Nemotron-4: Available on multiple AWS platforms, this cutting-edge model generates synthetic data to enhance custom LLMs.
Llama 3.1 8B-Instruct and 70B-Instruct: Multilingual large language models optimized for text generation and dialogue.
Mixtral 8x7B Instruct v0.1: A sparse mixture of experts model designed for versatile text generation and task completion.

Industry Adoption and Use Cases

Organizations across industries are leveraging NIM microservices to streamline AI application deployment while maintaining security and reducing costs. SoftServe, a digital services provider, has created six generative AI solutions on AWS, powered by NVIDIA NIM and AWS services. These include applications in drug discovery, industrial assistance, and speech recognition, all based on NVIDIA AI Blueprints for rapid deployment.

Getting Started

Developers can explore NVIDIA NIM microservices on the NVIDIA API catalog, which offers over 100 optimized models. Deployment is available through AWS Marketplace, Amazon Bedrock Marketplace, and Amazon SageMaker JumpStart. Developers can also access a developer license or a 90-day NVIDIA AI Enterprise trial to evaluate NIM microservices for their specific needs.

This collaboration between AWS and NVIDIA highlights their commitment to advancing generative AI by providing scalable, secure, and high-performance solutions for enterprises.

📣 SHARE:

SOURCE: NVIDIA

Read more AI news at RadicalShift.AI’s news section.

➕ ADD A NEWS ARTICLE

➕ ADD OTHER CONTENT

You must be logged in to add content to these sections.

👤 Author

Oleg Lazarov

Oleg Lazarov, 51, has been working at the intersection of technology and media for nearly 30 years now. Founder of EuropaWire, the first truly pan-European newswire service, EPR Network LLC, one of Internet’s oldest 2nd tier PR services online in North America, TravelPRNews.com, the web’s native travel pr news platform. On the technological side, inventor and founder of LinkedWords, a large-scale semantic platform launched in 2006 with the mission to contextually link/connect millions of websites and content areas across the Internet with the ambitious goal of becoming an alternative of Google by giving users the option of finding relevant / related information in the context of what they are currently reading / viewing on the Web without the need of searching at all. During its apogee times (~2009) the platform reached out to nearly 3% of the entire Internet population, according to comScore. Invented and co-founded NosyJoe.com, a hybrid between social search engine and an intelligent content tagging engine, which, by the time it was in closed beta (2007), was noticed and mentioned by media heavyweights like Newsweek and The New York Times in the context of which young companies may have the potential to take on Google. Profound interests in the paradigm shift the AI developments will prompt across societies and industries. On the technological side, all aspects of AI, ML, NLP, algorithms, Big Data, contextualization, semantics, data mining/analysis, taxonomies, intelligent content tagging, textual analysis, among others. On the media side, how AI will change the production, discovery and consumption of news content. All things public relations, newswire services, press release distribution, etc., and how AI assisted improvements can deliver better results across the sector. On the entrepreneurial side, how those technological advancements will impact giants, disrupt business models and address unmet market needs. Full bio at oleglazarov.com.

Edit your profile

YOU MAY ALSO LIKE:

NVIDIA Q4 2025: AI Boom Delivers Record Revenue and Profits February 27, 2025
Anthropic Expands Partnership with Amazon Web Services to Accelerate AI Innovation November 23, 2024
A deeper look into the CoreWeave’s Acquisition of Weights & Biases March 18, 2025
Upstage Partners with AWS to Launch High-Performance LLM “Solar Pro,” Driving AI Adoption… December 5, 2024
NVIDIA and AWS Unveil Groundbreaking AI, Quantum, and Robotics Solutions to Accelerate… December 4, 2024
NVIDIA Empowers Developers and Enterprises with NIM Inference Microservices for Next-Gen AI… June 3, 2024

🔄 Updates

If you are the owner of, or part of/represent the entity this News article belongs to, you can request additions / changes / amendments / updates to this entry by sending an email request to info@radicalshift.ai. Requests will be handled on a first come first served basis and will be free of charge. If you want to take over this entry, and have full control over it, you have to create an account at RadicalShift.AI and if you are the owner of, or part of/represent the entity this News article belongs to, we will have it transferred over to your account and then you can add/modify/update this entry anytime you want.

🚩 Flag / Report an Issue

Flag / report an issue with the current content entry.

If you’d prefer to make a report via email, you can send it directly to info@radicalshift.ai. Indicate the content entry / News article you are making a report for.

AI.RadicalShift

AWS Expands NVIDIA NIM Microservices Availability to Drive High-Performance Generative AI Inference

📣 SHARE:

🆔 RELATED PROFILES:

⚡ MORE FROM THAT SOURCE:

✨ SIGNALS FOR THAT SOURCE:

📰 THE LATEST NEWS:

Global Power Transformation: NVIDIA and Consortium Harness AI to Revolutionize Electricity Generation and Distribution

Intel Unveils Next-Gen Edge AI Solutions: Empowering Seamless Integration into Legacy Infrastructure

A deeper look into the CoreWeave’s Acquisition of Weights & Biases

Dane Technologies and Brain Corp Unveil Revolutionary Autonomous Inventory Robot

UiPath Expands Its AI-Driven Automation Arsenal with Strategic Peak Acquisition

➕ ADD A NEWS ARTICLE

➕ ADD OTHER CONTENT

👤 Author

🔄 Updates

🚩 Flag / Report an Issue

What is RadicalShift AI?

Latest Entries

🏭 INDUSTRIES / MARKETS:

🏷️ TAGS

📂 ARCHIVE

AWS Expands NVIDIA NIM Microservices Availability to Drive High-Performance Generative AI Inference

📣 SHARE:

🆔 RELATED PROFILES:

⚡ MORE FROM THAT SOURCE:

✨ SIGNALS FOR THAT SOURCE:

📰 THE LATEST NEWS:

Global Power Transformation: NVIDIA and Consortium Harness AI to Revolutionize Electricity Generation and Distribution

Intel Unveils Next-Gen Edge AI Solutions: Empowering Seamless Integration into Legacy Infrastructure

A deeper look into the CoreWeave’s Acquisition of Weights & Biases

Dane Technologies and Brain Corp Unveil Revolutionary Autonomous Inventory Robot

Palantir and Databricks Partner to Power Next-Generation AI Solutions

UiPath Expands Its AI-Driven Automation Arsenal with Strategic Peak Acquisition

➕ ADD A NEWS ARTICLE

➕ ADD OTHER CONTENT

👤 Author

🔄 Updates

🚩 Flag / Report an Issue

What is RadicalShift AI?

Latest Entries

🏭 INDUSTRIES / MARKETS:

🏷️ TAGS

📂 ARCHIVE