Google Cloud and NVIDIA Deepen Partnership to Enhance Accessibility and Scalability of AI Infrastructure

news

Mar 19, 2024 6:05 PM

Read time ~ 3 minutes

UPDATED: Jun 3, 2024 6:29 PM

Google Cloud and NVIDIA Deepen Partnership to Enhance Accessibility and Scalability of AI Infrastructure. Photo credit: NVIDIA

SANTA CLARA — On March 18, 2024, Google Cloud and NVIDIA announced a significant expansion of their partnership aimed at empowering the machine learning (ML) community with enhanced technology to accelerate the development, scaling, and management of generative AI applications. The move represents a concerted effort to foster more accessible and open AI infrastructure.

Key elements of this partnership include the adoption of NVIDIA’s latest Grace Blackwell AI computing platform by Google, along with the integration of NVIDIA DGX Cloud service into Google Cloud infrastructure. Additionally, Google will incorporate NVIDIA H100-powered DGX Cloud platforms into its offerings, providing developers with powerful tools to train and deploy AI models using their preferred frameworks.

Thomas Kurian, CEO of Google Cloud, emphasized the comprehensive nature of the collaboration, spanning from hardware integration to software ecosystems. He highlighted the commitment to providing an accessible and open AI platform for ML developers.

Jensen Huang, founder and CEO of NVIDIA, stressed the importance of offering solutions that enable enterprises to harness generative AI efficiently. He underlined the significance of expanded infrastructure offerings and integrations in providing customers with scalable AI applications.

The integration efforts between NVIDIA and Google Cloud build upon their longstanding commitment to providing leading capabilities across the AI stack. Key components of the expansion include:

Adoption of NVIDIA Grace Blackwell: This platform enables real-time inference on large language models (LLMs) with up to trillion parameters. Google’s adoption of Grace Blackwell for internal deployments signifies a step toward offering Blackwell-powered instances to cloud customers.
Grace Blackwell-powered DGX Cloud on Google Cloud: Google will introduce NVIDIA GB200 NVL72 systems to its cloud infrastructure, providing energy-efficient training and inference capabilities for LLMs. The availability of DGX Cloud on Google Cloud A3 VM instances further enhances the serverless experience for enterprise developers.
Support for JAX on GPUs: Collaboration between Google Cloud and NVIDIA facilitates the use of JAX, a high-performance machine learning framework, on NVIDIA H100 GPUs. This widens access to large-scale LLM training within the ML community.
Integration of NVIDIA NIM on Google Kubernetes Engine (GKE): NIM inference microservices, part of the NVIDIA AI Enterprise software platform, will be integrated into GKE, streamlining generative AI deployment in enterprises.
Support for NVIDIA NeMo: Google Cloud’s support for NVIDIA NeMo framework via GKE and Google Cloud HPC Toolkit simplifies the deployment and scaling of generative AI models.
Expansion of Vertex AI and Dataflow support for NVIDIA GPUs: Vertex AI now supports Google Cloud A3 VMs powered by NVIDIA H100 GPUs, providing MLOps teams with scalable infrastructure for AI applications. Dataflow has expanded support for accelerated data processing on NVIDIA GPUs.

The holistic partnership between Google Cloud and NVIDIA enables AI researchers, scientists, and developers to train, fine-tune, and serve sophisticated AI models with optimized tools and frameworks. Testimonials from companies like Runway, Palo Alto Networks, and Writer underscore the tangible benefits of this collaboration in enhancing model performance and lowering hosting costs.

The collaboration between Google Cloud and NVIDIA will be further showcased at GTC, the global AI conference, from March 18 to 21, providing an opportunity for industry professionals to delve deeper into the advancements in AI infrastructure.

📣 SHARE:

SOURCE: NVIDIA

Read more AI news at RadicalShift.AI’s news section.

➕ ADD A NEWS ARTICLE

➕ ADD OTHER CONTENT

You must be logged in to add content to these sections.

👤 Author

Oleg Lazarov

Oleg Lazarov, 51, has been working at the intersection of technology and media for nearly 30 years now. Founder of EuropaWire, the first truly pan-European newswire service, EPR Network LLC, one of Internet’s oldest 2nd tier PR services online in North America, TravelPRNews.com, the web’s native travel pr news platform. On the technological side, inventor and founder of LinkedWords, a large-scale semantic platform launched in 2006 with the mission to contextually link/connect millions of websites and content areas across the Internet with the ambitious goal of becoming an alternative of Google by giving users the option of finding relevant / related information in the context of what they are currently reading / viewing on the Web without the need of searching at all. During its apogee times (~2009) the platform reached out to nearly 3% of the entire Internet population, according to comScore. Invented and co-founded NosyJoe.com, a hybrid between social search engine and an intelligent content tagging engine, which, by the time it was in closed beta (2007), was noticed and mentioned by media heavyweights like Newsweek and The New York Times in the context of which young companies may have the potential to take on Google. Profound interests in the paradigm shift the AI developments will prompt across societies and industries. On the technological side, all aspects of AI, ML, NLP, algorithms, Big Data, contextualization, semantics, data mining/analysis, taxonomies, intelligent content tagging, textual analysis, among others. On the media side, how AI will change the production, discovery and consumption of news content. All things public relations, newswire services, press release distribution, etc., and how AI assisted improvements can deliver better results across the sector. On the entrepreneurial side, how those technological advancements will impact giants, disrupt business models and address unmet market needs. Full bio at oleglazarov.com.

Edit your profile

YOU MAY ALSO LIKE:

NVIDIA Q4 2025: AI Boom Delivers Record Revenue and Profits February 27, 2025
A deeper look into the CoreWeave’s Acquisition of Weights & Biases March 18, 2025
NVIDIA Unveils Revolutionary DGX SuperPOD for AI Supercomputing: Scaling AI Infrastructure… March 19, 2024
Project DIGITS: NVIDIA and Arm Empower AI Developers with Personal Supercomputing Power January 7, 2025
NVIDIA Blackwell Unveiled: A Paradigm Shift in Accelerated Computing March 19, 2024
Google Launches Gemma: Open Models for Responsible AI Development February 21, 2024

🔄 Updates

If you are the owner of, or part of/represent the entity this News article belongs to, you can request additions / changes / amendments / updates to this entry by sending an email request to info@radicalshift.ai. Requests will be handled on a first come first served basis and will be free of charge. If you want to take over this entry, and have full control over it, you have to create an account at RadicalShift.AI and if you are the owner of, or part of/represent the entity this News article belongs to, we will have it transferred over to your account and then you can add/modify/update this entry anytime you want.

🚩 Flag / Report an Issue

Flag / report an issue with the current content entry.

If you’d prefer to make a report via email, you can send it directly to info@radicalshift.ai. Indicate the content entry / News article you are making a report for.

AI.RadicalShift

Google Cloud and NVIDIA Deepen Partnership to Enhance Accessibility and Scalability of AI Infrastructure

📣 SHARE:

🆔 RELATED PROFILES:

⚡ MORE FROM THAT SOURCE:

✨ SIGNALS FOR THAT SOURCE:

📰 THE LATEST NEWS:

Global Power Transformation: NVIDIA and Consortium Harness AI to Revolutionize Electricity Generation and Distribution

Intel Unveils Next-Gen Edge AI Solutions: Empowering Seamless Integration into Legacy Infrastructure

A deeper look into the CoreWeave’s Acquisition of Weights & Biases

Dane Technologies and Brain Corp Unveil Revolutionary Autonomous Inventory Robot

UiPath Expands Its AI-Driven Automation Arsenal with Strategic Peak Acquisition

➕ ADD A NEWS ARTICLE

➕ ADD OTHER CONTENT

👤 Author

🔄 Updates

🚩 Flag / Report an Issue

What is RadicalShift AI?

Latest Entries

🏭 INDUSTRIES / MARKETS:

🏷️ TAGS

📂 ARCHIVE

Google Cloud and NVIDIA Deepen Partnership to Enhance Accessibility and Scalability of AI Infrastructure

📣 SHARE:

🆔 RELATED PROFILES:

⚡ MORE FROM THAT SOURCE:

✨ SIGNALS FOR THAT SOURCE:

📰 THE LATEST NEWS:

Global Power Transformation: NVIDIA and Consortium Harness AI to Revolutionize Electricity Generation and Distribution

Intel Unveils Next-Gen Edge AI Solutions: Empowering Seamless Integration into Legacy Infrastructure

A deeper look into the CoreWeave’s Acquisition of Weights & Biases

Dane Technologies and Brain Corp Unveil Revolutionary Autonomous Inventory Robot

Palantir and Databricks Partner to Power Next-Generation AI Solutions

UiPath Expands Its AI-Driven Automation Arsenal with Strategic Peak Acquisition

➕ ADD A NEWS ARTICLE

➕ ADD OTHER CONTENT

👤 Author

🔄 Updates

🚩 Flag / Report an Issue

What is RadicalShift AI?

Latest Entries

🏭 INDUSTRIES / MARKETS:

🏷️ TAGS

📂 ARCHIVE