NVIDIA, a multinational AI technology company and Nebius Group N.V., an AI cloud company, have formed a broader partnership to invest US$ 2 billion to build large-scale cloud infrastructure aimed at the rapidly growing AI market. The collaboration will extend across several parts of the AI technology stack, including data center architecture, computing hardware, software and operational systems.
Demand for high-performance computing for artificial intelligence has surged in recent years, particularly as newer forms of AI systems require significantly more computing power. The company’s partnership will help accelerate the buildout of Nebius’s AI-focused cloud platform.
According to a press release, the partnership will focus on developing hyperscale cloud systems designed specifically for AI workloads used by startups, developers and enterprises. The investment reflects NVIDIA’s support for Nebius’s engineering approach to building AI infrastructure across hardware, data centers and software.
Nebius has already been deploying NVIDIA infrastructure across its global platform, including several gigawatt-scale AI data centers in the United States. Under the new partnership, the companies aim to expand that capacity significantly. Nebius plans to deploy more than 5 GW of computing capacity by 2030, with NVIDIA supporting early adoption of its newest accelerated computing systems.
Jensen Huang, founder and CEO, NVIDIA, said, “AI is at another inflection point agentic AI, driving incredible compute demand and accelerating infrastructure buildout, Nebius is building an AI cloud designed for the agentic era, fully integrated from silicon to software and powered by NVIDIA’s next-generation accelerated compute. Together, we are scaling the cloud to meet the surging global demand for intelligence.”
Arkady Volozh, CEO, Nebius, said, “Nebius has been built for AI since day one not adapted from a general-purpose cloud, but designed for what developers actually need, now with NVIDIA, we are extending that throughout the stack from gigawatt-scale AI factories to inference and software as we build one of the first and largest clouds for all AI builders everywhere.”
Nebius plans to deploy multiple generations of NVIDIA infrastructure across its platform, including early adoption of technologies such as the NVIDIA Rubin platform, NVIDIA Vera CPUs and NVIDIA BlueField storage systems.
The companies will also work together on AI inference and so-called agentic AI systems software designed to allow AI models to act more autonomously. The goal is to build an inference and AI software stack for developers and enterprises using NVIDIA’s software frameworks, models and libraries.
In addition, NVIDIA will provide design resources and technical support for building large-scale AI data centers often described as “AI factories” including system design guidance, hardware samples, system software and engineering support. The partnership will also include operational tools aimed at managing large fleets of GPUs. Nebius plans to use NVIDIA monitoring systems and software recommendations to track hardware health and optimize performance across its infrastructure.

