SoftBank to launch AI data center GPU cloud in Japan

May 27, 2026 at 1:18 PM GMT+8

SoftBank Corp. has announced that it will launch an AI Data Center GPU Cloud, as part of its neocloud business in October 2026. Through this initiative, SoftBank aims to provide integrated AI computing infrastructure, and software, that can be securely used within Japan.

In a press release, SoftBank said that this will allow their customers to leverage advanced GPU-accelerated AI computing infrastructure, including NVIDIA GB200 NVL72 deployed in SoftBank’s Japan-based data centers, and execute a wide range of AI workloads, from model training and inference to data processing, while ensuring secure data management and operations within Japan.

Ahead of the launch, SoftBank began offering a beta version, and started using the service internally across its group companies.

“AI Data Center GPU Cloud” is a cloud service that combines SoftBank’s AI computing infrastructure with “Infrinia AI Cloud OS,” an AI data center software stack that provides Kubernetes as a Service (KaaS) for multi-tenant environments and Inference as a Service (Inf-aaS) for Large Language Model inference via APIs.

Junichi Miyakawa, President & CEO, SoftBank Corp., said, “As AI becomes more deeply integrated into society, the source of competitiveness is expanding beyond AI itself to include the computing power and operational software that support it. Under our new growth strategy, ‘Activate AI for Society,’ SoftBank will provide integrated computing infrastructure and software that can be securely used within Japan as a neocloud provider. ‘Infrinia AI Cloud OS’ and ‘AI Data Center GPU Cloud’ will serve as core services in this initiative, strongly supporting customers’ AI development and real-world deployment.”

Charlie Boyle, Vice President, DGX systems at NVIDIA, said, “The transformation of telecommunications into an AI-native architecture requires a new foundation of AI infrastructure capable of handling the most complex sovereign AI workloads. SoftBank’s deployment of the NVIDIA GB200 NVL72 and ‘Infrinia AI Cloud OS’ gives Japanese enterprises a high-performance, secure, and scalable platform to accelerate their industries.”

In addition, the service provides centralized and automated management of GPU resources, Kubernetes-based operations, and AI workload execution, enabling optimized processing for each workload. The company aims to reduce the effort required to set up development environments and manage compute resources, lowering operational burdens and costs while providing a stable platform that can flexibly adapt to evolving requirements.

Going forward, based on its “Telco AI Cloud” initiative to build next-generation social infrastructure for the AI era by leveraging its telecommunications foundation, SoftBank aims to optimize AI processing from training to inference by integrating “AI Data Center GPU Cloud” with AI-RAN, while building a sovereign, distributed AI infrastructure that delivers low latency and high reliability.