Developers access regional GPU compute for on-demand or long-term use, supporting strategic AI needs.
NVIDIA introduced DGX Cloud Lepton, an AI platform and compute marketplace that connects developers of agentic and physical AI applications with access to GPUs through a global network of cloud providers.
To support growing AI needs, NVIDIA Cloud Partners (NCPs) such as CoreWeave, Crusoe, Firmus, Foxconn, GMI Cloud, Lambda, Nebius, Nscale, SoftBank Corp., and Yotta Data Services will provide NVIDIA Blackwell and other NVIDIA architecture GPUs through the DGX Cloud Lepton marketplace.
Developers can tap into GPU compute capacity in specific regions for on-demand and long-term computing, supporting strategic and sovereign AI operational requirements. Additional cloud service providers and GPU marketplaces are expected to join the DGX Cloud Lepton marketplace.
DGX Cloud Lepton helps address the critical challenge of securing reliable, high-performance GPU resources by unifying access to cloud AI services and GPU capacity across the NVIDIA compute ecosystem. The platform integrates with the NVIDIA software stack, including NVIDIA NIM and NeMo microservices, NVIDIA Blueprints and NVIDIA Cloud Functions, to accelerate and simplify the development and deployment of AI applications.
For cloud providers, DGX Cloud Lepton provides management software that delivers real-time GPU health diagnostics and automates root-cause analysis, eliminating manual operations and reducing downtime.
Key benefits of the platform include:
- Improved productivity and flexibility: Offers a unified experience across development, training and inference, helping boost productivity. Developers can purchase GPU capacity directly from participating cloud providers through the marketplace or bring their own compute clusters, giving them greater flexibility and control.
- Frictionless deployment: Enables deployment of AI applications across multi-cloud and hybrid environments with minimal operational burden, using integrated services for inference, testing and training workloads.
- Agility and sovereignty: Gives developers quick access to GPU resources in specific regions, enabling compliance with data sovereignty regulations and meeting low-latency requirements for sensitive workloads.
- Predictable performance: Provides participating cloud providers enterprise-grade performance, reliability and security, ensuring a consistent user experience.
A new bar for AI cloud performance
NVIDIA also announced NVIDIA Exemplar Clouds to help NCPs enhance security, usability, performance and resiliency, using NVIDIA’s expertise, reference hardware and software and operational tools.
NVIDIA Exemplar Clouds tap into NVIDIA DGX Cloud Benchmarking, a comprehensive suite of tools and recipes for optimizing workload performance on AI platforms and quantifying the relationship between cost and performance.
Yotta Data Services is the first NCP in the Asia-Pacific region to join the NVIDIA Exemplar Cloud initiative.
For more information, visit nvidia.com.