
Global cloud and AI service providers are under mounting pressure to redesign their infrastructure as the scale and complexity of artificial intelligence workloads accelerate. Training trillion-parameter foundation models, serving inference at massive throughput, and supporting disaggregated, multi-tenant architectures are stretching conventional data center designs beyond their limits.
Against this backdrop, NVIDIA has unveiled BlueField Astra, a new secure control architecture built on its BlueField-4 data processing unit and tightly integrated with the Vera Rubin NVL72 AI platform, positioning it as a foundational technology for the next phase of large-scale AI infrastructure.
The shift toward bare-metal computing has become increasingly pronounced as organizations seek to extract maximum performance from GPU-accelerated systems.
Bare-Metal AI Clusters
Unlike virtualized environments, bare-metal AI clusters require strict isolation between tenants and a trusted mechanism to control access to networking and storage resources. These requirements are complicated by the fact that AI data centers operate across two fundamentally different networking domains: north–south traffic connecting users and applications to AI clusters, and east–west traffic linking GPUs within the compute fabric itself.
While service providers have already deployed BlueField DPUs to manage and secure north–south traffic, the east–west domain has historically prioritized performance over control, creating a gap in consistent security and governance.
BlueField Astra is designed to close that gap. Announced alongside NVIDIA’s Rubin platform roadmap, Astra introduces a system-level architecture that extends trusted control into the AI compute fabric without sacrificing throughput. At the heart of the design is a direct, dedicated connection between the BlueField-4 DPU and NVIDIA ConnectX-9 SuperNICs inside the Vera Rubin NVL72 compute tray. This arrangement allows the DPU to manage all network input and output for a compute node, unifying policy enforcement across both front-end and backend networks.
Cloud Service Providers
A central innovation of Astra is its isolation of the SuperNIC control plane from the host operating system. In traditional designs, host-based software configures both network adapters and fabric connectivity, creating opportunities for misconfiguration or interference by tenant workloads. Under Astra, all provisioning and policy instructions are delivered out-of-band via the DPU’s embedded Arm cores, ensuring that tenants – even those running bare metal – cannot view or alter network settings. Data movement remains fast and direct through the SuperNICs, but management functions are entirely removed from the tenant trust boundary.
For cloud service providers, this architecture promises a combination of stronger isolation, operational consistency, and reduced complexity. The same DPU that already governs north–south traffic can now apply identical security postures and workflows to the east–west fabric. Policies are enforced directly in SuperNIC hardware, limiting lateral movement and configuration drift while preserving the low latency and congestion control required by dense GPU clusters.
BlueField Astra also builds on NVIDIA’s DOCA software platform, enabling infrastructure services to run directly on the DPU rather than the host CPU. Networking, security, storage, and lifecycle management functions can be deployed as DOCA microservices anchored to BlueField-4, allowing existing operational practices to extend naturally into bare-metal AI environments. This approach supports tenant-aware networking, infrastructure-level telemetry, secure storage offload, and centralized device management without introducing new dependencies on host software.
Stronger Compliance for Regulated Industries
The implications extend beyond performance and security. By keeping policies and configurations on the DPU, service providers gain clearer audit trails and a compliance posture better aligned with regulated industries such as finance, healthcare, and government.
As AI infrastructure becomes critical to business operations, the ability to demonstrate consistent control and isolation across massive, multi-tenant systems is increasingly viewed as a competitive differentiator rather than an optional feature.
As the industry moves toward ever-larger AI deployments, BlueField Astra would signal a broader architectural shift. Instead of treating security as an add-on layered atop high-performance fabrics, NVIDIA is embedding trust and governance directly into the operating fabric of AI systems. For providers aiming to deliver bare-metal GPU performance at scale while maintaining cloud-grade security, Astra represents a new blueprint for how AI infrastructure can be built, managed, and secured.
Executive Insights FAQ
What problem is BlueField Astra designed to solve?
It addresses the lack of consistent security and control across both front-end and backend networks in large-scale, bare-metal AI environments.
How does Astra differ from traditional host-based control models?
It isolates the network control plane from the host OS, placing all provisioning and policy enforcement on the DPU.
Why is this important for multi-tenant AI clusters?
It prevents tenants from accessing or modifying network configurations, reducing risk and improving isolation.
Does Astra impact AI performance?
The architecture preserves high-throughput, low-latency data movement by enforcing policies directly in SuperNIC hardware.
Who stands to benefit most from this approach?
Cloud service providers and enterprises deploying large, regulated, or multi-tenant AI infrastructure at scale.


