NVIDIA unveils Vera Rubin at GTC 2026, the brain behind the next era AI

NVIDIA has unveiled its next AI platform, Vera Rubin, designed for the next phase of artificial intelligence: reasoning, planning, and automation.

VIEW GALLERY - 3

Jak Connor

Tech and Science Editor

Published Mar 17, 2026 1:44 AM CDT

1 minute & 15 seconds read time

TL;DR: NVIDIA's Vera Rubin AI platform, unveiled by CEO Jensen Huang, integrates seven chips and six racks to optimize AI inference and agent-based workloads. Featuring the Vera CPU and Rubin GPU with 288GB HBM4 memory and 50 PFLOPS performance, it aims to reduce inference costs by up to 10x and enhance autonomous AI capabilities.

Voice: Jak ConnorSpeed

0:00 / --:--

NVIDIA's CEO Jensen Huang has officially unveiled its next-generation AI platform, which includes seven chips and six individual racks.

NVIDIA unveils Vera Rubin at GTC 2026, the brain behind the next era AI 156561

VIEW GALLERY - 3 IMAGES

Vera Rubin was unveiled at GTC 2026, where Huang broke down the components that make up NVIDIA's new flagship AI platform. NVIDIA has positioned the unveiling of Vera Rubin as the foundation of a new era of AI infrastructure, with the platform being specifically designed around AI inference and agent-based workloads. At its core, Vera Rubin is a full-stack AI supercomputing platform that combines a set of tightly integrated components, including the Vera CPU, Rubin GPU, NVLink interconnect, networking, and data processing units.

Huang explained during the GTC Keynote that AI is moving toward continuous generation of responses, decision-making, and action-taking. The NVIDIA CEO added that Vera Rubin has been specifically designed to support this new direction of AI inference. Notably, NVIDIA's Vera Rubin is an AI system designed to reason, plan, and act autonomously. As for performance, NVIDIA has said Vera Rubin will cut inference token costs by up to 10x and reduce the number of GPUs required for complex models.

NVIDIA unveils Vera Rubin at GTC 2026, the brain behind the next era AI 15615

Furthermore, each NVIDIA Rubin GPU features 288GB of HBM4 memory, providing up to 22 TB/s of total bandwidth, along with 50 PFLOPS of NVFP4 compute performance. At the transistor level, each NVIDIA Rubin GPU has 336 billion transistors, with an additional 2.5 trillion transistors in HBM4 memory. As for the CPU, the Vera CPU is NVIDIA's first fully custom, next-generation Arm-based CPU designed solely for AI data centers. Its purpose is to keep the massive GPU cluster running at maximum efficiency 24/7.

NVIDIA unveils Vera Rubin at GTC 2026, the brain behind the next era AI

Best Deals: PlayStation 5 Disc Edition Console (slim)

Similar News Stories