NVIDIA Rubin CPX GPU to feature 128GB GDDR7 memory, launches end of 2026

NVIDIA's new Rubin CPX GPU will pack 128GB of GDDR7 memory, build for long-window AI inferencing and AI agent workloads, will launch in 2026.

NVIDIA Rubin CPX GPU to feature 128GB GDDR7 memory, launches end of 2026
Comment IconFacebook IconX IconReddit Icon
Gaming Editor
Published
1 minute & 45 seconds read time
TL;DR: NVIDIA's Rubin CPX GPU, launching in late 2026, delivers 30 PetaFLOPS of NVFP4 compute with 128GB GDDR7 memory, optimized for massive-context AI models and long-format video processing. Integrated in the Vera Rubin NVL144 CPX platform, it offers 8 exaflops AI performance and advanced memory bandwidth for next-gen AI inference.

NVIDIA has announced its upcoming Rubin CPX GPU, a new specialized accelerator from the next-gen Rubin family of AI chips, made specifically for massive-context AI models, sporting a huge 128GB of GDDR7 memory.

NVIDIA Rubin CPX GPU to feature 128GB GDDR7 memory, launches end of 2026 53

The new Rubin CPX GPU features 30 PetaFLOPS of NVFP4 compute performance on a single monolithic die, which marks a shift away from dual-GPU packages that NVIDIA has used on its current Blackwell and Blackwell Ultra AI GPUs, as well as the design path that the rest of the Rubin family will follow.

Rubin CPX works hand in hand with NVIDIA Vera CPUs and Rubin GPUs inside of the new NVIDIA Vera Rubin NVL144 CPX platform, with the integrated NVIDIA MGX system featuring 8 exaflops of AI compute to provide 7.5x more AI performance than the new NVIDIA GB300 NVL72 AI system, as well as 100TB of fast memory and 1.7PB/sec of memory bandwidth in a single rack.

Jensen Huang, founder and CEO of NVIDIA, said: "The Vera Rubin platform will mark another leap in the frontier of AI computing - introducing both the next-generation Rubin GPU and a new category of processors called CPX. Just as RTX revolutionized graphics and physical AI, Rubin CPX is the first CUDA GPU purpose-built for massive-context AI, where models reason across millions of tokens of knowledge at once".

NVIDIA's new Rubin CPX enables the highest performance and token revenue for long-context processing -- far beyond what today's systems were designed to handle. This transforms AI coding assistants from simple code-generation tools into sophisticated systems that are capable of comprehending and optimizing large-scale software projects.

The company explains that in order to process video, that AI models can "take up to 1 million tokens for an hour of content, pushing the limits of traditional GPU compute. Rubin CPX integrates video decoder and encoders, as well as long-context inference processing, in a single chip for unprecedented capabilities in long-format applications such as video search and high-quality generative video".

"Built on the NVIDIA Rubin architecture, the Rubin CPX GPU uses a cost‑efficient, monolithic die design packed with powerful NVFP4 computing resources and is optimized to deliver extremely high performance and energy efficiency for AI inference tasks".

NVIDIA's new Rubin CPX is expected to be available at the end of 2026.

Photo of the AMD Ryzen 7 9800X3D
Best Deals: AMD Ryzen 7 9800X3D
Today7 days ago30 days ago
$439.99 USD$409.95 USD
$439.99 USD$448 USD
$629.98 CAD-
$629.99 CAD$629.99 CAD
£372.60£375
$439.99 USD$409.95 USD
$689$715
* Prices last scanned 5/11/2026 at 10:33 am CDT - prices may be inaccurate. As an Amazon Associate, we earn from qualifying purchases. We earn affiliate commission from any Newegg or PCCG sales.

Gaming Editor

Email IconX IconLinkedIn Icon

Anthony joined TweakTown in 2010 and has since reviewed 100s of tech products. Anthony is a long time PC enthusiast with a passion of hate for games built around consoles. FPS gaming since the pre-Quake days, where you were insulted if you used a mouse to aim, he has been addicted to gaming and hardware ever since. Working in IT retail for 10 years gave him great experience with custom-built PCs. His addiction to GPU tech is unwavering and has recently taken a keen interest in artificial intelligence (AI) hardware.

Follow TweakTown on Google News
Newsletter Subscription