NVIDIA Ampere rumor: NVCache speeds up load times, optimize VRAM usage

NVCache will act like HBCC, using some system RAM and SSD to speed up load times and optimize VRAM usage.

Gaming Editor

Published May 9, 2020 11:21 PM CDT
Updated Nov 3, 2020 11:43 AM CST

4-minute read time

Voice: DefaultSpeed

0:00 / --:--

We are beginning to learn much more about NVIDIA's next-gen Ampere GPU architecture, with some super-juicy rumors that -- as always -- require a considerable amount of salt.

The latest on Ampere is from YouTube channel Moore's Law is Dead, with Tom explaining that has has some "exclusive indsider info" on Ampere. What is talked about in the video above is that Ampere is not just a die shrink (down from 14nm to 7nm) with additional RT cores -- but instead it is a full evolution of the Turing architecture.

One of the new nuggets of information is that NVIDIA will introduce something called NVCache with Ampere, which will act similar to AMD's own technology that debuted with Vega in HBCC (High Bandwidth Cache Controller). Like HBCC, NVCache will use some of the system RAM and SSD of your system, and super-speed game load times and optimize VRAM usage.

Read more: GeForce RTX 2080 Ti prototype appears with 'GeForce GTX' branding and beefier specs

NVIDIA Ampere rumor: NVCache speeds up load times, optimize VRAM usage 01

VIEW GALLERY - 4 IMAGES

NVIDIA Ampere rumor: NVCache speeds up load times, optimize VRAM usage 02

The same sources have said something similar to what I reported not too long ago: that the next-gen GeForce RTX 3060 as a "mid-range" card will have the performance of the now-flagship GeForce RTX 2080 Ti. Except, Tom reports that it is in ray tracing -- which seems to make sense. We don't know about the actual performance of this card obviously, which is where the salt is required.

We should hear all about Ampere (or at least as much as NVIDIA will share) during its GTC 2020 keynote in a few days. We should be introduced to the inner workings of the new Ampere architecture, with the GeForce RTX 3000 series cards to be unveiled somewhere around September.

(New) NVIDIA Ampere GA100 Specs

8192 CUDA cores @ 1750MHz boost
1024 Tensor Cores
256 RT Cores
Unknown amount of GDDR6 @ 16Gbps
Unknown TDP
7nm

NVIDIA Ampere GPUs

GA102 - 84 SMs / 5376 CUDA cores / 12GB GDDR6 / 384-bit bus - 40% faster than RTX 2080 Ti
GA103 - 60 SMs / 3840 CUDA cores / 10GB GDDR6 / 320-bit bus - 10% faster than RTX 2080 Ti
GA104 - 48 SMs / 3072 CUDA cores / 8GB GDDR6 / 256-bit bus - 5% slower than RTX 2080 Ti

Older NVIDIA Ampere GA100 Specs

8192 CUDA cores @ 2GHz (2.2GHz boost)
1024 Tensor Cores
130 RT Cores
48GB of HBM2e memory @ 1.2GHz
300W TDP
TSMC 7nm+
36 TFLOPs peak output

I've already written about rumors that NVIDIA's next-gen Ampere GPU architecture would be up to 75% faster than current-gen GPUs such as the Turing architecture, right after rumors that Ampere would offer 50% more performance at half the power of Turing. This is pretty crazy stuff right there.

Not only that, but we've got some rumored specs on the purported GeForce RTX 3080 and GeForce RTX 3070 graphics cards, which will both be powered by NVIDIA's new Ampere GPU architecture.

We've already heard that Ampere would offer 50% more performance at half the power of Turing, which sent the hairs on my neck standing up. Better yet, you can read about the leaked specs on the purported Ampere-based GeForce RTX 3080 and GeForce RTX 3070 right here.

NVIDIA Ampere rumor: NVCache speeds up load times, optimize VRAM usage 05

GA103 (GeForce RTX 3080)

10/20GB GDDR6
320-bit memory interface
60 SMs
3480 CUDA cores

GA104 (GeForce RTX 3070)

8/16GB GDDR6
256-bit memory interface
48 SMs
3072 CUDA cores

AMD also just revealed its next-gen RDNA 2 / Navi 2X plans, something we referred to as Big Navi until just recently. Those specs are monstrous, but they pale in comparison to what NVIDIA has planned for its flagship HPC destined Ampere GA100 beast.