Stable Diffusion 3.5 VRAM requirement reduced by 40% to run on more GeForce RTX GPUs

Stable Diffusion 3.5 is out now and with the large model requiring 18GB of VRAM, NVIDIA has collaborated by Stability AI to dramatically reduce this.

Stable Diffusion 3.5 VRAM requirement reduced by 40% to run on more GeForce RTX GPUs
Comment IconFacebook IconX IconReddit Icon
Senior Editor
Published
1 minute & 15 seconds read time
TL;DR: By 2025, over 8GB of VRAM will be essential for high-end 1440p and 4K gaming and local AI workloads. NVIDIA and Stability AI optimized Stable Diffusion 3.5 with FP8 quantization and TensorRT, reducing VRAM needs by 40% and boosting performance, enabling broader GeForce RTX 50 Series GPU support.

The VRAM capacity debate is currently being discussed in the PC gaming space. The consensus is that in 2025, you will need more than 8GB of VRAM for high-end 1440p and 4K gaming. VRAM is also a key and crucial component for running local AI, and the demand for more memory is growing alongside the arrival of more complex models.

Stable Diffusion 3.5 VRAM requirement reduced by 40% to run on more GeForce RTX GPUs 2

The powerful Stable Diffusion 3.5 large language model for creating images from text models uses 18GB of VRAM. This limits the use of the GeForce RTX 50 Series to the flagship GeForce RTX 5090. Well, not anymore, as NVIDIA has collaborated with Stability AI to quantify the model to FP8, reducing the VRAM requirement by 40% to 11GB.

Alongside optimizations with TensorRT to double performance, this now means that five GeForce RTX 50 Series GPUs (the RTX 5060 Ti 16GB, RTX 5070, RTX 5070 Ti, RTX 5080, and RTX 5090) can run the model locally.

According to NVIDIA, the TensorRT optimizations for Stable Diffusion 3.5 allow the model to fully take advantage of the Tensor Cores inside GeForce RTX hardware. The FP8 TensorRT version delivers a 2.3X performance boost to running the model in BF16 PyTorch while utilizing 40% less memory.

The new optimized models are available via Stability AI's Hugging Face portal. In July, NVIDIA and Stability AI plan to release the model as an NVIDIA NIM microservice for creators looking to deploy it in a wide range of apps.

Clever optimizations like this for new large AI models are fantastic to see. With the GeForce RTX 50 Series supporting FP4, we'll probably see more of these types of updates that will allow powerful generative AI tools to run locally on more GPUs, especially with NVIDIA announcing that there are now over 100 million RTX AI PCs worldwide.

Photo of the MSI RTX 5070 Ti Graphics Card
Best Deals: MSI RTX 5070 Ti Graphics Card
Today7 days ago30 days ago
$999.99 USD$999.99 USD
$999.99 USD$999.99 USD
--
$1399.99 CAD$1469.99 CAD
£949-
$999.99 USD$999.99 USD
$1899$1899
* Prices last scanned 5/1/2026 at 4:22 am CDT - prices may be inaccurate. As an Amazon Associate, we earn from qualifying purchases. We earn affiliate commission from any Newegg or PCCG sales.
News Source:blogs.nvidia.com

Senior Editor

Email IconX IconLinkedIn Icon

Kosta is a veteran gaming journalist that cut his teeth on well-respected Aussie publications like PC PowerPlay and HYPER back when articles were printed on paper. A lifelong gamer since the 8-bit Nintendo era, it was the CD-ROM-powered 90s that cemented his love for all things games and technology. From point-and-click adventure games to RTS games with full-motion video cut-scenes and FPS titles referred to as Doom clones. Genres he still loves to this day. Kosta is also a musician, releasing dreamy electronic jams under the name Kbit.

Follow TweakTown on Google News
Newsletter Subscription