RTX 5090 and RTX PRO 6000 GPU have a new bug: need a full system reboot after virtualization

NVIDIA's higher-end GeForce RTX 5090 and RTX PRO 6000 Blackwell GPUs become unresponsive after a few days of VM use, need a full system reboot.

VIEW GALLERY - 2

Anthony Garreffa

Gaming Editor

Published Sep 8, 2025 4:19 PM CDT

1 minute & 15 seconds read time

TL;DR: NVIDIA's GeForce RTX 5090 and RTX PRO 6000 GPUs face a critical virtualization bug causing system crashes and unresponsiveness after days of VM use, requiring full reboots. NVIDIA acknowledges the issue, affecting AI workloads, and is actively developing a fix while offering a $1000 bug bounty for solutions.

NVIDIA's higher-end GeForce RTX 5090 and RTX PRO 6000 cards have hit a new bug after running virtualization for a few days, which requires a full system reboot to get them back online again.

RTX 5090 and RTX PRO 6000 GPU have a new bug: need a full system reboot after virtualization 27

VIEW GALLERY - 2 IMAGES

CloudRift is a GPU cloud for developers, reporting crashing issues with both the RTX 5090 and RTX PRO 6000 cards saying that after a "few days" of VM usage, the cards were completely unresponsive. The GPUs can no longer be accessed unless the node system is rebooted, but thankfully it's only happening to the RTX 5090 and RTX PRO 6000, as the RTX 4090, Hopper H100, and Blackwell B200 aren't affected, for now.

What's happening exactly? The GPU gets assigned to a VM environment using the device driver VFIO, and after the Functional Level Reset (FLR), the GPU is completely unresponsive. After the GPU becomes unresponsive, it results in a kernel "soft lock" which puts the host and client environments under a deadlock. In order to get out of that deadlock the machine has to be rebooted, which isn't an easy thing to do for CloudRift, as they have a big volume of guest machines.

Read more: This $10,000 NVIDIA RTX PRO 6000 turns into a brick over an irreplaceable broken PCIe connector

CloudRift isn't the only company affected either, with a user at Proxmox reporting something similar, where he witnessed a complete host crash after shutting down a Windows client. He said that NVIDIA has responded to the problem, claiming that the company has been able to reproduce the issue, and is currently working on a fix for the issue.

Better yet, CloudRift has a $1000 bug bounty for those who can fix, or mitigate the issue, but NVIDIA is at work on that fix which shouldn't be too far away, especially when it's having a negative effect on AI workloads.

	Today	7 days ago	30 days ago
	$3440 USD	$3249.95 USD	$3449.99 USD	Buy
	$3799 USD	$3845 USD	$3765 USD	Buy
	$5999 CAD	$5999 CAD	$5580 CAD	Buy
	$5998.99 CAD	$5998.99 CAD	-	Buy
	$3440 USD	$3249.95 USD	$3449.99 USD	Buy
	$3440 USD	$3249.95 USD	$3449.99 USD	Buy
	Check Price	Check Price	Check Price	Buy
	Check Price	Check Price	Check Price	Buy
	Check Price	Check Price	Check Price	Buy
* Prices last scanned 4/29/2026 at 11:32 am CDT - prices may be inaccurate. As an Amazon Associate, we earn from qualifying purchases. We earn affiliate commission from any Newegg or PCCG sales.

Today

7 days ago

30 days ago

* Prices last scanned 4/29/2026 at 11:32 am CDT - prices may be inaccurate. As an Amazon Associate, we earn from qualifying purchases. We earn affiliate commission from any Newegg or PCCG sales.