NVIDIA's new B30 AI GPU for China expected to have significant demand, 75% as fast as the H20

NVIDIA's upcoming China-specific B30 AI GPU has estimated 75% of the performance of H20, with demand for B30 'significant' and orders already placed.

NVIDIA's new B30 AI GPU for China expected to have significant demand, 75% as fast as the H20
Comment IconFacebook IconX IconReddit Icon
Gaming Editor
Published
3 minutes & 15 seconds read time
TL;DR: NVIDIA's China-specific B30 AI GPU delivers about 75% of H20's performance, with strong demand from Chinese tech firms ordering hundreds of thousands of units. Optimized for small to medium AI models and cloud services, the B30 offers cost-effective, energy-efficient inference and seamless CUDA-X compatibility for easy framework migration.

NVIDIA's new China-specific B30 AI GPU has performance of around 75% of the H20 AI GPU, while demand for the new B30 is "significant" according to the latest reports.

In a new post on X by insider @Jukanrosleve, we're hearing from China's major internet companies that estimates that the performance of NVIDIA's new B30 AI GPU is "approximately 75% that of the H20". Chinese tech companies have reportedly placed orders for hundreds of thousands of units -- orders of over $1 billion -- in late-June, with deliveries expected in August.

Another large Chinese tech company reportedly plans to increase its Q3 2025 capital expenditure and intends to order 300,000 orders of NVIDIA's new B30 AI GPU, with a delivery schedule for September.

NVIDIA's new B30 AI GPU is expected to address two major pain points for China, which will see the new AI chip becoming the preferred solution for inference in small and medium-sized models, "fully aligning with the arrival of the inference era". Secondly, the B30 will act as a low-cost computing power pool for cloud services.

NVIDIA's upcoming China-specific B30 AI GPU has its own disadvantages in single-card energy efficiency, but that's mitigated in scenarios with low memory bandwidth requirements, including intelligent customer service, text generation, and image recognition.

For instance, when processing a 4096-long text, the H20's throughput reaches 961 tokens/s, while the new B30 is only capable of 60% of that, but when expanded into a larger 8-card AI GPU cluster, the new B30 can increase the effective bandwidth to 1.2TB/sec through dynamic compression technology, meeting medium concurrency demands.

The post from Jukan continues, saying that its deep compatibility with the CUDA-X ecosystem which allows enterprises to seamlessly migrate frameworks like PyTorch, which saves technical reconstruction costs.

Additionally, when it comes to the B30's role in a low-cost computing power pool for cloud services, NVIDIA's upcoming B30's cluster solution is highly cost-effective for small and medium-sized enterprises and academic institutions. In tests run by a major company, it shows that a computing power pool constructed with 100 of the new B30 AI GPUs can support lightweight training of billion-parameter models while reducing procurement costs by 40% and unit power consumption by close to 30% compared to H20.

Lastly, domestic-made AI chips from the likes of Huawei, might slightly pass the B30 in single-card FP16 computing power with around 200 TFLOPS of power, B30 maintains an advantage in mainstream model deployment efficiency due to the 'stickiness' of the CUDA ecosystem.

Photo of the AMD Ryzen 7 9800X3D
Best Deals: AMD Ryzen 7 9800X3D
Today7 days ago30 days ago
$419.95 USD-
$464 USD$466.99 USD
$629.98 CAD-
$629.99 CAD$629.99 CAD
$419.95 USD-
$419.95 USD-
$719$725
* Prices last scanned 3/31/2026 at 11:09 pm CDT - prices may be inaccurate. As an Amazon Associate, we earn from qualifying purchases. We earn affiliate commission from any Newegg or PCCG sales.

Gaming Editor

Email IconX IconLinkedIn Icon

Anthony joined TweakTown in 2010 and has since reviewed 100s of tech products. Anthony is a long time PC enthusiast with a passion of hate for games built around consoles. FPS gaming since the pre-Quake days, where you were insulted if you used a mouse to aim, he has been addicted to gaming and hardware ever since. Working in IT retail for 10 years gave him great experience with custom-built PCs. His addiction to GPU tech is unwavering and has recently taken a keen interest in artificial intelligence (AI) hardware.

Follow TweakTown on Google News
Newsletter Subscription