Nvidia claims software and hardware upgrades allow Blackwell Ultra GB300 to dominate MLPerf benchmarks — touts 45% DeepSeek R-1 inference throughput increase over GB200 -

Nvidia has broken its own records in MLPerf benchmarks using its latest-generation Blackwell Ultra GB300 NVL72 rack-scale system, delivering what it claims is a 45% increase in inference performance over the Blackwell-based GB200 platform in DeepSeek R1 tests. Combining hardware improvements and software optimizations, Nvidia claims the top spot when running a range of models, and suggests this should be a primary consideration for any developers building out “AI factories,” as it could result in major enhancements for revenue generation.

Nvidia’s Blackwell architecture is at the heart of its latest-generation RTX 50-series graphics cards, which offer the best performance for gaming, even if AMD’s RX 9000-series arguably offers better bang for buck. But it’s also what’s under the hood of the big AI-powering GPU stacks like its GB200 platform, which is being built into a range of data centers all over the world to power next-generation AI applications. Blackwell Ultra, GB300, is the enhanced version of that with even more performance, and Nvidia has now tested it with some impressive MLPerf records.

The latest version of the MLPerf benchmark includes inference performance testing using the DeepSeek R1, Llama 3.1 405B, Llama 3.1 8B, and Whisper models, and GB300 NVL72 stole the show in all of them. Nvidia claims a 45% increase in performance over GB200 when running the DeepSeek model, and up to five times the performance of older Hopper GPUs – although Nvidia does note those comparative results came from unverified third parties.

Nvidia claims software and hardware upgrades allow Blackwell Ultra GB300 to dominate MLPerf benchmarks — touts 45% DeepSeek R-1 inference throughput increase over GB200

Related Posts

Strix Halo Radeon 8060S iGPU benchmarked in games, delivers butter-smooth 1080p performance —AMD’s AI-focused Ryzen AI Max 395+ APU also excels at gaming

Strix Halo Radeon 8060S iGPU benchmarked in games, delivers butter-smooth 1080p performance —AMD’s AI-focused Ryzen AI Max 395+ APU also excels at gaming

Bolt Graphics brings its RISC-V graphics cards to Ubuntu Summit — Zeus path tracing GPUs target film and animation industry

Bolt Graphics brings its RISC-V graphics cards to Ubuntu Summit — Zeus path tracing GPUs target film and animation industry

Cooler Master tells customer to dismantle 12v2x6 connector to fit Asus RTX 5070 Ti — customer service offers dubious advice that might not even fix issue

Cooler Master tells customer to dismantle 12v2x6 connector to fit Asus RTX 5070 Ti — customer service offers dubious advice that might not even fix issue

Nvidia reveals Vera Rubin Superchip for the first time — incredibly compact board features 88-core Vera CPU, two Rubin GPUs, and 8 SOCAMM modules