NVIDIA leads rivals by 9x in MLPerf Inference v6.0 AI benchmark; Jensen Huang says the test is among the most stringent, and the advantage is not from a new Blackwell chip

Thế Duyệt

09:10 04/04/2026

NVIDIA leads rivals by 9x in MLPerf Inference v6.0 AI benchmark; Jensen Huang says the test is among the most stringent, and the advantage is not from a new Blackwell chip

During MLPerf Inference v6.0, NVIDIA was the only manufacturer to submit results for DeepSeek-R1, recording a nine-fold lead over the nearest competitor.

MLPerf Inference v6.0 expands model coverage

MLPerf Inference v6.0, developed by MLCommons, adds support for advanced inference and mixture-of-experts (MoE) models, including DeepSeek-R1, GPT-OSS-120B, and Mixtral 8x7B. The benchmark suite also broadens to dense large language models, generative-recommendation systems, and vision-language models, reflecting enterprise use cases. CEO Jensen Huang has described MLPerf as one of the most stringent benchmarks available.

Key performance gains in GB300 NVL72

The most notable comparisons come from the GB300 NVL72 configuration when looking at results from v5.1 versus v6.0.

DeepSeek-R1 (Server mode)

For the DeepSeek-R1 task in Server mode, throughput increased from 2,907 to 8,064 tokens per second per GPU, a 2.77x improvement.

DeepSeek-R1 (Offline mode)

In Offline mode, throughput rose from 5,842 to 9,821 tokens per second per GPU, a 1.68x increase.

Llama 3.1 405B

For the Llama 3.1 405B model, Server speed increased from 170 to 259 tokens per second per GPU (1.52x). Offline performance reached 271 tokens per second per GPU versus 224 tokens per second per GPU in the previous generation (1.21x).

Software optimizations drive most improvements

NVIDIA said the majority of the gains come from software optimizations rather than hardware changes. Since the first DeepSeek-R1 participation a few months earlier, NVIDIA improved token throughput by 2.7x through software updates alone.

On the hardware side, the GB300 NVL72 configuration delivers speeds up to 2.77x faster than GB200 NVL72, reflecting year-on-year improvements.

Participation and competitiveness focus

NVIDIA noted that it was the only vendor to submit DeepSeek-R1 results in last year’s MLPerf Inference. In v6.0, the company said this advantage remains, pointing to limited participation from other chip makers and even AMD compared with NVIDIA’s approach.

NVIDIA attributed its inference performance to what it described as an extremely tight co-design across the chip, system architecture, data-center design, and software. The company also said the MLPerf Inference v6.0 results are used to demonstrate token/USD and total cost of ownership (TCO) competitiveness in large-scale deployments.

Xuân Bắc
•

•

Top News

Bitcoin Slides as Crypto Markets Correct; Golden Cross at Risk

Unknown Author

•2 months ago

In brief\n\nBitcoin dropped to about $93,000, falling back below the EMA50 and putting its recent golden cross at risk of invalidation. The global crypto market cap stands at $3.15 trillion, down 2.38% in 24 hours. On Myriad Markets, 82% of the money is betting on Bitcoin pumping to $100K before…

Iconic Minerals Ltd.

•2 months ago

•

Latest News

•

Vinaconex forecasts 2026 net profit to fall 73% from 2025 after record 2025 earnings

Vinasun Faces Challenging Period as Kim Nguu Advisory Plans to Sell All VNS Shares

THACO AGRI Emerges as Largest Creditor of HAGL Agrico as 2025 Audited Results Show Net Loss and Going-Concern Risk

VN-Index dips below 1,700 as downtrend persists; experts urge risk management and capital preservation

Hoang Long Construction and Development posts 2025 net profit of VND 3.5 billion as total debt hits VND 3,487 billion with about VND 1.5 trillion bonds maturing mid-2026

Sabeco targets profit growth; Habeco cuts profit expectations for 2026

Platinum Victory to increase REE stake to 44.99% with new registration to buy about 17.8 million shares

Savings deposit rates rise toward 9–10% annually, drawing funds away from stocks and weighing on the VN-Index.

Novaland shares officially eligible for margin trading again in Q2 2026

World's largest gold ETF stalls as gold prices slide on Trump's remarks over US-Iran tensions

Indonesia's largest bank commits to provide an investment loan to the company of billionaire Pham Nhat Vuong

Tasco: Two board members resign as company considers reducing board size

Novaland publishes annual general meeting materials, targets strong profit growth in 2026

Vietcap raises charter capital to over 11.476 trillion dong after bonus share issuance

DCL shares plunge to limit after hitting historic high, erasing about VND 4.141 trillion in market capitalization in a single session

Community

Interested to stay up-to-date with cryptocurrencies?

Vinaconex forecasts 2026 net profit to fall 73% from 2025 after record 2025 earnings

Vinasun Faces Challenging Period as Kim Nguu Advisory Plans to Sell All VNS Shares

THACO AGRI Emerges as Largest Creditor of HAGL Agrico as 2025 Audited Results Show Net Loss and Going-Concern Risk

VN-Index dips below 1,700 as downtrend persists; experts urge risk management and capital preservation

Hoang Long Construction and Development posts 2025 net profit of VND 3.5 billion as total debt hits VND 3,487 billion with about VND 1.5 trillion bonds maturing mid-2026

Sabeco targets profit growth; Habeco cuts profit expectations for 2026

Platinum Victory to increase REE stake to 44.99% with new registration to buy about 17.8 million shares

Savings deposit rates rise toward 9–10% annually, drawing funds away from stocks and weighing on the VN-Index.

Novaland shares officially eligible for margin trading again in Q2 2026

World's largest gold ETF stalls as gold prices slide on Trump's remarks over US-Iran tensions

Indonesia's largest bank commits to provide an investment loan to the company of billionaire Pham Nhat Vuong

Tasco: Two board members resign as company considers reducing board size

Novaland publishes annual general meeting materials, targets strong profit growth in 2026

Vietcap raises charter capital to over 11.476 trillion dong after bonus share issuance

DCL shares plunge to limit after hitting historic high, erasing about VND 4.141 trillion in market capitalization in a single session

MLPerf Inference v6.0 expands model coverage

Key performance gains in GB300 NVL72

DeepSeek-R1 (Server mode)

DeepSeek-R1 (Offline mode)

Llama 3.1 405B

Software optimizations drive most improvements

Participation and competitiveness focus

Top News

Latest News

Latest News

Hoang Long Construction and Development posts 2025 net profit of VND 3.5 billion as total debt hits VND 3,487 billion with about VND 1.5 trillion bonds maturing mid-2026

Top News