Research Note: MLPerf Inference 4.0 Results

Steve McDowell
April 3, 2024

MLCommons released the results of its MLPerf Inference v4.0 benchmarks, which introduced two new workloads, Llama 2 and Stable Diffusion XL. The benchmarks offer detailed comparisons across a variety of system configurations for specific use cases.

This Research Note takes a look at the results.

2024-04-03-RN-MLPerf-4.0-Inference Download

Related Research

Qualcomm Dragonfly

Qualcomm’s Dragonfly Data Center Portfolio

June 29, 2026

Deal

Qualcomm Acquires Modular for its Hardware-Agnostic AI Software Layer

June 25, 2026

AI Abstract

HPE Discover: Agentic Governance, Vera CPU, and Confidential Computing added to AI Factory w/ NVIDIA

June 23, 2026

Qualcomm Dragonwing IQ10

Qualcomm’s Dragonwing IQ10 Robotics Reference Design

June 9, 2026

Steve McDowell

Steve McDowell is Principal Analyst and founder of NAND Research. Steve covers all things enterprise infrastructure, with a particular emphasis on data and storage .

Disclosure: The author is an industry analyst, and NAND Research an industry analyst firm, that engages in, or has engaged in, research, analysis, and advisory services with many technology companies, which may include those mentioned in this article. The author does not hold any equity positions with any company mentioned in this article.