🔍 MLPerf Configuration Finder (ongoing preliminary work)

Find the optimal configurations for your AI workloads by specifying your model and constraints. Results are ranked by performance and include both real benchmark data and AI-generated predictions.

All configurations include a ±10% tolerance for continuous features like model size, memory capacity, etc.

Ready to search. Enter your criteria and click 'Search Configurations'.

Architecture

Model architecture type

Weight Data Type

Precision format for model weights

Model Size (billions of parameters)

Number of parameters in billions

6 671


NVIDIA B200/GB200	3.5


AMD MI300X	3.5
AMD MI325X	4.5
NVIDIA B200/GB200	7
NVIDIA H100	3
NVIDIA H200	4
NVIDIA Jetson AGX	0.3
NVIDIA L40S	1.8
NVIDIA RTX 4090	1.2

When enabled, AI will predict performance for configurations not in the benchmark database

Include AI-generated predictions

Optimization Target

Choose whether to optimize for highest performance or lowest cost per token

performance cost

Enter your requirements and click 'Search Configurations' to find suitable hardware.

Configuration Details

Configuration Details

Model Performance Metrics

Mean Absolute Percentage Error (MAPE)	0


Root Mean Squared Error (RMSE)	0
Mean Absolute Error (MAE)	0
R² Score	0
Mean Absolute Percentage Error (MAPE)	0

Top Configurations Comparison

Number of configurations to show

Adjust to see more or fewer configurations in the chart

1 100

Plot

Authors: Daniel Altunay and Grigori Fursin (FCS Labs)

🔍 MLPerf Configuration Finder (ongoing preliminary work)

Configure Device Hourly Costs

Model Performance Analysis

Top Configurations Comparison