-
- Open Source Continuous Inference Benchmark trusted by Operators of Trillion Dollar
- GigaWatt Scale Token Factories
-
-
- As the world progresses exponentially towards AGI, software development and model
- releases move at the speed of light. Existing benchmarks rapidly become obsolete
- due to their static nature, and participants often submit software images
- purpose-built for the benchmark itself which do not reflect real world
- performance.
-
-
- InferenceX™ (formerly InferenceMAX) is our independent,
- vendor neutral, reproducible benchmark which addresses these issues by
- continuously benchmarking inference software across a wide range of AI
- accelerators that are actually available to the ML community.
-
-
- Our open data & insights are widely adopted by the ML community, capacity planning
- strategy teams at trillion dollar token factories & AI Labs & at multiple billion
- dollar NeoClouds. Learn more in our articles:{' '}
-
- v1
-
- ,{' '}
-
- v2
-
- .
-
-
-
- [
- 'OpenAI',
- 'Microsoft',
- 'Together AI',
- 'vLLM',
- 'GPU Mode',
- 'PyTorch Foundation',
- 'Oracle',
- 'CoreWeave',
- 'Nebius',
- 'Crusoe',
- 'TensorWave',
- 'SGLang',
- 'WEKA',
- ].includes(q.org),
- )}
- overrides={{
- order: ['OpenAI'],
- labels: {
- 'Together AI': 'Tri Dao',
- 'PyTorch Foundation': 'PyTorch',
- },
- }}
- moreHref="/quotes"
- />
-
-
-