LLM Performance Leaderboard
Compare Large Language Model performance on NVIDIA DGX Spark infrastructure. Real benchmarks, real hardware, real results.
Real Performance
Benchmarks from actual llama-benchy runs on DGX Spark hardware
Multiple Runtimes
Compare vLLM, SGLang, TensorRT-LLM, and llama.cpp implementations
Full Transparency
View complete recipes, configurations, and detailed benchmark results
Built on Open Source
This leaderboard is powered by community-developed tools. Support the projects that make this possible!