Spark Arena - LLM Leaderboard

LLM Performance Leaderboard

Compare Large Language Model performance on NVIDIA DGX Spark infrastructure. Real benchmarks, real hardware, real results.

Benchmarks from actual llama-benchy runs on DGX Spark hardware

Compare vLLM, SGLang, TensorRT-LLM, and llama.cpp implementations

View complete recipes, configurations, and detailed benchmark results

Built on Open Source

This leaderboard is powered by community-developed tools. Support the projects that make this possible!