As you can see all LLM benchmarks are normalized between 0 and 100. The models are already near 100. What's next? Answer: performance growth: reaching 1Ktok/s barrier and growing further, decreasing token cost. All this will lead to specialized processors for LLM and a new n, as it was with the PC revolution
