Settings

Theme

Ask HN: How to easily benchmark LLM without elaborate setup?

3 points by mrg3_2013 2 years ago · 1 comment · 1 min read

Reader

Is there a service that allows for various LLMs (I am mainly interested in completion APIs), open sourced or paid, to be called on a per token pricing (i.e no infra setup) ? Anyone looking into providing a wrapper for all LLMs essentially ?

agcat 2 years ago

You can also find the same if you know the tokens/sec for different input and tokens variation.

In case you are interested to see results for speed ( tokens/second)

I Ran some tests between LLama2 7Bn, Gemma 7Bn, Mistral 7Bn to compare tokens/second on 6 different libraries with 5 different input tokens range (20 to 5000) and three different output tokens (100,200 and 500) on A100.

These are the results : https://inferless.com/learn/exploring-llms-speed-benchmarks-...

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection