Settings

Theme

cerebras: 450 tokens/sec llama 3.1 70B

theregister.com

7 points by davidfiala a year ago · 2 comments

Reader

IronWolve a year ago

Cerebras fails the "how many r's in strawberry" test. Grok is the only one who passed that test.

Going to be interesting to see the speed and accuracy keep increasing, cant imagine how fast/accurate things will be in a decade. Cant wait.

davidfialaOP a year ago

- 1,800tps on llama 3.1 8B

- 450tps on llama 3.1 70B

free chat interface is at: https://inference.cerebras.ai (requires login)

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection