Llama3 on Groq

35 points by matanyall 2 years ago · 8 comments

Reader

Oras 2 years ago

That's impressive. I asked to summarise an article in 5 bullet points, and the output was 812.81 T/s on Llama 3 8B.

frozenport 2 years ago

LLama3 looks particularly good at tool calling

Groq's low latency is particularly good for tool calling

Seems like two techs that will make coding obsolete :-)

Alifatisk 2 years ago

Is the python lib open-source? I could only find the ja lib for Groq.

WhatsName 2 years ago

What is tbe cost per Mio. Token for llama3 70b on groq?

jacooper 2 years ago

When is Mixtral 8x22b coming?

Settings