Llama3 on Groq
groq.comThat's impressive. I asked to summarise an article in 5 bullet points, and the output was 812.81 T/s on Llama 3 8B.
LLama3 looks particularly good at tool calling
Groq's low latency is particularly good for tool calling
Seems like two techs that will make coding obsolete :-)
Is the python lib open-source? I could only find the ja lib for Groq.
What is tbe cost per Mio. Token for llama3 70b on groq?
$0.59 input, $0.79 output (https://wow.groq.com/)
When is Mixtral 8x22b coming?